Computer Lexicons

Voice Browser Working Group,

Greetings.  Computer lexicons are a component to many natural language systems.  The pronunciation lexicon specification ( is used by both automatic speech recognition and speech synthesis engines in voice browser applications. Computer lexicons are also utilized in some approaches to machine translation.

Some existing computer lexicon formats include: EPUB3 (, TEI (, OLIF (, Apple Dictionary Services Markup (, and WordNet (

With approaches for parallel markup, such as those in MathML3 (, portions of lexicon entries, including for machine-utilizable semantic definitions, could make use of multiple formats such as both text and RDF.

Additionally, integrated development environments can be of use for creating and updating computer lexicon data.  Grammatical Framework (, for example, illustrates a programming language approach to grammatical data, and it is possible that such approaches can be of use to computer lexicon design and development processes.

Informationally, CogAlex-III, a SIGLEX endorsed, post-COLING workshop (, indicative of contemporary lexicography, is upcoming and will be discussing the analysis of the conceptual input of dictionary users, the meaning of words, structure of lexicons, methods for crafting dictionaries or indexes, dictionary access (navigation and search strategies), and interface issues, in December.

Kind regards,

Adam Sobieski 		 	   		  

Received on Thursday, 11 October 2012 15:12:37 UTC