Titel: A General Method for Creating a Bilingual Transliteration Dictionary
Personen:Kirschenbaum, Amit/Wintner, Shuly
Jahr: 2010
Typ: Aufsatz
Verlag: European Language Resources Association (ELRA)
Ortsangabe: Valletta, Malta
In: Barbu Mititelu, Verginica/Pekar, Viktor/Barbu, Eduard (Hgg.): Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC 2010), Valetta, 17 - 23 May 2010
Seiten: 273-276
Untersuchte Sprachen: Englisch*English - Hebräisch*Hebrew
Schlagwörter: automatische Sprachverarbeitung*automatic speech processing
Internet-Lexikografie/Online-Lexikografie*internet lexicography/online lexicography
Orthografie im Wörterbuch*orthography/spelling information in dictionaries
Phonetik im Wörterbuch*phonetics in dictionaries/pronunciation
zweisprachige bzw. mehrsprachige Lexikografie*bilingual/multilingual lexicography
URI: http://www.lrec-conf.org/proceedings/lrec2010/pdf/16_Paper.pdf
Zuletzt besucht: 05.12.2018
Abstract: Transliteration is the rendering in one language of terms from another language (and, possibly, another writing system), approximating spelling and/or phonetic equivalents between the two languages. A transliteration dictionary is a crucial resource for a variety of natural language applications, most notably machine translation. We describe a general method for creating bilingual transliteration dictionaries from Wikipedia article titles. The method can be applied to any language pair with Wikipedia presence, independently of the writing systems involved, and requires only a single simple resource that can be provided by any literate bilingual speaker. It was successfully applied to extract a Hebrew-English transliteration dictionary which, when incorporated in a machine translation system, indeed improved its performance.