Titel: |
Developing LMF-XML Bilingual Dictionaries for Colloquial Arabic Dialects |
Personen: | Graff, David/Maamouri, Mohamed |
Jahr: |
2012 |
Typ: |
Aufsatz |
Verlag: |
European Language Resources Association (ELRA) |
Ortsangabe: |
Istanbul |
In: |
Calzolari, Nicoletta/Choukri, Khalid/Declerck, Thierry/Doğan, Mehmet U./Maegaard, Bente/Mariani, Joseph/Odijk, Jan/Piperidis, Stelios (Hgg.): Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC 2012), Istanbul, 23 - 25 May 2012 |
Seiten: |
269-274 |
Untersuchte Sprachen: |
Arabisch*Arabic - Englisch*English - Varietäten*Language Varieties |
Schlagwörter: |
Adaptivität*adaptivity
Datenbank*data base
Internet-Lexikografie/Online-Lexikografie*internet lexicography/online lexicography
XML/SGML*XML/SGML
zweisprachige bzw. mehrsprachige Lexikografie*bilingual/multilingual lexicography
|
Medium: |
Online |
URI: |
http://www.lrec-conf.org/proceedings/lrec2012/pdf/461_Paper.pdf |
Zuletzt besucht: |
17.09.2018 |
Abstract: |
The Linguistic Data Consortium and Georgetown University Press are collaborating to create updated editions of bilingual dictionaries that had originally been published in the 1960's for English-speaking learners of Moroccan, Syrian and Iraqi Arabic. In their first editions, these dictionaries used ad hoc Latin-alphabet orthography for each colloquial Arabic dialect, but adopted some properties of Arabic-based writing (collation order of Arabic headwords, clitic attachment to word forms in example phrases); despite their common features, there are notable differences among the three books that impede comparisons across the dialects, as well as comparisons
of each dialect to Modern Standard Arabic. In updating these volumes, we use both Arabic script and International Phonetic Alphabet orthographies; the former provides a common basis for word recognition across dialects, while the latter provides
dialect-specific pronunciations. Our goal is to preserve the full content of the original publications, supplement the Arabic headword inventory with new usages, and produce a uniform lexicon structure expressible via the Lexical Markup Framework (LMF, ISO 24613). To this end, we developed a relational database schema that applies consistently to each dialect, and HTTP-based tools for searching, editing, workflow, review and inventory management. |