Titel: UniDic for Early Middle Japanese: a Dictionary for Morphological Analysis of Classical Japanese
Personen:Ogiso, Toshinobu/Komachi, Mamoru/Den, Yasuharu/Matsumoto, Yuji
Jahr: 2012
Typ: Aufsatz
Verlag: European Language Resources Association (ELRA)
Ortsangabe: Istanbul
In: Calzolari, Nicoletta/Choukri, Khalid/Declerck, Thierry/Doğan, Mehmet U./Maegaard, Bente/Mariani, Joseph/Odijk, Jan/Piperidis, Stelios (Hgg.): Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC 2012), Istanbul, 23 - 25 May 2012
Seiten: 911-915
Untersuchte Sprachen: Japanisch*Japanese
Schlagwörter: Datenmodellierung*data modelling
gegenwartssprachliche Lexikografie*lexicography of contemporary language 
historische Lexikografie*historical lexicography
Internet-Lexikografie/Online-Lexikografie*internet lexicography/online lexicography
korpusbasierte Lexikografie*corpus-based lexicography
Medium: Online
URI: http://www.lrec-conf.org/proceedings/lrec2012/pdf/906_Paper.pdf
Zuletzt besucht: 10.09.2018
Abstract: In order to construct an annotated diachronic corpus of Japanese, we propose to create a new dictionary for morphological analysis of Early Middle Japanese (Classical Japanese) based on UniDic, a dictionary for Contemporary Japanese. Differences between the Early Middle Japanese and Contemporary Japanese, which prevent a naïve adaptation of UniDic to Early Middle Japanese, are found at the levels of lexicon, morphology, grammar, orthography and pronunciation. In order to overcome these problems, we extended dictionary entries and created a training corpus of Early Middle Japanese to adapt UniDic for Contemporary Japanese to Early Middle Japanese. Experimental results show that the proposed UniDic-EMJ, a new dictionary for Early Middle Japanese, achieves as high accuracy (97%) as needed for the linguistic research on lexicon and grammar in Japanese classical text analysis.