Titel: GDEX for Slovene
Personen:Kosem, Iztok/Husák, Milos/McCarthy, Diana
Jahr: 2011
Typ: Aufsatz
Verlag: Trojina, Institute for Applied Slovene Studies/ Lexical Computing Ltd.
Ortsangabe: Ljubljana/ Brighton
In: Kosem, Iztok/Kosem, Karmen (Hgg.): Electronic lexicography in the 21st Century: New Applications for New Users. Proceedings of eLex2011, Bled, Slowenien, 10 - 12 November 2011
Seiten: 151-159
Untersuchte Sprachen: Slowenisch*Slovenian
Schlagwörter: Beispiel*example
Benutzungsforschung*usage research
Datenbank*data base
korpusbasierte Lexikografie*corpus-based lexicography
Suchfunktion*search option
Verlinkung/Verweis*cross-references
URI: http://elex2011.trojina.si/Vsebine/proceedings.html
Zuletzt besucht: 10.09.2018
Abstract: Good Dictionary Examples or GDEX is a tool in the Sketch Engine designed to help lexicographers with identifying dictionary examples by ranking sentences according to how likely they are to be good candidates. The ranking is done automatically using various syntactic and lexical features. So far, only GDEX for English has been available. This paper presents the design and evaluation of Slovene GDEX, which was used for finding good examples for the new lexical database of Slovene, one of the activities in the Communication in Slovene project. Several different GDEX configurations were designed, evaluated and compared. The evaluation involved examining sentences of lemmas belonging to different word classes. Good sentences were logged for subsequent analysis with external data-mining software, WEKA. The observed behaviour was then used to adjust the parameters of the GDEX classifiers. We believe that the procedure of identifying features of good examples and their values, described in this paper, can be used for the development of GDEX for any language.