Titel: Slovene Lexical Database
Personen:Gantar, Polona/Krek, Simon
Jahr: 2011
Typ: Aufsatz
In: Majchráková, Daniela/Garabík, Radovan: Natural Language Processing, Multilinguality. Sixth International Conference, Modra, Slovakia, 20 - 21 October 2011
Seiten: 72-80
Untersuchte Sprachen: Slowenisch*Slovenian
Schlagwörter: einsprachige Lexikografie*monolingual lexicography
korpusbasierte Lexikografie*corpus-based lexicography
Mikrostruktur*microstructure
zweisprachige bzw. mehrsprachige Lexikografie*bilingual/multilingual lexicography
Medium: Online
URI: http://korpus.juls.savba.sk/~slovko/2011/Proceedings_Slovko_2011.pdf
Zuletzt besucht: 22.10.2018
Abstract: The paper describes the concept of the new Slovene lexical database which is compiled within the “Communication in Slovene” project. The database has a twofold goal: it is intended as the basis for the future compilation of different dictionaries of Slovene, both monolingual and bilingual, and as such its concept is biased towards lexicography. Secondly, it will be used for the enhancement of natural language processing tools for Slovene. The database is organized in six hierarchical levels with lexico-grammatical information which spans from simple morphological data on the top level to semantic, syntactic and collocational data on subordinate levels, with corpus examples at the bottom. Sketch Engine tool with word sketch, tickbox lexicography and GDEX modules is used to enable faster and more efficient extraction of corpus data from the 620-million word FidaPLUS corpus which is used as the source for the data in the database.