Titel: |
Slovene Lexical Database |
Personen: | Gantar, Polona/Krek, Simon |
Jahr: |
2011 |
Typ: |
Aufsatz |
In: |
Majchráková, Daniela/Garabík, Radovan: Natural Language Processing, Multilinguality. Sixth International Conference, Modra, Slovakia, 20 - 21 October 2011 |
Seiten: |
72-80 |
Untersuchte Sprachen: |
Slowenisch*Slovenian |
Schlagwörter: |
einsprachige Lexikografie*monolingual lexicography
korpusbasierte Lexikografie*corpus-based lexicography
Mikrostruktur*microstructure
zweisprachige bzw. mehrsprachige Lexikografie*bilingual/multilingual lexicography
|
Medium: |
Online |
URI: |
http://korpus.juls.savba.sk/~slovko/2011/Proceedings_Slovko_2011.pdf |
Zuletzt besucht: |
22.10.2018 |
Abstract: |
The paper describes the concept of the new Slovene lexical database
which is compiled within the “Communication in Slovene” project. The
database has a twofold goal: it is intended as the basis for the future compilation of different dictionaries of Slovene, both monolingual and bilingual, and as such its concept is biased towards lexicography. Secondly, it will be used for the enhancement of natural language processing tools for Slovene. The database is organized in six hierarchical levels with lexico-grammatical information which spans from simple morphological data on the top level to semantic, syntactic and collocational data on subordinate levels, with corpus examples at the bottom. Sketch Engine tool with word sketch, tickbox lexicography and GDEX modules is used to enable faster and more efficient extraction of corpus
data from the 620-million word FidaPLUS corpus which is used as the source
for the data in the database. |