Titel: Lexicographic Potential of the Georgian Dialect Corpus
Personen:Beridze, Marina/Nadaraia, David
Jahr: 2016
Typ: Aufsatz
Verlag: Ivane Javakhishvili Tbilisi State University
Ortsangabe: Tbilisi
In: Margalitadze, Tinatin/Meladze, George (Hgg.): Proceedings of the 17th EURALEX International Congress: Lexicography and Linguistic Diversity. Tbilisi, Georgia 6 - 10 September 2016
Seiten: 300-309
Untersuchte Sprachen: Georgisch*Georgian - Varietäten*Language Varieties
Schlagwörter: Datenbank*data base
Internet-Lexikografie/Online-Lexikografie*internet lexicography/online lexicography
korpusbasierte Lexikografie*corpus-based lexicography
lexikografischer Prozess*lexicographic process
Medium: Online
URI: http://euralex.org/category/publications/euralex-2016/
Zuletzt besucht: 22.10.2018
Abstract: The project Linguistic Portrait of Georgia envisages various aspects of documentation of Georgian linguistic reality by means of corpus methodologies. This title is an umbrella for three large-scale projects within the framework of which The Georgian Dialect Corpus – GDC (http://corpora.co) was developed. Presently, the architecture and text base of the corpus have been designed, being permanently developed and updated. Besides, the lexicographic base of the corpus is organized, agglomerating data from printed dialect dictionaries. The lexical stock of the corpus is presented based on text, lexicographic and encyclopaedic data. The total quantity of tokens in the corpus is estimated to be up to 2 000 000, while the lexicographic base has 60 000 items (lemmas with entries) by now; this quantity is considerably increased owing to phonetic and grammatical variations, frequently associated with a single lexical item.