Titel: |
Lexicographic Potential of the Georgian Dialect Corpus |
Personen: | Beridze, Marina/Nadaraia, David |
Jahr: |
2016 |
Typ: |
Aufsatz |
Verlag: |
Ivane Javakhishvili Tbilisi State University |
Ortsangabe: |
Tbilisi |
In: |
Margalitadze, Tinatin/Meladze, George (Hgg.): Proceedings of the 17th EURALEX International Congress: Lexicography and Linguistic Diversity. Tbilisi, Georgia 6 - 10 September 2016 |
Seiten: |
300-309 |
Untersuchte Sprachen: |
Georgisch*Georgian - Varietäten*Language Varieties |
Schlagwörter: |
Datenbank*data base
Internet-Lexikografie/Online-Lexikografie*internet lexicography/online lexicography
korpusbasierte Lexikografie*corpus-based lexicography
lexikografischer Prozess*lexicographic process
|
Medium: |
Online |
URI: |
http://euralex.org/category/publications/euralex-2016/ |
Zuletzt besucht: |
22.10.2018 |
Abstract: |
The project Linguistic Portrait of Georgia envisages various aspects of documentation of Georgian linguistic reality by means of corpus methodologies. This title is an umbrella for three large-scale projects within the framework of which The Georgian Dialect Corpus – GDC (http://corpora.co) was developed.
Presently, the architecture and text base of the corpus have been designed, being permanently developed and updated. Besides, the lexicographic base of the corpus is organized, agglomerating data from printed dialect dictionaries. The lexical stock of the corpus is presented based on text, lexicographic and encyclopaedic data. The total quantity of tokens in the corpus is estimated to be up to 2 000 000, while the lexicographic base has 60 000 items (lemmas with entries) by now; this quantity is considerably increased owing to phonetic and grammatical variations, frequently associated with a single lexical item. |