Titel: |
Automation of Lexicographic Work Using General and Specialized Corpora: Two Case Studies |
Personen: | Kosem, Iztok/Gantar, Polona/Logar, NataĊĦa/Krek, Simon |
Jahr: |
2014 |
Typ: |
Aufsatz |
Verlag: |
Institute for Specialised Communication and Multilingualism |
Ortsangabe: |
Bolzano/Bozen |
In: |
Abel, Andrea/Vettori, Chiara/Ralli, Natascia: Proceedings of the 16th EURALEX International Congress: The User in Focus, Bolzano/Bozen, Italien 15 - 19 July 2014 |
Seiten: |
355-364 |
Untersuchte Sprachen: |
Slowenisch*Slovenian |
Schlagwörter: |
Benutzungsforschung*usage research
Datenbank*data base
Fachlexikografie*specialised lexicography/LSP lexicography
Kollokationen/Phraseologismen/Wortverbindungen*collocations/phraseologisms/multi word items
korpusbasierte Lexikografie*corpus-based lexicography
|
Medium: |
Online |
URI: |
http://euralex.org/category/publications/euralex-2014/ |
Zuletzt besucht: |
22.10.2018 |
Abstract: |
Due to increasingly large amounts of authentic data to analyse, lexicographers are nowadays looking to language technologies to
provide them with not only the tools to analyse the data, but also with tools and methods that ease and speed up the data analysis. One
of the most promising avenues of research has been the automation of early stages of the corpus data analysis, with the aim to
summarize, and consequently reduce, the amount of corpus data that the lexicographers need to examine. However, most of this
research deals with general lexicography; terminology is yet to extensively test these methods. This paper attempts to address this gap
by presenting two separate Slovene research projects, one lexicographic (Slovene Lexical Database) and the other terminological
(Termis), that used the same method of automatic extraction of corpus data (presented in Kosem et al. 2013). After describing the
projects and the corpora use, similarities and differences in the parameter settings and the quality of extracted data in the two projects
are presented. We conclude with discussing the further potential of automation in both general and specialised lexicography. |