Titel: Term candidate extraction for terminography and CAT: an overview of TTC
Personen:Heid, Ulrich/Gojun, Anita
Jahr: 2012
Typ: Aufsatz
Verlag: Universitetet i Oslo, Institutt for lingvistiske og nordiske studier
Ortsangabe: Oslo
In: Fjeld, Ruth V./Torjusen, Julie M. (Hgg.): Proceedings of the 15th EURALEX International Congress 2012, Oslo, Norway, 7 - 11 August 2012
Seiten: 585-594
Untersuchte Sprachen: Deutsch*German - Englisch*English
Schlagwörter: korpusbasierte Lexikografie*corpus-based lexicography
Übersetzung*translation
Medium: Online
URI: http://euralex.org/category/publications/euralex-oslo-2012/
Zuletzt besucht: 17.09.2018
Abstract: In this paper, we present a tool chain for terminology extraction and term alignment which is under development in the EU-project TTC. The tool components comprise the crawling of domain-specific text from the internet, in different languages, linguistic pre-processing of the corpus collected in this way, and the extraction of term candidates. Extracted term candidates of two languages are aligned into pairs of source and target term equivalents. This output can be used both in interactive translation setups (e.g. computer-aided translation) and in machine translation.