Titel: A Corpus-assisted Approach to Paronym Categorisation
Personen:Mell, Ruth M./Storjohann, Petra
Jahr: 2017
Typ: Aufsatz
Verlag: Lexical Computing CZ s.r.o.
Ortsangabe: Brno, Czech Republic
In: Kosem, Iztok/Tiberius, Carole/Jakubíček, Miloš/Kallas, Jelena/Krek, Simon/Baisa, Vít (Hgg.): Electronic lexicography in the 21st century. Lexicography from scratch. Proceedings of eLex 2017 conference, 19 - 21 September 2017, Leiden, the Netherlands
Seiten: 342-354
Untersuchte Sprachen: Deutsch*German
Schlagwörter: Internet-Lexikografie/Online-Lexikografie*internet lexicography/online lexicography
korpusbasierte Lexikografie*corpus-based lexicography
semantische Relationen im Wörterbuch*semantic/sense relations in dictionaries
Medium: Online
URI: https://elex.link/elex2017/proceedings-download/
Zuletzt besucht: 22.10.2018
Abstract: In this paper, we will present a first attempt to classify commonly confused words in German by consulting their communicative functions in corpora. Although the use of so-called paronyms causes frequent uncertainties due to similarities in spelling, sound and semantics, up until now the phenomenon has attracted little attention either from the perspective of corpus linguistics or from cognitive linguistics. Existing investigations rely on structuralist models, which do not account for empirical evidence. Still, they have developed an elaborate model based on formal criteria, primarily on word formation (cf. Lăzărescu 1999). Looking from a corpus perspective, such classifications are incompatible with language in use and cognitive elements of misuse. This article sketches first lexicological insights into a classification model as derived from semantic analyses of written communication. Firstly, a brief description of the project will be provided. Secondly, corpus-assisted paronym detection will be focused. Thirdly, in the main section the paper concerns the description of the datasets for paronym classification and the classification procedures. As a work in progress, new insights will continually be extended once spoken and CMC data are added to the investigations.