Titel: |
A Danish FrameNet Lexicon and an Annotated Corpus Used for Training and Evaluating a Semantic Frame Classifier |
Personen: | Pedersen, Bolette S./Nimb, Sanni/Søgaard, Anders/Hartmann, Mareike/Olsen, Sussi |
Jahr: |
2018 |
Typ: |
Aufsatz |
Verlag: |
European Language Resources Association (ELRA) |
Ortsangabe: |
Miyazaki |
In: |
Calzolari, Nicoletta/Choukri, Khalid/Cieri, Christopher/Declerck, Thierry/Goggi, Sara/Hasida, Koiti/Isahara, Hitoshi/Maegaard, Bente/Mariani, Joseph/Mazo, Hélène/Moreno, Asunción/Odijk, Jan/Piperidis, Stelios/Tokunaga, Takenobu (Hgg.): Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, 7 - 12 May 2018 |
Seiten: |
2382-2386 |
Untersuchte Sprachen: |
Dänisch*Danish |
Schlagwörter: |
Datenbank*data base
Internet-Lexikografie/Online-Lexikografie*internet lexicography/online lexicography
korpusbasierte Lexikografie*corpus-based lexicography
lexikografischer Prozess*lexicographic process
semantische Relationen im Wörterbuch*semantic/sense relations in dictionaries
|
Medium: |
Online |
URI: |
http://www.lrec-conf.org/proceedings/lrec2018/papers.html |
Zuletzt besucht: |
21.11.2018 |
Abstract: |
In this paper, we present an approach to efficiently compile a Danish FrameNet based on the Danish Thesaurus, focusing in particular on cognition and communication frames. The Danish FrameNet uses the frame and role inventory of the English FrameNet. We present the corresponding corpus annotations of frames and roles and show how our corpus can be used for training and evaluating a semantic frame classifier for cognition and communication frames. We also present results of cross-language transfer of a model trained on the English FrameNet. Our approach is significantly faster than building a lexicon from scratch, and we show that it is feasible to annotate Danish with frames developed for English, and finally, that frame annotations - even if limited in size at the current stage - are useful for automatic frame classification. |