Titel: A Danish FrameNet Lexicon and an Annotated Corpus Used for Training and Evaluating a Semantic Frame Classifier
Personen:Pedersen, Bolette S./Nimb, Sanni/Søgaard, Anders/Hartmann, Mareike/Olsen, Sussi
Jahr: 2018
Typ: Aufsatz
Verlag: European Language Resources Association (ELRA)
Ortsangabe: Miyazaki
In: Calzolari, Nicoletta/Choukri, Khalid/Cieri, Christopher/Declerck, Thierry/Goggi, Sara/Hasida, Koiti/Isahara, Hitoshi/Maegaard, Bente/Mariani, Joseph/Mazo, Hélène/Moreno, Asunción/Odijk, Jan/Piperidis, Stelios/Tokunaga, Takenobu (Hgg.): Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, 7 - 12 May 2018
Seiten: 2382-2386
Untersuchte Sprachen: Dänisch*Danish
Schlagwörter: Datenbank*data base
Internet-Lexikografie/Online-Lexikografie*internet lexicography/online lexicography
korpusbasierte Lexikografie*corpus-based lexicography
lexikografischer Prozess*lexicographic process
semantische Relationen im Wörterbuch*semantic/sense relations in dictionaries
Medium: Online
URI: http://www.lrec-conf.org/proceedings/lrec2018/papers.html
Zuletzt besucht: 21.11.2018
Abstract: In this paper, we present an approach to efficiently compile a Danish FrameNet based on the Danish Thesaurus, focusing in particular on cognition and communication frames. The Danish FrameNet uses the frame and role inventory of the English FrameNet. We present the corresponding corpus annotations of frames and roles and show how our corpus can be used for training and evaluating a semantic frame classifier for cognition and communication frames. We also present results of cross-language transfer of a model trained on the English FrameNet. Our approach is significantly faster than building a lexicon from scratch, and we show that it is feasible to annotate Danish with frames developed for English, and finally, that frame annotations - even if limited in size at the current stage - are useful for automatic frame classification.