Titel: The More the Better? Assessing the Influence of Wikipedia’s Growth on Semantic Relatedness Measures
Personen:Zesch, Torsten/Gurevych, Iryna
Jahr: 2010
Typ: Aufsatz
Verlag: European Language Resources Association (ELRA)
Ortsangabe: Valletta, Malta
In: Barbu Mititelu, Verginica/Pekar, Viktor/Barbu, Eduard (Hgg.): Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC 2010), Valetta, 17 - 23 May 2010
Untersuchte Sprachen: Englisch*English
Schlagwörter: Internet-Lexikografie/Online-Lexikografie*internet lexicography/online lexicography
lexikografischer Prozess*lexicographic process
Nutzerbeteiligung*user contribution
URI: http://www.lrec-conf.org/proceedings/lrec2010/pdf/93_Paper.pdf
Zuletzt besucht: 10.09.2018
Abstract: Wikipedia has been used as a knowledge source in many areas of natural language processing. As most studies only use a certain Wikipedia snapshot, the influence of Wikipedia's massive growth on the results is largely unknown. For the first time, we perform an in-depth analysis of this influence using semantic relatedness as an example application that tests a wide range of Wikipedia's properties. We find that the growth of Wikipedia has almost no effect on the correlation of semantic relatedness measures with human judgments, while the coverage steadily increases.