Titel: The DANTE Database (Database of ANalysed Texts of English)
Personen:Convery, Cathal/Ó Mianáin, Pádraig/Ó Raghallaigh, Muiris/Atkins, Beryl T. S./Kilgarriff, Adam/Rundell, Michael
Jahr: 2010
Typ: Aufsatz
Verlag: Fryske Akademy
Ortsangabe: Leeuwarden
In: Dykstra, Anne/Schoonheim, Tanneke (Hgg.): Proceedings of the 14th EURALEX International Congress, Leeuwarden, Netherlands, 6 - 10 July 2010
Seiten: 293-295
Untersuchte Sprachen: Englisch*English - Irisch*Irish - Verschiedene*various
Schlagwörter: Datenbank*data base
korpusbasierte Lexikografie*corpus-based lexicography
Maschinenlesbarkeit*machine readability
Wortschatz*vocabulary
URI: http://euralex.org/category/publications/euralex-leeuwarden-2010/
Zuletzt besucht: 10.09.2018
Abstract: This database (www.webDante.com) was designed and created for Foras na Gaeilge by the Lexicography MasterClass and their 15-strong team led by Valerie Grundy (Managing Editor); textfow is managed by Diana Rawlinson (Project Administrator). The corpus of 1.7 bn words of current English, custom-built in 2007, was queried using the Sketch Engine (www.sketchengine.co.uk/), and the database was compiled in IDM's Dictionary Production System (DPS: www.idm.fr). The present volume contains a fuller description of this project (Atkins, Kilgarrif and Rundell "Database of ANalysed Texts of English (DANTE): the NEID database project") and of its use in a bilingual dictionary (Convery, Ó Mianáin and Ó Raghallaigh "Covering all bases: Regional Marking of material in the New English-Irish Dictionary"). The 95,000 or so DANTE entries cover approximately 50,000 headwords and 45,000 compounds, idioms and phrasal verbs, using over 40 datatypes. The lexical entry is subdivided into lexical units, each a sense of a single- or multi-word lemma. Almost every linguistic fact recorded is accompanied by full corpus sentences illustrating its use in text. Apart from the definitions and the corpus-derived example sentences, all the significant information is machine-retrievable. Functionality demonstrated here includes simple and complex searches over various combinations of datatypes and the automatic insertion of empty translation felds for use in dictionary building. DANTE was created as the initial stage of compilation of the New English-Irish Dictionary. Its long-term potential is much more far-reaching: it ofers publishers world-wide a comprehensive launchpad for bilingual dictionaries with English as the source language or the draft stage of a learners' dictionary of English; a source of updating material for an existing dictionary, etc. It ofers software developers, universities and other research institutions a resource for improved word sense disambiguation, the creation or enhancement of online lexicons, and other uses in software applications such as machine-assisted translation, information retrieval systems, etc. More details from info@webDante.com.