Title |
Extracting Lexical Semantic Knowledge from Wikipedia and Wiktionary |
Authors |
Torsten Zesch, Christof Müller and Iryna Gurevych |
Abstract |
Recently, collaboratively constructed resources such as Wikipedia and Wiktionary have been discovered as valuable lexical semantic knowledge bases with a high potential in diverse Natural Language Processing (NLP) tasks. Collaborative knowledge bases however significantly differ from traditional linguistic knowledge bases in various respects, and this constitutes both an asset and an impediment for research in NLP. This paper addresses one such major impediment, namely the lack of suitable programmatic access mechanisms to the knowledge stored in these large semantic knowledge bases. We present two application programming interfaces for Wikipedia and Wiktionary which are especially designed for mining the rich lexical semantic information dispersed in the knowledge bases, and provide efficient and structured access to the available knowledge. As we believe them to be of general interest to the NLP community, we have made them freely available for research purposes. |
Language |
Multiple languages |
Topics |
LR Infrastructures and Architectures, Lexicon, lexical database, Tools, systems, applications |
Full paper |
Extracting Lexical Semantic Knowledge from Wikipedia and Wiktionary |
Slides |
Extracting Lexical Semantic Knowledge from Wikipedia and Wiktionary |
Bibtex |
@InProceedings{ZESCH08.420,
author = {Torsten Zesch, Christof Müller and Iryna Gurevych},
title = {Extracting Lexical Semantic Knowledge from Wikipedia and Wiktionary},
booktitle = {Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)},
year = {2008},
month = {may},
date = {28-30},
address = {Marrakech, Morocco},
editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis, Daniel Tapias},
publisher = {European Language Resources Association (ELRA)},
isbn = {2-9517408-4-0},
note = {http://www.lrec-conf.org/proceedings/lrec2008/},
language = {english}
} |