Summary of the paper

Title Learning to Mine Definitions from Slovene Structured and Unstructured Knowledge-Rich Resources
Authors Darja Fišer, Senja Pollak and Špela Vintar
Abstract The paper presents an innovative approach to extract Slovene definition candidates from domain-specific corpora using morphosyntactic patterns, automatic terminology recognition and semantic tagging with wordnet senses. First, a classification model was trained on examples from Slovene Wikipedia which was then used to find well-formed definitions among the extracted candidates. The results of the experiment are encouraging, with accuracy ranging from 67% to 71%. The paper also addresses some drawbacks of the approach and suggests ways to overcome them in future work.
Topics Knowledge Discovery/Representation, Lexicon, lexical database, Text mining
Full paper Learning to Mine Definitions from Slovene Structured and Unstructured Knowledge-Rich Resources
Slides Learning to Mine Definitions from Slovene Structured and Unstructured Knowledge-Rich Resources
Bibtex @InProceedings{FIER10.141,
  author = {Darja Fišer and Senja Pollak and Špela Vintar},
  title = {Learning to Mine Definitions from Slovene Structured and Unstructured Knowledge-Rich Resources},
  booktitle = {Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)},
  year = {2010},
  month = {may},
  date = {19-21},
  address = {Valletta, Malta},
  editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Bente Maegaard and Joseph Mariani and Jan Odijk and Stelios Piperidis and Mike Rosner and Daniel Tapias},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {2-9517408-6-7},
  language = {english}
 }
Powered by ELDA © 2010 ELDA/ELRA