Title |
Learning Patterns for Building Resources about Semantic Relations in the Medical Domain |
Authors |
Mehdi Embarek and Olivier Ferret |
Abstract |
In this article, we present a method for extracting automatically semantic relations from texts in the medical domain using linguistic patterns. These patterns refer to three levels of information about words: inflected form, lemma and part-of-speech. The method we present consists first in identifying the entities that are part of the relations to extract, that is to say diseases, exams, treatments, drugs or symptoms. Thereafter, sentences that contain couples of entities are extracted and the presence of a semantic relation is validated by applying linguistic patterns. These patterns were previously learnt automatically from a manually annotated corpus by relying onan algorithm based on the edit distance. We first report the results of an evaluation of our medical entity tagger for the five types of entities we have mentioned above and then, more globally, the results of an evaluation of our extraction method for four relations between these entities. Both evaluations were done for French. |
Language |
Single language |
Topics |
Ontologies, Semantics, Acquisition, Machine Learning |
Full paper |
Learning Patterns for Building Resources about Semantic Relations in the Medical Domain |
Slides |
- |
Bibtex |
@InProceedings{EMBAREK08.514,
author = {Mehdi Embarek and Olivier Ferret},
title = {Learning Patterns for Building Resources about Semantic Relations in the Medical Domain},
booktitle = {Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)},
year = {2008},
month = {may},
date = {28-30},
address = {Marrakech, Morocco},
editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis, Daniel Tapias},
publisher = {European Language Resources Association (ELRA)},
isbn = {2-9517408-4-0},
note = {http://www.lrec-conf.org/proceedings/lrec2008/},
language = {english}
} |