Title |
Adapting and evaluating a generic term extraction tool |
Authors |
Anita Gojun, Ulrich Heid, Bernd Weißbach, Carola Loth and Insa Mingers |
Abstract |
We present techniques for monolingual term candidate extraction which are being developed in the EU project TTC. We designed an application for German and English data that serves as a first evaluation of the methods for terminology extraction used in the project. The application situation highlighted the need for tools to handle lemmatization errors and to remove incomplete word sequences from multi-word term candidate lists, as well as the fact that the provision of German citation forms requires more morphological knowledge than TTC's slim approach can provide. We show a detailed evaluation of our extraction results and discuss the method for the evaluation of terminology extraction systems. |
Topics |
Lexicon, lexical database, Tools, systems, applications, Evaluation methodologies |
Full paper |
Adapting and evaluating a generic term extraction tool |
Bibtex |
@InProceedings{GOJUN12.746,
author = {Anita Gojun and Ulrich Heid and Bernd Weißbach and Carola Loth and Insa Mingers}, title = {Adapting and evaluating a generic term extraction tool}, booktitle = {Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12)}, year = {2012}, month = {may}, date = {23-25}, address = {Istanbul, Turkey}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Mehmet Uğur Doğan and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis}, publisher = {European Language Resources Association (ELRA)}, isbn = {978-2-9517408-7-7}, language = {english} } |