Title |
Massively Increasing TIMEX3 Resources: A Transduction Approach |
Authors |
Leon Derczynski, Hector Llorens and Estela Saquete |
Abstract |
Automatic annotation of temporal expressions is a research challenge of great interest in the field of information extraction. Gold standard temporally-annotated resources are limited in size, which makes research using them difficult. Standards have also evolved over the past decade, so not all temporally annotated data is in the same format. We vastly increase available human-annotated temporal expression resources by converting older format resources to TimeML/TIMEX3. This task is difficult due to differing annotation methods. We present a robust conversion tool and a new, large temporal expression resource. Using this, we evaluate our conversion process by using it as training data for an existing TimeML annotation tool, achieving a 0.87 F1 measure - better than any system in the TempEval-2 timex recognition exercise. |
Topics |
Corpus (creation, annotation, etc.), Tools, systems, applications, Discourse annotation, representation and processing |
Full paper |
Massively Increasing TIMEX3 Resources: A Transduction Approach |
Bibtex |
@InProceedings{DERCZYNSKI12.451,
author = {Leon Derczynski and Hector Llorens and Estela Saquete}, title = {Massively Increasing TIMEX3 Resources: A Transduction Approach}, booktitle = {Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12)}, year = {2012}, month = {may}, date = {23-25}, address = {Istanbul, Turkey}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Mehmet Uğur Doğan and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis}, publisher = {European Language Resources Association (ELRA)}, isbn = {978-2-9517408-7-7}, language = {english} } |