Title |
Extending HeidelTime for Temporal Expressions Referring to Historic Dates |
Authors |
Jannik Strötgen, Thomas Bögel, Julian Zell, Ayser Armiti, Tran Van Canh and Michael Gertz |
Abstract |
Research on temporal tagging has achieved a lot of attention during the last years. However, most of the work focuses on processing news-style documents. Thus, references to historic dates are often not well handled by temporal taggers although they frequently occur in narrative-style documents about history, e.g., in many Wikipedia articles. In this paper, we present the AncientTimes corpus containing documents about different historic time periods in eight languages, in which we manually annotated temporal expressions. Based on this corpus, we explain the challenges of temporal tagging documents about history. Furthermore, we use the corpus to extend our multilingual, cross-domain temporal tagger HeidelTime to extract and normalize temporal expressions referring to historic dates, and to demonstrate HeidelTime's new capabilities. Both, the AncientTimes corpus as well as the new HeidelTime version are made publicly available. |
Topics |
Named Entity Recognition, Multilinguality |
Full paper |
Extending HeidelTime for Temporal Expressions Referring to Historic Dates |
Bibtex |
@InProceedings{STRTGEN14.849,
author = {Jannik Strötgen and Thomas Bögel and Julian Zell and Ayser Armiti and Tran Van Canh and Michael Gertz}, title = {Extending HeidelTime for Temporal Expressions Referring to Historic Dates}, booktitle = {Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)}, year = {2014}, month = {may}, date = {26-31}, address = {Reykjavik, Iceland}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Hrafn Loftsson and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis}, publisher = {European Language Resources Association (ELRA)}, isbn = {978-2-9517408-8-4}, language = {english} } |