Title |
HamleDT: To Parse or Not to Parse? |
Authors |
Daniel Zeman, David Mareček, Martin Popel, Loganathan Ramasamy, Jan Štěpánek, Zdeněk Žabokrtský and Jan Hajič |
Abstract |
We propose HamleDT ― HArmonized Multi-LanguagE Dependency Treebank. HamleDT is a compilation of existing dependency treebanks (or dependency conversions of other treebanks), transformed so that they all conform to the same annotation style. While the license terms prevent us from directly redistributing the corpora, most of them are easily acquirable for research purposes. What we provide instead is the software that normalizes tree structures in the data obtained by the user from their original providers. |
Topics |
Corpus (creation, annotation, etc.), Grammar and Syntax, Evaluation methodologies |
Full paper |
HamleDT: To Parse or Not to Parse? |
Bibtex |
@InProceedings{ZEMAN12.429,
author = {Daniel Zeman and David Mareček and Martin Popel and Loganathan Ramasamy and Jan Štěpánek and Zdeněk Žabokrtský and Jan Hajič}, title = {HamleDT: To Parse or Not to Parse?}, booktitle = {Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12)}, year = {2012}, month = {may}, date = {23-25}, address = {Istanbul, Turkey}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Mehmet Uğur Doğan and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis}, publisher = {European Language Resources Association (ELRA)}, isbn = {978-2-9517408-7-7}, language = {english} } |