Title |
Terra: a Collection of Translation Error-Annotated Corpora |
Authors |
Mark Fishel, Ondřej Bojar and Maja Popović |
Abstract |
Recently the first methods of automatic diagnostics of machine translation have emerged; since this area of research is relatively young, the efforts are not coordinated. We present a collection of translation error-annotated corpora, consisting of automatically produced translations and their detailed manual translation error analysis. Using the collected corpora we evaluate the available state-of-the-art methods of MT diagnostics and assess, how well the methods perform, how they compare to each other and whether they can be useful in practice. |
Topics |
Evaluation methodologies, Corpus (creation, annotation, etc.), Machine Translation, SpeechToSpeech Translation |
Full paper |
Terra: a Collection of Translation Error-Annotated Corpora |
Bibtex |
@InProceedings{FISHEL12.481,
author = {Mark Fishel and Ondřej Bojar and Maja Popović}, title = {Terra: a Collection of Translation Error-Annotated Corpora}, booktitle = {Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12)}, year = {2012}, month = {may}, date = {23-25}, address = {Istanbul, Turkey}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Mehmet Uğur Doğan and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis}, publisher = {European Language Resources Association (ELRA)}, isbn = {978-2-9517408-7-7}, language = {english} } |