LREC 2016 Proceedings

Summary of the paper

Title	Phrase Level Segmentation and Labelling of Machine Translation Errors
Authors	Frédéric Blain, Varvara Logacheva and Lucia Specia
Abstract	This paper presents our work towards a novel approach for Quality Estimation (QE) of machine translation based on sequences of adjacent words, the so-called phrases. This new level of QE aims to provide a natural balance between QE at word and sentence-level, which are either too fine grained or too coarse levels for some applications. However, phrase-level QE implies an intrinsic challenge: how to segment a machine translation into sequence of words (contiguous or not) that represent an error. We discuss three possible segmentation strategies to automatically extract erroneous phrases. We evaluate these strategies against annotations at phrase-level produced by humans, using a new dataset collected for this purpose.
Topics	Machine Translation, SpeechToSpeech Translation, Corpus (Creation, Annotation, etc.), Parsing
Full paper	Phrase Level Segmentation and Labelling of Machine Translation Errors
Bibtex	@InProceedings{BLAIN16.1194, author = {Frédéric Blain and Varvara Logacheva and Lucia Specia}, title = {Phrase Level Segmentation and Labelling of Machine Translation Errors}, booktitle = {Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)}, year = {2016}, month = {may}, date = {23-28}, location = {Portorož, Slovenia}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Sara Goggi and Marko Grobelnik and Bente Maegaard and Joseph Mariani and Helene Mazo and Asuncion Moreno and Jan Odijk and Stelios Piperidis}, publisher = {European Language Resources Association (ELRA)}, address = {Paris, France}, isbn = {978-2-9517408-9-1}, language = {english} }