Title |
A Large-Scale Evaluation of Pre-editing Strategies for Improving User-Generated Content Translation |
Authors |
Violeta Seretan, Pierrette Bouillon and Johanna Gerlach |
Abstract |
The user-generated content represents an increasing share of the information available today. To make this type of content instantly accessible in another language, the ACCEPT project focuses on developing pre-editing technologies for correcting the source text in order to increase its translatability. Linguistically-informed pre-editing rules have been developed for English and French for the two domains considered by the project, namely, the technical domain and the healthcare domain. In this paper, we present the evaluation experiments carried out to assess the impact of the proposed pre-editing rules on translation quality. Results from a large-scale evaluation campaign show that pre-editing helps indeed attain a better translation quality for a high proportion of the data, the difference with the number of cases where the adverse effect is observed being statistically significant. The ACCEPT pre-editing technology is freely available online and can be used in any Web-based environment to enhance the translatability of user-generated content so that it reaches a broader audience. |
Topics |
Authoring Tools, Evaluation Methodologies |
Full paper |
A Large-Scale Evaluation of Pre-editing Strategies for Improving User-Generated Content Translation |
Bibtex |
@InProceedings{SERETAN14.676,
author = {Violeta Seretan and Pierrette Bouillon and Johanna Gerlach}, title = {A Large-Scale Evaluation of Pre-editing Strategies for Improving User-Generated Content Translation}, booktitle = {Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)}, year = {2014}, month = {may}, date = {26-31}, address = {Reykjavik, Iceland}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Hrafn Loftsson and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis}, publisher = {European Language Resources Association (ELRA)}, isbn = {978-2-9517408-8-4}, language = {english} } |