SUMMARY : Session O25-WE Machine Translation and Evaluation
Title | A Model for Context-Based Evaluation of Language Processing Systems and its Application to Machine Translation Evaluation |
---|---|
Authors | A. Popescu-belis, P. Estrella, M. King, N. Underwood |
Abstract | In this paper, we propose a formal framework that takes into account the influence of the intended context of use of an NLP system on the procedure and the metrics used to evaluate the system. We introduce in particular the notion of a context-dependent quality model and explain how it can be adapted to a given context of use. More specifically, we define vector-space representations of contexts of use and of quality models, which are connected by a generic contextual quality model (GCQM). For each domain, experts in evaluation are needed to build a GCQM based on analytic knowledge and on previous evaluations, using the mechanism proposed here. The main inspiration source for this work is the FEMTI framework for the evaluation of machine translation, which implements partly the present model, and which is described briefly along with insights from other domains. |
Keywords | Evaluation, Context-based evaluation, Evaluation tools, Machine translation evaluation |
Full paper | A Model for Context-Based Evaluation of Language Processing Systems and its Application to Machine Translation Evaluation |