SUMMARY : Session O22-GW Infrastructures, Methodologies and Standards
Title | Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology |
---|---|
Authors | M. Gavrilidou, P. Labropoulou, S. Piperidis, V. Giouli, N. Calzolari, M. Monachini, C. Soria, K. Choukri |
Abstract | This paper reports on the multilingual Language Resources (MLRs), i.e. parallel corpora and terminological lexicons for less widely digitally available languages, that have been developed in the INTERA project and the methodology adopted for their production. Special emphasis is given to the reality factors that have influenced the MLRs development approach and their final constitution. Building on the experience gained in the project, a production model has been elaborated, suggesting ways and techniques that can be exploited in order to improve LRs production taking into account realistic issues. |
Keywords | language resources, multilingual parallel corpora, term extraction, language resources production models, less widely digitally available languages, eContent users |
Full paper | Language Resources Production Models: the Case of the INTERA Multilingual Corpus and Terminology |