Title |
Designing a Tool for Exploiting Bilingual Comparable Corpora |
Authors |
Bennison Peter (Dublin City University, Dublin 9, Ireland, pbenn@hotmail.com) Bowker Lynne (Dublin City University, Dublin 9, Ireland, lynne.bowker@dcu.ie) |
Keywords |
Bilingual Comparable Corpora, Computer-Assisted Translation Tools, Corpus Design, ExTrECC, Term Extraction, Translation Equivalents |
Session |
Session WP3 - Multilingual Corpora |
Full Paper |
20.ps, 20.pdf |
Abstract |
Translators have a real need for a tool that will allow them to exploit information contained in bilingual comparable corpora. ExTrECC is designed to be a semi-automatic tool that processes bilingual comparable corpora and presents a translator with a list of potential equivalents (in context) of the search term. The task of identifying translation equivalents in a non-aligned, non-translated corpus is a difficult one, and ExTrECC makes use of a number of techniques, some of which are simple and others more sophisticated. The basic design of ExTrECC (graphical user interface, architecture, algorithms) is outlined in this paper. |