Title |
TermWise: A CAT-tool with Context-Sensitive Terminological Support. |
Authors |
Kris Heylen, Stephen Bond, Dirk De Hertog De Hertog, Ivan Vulić and Hendrik Kockaert |
Abstract |
Increasingly, large bilingual document collections are being made available online, especially in the legal domain. This type of Big Data is a valuable resource that specialized translators exploit to search for informative examples of how domain-specific expressions should be translated. However, general purpose search engines are not optimized to retrieve previous translations that are maximally relevant to a translator. In this paper, we report on the TermWise project, a cooperation of terminologists, corpus linguists and computer scientists, that aims to leverage big online translation data for terminological support to legal translators at the Belgian Federal Ministry of Justice. The project developed dedicated knowledge extraction algorithms and a server-based tool to provide translators with the most relevant previous translations of domain-specific expressions relative to the current translation assignment. The functionality is implemented an extra database, a Term&Phrase Memory, that is meant to be integrated with existing Computer Assisted Translation tools. In the paper, we give an overview of the system, give a demo of the user interface, we present a user-based evaluation by translators and discuss how the tool is part of the general evolution towards exploiting Big Data in translation. |
Topics |
MultiWord Expressions & Collocations, Authoring Tools |
Full paper |
TermWise: A CAT-tool with Context-Sensitive Terminological Support. |
Bibtex |
@InProceedings{HEYLEN14.706,
author = {Kris Heylen and Stephen Bond and Dirk De Hertog De Hertog and Ivan Vulić and Hendrik Kockaert}, title = {TermWise: A CAT-tool with Context-Sensitive Terminological Support.}, booktitle = {Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)}, year = {2014}, month = {may}, date = {26-31}, address = {Reykjavik, Iceland}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Hrafn Loftsson and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis}, publisher = {European Language Resources Association (ELRA)}, isbn = {978-2-9517408-8-4}, language = {english} } |