Title |
Using Parsed Corpora for Estimating Stochastic Inversion Transduction Grammars |
Authors |
Germán Sanchis and Joan Andreu Sánchez |
Abstract |
An important problem when using Stochastic Inversion Transduction Grammars is their computational cost. More specifically, when dealing with corpora such as Europarl. only one iteration of the estimation algorithm becomes prohibitive. In this work, we apply a reduction of the cost by taking profit of the bracketing information in parsed corpora and show machine translation results obtained with a bracketed Europarl corpus, yielding interresting improvements when increasing the number of non-terminal symbols. |
Language |
Multiple languages |
Topics |
Machine Translation, SpeechToSpeech Translation, Grammars, Syntax |
Full paper |
Using Parsed Corpora for Estimating Stochastic Inversion Transduction Grammars |
Slides |
Using Parsed Corpora for Estimating Stochastic Inversion Transduction Grammars |
Bibtex |
@InProceedings{SANCHIS08.465,
author = {Germán Sanchis and Joan Andreu Sánchez},
title = {Using Parsed Corpora for Estimating Stochastic Inversion Transduction Grammars},
booktitle = {Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)},
year = {2008},
month = {may},
date = {28-30},
address = {Marrakech, Morocco},
editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis, Daniel Tapias},
publisher = {European Language Resources Association (ELRA)},
isbn = {2-9517408-4-0},
note = {http://www.lrec-conf.org/proceedings/lrec2008/},
language = {english}
} |