Summary of the paper

Title Italian and Spanish Null Subjects. A Case Study Evaluation in an MT Perspective.
Authors Lorenza Russo, Sharid Loáiciga and Asheesh Gulati
Abstract Thanks to their rich morphology, Italian and Spanish allow pro-drop pronouns, i.e., non lexically-realized subject pronouns. Here we distinguish between two different types of null subjects: personal pro-drop and impersonal pro-drop. We evaluate the translation of these two categories into French, a non pro-drop language, using Its-2, a transfer-based system developed at our laboratory; and Moses, a statistical system. Three different corpora are used: two subsets of the Europarl corpus and a third corpus built using newspaper articles. Null subjects turn out to be quantitatively important in all three corpora, but their distribution varies depending on the language and the text genre though. From a MT perspective, translation results are determined by the type of pro-drop and the pair of languages involved. Impersonal pro-drop is harder to translate than personal pro-drop, especially for the translation from Italian into French, and a significant portion of incorrect translations consists of missing pronouns.
Topics Corpus (creation, annotation, etc.), Machine Translation, SpeechToSpeech Translation, Multilinguality
Full paper Italian and Spanish Null Subjects. A Case Study Evaluation in an MT Perspective.
Bibtex @InProceedings{RUSSO12.813,
  author = {Lorenza Russo and Sharid Loáiciga and Asheesh Gulati},
  title = {Italian and Spanish Null Subjects. A Case Study Evaluation in an MT Perspective.},
  booktitle = {Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12)},
  year = {2012},
  month = {may},
  date = {23-25},
  address = {Istanbul, Turkey},
  editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Mehmet Uğur Doğan and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {978-2-9517408-7-7},
  language = {english}
 }
Powered by ELDA © 2012 ELDA/ELRA