Summary of the paper

Title Automatically Enriching Spoken Corpora with Syntactic Information for Linguistic Studies
Authors Alexis Nasr, Frederic Bechet, Benoit Favre, Thierry Bazillon, Jose Deulofeu and andre Valli
Abstract Syntactic parsing of speech transcriptions faces the problem of the presence of disfluencies that break the syntactic structure of the utterances. We propose in this paper two solutions to this problem. The first one relies on a disfluencies predictor that detects disfluencies and removes them prior to parsing. The second one integrates the disfluencies in the syntactic structure of the utterances and train a disfluencies aware parser.
Topics Parsing
Full paper Automatically Enriching Spoken Corpora with Syntactic Information for Linguistic Studies
Bibtex @InProceedings{NASR14.816,
  author = {Alexis Nasr and Frederic Bechet and Benoit Favre and Thierry Bazillon and Jose Deulofeu and andre Valli},
  title = {Automatically Enriching Spoken Corpora with Syntactic Information for Linguistic Studies},
  booktitle = {Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)},
  year = {2014},
  month = {may},
  date = {26-31},
  address = {Reykjavik, Iceland},
  editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Hrafn Loftsson and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {978-2-9517408-8-4},
  language = {english}
 }
Powered by ELDA © 2014 ELDA/ELRA