LREC 2016 Proceedings

Summary of the paper

Title	Palabras: Crowdsourcing Transcriptions of L2 Speech
Authors	Eric Sanders, Pepi Burgos, Catia Cucchiarini and Roeland van Hout
Abstract	We developed a web application for crowdsourcing transcriptions of Dutch words spoken by Spanish L2 learners. In this paper we discuss the design of the application and the influence of metadata and various forms of feedback. Useful data were obtained from 159 participants, with an average of over 20 transcriptions per item, which seems a satisfactory result for this type of research. Informing participants about how many items they still had to complete, and not how many they had already completed, turned to be an incentive to do more items. Assigning participants a score for their performance made it more attractive for them to carry out the transcription task, but this seemed to influence their performance. We discuss possible advantages and disadvantages in connection with the aim of the research and consider possible lessons for designing future experiments.
Topics	Crowdsourcing, Speech Resource/Database, Speech Recognition/Understanding
Full paper	Palabras: Crowdsourcing Transcriptions of L2 Speech
Bibtex	@InProceedings{SANDERS16.46, author = {Eric Sanders and Pepi Burgos and Catia Cucchiarini and Roeland van Hout}, title = {Palabras: Crowdsourcing Transcriptions of L2 Speech}, booktitle = {Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)}, year = {2016}, month = {may}, date = {23-28}, location = {Portorož, Slovenia}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Sara Goggi and Marko Grobelnik and Bente Maegaard and Joseph Mariani and Helene Mazo and Asuncion Moreno and Jan Odijk and Stelios Piperidis}, publisher = {European Language Resources Association (ELRA)}, address = {Paris, France}, isbn = {978-2-9517408-9-1}, language = {english} }