Title |
The I3MEDIA speech database: a trilingual annotated corpus for the analysis and synthesis of emotional speech |
Authors |
Juan María Garrido, Yesika Laplaza, Montse Marquina, Andrea Pearman, José Gregorio Escalada, Miguel Ángel Rodríguez and Ana Armenta |
Abstract |
In this article the I3Media corpus is presented, a trilingual (Catalan, English, Spanish) speech database of neutral and emotional material collected for analysis and synthesis purposes. The corpus is actually made up of six different subsets of material: a neutral subcorpus, containing emotionless utterances; a dialog' subcorpus, containing typical call center utterances; an emotional' corpus, a set of sentences representative of pure emotional states; a football' subcorpus, including utterances imitating a football broadcasting situation; a SMS' subcorpus, including readings of SMS texts; and a paralinguistic elements' corpus, including recordings of interjections and paralinguistic sounds uttered in isolation. The corpus was read by professional speakers (male, in the case of Spanish and Catalan; female, in the case of the English corpus), carefully selected to meet criteria of language competence, voice quality and acting conditions. It is the result of a collaboration between the Speech Technology Group at Telefónica Investigación y Desarrollo (TID) and the Speech and Language Group at Barcelona Media Centre d'Innovació (BM), as part of the I3Media project. |
Topics |
Emotion Recognition/Generation, Corpus (creation, annotation, etc.), Speech resource/database |
Full paper |
The I3MEDIA speech database: a trilingual annotated corpus for the analysis and synthesis of emotional speech |
Bibtex |
@InProceedings{GARRIDO12.865,
author = {Juan María Garrido and Yesika Laplaza and Montse Marquina and Andrea Pearman and José Gregorio Escalada and Miguel Ángel Rodríguez and Ana Armenta}, title = {The I3MEDIA speech database: a trilingual annotated corpus for the analysis and synthesis of emotional speech}, booktitle = {Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12)}, year = {2012}, month = {may}, date = {23-25}, address = {Istanbul, Turkey}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Mehmet Uğur Doğan and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis}, publisher = {European Language Resources Association (ELRA)}, isbn = {978-2-9517408-7-7}, language = {english} } |