LREC 2018 Proceedings

Summary of the paper

Title	A Vietnamese Dialog Act Corpus Based on ISO 24617-2 standard
Authors	Thi Lan Ngo, Pham Khac Linh and Takeda Hideaki
Abstract	The voice-based human-machine interaction systems such as personal virtual assistants, chat-bots, and automatic contact centres are becoming increasingly popular. In this trend, conversation mining research also is getting the attention of many researchers. Standardized data play an important role in conversation mining. In this paper, we present a new Vietnamese corpus annotated for dialog acts using the ISO 24617-2 standard (2012), for emotions using Ekman's six primitives (1972), and for sentiment using the tags ``positive", ``negative" and ``neutral''. Emotion and sentiment are tagged at functional segment level. We show how the corpus is constructed and provide a brief statistical description of the data. This is the first Vietnamese dialog act corpus.
Topics	Opinion Mining / Sentiment Analysis, Corpus (Creation, Annotation, Etc.), Other
Full paper	A Vietnamese Dialog Act Corpus Based on ISO 24617-2 standard
Bibtex	@InProceedings{NGO18.942, author = {Thi Lan Ngo and Pham Khac Linh and Takeda Hideaki}, title = "{A Vietnamese Dialog Act Corpus Based on ISO 24617-2 standard}", booktitle = {Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)}, year = {2018}, month = {May 7-12, 2018}, address = {Miyazaki, Japan}, editor = {Nicoletta Calzolari (Conference chair) and Khalid Choukri and Christopher Cieri and Thierry Declerck and Sara Goggi and Koiti Hasida and Hitoshi Isahara and Bente Maegaard and Joseph Mariani and Hélène Mazo and Asuncion Moreno and Jan Odijk and Stelios Piperidis and Takenobu Tokunaga}, publisher = {European Language Resources Association (ELRA)}, isbn = {979-10-95546-00-9}, language = {english} }