Title |
Investigating the Image of Entities in Social Media: Dataset Design and First Results |
Authors |
Julien Velcin, Young-Min Kim, Caroline Brun, Jean-Yves Dormagen, Eric Sanjuan, Leila Khouas, Anne Peradotto, Stéphane Bonnevay, Claude Roux, Julien Boyadjian, Alejandro Molina and Marie Neihouser |
Abstract |
The objective of this paper is to describe the design of a dataset that deals with the image (i.e., representation, web reputation) of various entities populating the Internet: politicians, celebrities, companies, brands etc. Our main contribution is to build and provide an original annotated French dataset. This dataset consists of 11527 manually annotated tweets expressing the opinion on specific facets (e.g., ethic, communication, economic project) describing two French policitians over time. We believe that other researchers might benefit from this experience, since designing and implementing such a dataset has proven quite an interesting challenge. This design comprises different processes such as data selection, formal definition and instantiation of an image. We have set up a full open-source annotation platform. In addition to the dataset design, we present the first results that we obtained by applying clustering methods to the annotated dataset in order to extract the entity images. |
Topics |
Opinion Mining / Sentiment Analysis, Knowledge Discovery/Representation |
Full paper |
Investigating the Image of Entities in Social Media: Dataset Design and First Results |
Bibtex |
@InProceedings{VELCIN14.302,
author = {Julien Velcin and Young-Min Kim and Caroline Brun and Jean-Yves Dormagen and Eric Sanjuan and Leila Khouas and Anne Peradotto and Stéphane Bonnevay and Claude Roux and Julien Boyadjian and Alejandro Molina and Marie Neihouser}, title = {Investigating the Image of Entities in Social Media: Dataset Design and First Results}, booktitle = {Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)}, year = {2014}, month = {may}, date = {26-31}, address = {Reykjavik, Iceland}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Hrafn Loftsson and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis}, publisher = {European Language Resources Association (ELRA)}, isbn = {978-2-9517408-8-4}, language = {english} } |