Summary of the paper

Title TwiSty: A Multilingual Twitter Stylometry Corpus for Gender and Personality Profiling
Authors Ben Verhoeven, Walter Daelemans and Barbara Plank
Abstract Personality profiling is the task of detecting personality traits of authors based on writing style. Several personality typologies exist, however, the Briggs-Myer Type Indicator (MBTI) is particularly popular in the non-scientific community, and many people use it to analyse their own personality and talk about the results online. Therefore, large amounts of self-assessed data on MBTI are readily available on social-media platforms such as Twitter. We present a novel corpus of tweets annotated with the MBTI personality type and gender of their author for six Western European languages (Dutch, German, French, Italian, Portuguese and Spanish). We outline the corpus creation and annotation, show statistics of the obtained data distributions and present first baselines on Myers-Briggs personality profiling and gender prediction for all six languages.
Topics Profiling, Corpus (Creation, Annotation, etc.), Social Media Processing
Full paper TwiSty: A Multilingual Twitter Stylometry Corpus for Gender and Personality Profiling
Bibtex @InProceedings{VERHOEVEN16.644,
  author = {Ben Verhoeven and Walter Daelemans and Barbara Plank},
  title = {TwiSty: A Multilingual Twitter Stylometry Corpus for Gender and Personality Profiling},
  booktitle = {Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)},
  year = {2016},
  month = {may},
  date = {23-28},
  location = {Portorož, Slovenia},
  editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Sara Goggi and Marko Grobelnik and Bente Maegaard and Joseph Mariani and Helene Mazo and Asuncion Moreno and Jan Odijk and Stelios Piperidis},
  publisher = {European Language Resources Association (ELRA)},
  address = {Paris, France},
  isbn = {978-2-9517408-9-1},
  language = {english}
 }
Powered by ELDA © 2016 ELDA/ELRA