Title |
SentiWS - A Publicly Available German-language Resource for Sentiment Analysis |
Authors |
Robert Remus, Uwe Quasthoff and Gerhard Heyer |
Abstract |
SentimentWortschatz, or SentiWS for short, is a publicly available German-language resource for sentiment analysis, opinion mining etc. It lists positive and negative sentiment bearing words weighted within the interval of [-1; 1] plus their part of speech tag, and if applicable, their inflections. The current version of SentiWS (v1.8b) contains 1,650 negative and 1,818 positive words, which sum up to 16,406 positive and 16,328 negative word forms, respectively. It not only contains adjectives and adverbs explicitly expressing a sentiment, but also nouns and verbs implicitly containing one. The present work describes the resources structure, the three sources utilised to assemble it and the semi-supervised method incorporated to weight the strength of its entries. Furthermore the resources contents are extensively evaluated using a German-language evaluation set we constructed. The evaluation set is verified being reliable and its shown that SentiWS provides a beneficial lexical resource for German-language sentiment analysis related tasks to build on. |
Topics |
Emotion Recognition/Generation, Lexicon, lexical database, Acquisition |
Full paper |
SentiWS - A Publicly Available German-language Resource for Sentiment Analysis |
Slides |
- |
Bibtex |
@InProceedings{REMUS10.490,
author = {Robert Remus and Uwe Quasthoff and Gerhard Heyer}, title = {SentiWS - A Publicly Available German-language Resource for Sentiment Analysis}, booktitle = {Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)}, year = {2010}, month = {may}, date = {19-21}, address = {Valletta, Malta}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Bente Maegaard and Joseph Mariani and Jan Odijk and Stelios Piperidis and Mike Rosner and Daniel Tapias}, publisher = {European Language Resources Association (ELRA)}, isbn = {2-9517408-6-7}, language = {english} } |