Title |
Is Sentiment a Property of Synsets? Evaluating Resources for Sentiment Classification using Machine Learning |
Authors |
Aleksander Wawer |
Abstract |
Existing approaches to classifying documents by sentiment include machine learning with features created from n-grams and part of speech. This paper explores a different approach and examines performance of one selected machine learning algorithm, Support Vector Machines, with features computed using existing lexical resources. Special attention has been paid to fine tuning of the algorithm regarding number of features. The immediate purpose of this experiment is to evaluate lexical and sentiment resources in document-level sentiment classification task. Results described in the paper are also useful to indicate how lexicon design, different language dimensions and semantic categories contribute to document-level sentiment recognition. In a less direct way (through the examination of evaluated resources), the experiment analyzes adequacy of lexemes, word senses and synsets as different possible layers for ascribing sentiment, or as candidates for sentiment carriers. The proposed approach of machine learning word category frequencies instead of n-grams and part of speech features can potentially exhibit improvements in domain independency, but this hypothesis has to be verified in future works. |
Topics |
Document Classification, Text categorisation, Emotion Recognition/Generation, Lexicon, lexical database |
Full paper |
Is Sentiment a Property of Synsets? Evaluating Resources for Sentiment Classification using Machine Learning |
Slides |
- |
Bibtex |
@InProceedings{WAWER10.149,
author = {Aleksander Wawer}, title = {Is Sentiment a Property of Synsets? Evaluating Resources for Sentiment Classification using Machine Learning}, booktitle = {Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)}, year = {2010}, month = {may}, date = {19-21}, address = {Valletta, Malta}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Bente Maegaard and Joseph Mariani and Jan Odijk and Stelios Piperidis and Mike Rosner and Daniel Tapias}, publisher = {European Language Resources Association (ELRA)}, isbn = {2-9517408-6-7}, language = {english} } |