Summary of the paper

Title Comprehensive Annotation of Multiword Expressions in a Social Web Corpus
Authors Nathan Schneider, Spencer Onuffer, Nora Kazour, Emily Danchik, Michael T. Mordowanec, Henrietta Conrad and Noah A. Smith
Abstract Multiword expressions (MWEs) are quite frequent in languages such as English, but their diversity, the scarcity of individual MWE types, and contextual ambiguity have presented obstacles to corpus-based studies and NLP systems addressing them as a class. Here we advocate for a comprehensive annotation approach: proceeding sentence by sentence, our annotators manually group tokens into MWEs according to guidelines that cover a broad range of multiword phenomena. Under this scheme, we have fully annotated an English web corpus for multiword expressions, including those containing gaps.
Topics MultiWord Expressions & Collocations, Social Media Processing
Full paper Comprehensive Annotation of Multiword Expressions in a Social Web Corpus
Bibtex @InProceedings{SCHNEIDER14.521,
  author = {Nathan Schneider and Spencer Onuffer and Nora Kazour and Emily Danchik and Michael T. Mordowanec and Henrietta Conrad and Noah A. Smith},
  title = {Comprehensive Annotation of Multiword Expressions in a Social Web Corpus},
  booktitle = {Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)},
  year = {2014},
  month = {may},
  date = {26-31},
  address = {Reykjavik, Iceland},
  editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Hrafn Loftsson and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {978-2-9517408-8-4},
  language = {english}
 }
Powered by ELDA © 2014 ELDA/ELRA