Summary of the paper

Title Coh-Metrix-Esp: A Complexity Analysis Tool for Documents Written in Spanish
Authors Andre Quispersaravia and Walter Perez and Marco Sobrevilla and Fernando Alva-Manchengo
Abstract Text Complexity Analysis is an useful task in Education. For example, it can help teachers select appropriate texts for their students according to their educational level. This task requires the analysis of several text features that people do mostly manually (e.g. syntactic complexity, words variety, etc.). In this paper, we present a tool useful for Complexity Analysis, called Coh-Metrix-Esp. This is the Spanish version of Coh-Metrix and is able to calculate 45 readability indices. We analyse how these indices behave in a corpus of “simple” and “complex” documents, and also use them as features in a complexity binary classifier for texts in Spanish. After some experiments with machine learning algorithms, we got 0.9 F-measure for a corpus that contains tales for kids and adults and 0.82 F-measure for a corpus with texts written for students of Spanish as a foreign language.
Topics Other, Tools, Systems, Applications, Document Classification, Text categorisation
Full paper Coh-Metrix-Esp: A Complexity Analysis Tool for Documents Written in Spanish
Bibtex @InProceedings{QUISPERSARAVIA16.1115,
  author = {Andre Quispersaravia and Walter Perez and Marco Sobrevilla and Fernando Alva-Manchengo},
  title = {Coh-Metrix-Esp: A Complexity Analysis Tool for Documents Written in Spanish},
  booktitle = {Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2016)},
  year = {2016},
  month = {may},
  date = {26-31},
  location = {Portorož, Slovenia},
  editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Sara Goggi and Marko Grobelnik and Bente Maegaard and Joseph Mariani and Helene Mazo and Asuncion Moreno and Jan Odijk and Stelios Piperidis},
  publisher = {European Language Resources Association (ELRA)},
  address = {Paris, France},
  isbn = {978-2-9517408-9-1},
  language = {english}
 }
Powered by ELDA © 2016 ELDA/ELRA