Summary of the paper

Title Data Anonymization for Requirements Quality Analysis: a Reproducible Automatic Error Detection Task
Authors Juyeon Kang and Jungyeul Park
Abstract In this work, we aim at identifying potential problems of ambiguity, completeness, conformity, singularity and readability in system and software requirements specifications. Those problems arise particularly when they are written in a natural language. While we describe them from a linguistic point of view, the business impacts of each potential error are also considered in system engineering context. We investigate and explore error patterns for requirements quality analysis by manually analyzing the corpus. This analysis is based on the requirements grammar that we developed in our previous work. In addition, this paper extends our previous work in a two-fold way: (1) we increase more than twice the number of evaluation data (1K sentences) through a manual verification process, and (2) we anonymize all sensible and confidential entities in evaluation data to make our data publicly available. We also provide the baseline system using conditional random fields for requirements quality analysis, and we obtain 79.47\% for the F$_1$ score on proposed evaluation data.
Topics Language Modelling, Corpus (Creation, Annotation, Etc.), Other
Full paper Data Anonymization for Requirements Quality Analysis: a Reproducible Automatic Error Detection Task
Bibtex @InProceedings{KANG18.72,
  author = {Juyeon Kang and Jungyeul Park},
  title = "{Data Anonymization for Requirements Quality Analysis: a Reproducible Automatic Error Detection Task}",
  booktitle = {Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)},
  year = {2018},
  month = {May 7-12, 2018},
  address = {Miyazaki, Japan},
  editor = {Nicoletta Calzolari (Conference chair) and Khalid Choukri and Christopher Cieri and Thierry Declerck and Sara Goggi and Koiti Hasida and Hitoshi Isahara and Bente Maegaard and Joseph Mariani and Hélène Mazo and Asuncion Moreno and Jan Odijk and Stelios Piperidis and Takenobu Tokunaga},
  publisher = {European Language Resources Association (ELRA)},
  isbn = {979-10-95546-00-9},
  language = {english}
  }
Powered by ELDA © 2018 ELDA/ELRA