Title |
Construction of Text Summarization Corpus for the Credibility of Information on the Web |
Authors |
Masahiro Nakano, Hideyuki Shibuki, Rintaro Miyazaki, Madoka Ishioroshi, Koichi Kaneko and Tatsunori Mori |
Abstract |
Recently, the credibility of information on the Web has become an important issue. In addition to telling about content of source documents, indicating how to interpret the content, especially showing interpretation of the relation between statements appeared to contradict each other, is important for helping a user judge the credibility of information. In this paper, we will describe the purpose and the way in the construction of a text summarization corpus. Our purpose in the construction of the corpus includes the following three points; to collect Web documents relevant to several query sentences, to prepare gold standard data to evaluate smaller sub-processes in the extraction process and the summary generation process, to investigate the summaries made by human summarizers. The constructed corpus contains six query sentences, 24 manually-constructed summaries, and 24 collections of source Web documents. We also investigated how the descriptions of interpretation, which help a user judge the credibility of other descriptions in the summary, appear in the corpus. As a result, we confirmed that showing interpretation on conflicts is important for helping a user judge the credibility of information. |
Topics |
Corpus (creation, annotation, etc.), Summarisation, Text mining |
Full paper |
Construction of Text Summarization Corpus for the Credibility of Information on the Web |
Slides |
- |
Bibtex |
@InProceedings{NAKANO10.135,
author = {Masahiro Nakano and Hideyuki Shibuki and Rintaro Miyazaki and Madoka Ishioroshi and Koichi Kaneko and Tatsunori Mori}, title = {Construction of Text Summarization Corpus for the Credibility of Information on the Web}, booktitle = {Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)}, year = {2010}, month = {may}, date = {19-21}, address = {Valletta, Malta}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Bente Maegaard and Joseph Mariani and Jan Odijk and Stelios Piperidis and Mike Rosner and Daniel Tapias}, publisher = {European Language Resources Association (ELRA)}, isbn = {2-9517408-6-7}, language = {english} } |