Title |
Adjudicator Agreement and System Rankings for Person Name Search |
Authors |
Mark Arehart, Chris Wolf and Keith J. Miller |
Abstract |
We have analyzed system rankings for person name search algorithms using a data set for which several versions of ground truth were developed by employing different means of resolving adjudicator conflicts. Thirteen algorithms were ranked by F-score, using bootstrap resampling for significance testing, on a dataset containing 70,000 romanized names from various cultures. We found some disagreement among the four adjudicators, with kappa ranging from 0.57 to 0.78. Truth sets based on a single adjudicator, and on the intersection or union of positive adjudications produced sizeable variability in scoring sensitivity - and to a lesser degree rank order - compared to the consensus truth set. However, results on truth sets constructed by randomly choosing an adjudicator for each item were highly consistent with the consensus. The implication is that an evaluation where one adjudicator has judged each item is nearly as good as a more expensive and labor-intensive one where multiple adjudicators have judged each item and conflicts are resolved through voting. |
Language |
|
Topics |
Evaluation methodologies, Information Extraction, Information Retrieval, Corpus (creation, annotation, etc.) |
Full paper |
Adjudicator Agreement and System Rankings for Person Name Search |
Slides |
Adjudicator Agreement and System Rankings for Person Name Search |
Bibtex |
@InProceedings{AREHART08.647,
author = {Mark Arehart, Chris Wolf and Keith J. Miller},
title = {Adjudicator Agreement and System Rankings for Person Name Search},
booktitle = {Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)},
year = {2008},
month = {may},
date = {28-30},
address = {Marrakech, Morocco},
editor = {Nicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis, Daniel Tapias},
publisher = {European Language Resources Association (ELRA)},
isbn = {2-9517408-4-0},
note = {http://www.lrec-conf.org/proceedings/lrec2008/},
language = {english}
} |