Title |
CCASH: A Web Application Framework for Efficient, Distributed Language Resource Development |
Authors |
Paul Felt, Owen Merkling, Marc Carmen, Eric Ringger, Warren Lemmon, Kevin Seppi and Robbie Haertel |
Abstract |
We introduce CCASH (Cost-Conscious Annotation Supervised by Humans), an extensible web application framework for cost-efficient annotation. CCASH provides a framework in which cost-efficient annotation methods such as Active Learning can be explored via user studies and afterwards applied to large annotation projects. CCASHs architecture is described as well as the technologies that it is built on. CCASH allows custom annotation tasks to be built from a growing set of useful annotation widgets. It also allows annotation methods (such as AL) to be implemented in any language. Being a web application framework, CCASH offers secure centralized data and annotation storage and facilitates collaboration among multiple annotations. By default it records timing information about each annotation and provides facilities for recording custom statistics. The CCASH framework has been used to evaluate a novel annotation strategy presented in a concurrently published paper, and will be used in the future to annotate a large Syriac corpus. |
Topics |
Tools, systems, applications, Corpus (creation, annotation, etc.), Part of speech tagging |
Full paper |
CCASH: A Web Application Framework for Efficient, Distributed Language Resource Development |
Slides |
- |
Bibtex |
@InProceedings{FELT10.360,
author = {Paul Felt and Owen Merkling and Marc Carmen and Eric Ringger and Warren Lemmon and Kevin Seppi and Robbie Haertel}, title = {CCASH: A Web Application Framework for Efficient, Distributed Language Resource Development}, booktitle = {Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)}, year = {2010}, month = {may}, date = {19-21}, address = {Valletta, Malta}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Bente Maegaard and Joseph Mariani and Jan Odijk and Stelios Piperidis and Mike Rosner and Daniel Tapias}, publisher = {European Language Resources Association (ELRA)}, isbn = {2-9517408-6-7}, language = {english} } |