Title |
Partial Parsing as a Method to Expedite Dependency Annotation of a Hindi Treebank |
Authors |
Mridul Gupta, Vineet Yadav, Samar Husain and Dipti Misra Sharma |
Abstract |
The paper describes an approach to expedite the process of manual annotation of a Hindi dependency treebank which is currently under development. We propose a way by which consistency among a set of manual annotators could be improved. Furthermore, we show that our setup can also prove useful for evaluating when an inexperienced annotator is ready to start participating in the production of the treebank. We test our approach on sample sets of data obtained from an ongoing work on creation of this treebank. The results asserting our proposal are reported in this paper. We report results from a semi-automated approach of dependency annotation experiment. We find out the rate of agreement between annotators using Cohens Kappa. We also compare results with respect to the total time taken to annotate sample data-sets using a completely manual approach as opposed to a semi-automated approach. It is observed from the results that this semi-automated approach when carried out with experienced and trained human annotators improves the overall quality of treebank annotation and also speeds up the process. |
Topics |
Tools, systems, applications, Corpus (creation, annotation, etc.), Evaluation methodologies |
Full paper |
Partial Parsing as a Method to Expedite Dependency Annotation of a Hindi Treebank |
Slides |
- |
Bibtex |
@InProceedings{GUPTA10.739,
author = {Mridul Gupta and Vineet Yadav and Samar Husain and Dipti Misra Sharma}, title = {Partial Parsing as a Method to Expedite Dependency Annotation of a Hindi Treebank}, booktitle = {Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)}, year = {2010}, month = {may}, date = {19-21}, address = {Valletta, Malta}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Bente Maegaard and Joseph Mariani and Jan Odijk and Stelios Piperidis and Mike Rosner and Daniel Tapias}, publisher = {European Language Resources Association (ELRA)}, isbn = {2-9517408-6-7}, language = {english} } |