Title |
ANC2Go: A Web Application for Customized Corpus Creation |
Authors |
Nancy Ide, Keith Suderman and Brian Simms |
Abstract |
We describe a web application called ANC2Go that enables the user to select data from the Open American National Corpus (OANC) and the Manually Annotated Sub-corpus (MASC) together with some or all of the annotations available. The user also may select from among a variety of options for output format, or may receive the selected portions of the corpus and annotations in their original GrAF XML standoff format.. The request is processed by merging the annotations selected and rendering them in the desired output format, then bundling the results and making it available for download. Thus, users can create a customized corpus with data and annotations of their choosing, delivered in the format that is most convenient for their use. ANC2Go will be released as a web service in the near future. Both the OANC and MASC are freely available for any use from the American National Corpus website and may be accessed through the ANC2Go application, or they may downloaded in their entirety. |
Topics |
Corpus (creation, annotation, etc.), Web Services, Standards for LRs |
Full paper |
ANC2Go: A Web Application for Customized Corpus Creation |
Slides |
ANC2Go: A Web Application for Customized Corpus Creation |
Bibtex |
@InProceedings{IDE10.745,
author = {Nancy Ide and Keith Suderman and Brian Simms}, title = {ANC2Go: A Web Application for Customized Corpus Creation}, booktitle = {Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)}, year = {2010}, month = {may}, date = {19-21}, address = {Valletta, Malta}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Bente Maegaard and Joseph Mariani and Jan Odijk and Stelios Piperidis and Mike Rosner and Daniel Tapias}, publisher = {European Language Resources Association (ELRA)}, isbn = {2-9517408-6-7}, language = {english} } |