Title |
An NLP Curator (or: How I Learned to Stop Worrying and Love NLP Pipelines) |
Authors |
James Clarke, Vivek Srikumar, Mark Sammons and Dan Roth |
Abstract |
Natural Language Processing continues to grow in popularity in a range of research and commercial applications, yet managing the wide array of potential NLP components remains a difficult problem. This paper describes Curator, an NLP management framework designed to address some common problems and inefficiencies associated with building NLP process pipelines; and Edison, an NLP data structure library in Java that provides streamlined interactions with Curator and offers a range of useful supporting functionality. |
Topics |
Tools, systems, applications, LR Infrastructures and Architectures, Other |
Full paper |
An NLP Curator (or: How I Learned to Stop Worrying and Love NLP Pipelines) |
Bibtex |
@InProceedings{CLARKE12.664,
author = {James Clarke and Vivek Srikumar and Mark Sammons and Dan Roth}, title = {An NLP Curator (or: How I Learned to Stop Worrying and Love NLP Pipelines)}, booktitle = {Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12)}, year = {2012}, month = {may}, date = {23-25}, address = {Istanbul, Turkey}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Mehmet Uğur Doğan and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis}, publisher = {European Language Resources Association (ELRA)}, isbn = {978-2-9517408-7-7}, language = {english} } |