Title |
Technical Infrastructure at Linguistic Data Consortium: Software and Hardware Resources for Linguistic Data Creation |
Authors |
Kazuaki Maeda, Haejoong Lee, Stephen Grimes, Jonathan Wright, Robert Parker, David Lee and Andrea Mazzucchi |
Abstract |
Linguistic Data Consortium (LDC) at the University of Pennsylvania has participated as a data provider in a variety of governmentsponsored programs that support development of Human Language Technologies. As the number of projects increases, the quantity and variety of the data LDC produces have increased dramatically in recent years. In this paper, we describe the technical infrastructure, both hardware and software, that LDC has built to support these complex, large-scale linguistic data creation efforts at LDC. As it would not be possible to cover all aspects of LDCs technical infrastructure in one paper, this paper focuses on recent development. We also report on our plans for making our custom-built software resources available to the community as open source software, and introduce an initiative to collaborate with software developers outside LDC. We hope that our approaches and software resources will be useful to the community members who take on similar challenges. |
Topics |
Tools, systems, applications, LR Infrastructures and Architectures |
Full paper |
Technical Infrastructure at Linguistic Data Consortium: Software and Hardware Resources for Linguistic Data Creation |
Slides |
- |
Bibtex |
@InProceedings{MAEDA10.857,
author = {Kazuaki Maeda and Haejoong Lee and Stephen Grimes and Jonathan Wright and Robert Parker and David Lee and Andrea Mazzucchi}, title = {Technical Infrastructure at Linguistic Data Consortium: Software and Hardware Resources for Linguistic Data Creation}, booktitle = {Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)}, year = {2010}, month = {may}, date = {19-21}, address = {Valletta, Malta}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Bente Maegaard and Joseph Mariani and Jan Odijk and Stelios Piperidis and Mike Rosner and Daniel Tapias}, publisher = {European Language Resources Association (ELRA)}, isbn = {2-9517408-6-7}, language = {english} } |