LREC 2012 Proceedings

Summary of the paper

Title	The Australian National Corpus: National Infrastructure for Language Resources
Authors	Steve Cassidy, Michael Haugh, Pam Peters and Mark Fallu
Abstract	The Australian National Corpus has been established in an effort to make currently scattered and relatively inaccessible data available to researchers through an online portal. In contrast to other national corpora, it is conceptualised as a linked collection of many existing and future language resources representing language use in Australia, unified through common technical standards. This approach allows us to bootstrap a significant collection and add value to existing resources by providing a unified, online tool-set to support research in a number of disciplines. This paper provides an outline of the technical platform being developed to support the corpus and a brief overview of some of the collections that form part of the initial version of the Australian National Corpus.
Topics	Corpus (creation, annotation, etc.), LR Infrastructures and Architectures, LR national/international projects, organizational/policy issues
Full paper	The Australian National Corpus: National Infrastructure for Language Resources
Bibtex	@InProceedings{CASSIDY12.400, author = {Steve Cassidy and Michael Haugh and Pam Peters and Mark Fallu}, title = {The Australian National Corpus: National Infrastructure for Language Resources}, booktitle = {Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12)}, year = {2012}, month = {may}, date = {23-25}, address = {Istanbul, Turkey}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Mehmet Uğur Doğan and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis}, publisher = {European Language Resources Association (ELRA)}, isbn = {978-2-9517408-7-7}, language = {english} }