Title |
The Political Speech Corpus of Bulgarian |
Authors |
Petya Osenova and Kiril Simov |
Abstract |
The paper introduces the Political Speech Corpus of Bulgarian. First, its current state has been discussed with respect to its size, coverage, genre specification and related online services. Then, the focus goes to the annotation details. On the one hand, the layers of linguistic annotation are presented. On the other hand, the compatibility with CLARIN technical Infrastructure is explained. Also, some user-based scenarios are mentioned to demonstrate the corpus services and applicability. |
Topics |
Corpus (creation, annotation, etc.), Document Classification, Text categorisation |
Full paper |
The Political Speech Corpus of Bulgarian |
Bibtex |
@InProceedings{OSENOVA12.956,
author = {Petya Osenova and Kiril Simov}, title = {The Political Speech Corpus of Bulgarian}, booktitle = {Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12)}, year = {2012}, month = {may}, date = {23-25}, address = {Istanbul, Turkey}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Mehmet Uğur Doğan and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis}, publisher = {European Language Resources Association (ELRA)}, isbn = {978-2-9517408-7-7}, language = {english} } |