Title |
Treebanking by Sentence and Tree Transformation: Building a Treebank to support Question Answering in Portuguese |
Authors |
Patricia Gonçalves, Rita Santos and António Branco |
Abstract |
This paper presents CINTIL-QATreebank, a treebank composed of Portuguese sentences that can be used to support the development of Question Answering systems. To create this treebank, we use declarative sentences from the pre-existing CINTIL-Treebank and manually transform their syntactic structure into a non-declarative sentence. Our corpus includes two clause types: interrogative and imperative clauses. CINTIL-QATreebank can be used in language science and techology general research, but it was developed particularly for the development of automatic Question Answering systems. The non-declarative entences are annotated with several layers of linguistic information, namely (i) trees with information on constituency and grammatical function; (ii) sentence type; (iii) interrogative pronoun; (iv) question type; and (v) semantic type of expected answer. Moreover, these non-declarative sentences are paired with their declarative counterparts and associated with the expected answer snippets. |
Topics |
Corpus (creation, annotation, etc.), Grammar and Syntax, Question Answering |
Full paper |
Treebanking by Sentence and Tree Transformation: Building a Treebank to support Question Answering in Portuguese |
Bibtex |
@InProceedings{GONALVES12.460,
author = {Patricia Gonçalves and Rita Santos and António Branco}, title = {Treebanking by Sentence and Tree Transformation: Building a Treebank to support Question Answering in Portuguese}, booktitle = {Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12)}, year = {2012}, month = {may}, date = {23-25}, address = {Istanbul, Turkey}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Mehmet Uğur Doğan and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis}, publisher = {European Language Resources Association (ELRA)}, isbn = {978-2-9517408-7-7}, language = {english} } |