Title |
Grammatical Error Annotation for Korean Learners of Spoken English |
Authors |
Hongsuck Seo, Kyusong Lee, Gary Geunbae Lee, Soo-Ok Kweon and Hae-Ri Kim |
Abstract |
The goal of our research is to build a grammatical error-tagged corpus for Korean learners of Spoken English dubbed Postech Learner Corpus. We collected raw story-telling speech from Korean university students. Transcription and annotation using the Cambridge Learner Corpus tagset were performed by six Korean annotators fluent in English. For the annotation of the corpus, we developed an annotation tool and a validation tool. After comparing human annotation with machine-recommended error tags, unmatched errors were rechecked by a native annotator. We observed different characteristics between the spoken language corpus built in this study and an existing written language corpus. |
Topics |
Corpus (creation, annotation, etc.), Grammar and Syntax, Other |
Full paper |
Grammatical Error Annotation for Korean Learners of Spoken English |
Bibtex |
@InProceedings{SEO12.168,
author = {Hongsuck Seo and Kyusong Lee and Gary Geunbae Lee and Soo-Ok Kweon and Hae-Ri Kim}, title = {Grammatical Error Annotation for Korean Learners of Spoken English}, booktitle = {Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC'12)}, year = {2012}, month = {may}, date = {23-25}, address = {Istanbul, Turkey}, editor = {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Mehmet Uğur Doğan and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis}, publisher = {European Language Resources Association (ELRA)}, isbn = {978-2-9517408-7-7}, language = {english} } |