Title |
Applying Computational Linguistic Techniques in a Documentary Project for Q’anjob’al (Mayan, Guatemala) |
Author(s) |
Jonas Kuhn, B’alam Mateo-Toledo The University of Texas at Austin, Department of Linguistics |
Session |
P19-SW |
Abstract |
This paper reports on a number of experiments in which we applied standard techniques from NLP in the context of documentation of endangered languages. We concentrated on the use of existing, freely available toolkits. Specifically, we explore the use of Finite-State Morphological Analysis, Maximum Entropy Part-of-Speech Tagging, and N-Gram Language Modeling. |
Keyword(s) |
Endangered languages, corpora, finite-state morphology, Maximum Entroy tagging, N-gram language models |
Language(s) |
Q’anjob’al (Mayan, Guatemala) |
Full Paper |