SUMMARY : Session P22-W
Title | A Part-of-speech tagger for Irish using Finite-State Morphology and Constraint Grammar Disambiguation |
---|---|
Authors | E. Dhonnchadha, J. Genabith |
Abstract | This paper describes the methodology used to develop a part-of-speech tagger for Irish, which is used to annotate a corpus of 30 million words of text with part-of-speech tags and lemmas. The tagger is evaluated using a manually disambiguated test corpus and it currently achieves 95% accuracy on unrestricted text. To our knowledge, this is the first part-of-speech tagger for Irish. |
Keywords | |
Full paper | A Part-of-speech tagger for Irish using Finite-State Morphology and Constraint Grammar Disambiguation |