| Title | Finite-State Morphological Transducers for Three Kypchak Languages | 
  
  | Authors | Jonathan Washington, Ilnar Salimzyanov and Francis Tyers | 
  
  | Abstract | This paper describes the development of free/open-source finite-state morphological transducers for three Turkic languages―Kazakh, Tatar, and Kumyk―representing one language from each of the three sub-branches of the Kypchak branch of Turkic. The finite-state toolkit used for the work is the Helsinki Finite-State Toolkit (HFST). This paper describes how the development of a transducer for each subsequent closely-related language took less development time. An evaluation is presented which shows that the transducers all have a reasonable coverage―around 90\%―on freely available corpora of the languages, and high precision over a manually verified test set. | 
  
  | Topics | Part-of-Speech Tagging, Endangered Languages | 
  
  | Full paper  | Finite-State Morphological Transducers for Three Kypchak Languages | 
  
  | Bibtex | @InProceedings{WASHINGTON14.1207, author =  {Jonathan Washington and Ilnar Salimzyanov and Francis Tyers},
 title =  {Finite-State Morphological Transducers for Three Kypchak Languages},
 booktitle =  {Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)},
 year =  {2014},
 month =  {may},
 date =  {26-31},
 address =  {Reykjavik, Iceland},
 editor =  {Nicoletta Calzolari (Conference Chair) and Khalid Choukri and Thierry Declerck and Hrafn Loftsson and Bente Maegaard and Joseph Mariani and Asuncion Moreno and Jan Odijk and Stelios Piperidis},
 publisher =  {European Language Resources Association (ELRA)},
 isbn =  {978-2-9517408-8-4},
 language =  {english}
 }
 |