W11 2018 Proceedings

Summary of the paper

Title	Automatic Language Identification System for Hindi and Magahi
Authors	Priya Rani, Atul Kr. Ojha and Girish Nath Jha
Abstract	Language identification has become a prerequisite for all kinds of automated text processing systems. In this paper, we present a rule-based language identifier tool for two closely related Indo-Aryan languages: Hindi and Magahi. This system has currently achieved an accuracy of approx 86.34%. We hope to improve this in the future. Automatic identification of languages will be significant in the accuracy of output of Web Crawlers.
Topics	Rule-Based Approach, Language Identification, Hindi And Magahi
Full paper	Automatic Language Identification System for Hindi and Magahi
Bibtex	@InProceedings{RANI18.16, author = {Priya Rani ,Atul Kr. Ojha and Girish Nath Jha}, title = {Automatic Language Identification System for Hindi and Magahi}, booktitle = {Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)}, year = {2018}, month = {may}, date = {7-12}, location = {Miyazaki, Japan}, editor = {Girish Nath Jha and Kalika Bali and Sobha L and Atul Kr. Ojha}, publisher = {European Language Resources Association (ELRA)}, address = {Paris, France}, isbn = {979-10-95546-09-2}, language = {english} }