Comparison of Various Models in the Context of Language Identification (Indo Aryan Languages)
Journal: International Journal of Science and Research (IJSR) (Vol.10, No. 3)Publication Date: 2021-03-05
Authors : Salman Alam;
Page : 185-188
Keywords : Hindi; Magahi; Bhojpuri;
Abstract
Automatic language detection is a text classification task in which language is identified in a given multilingual text by the machine. This paper compares the different models of machine learning algorithm in the context of language identification. The corpus includes five major Indo-Aryan Language which are closely related to each other like Hindi, Bhojpuri, Awadhi, Maghahi and Braj. In this paper I have compared models like Random forest classifier, SVC, SGD Classifier, Multi-nominal logistic Regression, Gaussian Naïve Bayes and Bernoulli Naïve Bayes. Out of these models Multi-nominal Naïve Bayes has attained the best accuracy of 74 %.
Other Latest Articles
- Experiences in the Commercial Application of GSHP Systems in Kosovo
- A New Hope - Teledentistry
- To Measure Patient Satisfaction with Physical Therapy - A Cross Sectional Study
- Assessment of Analgesic Activity of the Extract of Sesbania genus Plants in Animal Models
- Spontaneous Resolution of Idiopathic Hydrops in One Twin of a Dichorionic Twin Pregnancy
Last modified: 2021-06-26 18:42:03