A Comprehensive Analysis of Handling Imbalanced Dataset
Journal: International Journal of Advanced Trends in Computer Science and Engineering (IJATCSE) (Vol.10, No. 2)Publication Date: 2021-04-09
Authors : Adeoti Babajide Ebenezer Boyinbode O.K Oladunjoye Michael Idowu;
Page : 454-463
Keywords : Accuracy paradox; Cost sensitive; Dataset; Ensemble; Imbalanced; Machine Learning; Sampling;
Abstract
Classification is a major obstacle in Machine Learning generally and also specific when tackling class imbalance problem. A dataset is said to be imbalanced if a class we are interested in falls to the minority class and appears scanty when compared to the majority class, the minority class is also known as the positive class while the majority class is also known as the negative class. Class imbalance has been a major bottleneck for Machine Learning scientist as it often leads to using wrong model for different purposes, this Survey will lead researchers to choose the right model and the best strategies to handle imbalance dataset in the course of tackling machine learning problems. Proper handling of class imbalance dataset could leads to accurate and good result. Handling class imbalance data in a conventional manner, especially when the level of imbalance is high may leads to accuracy paradox (an assumption of realizing 99% accuracy during evaluation process when the class distribution is highly imbalanced), hence imbalance class distribution requires special consideration, and for this purpose we dealt extensively on handling and solving imbalanced class problem in machine learning, such as Data Sampling Approach, Cost sensitive learning approach and Ensemble Approach. Key words: A
Other Latest Articles
- Identification of Online Statistical Translation and Text Issues in Communication Technologies
- A Proposed Ideal Network Design for Collaborative Workspace Businesses
- Eco-friendly Management of Harmful Colonization of Striped Mealybug (Ferrisia virgata) over plants by using Herbal Pesticide
- Miocardiopatia Dilatada após uso de Adalimumabe em paciente com Pioderma Gangrenoso
- Qualidade de vida dos pacientes com chikungunya: fatores associados durante uma epidemia ocorrida no nordeste do Brasil
Last modified: 2021-04-10 14:11:57