A hybrid random forest and k-nearest neighbors approach for breast cancer detection
Journal: International Journal of Advanced Computer Research (IJACR) (Vol.14, No. 66)Publication Date: 2024-03-30
Authors : Om Prakash Kumar; Animesh Kumar Dubey;
Page : 42-49
Keywords : Breast cancer detection; Random forest; k-nearest neighbors; Hybrid model; Machine learning.;
Abstract
In this paper, a novel hybrid approach was presented combining random forest (RF) and k-nearest neighbors (kNN) for the classification of breast cancer data. RF is selected for its robustness against overfitting and its ability to handle high-dimensional data effectively, providing a measure of feature importance and generalizing well due to its ensemble nature. kNN is chosen for its simplicity and effectiveness in capturing local data patterns. Our hybrid RF-kNN approach involves feature importance weighting in kNN, dynamic k selection, polynomial feature expansion, and ensemble output combination. The Wisconsin breast cancer database (WBCD) is used for experimentation, evaluated using 10-fold cross-validation. Performance metrics include accuracy, precision, recall, and F1-score. The results demonstrate that the hybrid RF-kNN model outperforms individual models, achieving superior performance across all metrics and data splits. This highlights the robustness and effectiveness of the hybrid model in reducing false positives and correctly identifying patients with breast cancer, making it a reliable model for breast cancer detection.
Other Latest Articles
- Facets related to challenges and prospects of cloud computing adoption in resource-constrained settings: systematic review analysis
- Greek sign language recognition using machine learning
- Breathing easy: Navigating Lung Health in a Changing Atmosphere
- Checklist of Aphidophagous Neuroptera (Insecta: Orthoptera) in India
- A brief overview of the relationship between Global warming and Insect outbreaks
Last modified: 2024-07-04 16:44:12