ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

Breast Cancer Diagnosis

Journal: IMPACT : International Journal of Research in Applied, Natural and Social Sciences ( IMPACT : IJRANSS ) (Vol.8, No. 7)

Publication Date:

Authors : ; ;

Page : 7-16

Keywords : K-Nearest Neighbors; Support Vector Machine; Confusion Matrix; Feature Extraction; Principle Component Analysis; Algorithm Comparision and Optimization;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

After lung cancer, Breast Cancer is the most intricate disease diagnosed among 8% of women. Breast Cancer is characterized by the mutation of genes, constant pain, changes in the size, color (redness), skin texture of breasts. The detection and treatment of breast cancer involves screening at an early stage by which the mortality can be reduced by a quarter. Data Mining techniques are used to predict the probability of a person who are affected by Breast Cancer using certain Data Mining Techniques. The techniques include KNN Classification, Support Vector Machine Algorithm, and Confusion Matrix Algorithm. The project is implemented using Python programming and deploys the analytical results as a comparision plot. The data-set has been fetched from the UCI repository, and it contains different attributes or factors such as patient_id, unif_cell-size, unif_cell-shape, marg_adhesion and bare_nuclei. In this project, the results are deployed as a comparision chart, which provides better visualization. Various techniques have been used to understand the behavior patterns and to predict the percentage of occurrence of breast cancer. We infer from our study that KNN is well suited algorithm for diagnosis with a accuracy of 97%.

Last modified: 2020-07-03 21:58:13