An interpretable machine learning-based breast cancer classification using XGBoost, SHAP, and LIME
Monoronjon Dutta, Khondokar Md. Mehedi Hasan, Alifa Akter, Md. Hasibur Rahman, Md. Assaduzzaman
Abstract
Globally, breast cancer is among the most prevalent and deadly tumors that affect women. Early and accurate identification of breast cancer is essential for effective treatment planning and improving patient outcomes. This research focuses on improving breast cancer classification accuracy through machine learning (ML) methodologies, emphasizing interpretability. The study utilized the chi-square method to enhance model testing performance by pinpointing the most significant features for further analysis. The study also improved data quality by identifying and removing outliers, thus minimizing the influence of data irregularities on the performance of the models. For classification, the study evaluated six different ML algorithms—namely extreme gradient boosting (XGBoost), decision tree (DT), AdaBoost (AB), support vector machine (SVM), gradient boosting (GB), and K-nearest neighbors (KNN)—each applied to distinguish between the two variants of breast cancer. Among these, the XGBoost classifier emerged as the most accurate, achieving an impressive 99.30% accuracy rate. Moreover, the research incorporated shapley additive explanations (SHAP) and local interpretable model-agnostic explanations (LIME) methods to boost the interpretability of the proposed model, offering crucial insights into the model’s decision-making process. Applying these interpretability techniques provided significant insights into the predictive factors influencing healthcare outcomes, ensuring the classification approach’s transparency and reliability.
Keywords
Breast cancer detection; Chi-square; Feature selection; SHAP and LIME; Machine learning techniques
DOI:
https://doi.org/10.11591/eei.v13i6.7866
Refbacks
There are currently no refbacks.
This work is licensed under a
Creative Commons Attribution-ShareAlike 4.0 International License .
<div class="statcounter"><a title="hit counter" href="http://statcounter.com/free-hit-counter/" target="_blank"><img class="statcounter" src="http://c.statcounter.com/10241695/0/5a758c6a/0/" alt="hit counter"></a></div>
Bulletin of EEI Stats
Bulletin of Electrical Engineering and Informatics (BEEI) ISSN: 2089-3191, e-ISSN: 2302-9285 This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU) .