An interpretable machine learning-based breast cancer classification using XGBoost, SHAP, and LIME

Monoronjon Dutta, Khondokar Md. Mehedi Hasan, Alifa Akter, Md. Hasibur Rahman, Md. Assaduzzaman

Abstract


Globally, breast cancer is among the most prevalent and deadly tumors that affect women. Early and accurate identification of breast cancer is essential for effective treatment planning and improving patient outcomes. This research focuses on improving breast cancer classification accuracy through machine learning (ML) methodologies, emphasizing interpretability. The study utilized the chi-square method to enhance model testing performance by pinpointing the most significant features for further analysis. The study also improved data quality by identifying and removing outliers, thus minimizing the influence of data irregularities on the performance of the models. For classification, the study evaluated six different ML algorithms—namely extreme gradient boosting (XGBoost), decision tree (DT), AdaBoost (AB), support vector machine (SVM), gradient boosting (GB), and K-nearest neighbors (KNN)—each applied to distinguish between the two variants of breast cancer. Among these, the XGBoost classifier emerged as the most accurate, achieving an impressive 99.30% accuracy rate. Moreover, the research incorporated shapley additive explanations (SHAP) and local interpretable model-agnostic explanations (LIME) methods to boost the interpretability of the proposed model, offering crucial insights into the model’s decision-making process. Applying these interpretability techniques provided significant insights into the predictive factors influencing healthcare outcomes, ensuring the classification approach’s transparency and reliability.

Keywords


Breast cancer detection; Chi-square; Feature selection; SHAP and LIME; Machine learning techniques

Full Text:

PDF


DOI: https://doi.org/10.11591/eei.v13i6.7866

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Bulletin of EEI Stats

Bulletin of Electrical Engineering and Informatics (BEEI)
ISSN: 2089-3191, e-ISSN: 2302-9285
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).