Enhancing classification in high-dimensional data with robust rMI-SVM feature selection
Fung Yuen Chin, Yong Kheng Goh
Abstract
Dealing with high-dimensional datasets presents notable challenges for classification modelling, primarily due to complexity and susceptibility to overfitting. Traditional feature selection methods frequently struggle to guarantee improved classification performance by including more features. Instead, they often rely on utilising the entire feature set. To address these challenges, a robust feature selection algorithm known as ranked mutual information for support vector machines (rMI-SVM) has been introduced. This approach mitigates the risk of overfitting by selecting features that augment the classification model with additional information, thereby ensuring enhanced performance as more features are selected. rMI-SVM can accommodate datasets with missing values regardless of data linearity as it does not require additional parameters or preset the number of features needed. The proposed method offers a solution to the challenges posed by high-dimensional data, and explicitly identifies the optimal number of features required for a classification model, thus circumventing the necessity of using the full feature set. These findings are supported by receiver operating characteristic (ROC) curves, which highlight the effectiveness of rMI-SVM in outperforming existing baselines and delivering a superior classification model performance.
Keywords
Classification; Feature selection; Machine learning; Mutual information; Support vector machine
DOI:
https://doi.org/10.11591/eei.v13i5.7938
Refbacks
There are currently no refbacks.
This work is licensed under a
Creative Commons Attribution-ShareAlike 4.0 International License .
<div class="statcounter"><a title="hit counter" href="http://statcounter.com/free-hit-counter/" target="_blank"><img class="statcounter" src="http://c.statcounter.com/10241695/0/5a758c6a/0/" alt="hit counter"></a></div>
Bulletin of EEI Stats
Bulletin of Electrical Engineering and Informatics (BEEI) ISSN: 2089-3191, e-ISSN: 2302-9285 This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU) .