The effect of feature selection with optimization on taxi fare prediction

Amany A. Naim, Asmaa Hekal Omar, Asmaa A. Ibrahim, Asmaa Mohamed, Naglaa M. Mostafa

Abstract


Feature selection plays a key influence in machine learning (ML); the main objective of feature selection is to eliminate irrelevant and redundant variables in different classification problems to improve the performance of the learning algorithms. Classification accuracy is improved by reducing the number of selected features. Many real-world problems, such as taxi fare can be predicted by ML. This paper proposes feature selection using genetic algorithm (GA) optimization to predict taxi fare. Experiments are performed on real datasets of taxi fare, and this paper uses eight classifiers to evaluate the selected features. The performance of the classifiers is assessed using various performance metrics. The results are compared with feature selection without optimization. The proposed method records high classification accuracy when evaluated by three types of classifiers (random forest, AdaBoost, and Gradient Boost). The results indicate that the prediction accuracy of the proposed method is 99.7% on taxi fare dataset.

Keywords


Fare prediction; Feature selection; Genetic algorithm; Machine learning; Optimization; Predictive analysis; Supervised learning

Full Text:

PDF


DOI: https://doi.org/10.11591/eei.v14i3.8658

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Bulletin of EEI Stats

Bulletin of Electrical Engineering and Informatics (BEEI)
ISSN: 2089-3191, e-ISSN: 2302-9285
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).