An effective classification approach for big data with parallel generalized Hebbian algorithm

Ahmed Hussein Ali, Royida A. Ibrahem Alhayali, Mostafa Abdulghafoor Mohammed, Tole Sutikno

Abstract


Advancements in information technology is contributing to the excessive rate of big data generation recently. Big data refers to datasets that are huge in volume and consumes much time and space to process and transmit using the available resources. Big data also covers data with unstructured and structured formats. Many agencies are currently subscribing to research on big data analytics owing to the failure of the existing data processing techniques to handle the rate at which big data is generated. This paper presents an efficient classification and reduction technique for big data based on parallel generalized Hebbian algorithm (GHA) which is one of the commonly used principal component analysis (PCA) neural network (NN) learning algorithms. The new method proposed in this study was compared to the existing methods to demonstrate its capabilities in reducing the dimensionality of big data. The proposed method in this paper is implemented using Spark Radoop platform.

Keywords


Big data; Generalized Hebbian algorithm; Machine learning; Neural network; Principal component analysis; Spark Radoop

Full Text:

PDF


DOI: https://doi.org/10.11591/eei.v10i6.3135

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Bulletin of EEI Stats