An in-depth exploration of Bangla blog post classification

Tanvirul Islam; Ashik Iqbal Prince; Md. Mehedee Zaman Khan; Md. Ismail Jabiullah; Md. Tarek Habib

doi:10.11591/eei.v10i2.2873

An in-depth exploration of Bangla blog post classification

Tanvirul Islam, Ashik Iqbal Prince, Md. Mehedee Zaman Khan, Md. Ismail Jabiullah, Md. Tarek Habib

Abstract

Bangla blog is increasing rapidly in the era of information, and consequently, the blog has a diverse layout and categorization. In such an aptitude, automated blog post classification is a comparatively more efficient solution in order to organize Bangla blog posts in a standard way so that users can easily find their required articles of interest. In this research, nine supervised learning models which are Support Vector Machine (SVM), multinomial naÃ¯ve Bayes (MNB), multi-layer perceptron (MLP), k-nearest neighbours (k-NN), stochastic gradient descent (SGD), decision tree, perceptron, ridge classifier and random forest are utilized and compared for classification of Bangla blog post. Moreover, the performance on predicting blog posts against eight categories, three feature extraction techniques are applied, namely unigram TF-IDF (term frequency-inverse document frequency), bigram TF-IDF, and trigram TF-IDF. The majority of the classifiers show above 80% accuracy. Other performance evaluation metrics also show good results while comparing the selected classifiers.

Keywords

Bangla blog; Bangla text classification; Bigram; Supervised machine learning; TF-IDF; Trigram; Unigram

Full Text:

PDF

DOI: https://doi.org/10.11591/eei.v10i2.2873

Refbacks

There are currently no refbacks.

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Bulletin of EEI Stats

Bulletin of Electrical Engineering and Informatics (BEEI)
ISSN: 2089-3191, e-ISSN: 2302-9285
This journal is published by the Institute of Advanced Engineering and Science (IAES) in collaboration with Intelektual Pustaka Media Utama (IPMU).

Username
Password
Remember me