Identifying clickbait in online news using deep learning

Andry Chowanda, Nadia Nadia, Lie Maximilianus Maria Kolbe

Abstract


Several industries use clickbait techniques as their strategy to increase the number of readers for their news. Some news companies implement catchy headlines and images in their news article links, with the expectation that the readers will be interested in reading the news and click the provided link. The majority of the news is not hoax news. However, the content might not be as grand as the catchy headlines and images provided to the readers. This research aims to explore the classification model using machine learning to identify if the headlines are classified as clickbait in online news. This research explores several machine learning techniques to classify clickbait in online news and comprehensively explain the results. Several popular machine learning techniques were implemented and explored in this research. The results demonstrate that the model trained with fast large margin provides the best accuracy and classification error (90% and 10%, respectively). Moreover, to improve the performance, bidirectional encoder representations from transformers architecture was used to model clickbait in online news. The best BERT model achieved 98.86% in the test accuracy. BERT model requires more time to train (0.9 hour) compared to machine learning (0.4 hour).

Keywords


BERT; Clickbait classification; Deep learning; Machine learning; Online news

Full Text:

PDF


DOI: https://doi.org/10.11591/eei.v12i3.4444

Refbacks

  • There are currently no refbacks.


Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

Bulletin of EEI Stats