
Hoaxes are something that can not be avoided, especially in Indonesia, where the literacy rate in Indonesia is quite low, they are easy to believe in news without doing fact check. The worst thing is that news that is trusted by the public may not be read completely through the entire content. They believe, from the title alone could already covers the entire content of the news. Media in the other side, are also competing to make controversial titles so that their traffic is improved. The research that will be carried out by us is where we can classify a news whether it is a fact or a hoax from the title alone. The RoBERTa (A Robustly Optimized BERT Pretraining Approach) model will be used in this study, because in several previous studies RoBERTa has proven to be good for classification. The accuracy achieved in this study also reached 99.52% with an accuracy validation of 93.84% which shows that even with an imbalanced dataset the classification shows a promising result by using the RoBERTa model which data is balanced using the undersampling method.
DOI: https://doi.org/10.1109/aidas56890.2022.9918747
Publish Year: 2022