Comparison of User Review Sentiments of Brainly and Ruangguru Applications Using Naïve Bayes, KNN, Decision Tree
DOI:
https://doi.org/10.62712/juktisi.v5i1.1294Abstract
Sentiment analysis is an important way to understand user opinions about digital education apps, as the number of reviews on the Google Play Store is too large to be manually analyzed one by one. This study compares three machine learning methods, namely Naïve Bayes, K-Nearest Neighbor (KNN), and Decision Tree, to classify sentiments from user reviews of the Brainly and Ruang Guru apps. Data were collected by scraping 8,000 reviews from the Google Play Store, i.e., 4,000 reviews per app, from May to June 2026; after removing duplicate reviews, 6,151 reviews remained, consisting of 2,836 reviews for Brainly and 3,315 reviews for Ruang Guru. Sentiment labels were arranged based on the number of stars (1–3 means negative, 4–5 means positive), resulting in an unbalanced distribution of 79.8% positive and 20.2% negative. The text was processed through nine pre-processing stages specifically used for informal Indonesian. Features were then extracted using the TF-IDF method, resulting in 2,398 features and a viewing rate of 99.78%. The training data was quantity-equalized using the SMOTE technique, and the model was optimized with GridSearchCV using StratifiedKFold with 5 data splits. In the tuning and SMOTE scenarios, the Naïve Bayes method showed the best performance with an accuracy of 82.78%, an F1-Score of 83.79%, and an ROC-AUC of 88.44%, which was better than Decision Tree and KNN. Interestingly, the Naïve Bayes method without using SMOTE actually achieved the highest overall accuracy of 88.95%, indicating that using SMOTE on high-dimensional TF-IDF data does not always improve model performance. Differentiating keyword analysis helps to identify positive sentiments such as 'helpful', 'easy', and 'best', as well as negative sentiments such as 'trash', 'ads', and 'error', which can be used as a benchmark in providing service quality by the second application developer.
Downloads
References
[1] R. F. Chandra and D. A. Putri, "Implementasi Metode Naïve Bayes Pada Ulasan Pengguna Aplikasi Dana Di Google Play Store," Jurnal Infortech, vol. 7, no. 1, pp. 64–69, 2025, doi: 10.31294/infortech.v7i1.12366.
[2] S. Nurhikmah, R. Ramadani, and G. Triyono, "Analisis Sentimen pada Ulasan Aplikasi Wondr di Play Store dengan Metode Naive Bayes," Jurnal Algoritma, vol. 22, no. 2, pp. 1919–1930, 2025, doi: 10.33364/algoritma/v.22-2.2507.
[3] M. Ulfa, R. H. Kusumodestoni, and A. Sucipto, "Analisis Sentimen Review Aplikasi Identitas Kependudukan Digital di Google Play Store Menggunakan KNN," Jurnal Informatika Teknologi dan Sains (Jinteks), vol. 6, no. 4, pp. 1155–1165, 2024, doi: 10.51401/jinteks.v6i4.4963.
[4] A. Azmi, Y. Hendriyani, I. P. Dewi, dan K. Budayawan, "Analisis Sentimen Ulasan Pengguna Aplikasi Bibit Menggunakan Algoritma Naive Bayes dan K-Nearest Neighbors (KNN)," Jurnal Pendidikan Tambusai, vol. 9, no. 2, pp. 14040–14048, 2025, doi: 10.31004/jptam.v9i2.27464.
[5] A. D. Sugiarto and M. S. Utomo, "Analisis Sentimen Ulasan Pengguna BCA Mobile di Google Play Store Menggunakan Metode Decision Tree," JATI (Jurnal Mahasiswa Teknik Informatika), vol. 9, no. 5, 2025, doi: 10.36040/jati.v9i5.14969.
[6] M. Iqbal, A. D. Wiranata, R. Suwito, and R. F. Ananda, "Perbandingan Algoritma Naïve Bayes, KNN, dan Decision Tree terhadap Ulasan Aplikasi Threads dan Twitter," KLIK: Kajian Ilmiah Informatika dan Komputer, vol. 4, no. 3, pp. 1799–1807, 2023, doi: 10.30865/klik.v4i3.1402.
[7] G. S. Al-Husna, D. Asmarajati, I. A. Ihsannuddin, and R. Mahmudati, "Perbandingan Metode Naïve Bayes dan Support Vector Machine untuk Analisis Sentimen pada Ulasan Pengguna Aplikasi LinkedIn," STORAGE: Jurnal Ilmiah Teknik dan Ilmu Komputer, vol. 3, no. 2, pp. 139–144, 2024, doi: 10.55123/storage.v3i2.3602.
[8] T. Wiratama Putra, A. Triayudi, and Andrianingsih, "Analisis Sentimen Pembelajaran Daring menggunakan Metode Naïve Bayes, KNN, dan Decision Tree," Jurnal JTIK (Jurnal Teknologi Informasi dan Komunikasi), vol. 6, no. 1, pp. 20–26, 2022, doi: 10.35870/jtik.v6i1.368.
[9] A. Pinkan M. et al., "Perbandingan Metode Naïve Bayes, Decision Tree, dan KNN dalam Analisis Sentimen Aplikasi Gojek di Playstore," ZONAsi: Jurnal Sistem Informasi, vol. 7, no. 2, pp. 725–734, 2025, doi: 10.31849/zn.v7i2.26566.
[10] A. Syukron, Sardiarinto, E. Saputro, dan P. Widodo, "Penerapan Metode Smote Untuk Mengatasi Ketidakseimbangan Kelas Pada Prediksi Gagal Jantung," Jurnal Teknologi Informasi dan Terapan (J-TIT), vol. 10, no. 1, pp. 47–50, 2023, doi: 10.25047/jtit.v10i1.313.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Adarianus Windi, Panny Agustia Rahayuningsih, Muhammad Rezki

This work is licensed under a Creative Commons Attribution 4.0 International License.















