Klasifikasi Wilayah WHO Berdasarkan Data HIV/AIDS Global Menggunakan Pendekatan Machine Learning
DOI:
https://doi.org/10.62712/juktisi.v5i1.1140Keywords:
HIV/AIDS Global, Klasifikasi Wilayah WHO, Logistic Regression, Support Vector Machine, GridSearchCVAbstract
Pemetaan epidemiologi HIV/AIDS secara global berdasarkan wilayah World Health Organization (WHO) menjadi salah satu langkah strategis dalam mendukung distribusi bantuan medis yang tepat sasaran. Namun, perbedaan karakteristik data antar-wilayah yang sangat bervariasi membuat proses klasifikasi menjadi tantangan tersendiri. Penelitian ini membandingkan tiga model machine learning, yaitu Logistic Regression, Support Vector Machine (SVM) dasar, dan SVM teroptimasi, untuk mengklasifikasikan data HIV/AIDS ke dalam enam wilayah WHO: Africa, Americas, Eastern Mediterranean, Europe, South-East Asia, dan Western Pacific. Data yang digunakan mencakup estimasi jumlah penderita (nilai median, minimum, dan maksimum) serta persentase cakupan pengobatan antiretroviral (ART coverage %). Proses pra-pemrosesan meliputi analisis data eksploratif, imputasi median untuk menangani nilai kosong, dan normalisasi menggunakan StandardScaler. Evaluasi model dilakukan melalui validasi silang 5-fold dan matriks konfusi. Hasil menunjukkan bahwa data memiliki tingkat tumpang tindih antar-kelas yang cukup tinggi. Logistic Regression menghasilkan akurasi pengujian 39,29% dengan CV Mean 44,05%, sementara SVM teroptimasi dengan parameter C=10, gamma='scale', dan kernel RBF mencapai CV Mean tertinggi sebesar 49,05%. Analisis lebih lanjut mengungkapkan bahwa ART coverage % merupakan fitur paling dominan dalam membedakan karakteristik beban epidemiologi antar-wilayah.
Downloads
References
WHO, "HIV data and statistics," World Health Organization, 2025. [Online]. Available: https://www.who.int/teams/global-hiv-hepatitis-and-stis-programmes/hiv/strategic-information/hiv-data-and-statistics
UNAIDS/WHO, "HIV statistics, globally and by WHO region, 2025," WHO Information Sheet, 2025. [Online]. Available: https://cdn.who.int/media/docs/default-source/hq-hiv-hepatitis-and-stis-library/who-ias-hiv-statistics_2025-new.pdf
S. Rasheed et al., "Heart Disease Prediction Using GridSearchCV and Random Forest," EAI Endorsed Transactions on Pervasive Health and Technology, vol. 10, Mar. 2024. https://doi.org/10.4108/eetpht.10.5523
A. Khalid et al., "Influence of Optimal Hyperparameters on the Performance of Machine Learning Algorithms for Predicting Heart Disease," Processes, vol. 11, no. 3, p. 734, 2023. https://doi.org/10.3390/pr11030734
S. Domínguez-Rodríguez et al., "Scalable and robust machine learning framework for HIV classification using clinical and laboratory data," Scientific Reports, 2025. https://doi.org/10.1038/s41598-025-00085-4
B. Olatosi et al., "Machine Learning Approaches to Identify Communities with High HIV Prevalence in Resource-Limited Settings," medRxiv, 2025. https://doi.org/10.1101/2025.11.10.25339949
T. Wongvorachan, S. He, and O. Bulut, "A Comparison of Undersampling, Oversampling, and SMOTE Methods for Dealing with Imbalanced Classification in Educational Data Mining," Information, vol. 14, no. 1, p. 54, 2023. https://doi.org/10.3390/info14010054
Z. Zhang et al., "Multi-Class Imbalanced Learning with Support Vector Machines via Differential Evolution," arXiv, 2025. https://arxiv.org/abs/2502.14597
J. Hemmatian et al., "Addressing imbalanced data classification with Cluster-Based Reduced Noise SMOTE," PLOS ONE, 2025. https://doi.org/10.1371/journal.pone.0317396
N. U. Maulidevi and K. Surendro, "SMOTE-LOF for noise identification in imbalanced data classification," Journal of King Saud University – Computer and Information Sciences, vol. 34, no. 6, pp. 3413–3423, 2022. https://doi.org/10.1016/j.jksuci.2020.02.004
N. V. Chawla, K. W. Bowyer, L. O. Hall, and W. P. Kegelmeyer, "SMOTE: Synthetic Minority Over-sampling Technique," Journal of Artificial Intelligence Research, vol. 16, pp. 321–357, 2002. https://doi.org/10.1613/jair.953
T. Wongvorachan, "An Investigation of SMOTE Based Methods for Imbalanced Datasets With Data Complexity Analysis," IEEE Transactions on Knowledge and Data Engineering, vol. 35, 2023. https://doi.org/10.1109/TKDE.2022.3179381
H. Abid et al., "Data oversampling and imbalanced datasets: an investigation of performance for machine learning and feature engineering," Journal of Big Data, Springer, 2024. https://doi.org/10.1186/s40537-024-00943-4
P. M. Nyarige et al., "The successful scaling-up of antiretroviral therapy globally has many lessons for advancing universal health coverage," BMC Global and Public Health, 2025. https://pmc.ncbi.nlm.nih.gov/articles/PMC12860156/
Global Burden of Disease 2017 HIV Collaborators, "Global, regional, and national incidence, prevalence, and mortality of HIV, 1980–2017, and forecasts to 2030," The Lancet HIV, vol. 6, no. 12, pp. e831–e859, 2019. https://doi.org/10.1016/S2352-3018(19)30196-1
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Muhamad Agry Sugiarto, Hafiyyan Putra Pratama

This work is licensed under a Creative Commons Attribution 4.0 International License.















