Differentiated Thyroid Cancer Recurrence Prediction Using Boosting Algorithms

Authors

  • Mucahid Mustafa Saritas Selcuk University
  • Muslume Beyza Yildiz Selcuk University
  • Talha Alperen Cengel Selcuk University
  • Murat Koklu Selcuk University

DOI:

https://doi.org/10.62712/juktisi.v4i2.490

Keywords:

Differentiated Thyroid Cancer, AdaBoost, Gradient Boosting, CatBoost, machine learning

Abstract

This study aims to compare the performance of AdaBoost, Gradient Boosting, and CatBoost algorithms in predicting the recurrence risk of Differentiated Thyroid Cancer (DTC). DTC is the most common type of thyroid cancer, and due to its recurrence risk, accurate and effective prediction models are needed. In this study, a dataset containing clinical and pathological data of patients diagnosed with DTC was used. The performance of the models was evaluated using metrics such as accuracy, precision, recall, and F1 score. The results revealed that the CatBoost algorithm achieved the highest performance, with an accuracy of 98.70% and an F1 score of 98.69% on the test data. The Gradient Boosting algorithm ranked second with an accuracy of 97.40% and an F1 score of 97.40%, while the AdaBoost algorithm showed the lowest performance, with an accuracy of 96.10% and an F1 score of 96.14%. These findings indicate that the CatBoost algorithm outperforms the other algorithms in predicting DTC recurrence risk and is a suitable candidate for use in clinical decision support systems.

Downloads

Download data is not yet available.

References

REFERENCES

A. Shokoohi et al., "Treatment for recurrent differentiated thyroid cancer: a Canadian population based experience," Cureus, vol. 12, no. 2, 2020.

M. E. Cabanillas, D. G. McFadden, and C. Durante, "Thyroid cancer," The Lancet, vol. 388, no. 10061, pp. 2783-2795, 2016.

W. R. Burns and M. A. Zeiger, "Differentiated thyroid cancer," in Seminars in oncology, 2010, vol. 37, no. 6: Elsevier, pp. 557-566, doi: 10.1053/j.seminoncol.2010.10.008.

M. Schlumberger and S. Leboulleux, "Current practice in patients with differentiated thyroid cancer," Nature Reviews Endocrinology, vol. 17, no. 3, pp. 176-188, 2021, doi: 10.1038/s41574-020-00448-z.

B. Schmidbauer, K. Menhart, D. Hellwig, and J. Grosse, "Differentiated thyroid cancer—treatment: state of the art," International journal of molecular sciences, vol. 18, no. 6, p. 1292, 2017, doi: 10.3390/ijms18061292.

Y. Habchi et al., "Ai in thyroid cancer diagnosis: Techniques, trends, and future directions," Systems, vol. 11, no. 10, p. 519, 2023, doi: 10.3390/systems11100519.

Y. S. Taspinar, I. Cinar, and M. Koklu, "Classification by a stacking model using CNN features for COVID-19 infection diagnosis," Journal of X-ray science and technology, vol. 30, no. 1, pp. 73-88, 2022, doi: 10.3233/XST-211031.

M. Koklu and K. Tutuncu, "Classification of chronic kidney disease with most known data mining methods," Int. J. Adv. Sci. Eng. Technol, vol. 5, no. 2, pp. 14-18, 2017.

M. M. Saritas, M. Koklu, and I. A. Ozkan, "A Novel Embedded System on Cold Box Design for The Cold Chain," International Journal of Applied Mathematics Electronics and Computers, vol. 5, no. 4, pp. 67-70, 2017, doi: 10.18100/ijamec.2017436077.

N. Schwalbe and B. Wahl, "Artificial intelligence and the future of global health," The Lancet, vol. 395, no. 10236, pp. 1579-1586, 2020.

T. Panch, P. Szolovits, and R. Atun, "Artificial intelligence, machine learning and health systems," Journal of global health, vol. 8, no. 2, 2018, doi: 10.7189/jogh.08.020303.

C. Bentéjac, A. Csörgő, and G. Martínez-Muñoz, "A comparative analysis of gradient boosting algorithms," Artificial Intelligence Review, vol. 54, pp. 1937-1967, 2021, doi: 10.1007/s10462-020-09896-5.

C. Ying, M. Qi-Guang, L. Jia-Chen, and G. Lin, "Advance and prospects of AdaBoost algorithm," Acta Automatica Sinica, vol. 39, no. 6, pp. 745-758, 2013, doi: 10.1016/S1874-1029(13)60052-X.

C. Zhang, X. Chen, S. Wang, J. Hu, C. Wang, and X. Liu, "Using CatBoost algorithm to identify middle-aged and elderly depression, national health and nutrition examination survey 2011–2018," Psychiatry Research, vol. 306, p. 114261, 2021, doi: 10.1016/j.psychres.2021.114261.

S. Borzooei and A. Tarokhian. "Differentiated Thyroid Cancer Recurrence," UCI Machine Learning Repository, 2023. [Online]. Available: https://doi.org/10.24432/C5632J.

S. Yadav and S. Shukla, "Analysis of k-fold cross-validation over hold-out validation on colossal datasets for quality classification," in 2016 IEEE 6th International conference on advanced computing (IACC), 2016: IEEE, pp. 78-83, doi: 10.1109/IACC.2016.25.

Y. S. Taspinar, M. Koklu, and M. Altin, "Identification of the english accent spoken in different countries by the k-nearest neighbor method," International Journal of Intelligent Systems and Applications in Engineering, 2020. [Online]. Available: https://hdl.handle.net/20.500.13091/1351.

Y. S. Taspinar, M. M. Saritas, İ. Cinar, and M. Koklu, "Gender determination using voice data," International Journal of Applied Mathematics Electronics and Computers, vol. 8, no. 4, pp. 232-235, 2020, doi: 10.18100/ijamec.809476.

M. Koklu and K. Tutuncu, "Tree based classification methods for occupancy detection," in IOP Conference Series: Materials Science and Engineering, 2019, vol. 675, no. 1: IOP Publishing, p. 012032, doi: 10.1088/1757-899X/675/1/012032.

I. Cinar, Y. S. Taspinar, R. Kursun, and M. Koklu, "Identification of corneal ulcers with pre-trained AlexNet based on transfer learning," in 2022 11th Mediterranean conference on embedded computing (MECO), 2022: IEEE, pp. 1-4, doi: 10.1109/MECO55406.2022.9797218.

R. Kursun, I. Cinar, Y. S. Taspinar, and M. Koklu, "Flower recognition system with optimized features for deep features," in 2022 11th Mediterranean conference on embedded computing (MECO), 2022: IEEE, pp. 1-4, doi: 10.1109/MECO55406.2022.9797103.

H. Isik et al., "Maize seeds forecasting with hybrid directional and bi‐directional long short‐term memory models," Food Science & Nutrition, vol. 12, no. 2, pp. 786-803, 2024, doi: 10.1002/fsn3.3783.

R. Kursun, E. T. Yasin, and M. Koklu, "Machine learning-based classification of infected date palm leaves caused by dubas insects: a comparative analysis of feature extraction methods and classification algorithms," in 2023 Innovations in Intelligent Systems and Applications Conference (ASYU), 2023: IEEE, pp. 1-6, doi: 10.1109/ASYU58738.2023.10296641.

R. Schönhof, A. Werner, J. Elstner, B. Zopcsak, R. Awad, and M. Huber, "Feature visualization within an automated design assessment leveraging explainable artificial intelligence methods," Procedia CIRP, vol. 100, pp. 331-336, 2021, doi: 10.1016/j.procir.2021.05.075.

T. Hastie, S. Rosset, J. Zhu, and H. Zou, "Multi-class adaboost," Statistics and its Interface, vol. 2, no. 3, pp. 349-360, 2009.

J. T. Hancock and T. M. Khoshgoftaar, "CatBoost for big data: an interdisciplinary review," Journal of big data, vol. 7, no. 1, p. 94, 2020, doi: 10.1186/s40537-020-00369-8.

S. Shastri, P. Kour, S. Kumar, K. Singh, and V. Mansotra, "GBoost: A novel Grading-AdaBoost ensemble approach for automatic identification of erythemato-squamous disease," International Journal of Information Technology, vol. 13, pp. 959-971, 2021, doi: 10.1007/s41870-020-00589-4.

D. T. Nguyen, J. K. Kang, T. D. Pham, G. Batchuluun, and K. R. Park, "Ultrasound image-based diagnosis of malignant thyroid nodule using artificial intelligence," Sensors, vol. 20, no. 7, p. 1822, 2020, doi: 10.3390/s20071822.

H. Ye et al., "An intelligent platform for ultrasound diagnosis of thyroid nodules," Scientific Reports, vol. 10, no. 1, p. 13223, 2020, doi: 10.1038/s41598-020-70159-y.

H. Zhou et al., "Differential diagnosis of benign and malignant thyroid nodules using deep learning radiomics of thyroid ultrasound images," European Journal of Radiology, vol. 127, p. 108992, 2020, doi: 10.1016/j.ejrad.2020.108992.

S. Tsantis, N. Dimitropoulos, D. Cavouras, and G. Nikiforidis, "Morphological and wavelet features towards sonographic thyroid nodules evaluation," Computerized Medical Imaging and Graphics, vol. 33, no. 2, pp. 91-99, 2009, doi: 10.1016/j.compmedimag.2008.10.010.

Y.-y. Guo et al., "Machine learning for identifying benign and malignant of thyroid tumors: A retrospective study of 2,423 patients," Frontiers in Public Health, vol. 10, p. 960740, 2022, doi: 10.3389/fpubh.2022.960740.

S. Borzooei, G. Briganti, M. Golparian, J. R. Lechien, and A. Tarokhian, "Machine learning for risk stratification of thyroid cancer patients: a 15-year cohort study," European Archives of Oto-Rhino-Laryngology, vol. 281, no. 4, pp. 2095-2104, 2024, doi: 10.1007/s00405-023-08299-w.

Downloads

Published

2025-07-25

How to Cite

Saritas, M. M., Yildiz, M. B., Cengel, T. A., & Koklu, M. (2025). Differentiated Thyroid Cancer Recurrence Prediction Using Boosting Algorithms. Jurnal Komputer Teknologi Informasi Sistem Informasi (JUKTISI), 4(2), 663–676. https://doi.org/10.62712/juktisi.v4i2.490