Leakage-Aware Benchmarking of Lightweight Models for Robust Handwritten Hanacaraka Character Recognition
DOI:
https://doi.org/10.62712/juktisi.v5i1.1039Keywords:
Hanacaraka, Javanese script, handwritten recognition, leakage-aware evaluation, lightweight modelsAbstract
Handwritten Hanacaraka character recognition is important for preserving Javanese script, but reported results can be difficult to compare because datasets, preprocessing procedures, and train-test separation protocols vary across studies. This study presents a leakage-aware benchmark of lightweight models on a public handwritten Hanacaraka dataset containing 20 basic character classes. A data audit removed 17 unreadable image files and retained 1,562 valid images. Two experimental settings were evaluated: a perceptual-hash grouped split for leakage-aware testing and a random-stratified split as an optimistic upper-bound scenario. The leakage-aware benchmark compared HOG with SVM, HOG with Random Forest, MobileNetV2 head-only training, fine-tuned MobileNetV2, and a confusion-aware MobileNetV2 variant. Fine-tuned MobileNetV2 achieved the best leakage-aware result with 53.82% accuracy and 49.59% macro-F1, while robustness testing under image distortions produced 47.85% accuracy and 44.53% macro-F1. In the optimistic random-stratified experiment, an ensemble of EfficientNetB0 and MobileNetV2 with test-time augmentation reached 74.11% accuracy and 74.24% macro-F1. The results indicate that stricter evaluation substantially lowers performance and that visually similar classes remain difficult. Therefore, future Hanacaraka recognition work should report leakage control, robustness, and confusion analysis, not only clean-set accuracy.
Downloads
References
C. K. Dewa, A. L. Fadhilah, and Afiahayati, "Convolutional neural networks for handwritten Javanese character recognition," IJCCS (Indonesian Journal of Computing and Cybernetics Systems), vol. 12, no. 1, pp. 83-94, 2018, doi: 10.22146/ijccs.31144.
M. A. Rasyidi, T. Bariyah, Y. I. Riskajaya, and A. D. Septyani, "Classification of handwritten Javanese script using random forest algorithm," Bulletin of Electrical Engineering and Informatics, vol. 10, no. 3, pp. 1308-1315, 2021, doi: 10.11591/eei.v10i3.3036.
F. T. Anggraeny, Y. V. Via, and R. Mumpuni, "Image preprocessing analysis in handwritten Javanese character recognition," Bulletin of Electrical Engineering and Informatics, vol. 12, no. 2, pp. 860-867, 2023, doi: 10.11591/eei.v12i2.4172.
A. Susanto, I. U. W. Mulyono, C. A. Sari, E. H. Rachmawanto, D. R. I. M. Setiadi, and M. K. Sarker, "Handwritten Javanese script recognition method based 12-layers deep convolutional neural network and data augmentation," IAES International Journal of Artificial Intelligence, vol. 12, no. 3, pp. 1448-1458, 2023, doi: 10.11591/ijai.v12.i3.pp1448-1458.
A. Susanto, I. U. W. Mulyono, C. A. Sari, E. H. Rachmawanto, D. R. I. M. Setiadi, and M. K. Sarker, "Improved Javanese script recognition using custom model of convolution neural network," International Journal of Electrical and Computer Engineering, vol. 13, no. 6, pp. 6629-6636, 2023, doi: 10.11591/ijece.v13i6.pp6629-6636.
E. D. B. Sudewo, M. K. Biddinika, and A. Fadlil, "DenseNet architecture for efficient and accurate recognition of Javanese script Hanacaraka character," MATRIK: Jurnal Manajemen, Teknik Informatika dan Rekayasa Komputer, vol. 23, no. 2, pp. 453-464, 2024, doi: 10.30812/matrik.v23i2.3855.
E. D. B. Sudewo, M. K. Biddinika, and A. Fadlil, "Javanese script Hanacaraka character prediction with ResNet-18 architecture," JURTEKSI (Jurnal Teknologi dan Sistem Informasi), vol. 10, no. 2, pp. 363-370, 2024, doi: 10.33330/jurteksi.v10i2.3017.
M. F. Naufal, J. Siswantoro, and J. T. Soebroto, "Transliterating Javanese script images to Roman script using convolutional neural network with transfer learning," JOIV: International Journal on Informatics Visualization, vol. 8, no. 3, pp. 1460-1468, 2024, doi: 10.62527/joiv.8.3.2566.
Y. Harjoseputro, Y. D. Handarkho, and H. T. R. Adie, "The Javanese letters classifier with mobile client-server architecture and convolution neural network method," International Journal of Interactive Mobile Technologies, vol. 13, no. 12, pp. 67-80, 2019, doi: 10.3991/ijim.v13i12.11492.
A. Susanto, C. A. Sari, E. H. Rachmawanto, I. U. W. Mulyono, and N. M. Yaacob, "A comparative study of Javanese script classification with GoogleNet, DenseNet, ResNet, VGG16 and VGG19," Scientific Journal of Informatics, vol. 11, no. 1, pp. 31-40, 2024, doi: 10.15294/sji.v11i1.47305.
M. Sandler, A. Howard, M. Zhu, A. Zhmoginov, and L.-C. Chen, "MobileNetV2: Inverted residuals and linear bottlenecks," in Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2018, pp. 4510-4520, doi: 10.1109/CVPR.2018.00474.
M. Tan and Q. V. Le, "EfficientNet: Rethinking model scaling for convolutional neural networks," in Proc. 36th International Conference on Machine Learning, 2019, pp. 6105-6114, doi: 10.48550/arXiv.1905.11946.
T.-Y. Lin, P. Goyal, R. Girshick, K. He, and P. Dollár, "Focal loss for dense object detection," in Proc. IEEE International Conference on Computer Vision, 2017, pp. 2999-3007, doi: 10.1109/ICCV.2017.324.
N. Dalal and B. Triggs, "Histograms of oriented gradients for human detection," in Proc. IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2005, pp. 886-893, doi: 10.1109/CVPR.2005.177.
C. Cortes and V. Vapnik, "Support-vector networks," Machine Learning, vol. 20, no. 3, pp. 273-297, 1995, doi: 10.1007/BF00994018.
L. Breiman, "Random forests," Machine Learning, vol. 45, no. 1, pp. 5-32, 2001, doi: 10.1023/A:1010933404324.
F. Pedregosa et al., "Scikit-learn: Machine learning in Python," Journal of Machine Learning Research, vol. 12, pp. 2825-2830, 2011, doi: 10.5555/1953048.2078195.
R. P. Nugroho, "Aksara Jawa / Hanacaraka Dataset," Kaggle, accessed May 13, 2026. [Online]. Available: https://www.kaggle.com/datasets/vzrenggamani/hanacaraka. DOI: N/A (online dataset).
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Ferian Fauzi Abdulloh, Sharazita Dyah Anggita, Ikmah, Ali Mustopa, Majid Rahardi, Devi Wulandari

This work is licensed under a Creative Commons Attribution 4.0 International License.















