Development of An Application Transforming Handwriting into Digital Form using CNN

Authors

DOI:

https://doi.org/10.61098/jarcis.v1i2.87

Keywords:

Computer vision, Convolutional Neural Network, Handwriting recognition, Handwriting Classification, ResNet50

Abstract

This study aims to develop an application to recognize and predict handwriting using a Convolutional Neural Network (CNN) with ResNet50 architecture. The software development life cycle (SDLC) is an incremental model with two increments. The first increment is used to build the model, and the second increment is used to build the user interface. The data used in this study is handwritten images of Latin uppercase, Latin lowercase, and Arabian numerals with 62 classes. The training data used English Handwritten Characters by Dhruvil Dave from Kaggle Dataset. Data was trained and validated using k-fold cross-validation with tenfold and ten epochs for each fold. The model has an accuracy, precision, recall, and f1-score of 66.33%, 73.4%, 66.2%, and 66%, respectively. The functional application can work as expected based on the black box testing. The developed application can predict handwriting with up to 50% accuracy.

Downloads

Download data is not yet available.

References

A. Prasojo, A. Hidayatno, and R. R. Isnanto, “Pengenalan Karakter Alfabet Menggunakan Jaringan Saraf Tiruan,” Universitas Diponegoro, 2011.

S. Namiki, K. Yokoyama, S. Yachida, T. Shibata, H. Miyano, and M. Ishikawa, “Online Object Recognition Using CNN-based Algorithm on High-speed Camera Imaging: Framework for fast and robust high-speed camera object recognition based on population data cleansing and data ensemble,” in 2020 25th International Conference on Pattern Recognition (ICPR), IEEE, Jan. 2021, pp. 2025–2032. doi: 10.1109/ICPR48806.2021.9413042.

P. Roy, S. Ghosh, and U. Pal, “A CNN Based Framework for Unistroke Numeral Recognition in Air-Writing,” in 2018 16th International Conference on Frontiers in Handwriting Recognition (ICFHR), IEEE, Aug. 2018, pp. 404–409. doi: 10.1109/ICFHR-2018.2018.00077.

M. Mukhopadhyay, A. Dey, R. N. Shaw, and A. Ghosh, “Facial emotion recognition based on Textural pattern and Convolutional Neural Network,” in 2021 IEEE 4th International Conference on Computing, Power and Communication Technologies (GUCON), IEEE, Sep. 2021, pp. 1–6. doi: 10.1109/GUCON50781.2021.9573860.

P. V Bhagyasree, A. James, and C. Saravanan, “A Proposed Framework for Recognition of Handwritten Cursive English Characters using DAG-CNN,” in 2019 1st International Conference on Innovations in Information and Communication Technology (ICIICT), IEEE, Apr. 2019, pp. 1–4. doi: 10.1109/ICIICT1.2019.8741412.

J. Butdee, W. Kongprawechnon, H. Nakahara, N. Chayopitak, C. Kingkan, and R. Pupadubsin, “Pattern Recognition of Partial Discharge Faults Using Convolutional Neural Network (CNN),” in 2023 8th International Conference on Control and Robotics Engineering (ICCRE), IEEE, Apr. 2023, pp. 61–66. doi: 10.1109/ICCRE57112.2023.10155616.

X. Wan, H. Song, L. Luo, Z. Li, G. Sheng, and X. Jiang, “Pattern Recognition of Partial Discharge Image Based on One-dimensional Convolutional Neural Network,” in 2018 Condition Monitoring and Diagnosis (CMD), IEEE, Sep. 2018, pp. 1–4. doi: 10.1109/CMD.2018.8535761.

D. S. Wita and D. Y. Liliana, “Klasifikasi Identitas Dengan Citra Telapak Tangan Menggunakan Convolutional Neural Network (CNN),” Jurnal Rekayasa Teknologi Informasi (JURTI), vol. 6, no. 1, p. 1, Jul. 2022, doi: 10.30872/jurti.v6i1.7100.

N. Kasim and G. S. Nugraha, “Pengenalan Pola Tulisan Tangan Aksara Arab Menggunakan Metode Convolution Neural Network,” Jurnal Teknologi Informasi, Komputer, dan Aplikasinya (JTIKA ), vol. 3, no. 1, pp. 85–95, Apr. 2021, doi: 10.29303/jtika.v3i1.136.

D. Fitriati, “PERBANDINGAN KINERJA CNN LeNet 5 DAN EXTREME LEARNING MACHINE PADA PENGENALAN CITRA TULISAN TANGAN ANGKA,” Jurnal Teknologi Terpadu, vol. 2, no. 1, Jul. 2016, doi: 10.54914/jtt.v2i1.45.

C. Umam and L. Handoko, “CONVOLUTIONAL NEURAL NETWORK (CNN) UNTUK IDENTIFKASI KARAKTER HIRAGANA,” PROSIDING SEMINAR NASIONAL LPPM UMP, vol. 2, no. 0, pp. 527–533, 2021, [Online]. Available: http://semnaslppm.ump.ac.id/index.php/semnaslppm/article/view/199

“CNN Architectures: LeNet, AlexNet, VGG, GoogLeNet, ResNet and more… | by Siddharth Das | Analytics Vidhya | Medium.” Accessed: Nov. 08, 2023. [Online]. Available: https://medium.com/analytics-vidhya/cnns-architectures-lenet-alexnet-vgg-googlenet-resnet-and-more-666091488df5

S. Li, J. Jiao, Y. Han, and T. Weissman, “Demystifying ResNet.” 2017.

R. Jayakanthan, A. H. Kumar, N. Sankarram, B. S. Charulatha, and A. Ramesh, “Handwritten tamil character recognition using ResNet,” International Journal of Research in Engineering, Science and Management, vol. 3, no. 3, pp. 133–137, 2020.

V. Atliha and D. Sesok, “Comparison of VGG and ResNet used as Encoders for Image Captioning,” in 2020 IEEE Open Conference of Electrical, Electronic and Information Sciences (eStream), IEEE, Apr. 2020, pp. 1–4. doi: 10.1109/eStream50540.2020.9108880.

A. Rachman, Andreansyah, and Rahmi, “Implementation of Incremental Models on Development of Web-Based Loan Cooperative Applications,” International Journal of Education, Science, Technology, and Engineering, vol. 3, no. 1, pp. 26–34, May 2020, doi: 10.36079/lamintang.ijeste-0301.105.

“English Handwritten Characters Dataset | Kaggle.” Accessed: Oct. 06, 2023. [Online]. Available: https://www.kaggle.com/discussions/general/221624

Y. Jung, “Multiple predicting K -fold cross-validation for model selection,” J Nonparametr Stat, vol. 30, no. 1, pp. 197–215, Jan. 2018, doi: 10.1080/10485252.2017.1404598.

“Incremental Model in SDLC: Use, Advantage & Disadvantage.” Accessed: Nov. 02, 2021. [Online]. Available: https://www.guru99.com/what-is-incremental-model-in-sdlc-advantages-disadvantages.html

“Incremental Model in SDLC: Use, Advantage & Disadvantage.” Accessed: Oct. 08, 2023. [Online]. Available: https://www.guru99.com/what-is-incremental-model-in-sdlc-advantages-disadvantages.html

V. Atliha and D. Sesok, “Comparison of VGG and ResNet used as Encoders for Image Captioning,” in 2020 IEEE Open Conference of Electrical, Electronic and Information Sciences (eStream), IEEE, Apr. 2020, pp. 1–4. doi: 10.1109/eStream50540.2020.9108880.

E. S. Marquez, J. S. Hare, and M. Niranjan, “Deep Cascade Learning,” IEEE Trans Neural Netw Learn Syst, vol. 29, no. 11, pp. 5475–5485, Nov. 2018, doi: 10.1109/TNNLS.2018.2805098.

“Vanishing Gradient Problem | What is Vanishing Gradient Problem?” Accessed: Oct. 10, 2023. [Online]. Available: https://www.mygreatlearning.com/blog/the-vanishing-gradient-problem/

K. He, X. Zhang, S. Ren, and J. Sun, “Deep Residual Learning for Image Recognition,” Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2016-December, pp. 770–778, Dec. 2015, doi: 10.1109/CVPR.2016.90.

O. Caelen, “A Bayesian interpretation of the confusion matrix,” Ann Math Artif Intell, vol. 81, no. 3–4, pp. 429–450, Dec. 2017, doi: 10.1007/S10472-017-9564-8/METRICS.

S. Nidhra, “Black Box and White Box Testing Techniques - A Literature Review,” International Journal of Embedded Systems and Applications, vol. 2, no. 2, pp. 29–50, Jun. 2012, doi: 10.5121/ijesa.2012.2204.

Downloads

Published

2023-12-29

How to Cite

Josulin, C., & Kurniawati, Y. E. (2023). Development of An Application Transforming Handwriting into Digital Form using CNN. Journal of Applied Research In Computer Science and Information Systems, 1(2), 86–99. https://doi.org/10.61098/jarcis.v1i2.87