High Impact Factor : 4.396 icon | Submit Manuscript Online icon |

Automating Ledger Digitization: Performance Evaluation of On-Device OCR Models for Recognizing Handwritten Financial Scripts

Author(s):

Sanskar Sanjay Dikondwar , PES Modern College of Engineering, Pune; Dr. Swati Ghule, PES Modern College of Engineering, Pune

Keywords:

OCR, Handwritten Text Recognition, Ledger Digitization, On-Device Machine Learning, FinTech, Financial Scripts, CER, WER, Bharat Bachat, Indic Scripts

Abstract

The digitization of handwritten financial records, such as ledgers, cash books, and account registers, represents a significant challenge in the domain of intelligent document processing and financial technology. Traditional Optical Character Recognition (OCR) engines, while effective for printed text, frequently fail to accurately interpret cursive, regional-language, or domain-specific handwritten financial scripts. This research investigates and benchmarks the performance of three on-device OCR models — Google ML Kit Text Recognition v2, Apple Vision Framework, and Tesseract OCR v5 — for recognizing handwritten financial text under real-world conditions. A curated dataset of 1,200 handwritten ledger images sourced from small and medium enterprises across India, encompassing Hindi, English, and mixed-script entries, is used for evaluation. Performance is assessed using Character Error Rate (CER), Word Error Rate (WER), and token-level F1-Score. Results demonstrate that fine-tuned Google ML Kit achieves the best CER of 6.3%, with further improvement to 4.5% after applying a post-processing language correction layer. The findings carry direct implications for FinTech platforms such as the Bharat Bachat application that seek to automate bookkeeping for self-employed individuals in emerging markets. A hybrid four-stage pipeline combining on-device OCR with domain-specific natural language correction is proposed for practical mobile deployment.

Other Details

Paper ID: IJSRDV14I30125
Published in: Volume : 14, Issue : 3
Publication Date: 01/06/2026
Page(s): 192-195

Article Preview

Download Article