FLIP: A Novel Feedback Learning-Based Intelligent Plugin Towards Accuracy Enhancement of Chinese OCR

Chinese Optical Character Recognition (OCR) technology is essential for digital transformation in Chinese regions, enabling automated document processing across various applications. However, Chinese OCR systems struggle with visually similar characters, where subtle stroke differences lead to syste...

Full description

Saved in:
Bibliographic Details
Main Authors: Xinyue Tao, Yueyue Han, Yakai Jin, Yunzhi Wu
Format: Article
Language:English
Published: MDPI AG 2025-07-01
Series:Mathematics
Subjects:
Online Access:https://www.mdpi.com/2227-7390/13/15/2372
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849405994423025664
author Xinyue Tao
Yueyue Han
Yakai Jin
Yunzhi Wu
author_facet Xinyue Tao
Yueyue Han
Yakai Jin
Yunzhi Wu
author_sort Xinyue Tao
collection DOAJ
description Chinese Optical Character Recognition (OCR) technology is essential for digital transformation in Chinese regions, enabling automated document processing across various applications. However, Chinese OCR systems struggle with visually similar characters, where subtle stroke differences lead to systematic recognition errors that limit practical deployment accuracy. This study develops FLIP (Feedback Learning-based Intelligent Plugin), a lightweight post-processing plugin designed to improve Chinese OCR accuracy across different systems without external dependencies. The plugin operates through three core components as follows: UTF-8 encoding-based output parsing that converts OCR results into mathematical representations, error correction using information entropy and weighted similarity measures to identify and fix character-level errors, and adaptive feedback learning that optimizes parameters through user interactions. The approach functions entirely through mathematical calculations at the character encoding level, ensuring universal compatibility with existing OCR systems while effectively handling complex Chinese character similarities. The plugin’s modular design enables seamless integration without requiring modifications to existing OCR algorithms, while its feedback mechanism adapts to domain-specific terminology and user preferences. Experimental evaluation on 10,000 Chinese document images using four state-of-the-art OCR models demonstrates consistent improvements across all tested systems, with precision gains ranging from 1.17% to 10.37% and overall Chinese character recognition accuracy exceeding 98%. The best performing model achieved 99.42% precision, with ablation studies confirming that feedback learning contributes additional improvements from 0.45% to 4.66% across different OCR architectures.
format Article
id doaj-art-c053c2f04c9d41c09cf049fbb99add1f
institution Kabale University
issn 2227-7390
language English
publishDate 2025-07-01
publisher MDPI AG
record_format Article
series Mathematics
spelling doaj-art-c053c2f04c9d41c09cf049fbb99add1f2025-08-20T03:36:31ZengMDPI AGMathematics2227-73902025-07-011315237210.3390/math13152372FLIP: A Novel Feedback Learning-Based Intelligent Plugin Towards Accuracy Enhancement of Chinese OCRXinyue Tao0Yueyue Han1Yakai Jin2Yunzhi Wu3School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei 230036, ChinaSchool of Information and Artificial Intelligence, Anhui Agricultural University, Hefei 230036, ChinaSchool of Information and Artificial Intelligence, Anhui Agricultural University, Hefei 230036, ChinaSchool of Information and Artificial Intelligence, Anhui Agricultural University, Hefei 230036, ChinaChinese Optical Character Recognition (OCR) technology is essential for digital transformation in Chinese regions, enabling automated document processing across various applications. However, Chinese OCR systems struggle with visually similar characters, where subtle stroke differences lead to systematic recognition errors that limit practical deployment accuracy. This study develops FLIP (Feedback Learning-based Intelligent Plugin), a lightweight post-processing plugin designed to improve Chinese OCR accuracy across different systems without external dependencies. The plugin operates through three core components as follows: UTF-8 encoding-based output parsing that converts OCR results into mathematical representations, error correction using information entropy and weighted similarity measures to identify and fix character-level errors, and adaptive feedback learning that optimizes parameters through user interactions. The approach functions entirely through mathematical calculations at the character encoding level, ensuring universal compatibility with existing OCR systems while effectively handling complex Chinese character similarities. The plugin’s modular design enables seamless integration without requiring modifications to existing OCR algorithms, while its feedback mechanism adapts to domain-specific terminology and user preferences. Experimental evaluation on 10,000 Chinese document images using four state-of-the-art OCR models demonstrates consistent improvements across all tested systems, with precision gains ranging from 1.17% to 10.37% and overall Chinese character recognition accuracy exceeding 98%. The best performing model achieved 99.42% precision, with ablation studies confirming that feedback learning contributes additional improvements from 0.45% to 4.66% across different OCR architectures.https://www.mdpi.com/2227-7390/13/15/2372optical character recognitionpost-processingtext recognitionmachine learning
spellingShingle Xinyue Tao
Yueyue Han
Yakai Jin
Yunzhi Wu
FLIP: A Novel Feedback Learning-Based Intelligent Plugin Towards Accuracy Enhancement of Chinese OCR
Mathematics
optical character recognition
post-processing
text recognition
machine learning
title FLIP: A Novel Feedback Learning-Based Intelligent Plugin Towards Accuracy Enhancement of Chinese OCR
title_full FLIP: A Novel Feedback Learning-Based Intelligent Plugin Towards Accuracy Enhancement of Chinese OCR
title_fullStr FLIP: A Novel Feedback Learning-Based Intelligent Plugin Towards Accuracy Enhancement of Chinese OCR
title_full_unstemmed FLIP: A Novel Feedback Learning-Based Intelligent Plugin Towards Accuracy Enhancement of Chinese OCR
title_short FLIP: A Novel Feedback Learning-Based Intelligent Plugin Towards Accuracy Enhancement of Chinese OCR
title_sort flip a novel feedback learning based intelligent plugin towards accuracy enhancement of chinese ocr
topic optical character recognition
post-processing
text recognition
machine learning
url https://www.mdpi.com/2227-7390/13/15/2372
work_keys_str_mv AT xinyuetao flipanovelfeedbacklearningbasedintelligentplugintowardsaccuracyenhancementofchineseocr
AT yueyuehan flipanovelfeedbacklearningbasedintelligentplugintowardsaccuracyenhancementofchineseocr
AT yakaijin flipanovelfeedbacklearningbasedintelligentplugintowardsaccuracyenhancementofchineseocr
AT yunzhiwu flipanovelfeedbacklearningbasedintelligentplugintowardsaccuracyenhancementofchineseocr