FLIP: A Novel Feedback Learning-Based Intelligent Plugin Towards Accuracy Enhancement of Chinese OCR
Chinese Optical Character Recognition (OCR) technology is essential for digital transformation in Chinese regions, enabling automated document processing across various applications. However, Chinese OCR systems struggle with visually similar characters, where subtle stroke differences lead to syste...
Saved in:
| Main Authors: | , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
MDPI AG
2025-07-01
|
| Series: | Mathematics |
| Subjects: | |
| Online Access: | https://www.mdpi.com/2227-7390/13/15/2372 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849405994423025664 |
|---|---|
| author | Xinyue Tao Yueyue Han Yakai Jin Yunzhi Wu |
| author_facet | Xinyue Tao Yueyue Han Yakai Jin Yunzhi Wu |
| author_sort | Xinyue Tao |
| collection | DOAJ |
| description | Chinese Optical Character Recognition (OCR) technology is essential for digital transformation in Chinese regions, enabling automated document processing across various applications. However, Chinese OCR systems struggle with visually similar characters, where subtle stroke differences lead to systematic recognition errors that limit practical deployment accuracy. This study develops FLIP (Feedback Learning-based Intelligent Plugin), a lightweight post-processing plugin designed to improve Chinese OCR accuracy across different systems without external dependencies. The plugin operates through three core components as follows: UTF-8 encoding-based output parsing that converts OCR results into mathematical representations, error correction using information entropy and weighted similarity measures to identify and fix character-level errors, and adaptive feedback learning that optimizes parameters through user interactions. The approach functions entirely through mathematical calculations at the character encoding level, ensuring universal compatibility with existing OCR systems while effectively handling complex Chinese character similarities. The plugin’s modular design enables seamless integration without requiring modifications to existing OCR algorithms, while its feedback mechanism adapts to domain-specific terminology and user preferences. Experimental evaluation on 10,000 Chinese document images using four state-of-the-art OCR models demonstrates consistent improvements across all tested systems, with precision gains ranging from 1.17% to 10.37% and overall Chinese character recognition accuracy exceeding 98%. The best performing model achieved 99.42% precision, with ablation studies confirming that feedback learning contributes additional improvements from 0.45% to 4.66% across different OCR architectures. |
| format | Article |
| id | doaj-art-c053c2f04c9d41c09cf049fbb99add1f |
| institution | Kabale University |
| issn | 2227-7390 |
| language | English |
| publishDate | 2025-07-01 |
| publisher | MDPI AG |
| record_format | Article |
| series | Mathematics |
| spelling | doaj-art-c053c2f04c9d41c09cf049fbb99add1f2025-08-20T03:36:31ZengMDPI AGMathematics2227-73902025-07-011315237210.3390/math13152372FLIP: A Novel Feedback Learning-Based Intelligent Plugin Towards Accuracy Enhancement of Chinese OCRXinyue Tao0Yueyue Han1Yakai Jin2Yunzhi Wu3School of Information and Artificial Intelligence, Anhui Agricultural University, Hefei 230036, ChinaSchool of Information and Artificial Intelligence, Anhui Agricultural University, Hefei 230036, ChinaSchool of Information and Artificial Intelligence, Anhui Agricultural University, Hefei 230036, ChinaSchool of Information and Artificial Intelligence, Anhui Agricultural University, Hefei 230036, ChinaChinese Optical Character Recognition (OCR) technology is essential for digital transformation in Chinese regions, enabling automated document processing across various applications. However, Chinese OCR systems struggle with visually similar characters, where subtle stroke differences lead to systematic recognition errors that limit practical deployment accuracy. This study develops FLIP (Feedback Learning-based Intelligent Plugin), a lightweight post-processing plugin designed to improve Chinese OCR accuracy across different systems without external dependencies. The plugin operates through three core components as follows: UTF-8 encoding-based output parsing that converts OCR results into mathematical representations, error correction using information entropy and weighted similarity measures to identify and fix character-level errors, and adaptive feedback learning that optimizes parameters through user interactions. The approach functions entirely through mathematical calculations at the character encoding level, ensuring universal compatibility with existing OCR systems while effectively handling complex Chinese character similarities. The plugin’s modular design enables seamless integration without requiring modifications to existing OCR algorithms, while its feedback mechanism adapts to domain-specific terminology and user preferences. Experimental evaluation on 10,000 Chinese document images using four state-of-the-art OCR models demonstrates consistent improvements across all tested systems, with precision gains ranging from 1.17% to 10.37% and overall Chinese character recognition accuracy exceeding 98%. The best performing model achieved 99.42% precision, with ablation studies confirming that feedback learning contributes additional improvements from 0.45% to 4.66% across different OCR architectures.https://www.mdpi.com/2227-7390/13/15/2372optical character recognitionpost-processingtext recognitionmachine learning |
| spellingShingle | Xinyue Tao Yueyue Han Yakai Jin Yunzhi Wu FLIP: A Novel Feedback Learning-Based Intelligent Plugin Towards Accuracy Enhancement of Chinese OCR Mathematics optical character recognition post-processing text recognition machine learning |
| title | FLIP: A Novel Feedback Learning-Based Intelligent Plugin Towards Accuracy Enhancement of Chinese OCR |
| title_full | FLIP: A Novel Feedback Learning-Based Intelligent Plugin Towards Accuracy Enhancement of Chinese OCR |
| title_fullStr | FLIP: A Novel Feedback Learning-Based Intelligent Plugin Towards Accuracy Enhancement of Chinese OCR |
| title_full_unstemmed | FLIP: A Novel Feedback Learning-Based Intelligent Plugin Towards Accuracy Enhancement of Chinese OCR |
| title_short | FLIP: A Novel Feedback Learning-Based Intelligent Plugin Towards Accuracy Enhancement of Chinese OCR |
| title_sort | flip a novel feedback learning based intelligent plugin towards accuracy enhancement of chinese ocr |
| topic | optical character recognition post-processing text recognition machine learning |
| url | https://www.mdpi.com/2227-7390/13/15/2372 |
| work_keys_str_mv | AT xinyuetao flipanovelfeedbacklearningbasedintelligentplugintowardsaccuracyenhancementofchineseocr AT yueyuehan flipanovelfeedbacklearningbasedintelligentplugintowardsaccuracyenhancementofchineseocr AT yakaijin flipanovelfeedbacklearningbasedintelligentplugintowardsaccuracyenhancementofchineseocr AT yunzhiwu flipanovelfeedbacklearningbasedintelligentplugintowardsaccuracyenhancementofchineseocr |