Curved Text Line Rectification via Bresenham’s Algorithm and Generalized Additive Models

This paper presents a methodology for rectifying curved text lines, a crucial process in optical character recognition (OCR) and computer vision. Utilizing generalized additive models (GAMs), the proposed method accurately estimates text curvature and rectifies it into a straight format for improved...

Full description

Saved in:
Bibliographic Details
Main Authors: Thomas Stogiannopoulos, Ilias Theodorakopoulos
Format: Article
Language:English
Published: MDPI AG 2024-10-01
Series:Signals
Subjects:
Online Access:https://www.mdpi.com/2624-6120/5/4/39
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:This paper presents a methodology for rectifying curved text lines, a crucial process in optical character recognition (OCR) and computer vision. Utilizing generalized additive models (GAMs), the proposed method accurately estimates text curvature and rectifies it into a straight format for improved text recognition. The process includes image binarization techniques like Otsu’s thresholding, morphological operations, curve estimation, and the Bresenham line drawing algorithm. The results show significant improvements in OCR accuracy among different challenging distortion scenarios. The implementation, written in Python, demonstrates the potential for enhancing text alignment and rectification in scanned text line images utilizing a flexible, robust, and customizable framework.
ISSN:2624-6120