Which OCR toolset is good and why : A comparative study

Optical Character Recognition (OCR) is a very active research area in many scientific disciplines like pattern recognition, natural language processing (NLP), computer vision, biomedical informatics, machine learning and artificial intelligence. This computational technology extracts the text in ed...

Full description

Saved in:
Bibliographic Details
Main Authors: Pooja Jain, Kavita Taneja, Harmunish Taneja
Format: Article
Language:English
Published: Elsevier 2021-04-01
Series:Kuwait Journal of Science
Subjects:
Online Access:https://journalskuwait.org/kjs/index.php/KJS/article/view/9589
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Optical Character Recognition (OCR) is a very active research area in many scientific disciplines like pattern recognition, natural language processing (NLP), computer vision, biomedical informatics, machine learning and artificial intelligence. This computational technology extracts the text in editable format ( MS Word/Excel, text files etc.) from  PDF files, scanned  or hand-written documents, images ( photographs, advertisements etc.) for further processing and has been utilized in many real world applications including banking, education, insurance, finance, healthcare and keyword based search in documents etc. Many OCR toolsets are available under various categories including open source, proprietary and online services. This research paper provides a comparative study of various OCR toolsets considering a variety of parameters.
ISSN:2307-4108
2307-4116