Computer Vision-Based Framework for Data Extraction From Heterogeneous Financial Tables: A Comprehensive Approach to Unlocking Financial Insights

Information extraction from financial document images is crucial in computer vision and NLP, as financial data often exists in image or PDF format, enabling organizations to analyze and make informed business decisions using OCR advancements. The table contents of financial document images are one o...

Full description

Saved in:

Bibliographic Details
Main Authors:	Iftakhar Ali Khandokar, Priya Deshpande
Format:	Article
Language:	English
Published:	IEEE 2025-01-01
Series:	IEEE Access
Subjects:	Computer vision deep learning information extraction transformer model
Online Access:	https://ieeexplore.ieee.org/document/10813363/
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1850274089285976064
author	Iftakhar Ali Khandokar Priya Deshpande
author_facet	Iftakhar Ali Khandokar Priya Deshpande
author_sort	Iftakhar Ali Khandokar
collection	DOAJ
description	Information extraction from financial document images is crucial in computer vision and NLP, as financial data often exists in image or PDF format, enabling organizations to analyze and make informed business decisions using OCR advancements. The table contents of financial document images are one of the prominent structures to confine important portions of data of the document and many Deep learning-based methods have been proposed to detect Table regions inside document images. The shortcomings of the current approach are that it is bounded within the detection of the table region and struggles in cases such as handling different layouts and preserving the relation among the different attributes of the table. Therefore, in this work, we proposed an end-to-end architecture to extract information from Financial table images while preserving the column row structures of the attributes within the table. We divided the task into four modules and generated synthesized data with different augmentation techniques to overcome data scarcity challenges and boost the performance of the pipeline modules. In terms of information extraction, the proposed method acquired 85% accuracy in the target invoice dataset.
format	Article
id	doaj-art-1e841ee628fc426a9d7ecab5ed4f28e3
institution	OA Journals
issn	2169-3536
language	English
publishDate	2025-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj-art-1e841ee628fc426a9d7ecab5ed4f28e32025-08-20T01:51:14ZengIEEEIEEE Access2169-35362025-01-0113177061772310.1109/ACCESS.2024.352214110813363Computer Vision-Based Framework for Data Extraction From Heterogeneous Financial Tables: A Comprehensive Approach to Unlocking Financial InsightsIftakhar Ali Khandokar0https://orcid.org/0000-0002-7354-5736Priya Deshpande1https://orcid.org/0000-0002-2631-5751Department of Electrical and Computer Engineering (EECE), Marquette University, Milwaukee, WI, USADepartment of Electrical and Computer Engineering (EECE), Marquette University, Milwaukee, WI, USAInformation extraction from financial document images is crucial in computer vision and NLP, as financial data often exists in image or PDF format, enabling organizations to analyze and make informed business decisions using OCR advancements. The table contents of financial document images are one of the prominent structures to confine important portions of data of the document and many Deep learning-based methods have been proposed to detect Table regions inside document images. The shortcomings of the current approach are that it is bounded within the detection of the table region and struggles in cases such as handling different layouts and preserving the relation among the different attributes of the table. Therefore, in this work, we proposed an end-to-end architecture to extract information from Financial table images while preserving the column row structures of the attributes within the table. We divided the task into four modules and generated synthesized data with different augmentation techniques to overcome data scarcity challenges and boost the performance of the pipeline modules. In terms of information extraction, the proposed method acquired 85% accuracy in the target invoice dataset.https://ieeexplore.ieee.org/document/10813363/Computer visiondeep learninginformation extractiontransformer model
spellingShingle	Iftakhar Ali Khandokar Priya Deshpande Computer Vision-Based Framework for Data Extraction From Heterogeneous Financial Tables: A Comprehensive Approach to Unlocking Financial Insights IEEE Access Computer vision deep learning information extraction transformer model
title	Computer Vision-Based Framework for Data Extraction From Heterogeneous Financial Tables: A Comprehensive Approach to Unlocking Financial Insights
title_full	Computer Vision-Based Framework for Data Extraction From Heterogeneous Financial Tables: A Comprehensive Approach to Unlocking Financial Insights
title_fullStr	Computer Vision-Based Framework for Data Extraction From Heterogeneous Financial Tables: A Comprehensive Approach to Unlocking Financial Insights
title_full_unstemmed	Computer Vision-Based Framework for Data Extraction From Heterogeneous Financial Tables: A Comprehensive Approach to Unlocking Financial Insights
title_short	Computer Vision-Based Framework for Data Extraction From Heterogeneous Financial Tables: A Comprehensive Approach to Unlocking Financial Insights
title_sort	computer vision based framework for data extraction from heterogeneous financial tables a comprehensive approach to unlocking financial insights
topic	Computer vision deep learning information extraction transformer model
url	https://ieeexplore.ieee.org/document/10813363/
work_keys_str_mv	AT iftakharalikhandokar computervisionbasedframeworkfordataextractionfromheterogeneousfinancialtablesacomprehensiveapproachtounlockingfinancialinsights AT priyadeshpande computervisionbasedframeworkfordataextractionfromheterogeneousfinancialtablesacomprehensiveapproachtounlockingfinancialinsights

Computer Vision-Based Framework for Data Extraction From Heterogeneous Financial Tables: A Comprehensive Approach to Unlocking Financial Insights

Similar Items