Analysis of printed document identification based on Deep Learning

In this study, we investigate the effectiveness of ResNet, a deep neural network architecture, for a deep learning approach to address the problem of printed document identification. ResNet is known for its ability to handle the vanishing gradient problem and learn highly representative features. M...

Full description

Saved in:
Bibliographic Details
Main Authors: Dinh Thong Nguyen, Phu Quang Nguyen, Hoang Bao An Mai
Format: Article
Language:English
Published: Can Tho University Publisher 2023-10-01
Series:CTU Journal of Innovation and Sustainable Development
Subjects:
Online Access:http://web2010.thanhtoan/index.php/ctujs/article/view/705
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:In this study, we investigate the effectiveness of ResNet, a deep neural network architecture, for a deep learning approach to address the problem of printed document identification. ResNet is known for its ability to handle the vanishing gradient problem and learn highly representative features. Multiple variations of ResNet have been applied, including ResNet50, ResNet101, and ResNet152, which provide the backbone architecture of our classification model and are trained on a comprehensive dataset of microscopic printed images containing some microscopic printing patterns from various source printers. We also incorporate Mix-up augmentation, a technique that generates virtual training samples by interpolating pairs of images and labels, to further enhance the performance and generalization capability of the model. The experimental results showed that ResNet101 and ResNet152 variants outperformed in accurately distinguishing printer sources based on microscopic printed patterns. We developed a mobile app to test the feasibility of our findings in practice. In conclusion, this study aims to lay the groundwork for creating a sufficiently pre-trained model with accurate performance of identification that can be deployed on mobile devices to detect the printed sources of documents.
ISSN:2588-1418
2815-6412