Optimizing the performance of a server-based classification for a large business document flow
The document categorization problem in the case of a large business document flow is considered. Textual and visual embeddings were employed for classification. Textual embeddings were extracted via OCR Tesseract. The Viola and Jones method was applied to generate visual embeddings. This paper descr...
Saved in:
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Belarusian National Technical University
2023-02-01
|
Series: | Системный анализ и прикладная информатика |
Subjects: | |
Online Access: | https://sapi.bntu.by/jour/article/view/595 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|