Enhancing book genre classification with BERT and InceptionV3: a deep learning approach for libraries
Accurate book genre classification is essential for library organization, information retrieval, and personalized recommendations. Traditional classification methods, often reliant on manual categorization and metadata-based approaches, struggle with the complexities of hybrid genres and evolving li...
Saved in:
| Main Authors: | , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
PeerJ Inc.
2025-06-01
|
| Series: | PeerJ Computer Science |
| Subjects: | |
| Online Access: | https://peerj.com/articles/cs-2934.pdf |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849470550525607936 |
|---|---|
| author | Xinting Yang Zehua Zhang |
| author_facet | Xinting Yang Zehua Zhang |
| author_sort | Xinting Yang |
| collection | DOAJ |
| description | Accurate book genre classification is essential for library organization, information retrieval, and personalized recommendations. Traditional classification methods, often reliant on manual categorization and metadata-based approaches, struggle with the complexities of hybrid genres and evolving literary trends. To address these limitations, this study proposes a hybrid deep learning model that integrates visual and textual features for enhanced genre classification. Specifically, we employ InceptionV3, an advanced convolutional neural network architecture, to extract visual features from book cover images and bidirectional encoder representations from transformers (BERT) to analyze textual data from book titles. A scaled dot-product attention mechanism is used to effectively fuse these multimodal features, dynamically weighting their contributions based on contextual relevance. Experimental results on the BookCover30 dataset demonstrate that our proposed model outperforms baseline approaches, achieving a balanced accuracy of 0.7951 and an F1-score of 0.7920, surpassing both standalone image- and text-based classifiers. This study highlights the potential of deep learning in improving automated genre classification, offering a scalable and adaptable solution for libraries and digital platforms. Future research may focus on expanding dataset diversity, optimizing computational efficiency, and addressing biases in classification models. |
| format | Article |
| id | doaj-art-ff9d50acfba04adc94dd05b377a684ea |
| institution | Kabale University |
| issn | 2376-5992 |
| language | English |
| publishDate | 2025-06-01 |
| publisher | PeerJ Inc. |
| record_format | Article |
| series | PeerJ Computer Science |
| spelling | doaj-art-ff9d50acfba04adc94dd05b377a684ea2025-08-20T03:25:07ZengPeerJ Inc.PeerJ Computer Science2376-59922025-06-0111e293410.7717/peerj-cs.2934Enhancing book genre classification with BERT and InceptionV3: a deep learning approach for librariesXinting Yang0Zehua Zhang1Library, Lanzhou University, Lanzhou, Gansu Province, ChinaDepartment of Physics Science and Technology, Lanzhou University, Lanzhou, Gansu Province, ChinaAccurate book genre classification is essential for library organization, information retrieval, and personalized recommendations. Traditional classification methods, often reliant on manual categorization and metadata-based approaches, struggle with the complexities of hybrid genres and evolving literary trends. To address these limitations, this study proposes a hybrid deep learning model that integrates visual and textual features for enhanced genre classification. Specifically, we employ InceptionV3, an advanced convolutional neural network architecture, to extract visual features from book cover images and bidirectional encoder representations from transformers (BERT) to analyze textual data from book titles. A scaled dot-product attention mechanism is used to effectively fuse these multimodal features, dynamically weighting their contributions based on contextual relevance. Experimental results on the BookCover30 dataset demonstrate that our proposed model outperforms baseline approaches, achieving a balanced accuracy of 0.7951 and an F1-score of 0.7920, surpassing both standalone image- and text-based classifiers. This study highlights the potential of deep learning in improving automated genre classification, offering a scalable and adaptable solution for libraries and digital platforms. Future research may focus on expanding dataset diversity, optimizing computational efficiency, and addressing biases in classification models.https://peerj.com/articles/cs-2934.pdfBook genre classificationLibraryDeep learningBERTInceptionV3 |
| spellingShingle | Xinting Yang Zehua Zhang Enhancing book genre classification with BERT and InceptionV3: a deep learning approach for libraries PeerJ Computer Science Book genre classification Library Deep learning BERT InceptionV3 |
| title | Enhancing book genre classification with BERT and InceptionV3: a deep learning approach for libraries |
| title_full | Enhancing book genre classification with BERT and InceptionV3: a deep learning approach for libraries |
| title_fullStr | Enhancing book genre classification with BERT and InceptionV3: a deep learning approach for libraries |
| title_full_unstemmed | Enhancing book genre classification with BERT and InceptionV3: a deep learning approach for libraries |
| title_short | Enhancing book genre classification with BERT and InceptionV3: a deep learning approach for libraries |
| title_sort | enhancing book genre classification with bert and inceptionv3 a deep learning approach for libraries |
| topic | Book genre classification Library Deep learning BERT InceptionV3 |
| url | https://peerj.com/articles/cs-2934.pdf |
| work_keys_str_mv | AT xintingyang enhancingbookgenreclassificationwithbertandinceptionv3adeeplearningapproachforlibraries AT zehuazhang enhancingbookgenreclassificationwithbertandinceptionv3adeeplearningapproachforlibraries |