Image captioning deep learning model using ResNet50 encoder and hybrid LSTM–GRU decoder optimized with beam search

Image captioning is a fascinating and fast-evolving research project that integrates two domains: Natural Language Processing and Computer Vision. Creating appropriate captions is a difficult task due to the many activities portrayed in the backdrop image. To mitigate these drawbacks, the envisioned...

Full description

Saved in:
Bibliographic Details
Main Authors: P. V. Kavitha, V. Karpagam
Format: Article
Language:English
Published: Taylor & Francis Group 2025-07-01
Series:Automatika
Subjects:
Online Access:https://www.tandfonline.com/doi/10.1080/00051144.2025.2485695
Tags: Add Tag
No Tags, Be the first to tag this record!