Architectural Synergies in Bi-Modal and Bi-Contrastive Learning

The integration of visual and linguistic elements within artificial intelligence research is increasingly emphasized, spurred by advancements in pre-trained model technologies. Traditionally, such models have been developed independently, using methods like contrastive learning and image-captioning...

Full description

Saved in:
Bibliographic Details
Main Authors: Yujia Gu, Brian Liu, Tianlong Zhang, Xinye Sha, Shiyong Chen
Format: Article
Language:English
Published: IEEE 2024-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10670389/
Tags: Add Tag
No Tags, Be the first to tag this record!