One Picture and One Thousand Words: Toward integrated multimodal generative models

Thanks to independent advances in language and image generation, we could soon be in the position to have systems that communicate with us by combining language and images in their output, a skill that humans do not possess (we receive, but we do not produce images at high speed). This paper explore...

Full description

Saved in:
Bibliographic Details
Main Author: Roberto Zamparelli
Format: Article
Language:English
Published: Accademia University Press 2024-12-01
Series:IJCoL
Online Access:https://journals.openedition.org/ijcol/1432
Tags: Add Tag
No Tags, Be the first to tag this record!