Zhao, X., Xu, M., Silamu, W., & Li, Y. CLIP-Llama: A New Approach for Scene Text Recognition with a Pre-Trained Vision-Language Model and a Pre-Trained Language Model. MDPI AG.
Chicago Style (17th ed.) CitationZhao, Xiaoqing, Miaomiao Xu, Wushour Silamu, and Yanbing Li. CLIP-Llama: A New Approach for Scene Text Recognition with a Pre-Trained Vision-Language Model and a Pre-Trained Language Model. MDPI AG.
MLA (9th ed.) CitationZhao, Xiaoqing, et al. CLIP-Llama: A New Approach for Scene Text Recognition with a Pre-Trained Vision-Language Model and a Pre-Trained Language Model. MDPI AG.
Warning: These citations may not always be 100% accurate.