A Survey on Multimodal Large Language Models
Shukang Yin1* , Chaoyou Fu2∗‡† , Sirui Zhao1∗‡, Ke Li2, Xing Sun2, Tong Xu1, Enhong Chen1‡
1School of CST., USTC & State Key Laboratory of Cognitive Intelligence
更多
2Tencent YouTu Lab
{xjtupanda,sirui}@mail.ustc.edu.cn, {tongxu,cheneh}@ustc.edu.cn
{bradyfu24}@gmail.com, {tristanli,winfredsun}@tencent.com
收起
文档评论