全部版块 我的主页
论坛 提问 悬赏 求职 新闻 读书 功能一区 经管文库(原现金交易版)
142 0
2024-07-20
Taming Transformers for High-Resolution Image Synthesis.pdf Sequential Modeling Enables Scalable Learning for Large Vision Models.pdf
NExT-GPT.Any-to-Any Multimodal LLM.pdf Visual Instruction Tuning.pdf
PROGRESS MEASURES FOR GROKKING VIA MECHANISTIC INTERPRETABILITY.pdf
MiniGPT-v2 Large Language Model As a Unified Interface for Vision-Language Multi-task Learning.pdf
Swin Transformer Hierarchical Vision Transformer using Shifted Windows.pdf
IMAGEBIND One Embedding Space To Bind Them All.pdf
CoDi-2 In-Context,Interleaved,and Interactive Any-to-Any Generation.pdf
Meta-Transformer.A Unified Framework for Multimodal Learning.pdf
Neural Discrete Representation Learning.pdf
Learning Transferable Visual Models From Natural Language Supervision.pdf
MINIGPT-4 ENHANCING VISION-LANGUAGE UNDERSTANDING WITH ADVANCED LARGE LANGUAGE MODELS.pdf
AN IMAGE IS WORTH 16 16 WORDS TRANSFORMERS FOR IMAGE RECOGNITION AT SCALE.pdf
InstructBLIP Towards General-purpose Vision-Language Models with Instruction Tuning.pdf
BLIP-2 Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models.pdf
Improved Baselines with Visual Instruction Tuning.pdf
阿里巴巴「AI剧组」--大模型驱动的影视短视频智能生产实践.pdf

多模态.part1.rar
大小:(98 MB)

只需: RMB 10元  马上下载

多模态.part2.rar
大小:(25.73 MB)

只需: RMB 10元  马上下载




二维码

扫码加我 拉你入群

请注明:姓名-公司-职位

以便审核进群资格,未注明则拒绝

栏目导航
热门文章
推荐文章

说点什么

分享

扫码加好友,拉您进群
各岗位、行业、专业交流群