TACo: Token-aware Cascade Contrastive Learning for Video-Text Alignment
Jianwei Yang Yonatan Bisk Jianfeng Gao
Microsoft Research Carnegie Mellon University Microsoft Research
jianwyan@microsoft.com ybisk@cs.cmu.edu jfgao@microsoft.com
Abstract ′ hard negatives
Contrastive Loss 3
Contr ...
附件列表