Vision-Language Transformer and Query Generation for Referring
Segmentation
Henghui Ding* Chang Liu* Suchen Wang Xudong Jiang
Nanyang Technological University, Singapore
{ding0093, liuc0058, wang.sc, exdjiang}@ntu.edu.sg
Vision-Guided
Abstract Attention Query Vectors
Input:
...
附件列表