Connective Cognition Network for Directional Visual Commonsense Reasoning | Aming Wu · Linchao Zhu · Yahong Han · Yi Yang |
Heterogeneous Graph Learning for Visual Commonsense Reasoning | Weijiang Yu · Jingwen Zhou · Weihao Yu · Xiaodan Liang · Nong Xiao |
Learning Dynamics of Attention: Human Prior for Interpretable Machine Reasoning | Wonjae Kim · Yoonho Lee |
RUBi: Reducing Unimodal Biases for Visual Question Answering | Remi Cadene · Corentin Dancette · Hedi Ben younes · Matthieu Cord · Devi Parikh |
Self-Critical Reasoning for Robust Visual Question Answering | Jialin Wu · Raymond Mooney |
Variational Structured Semantic Inference for Diverse Image Captioning | Fuhai Chen · Rongrong Ji · Jiayi Ji · Xiaoshuai Sun · Baochang Zhang · Xuri Ge · Yongjian Wu · Feiyue Huang · Yan Wang |
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks | Jiasen Lu · Dhruv Batra · Devi Parikh · Stefan Lee |
Visual Concept-Metaconcept Learning | Chi Han · Jiayuan Mao · Chuang Gan · Josh Tenenbaum · Jiajun Wu |