2020   • 2019   • 2018   • 2017 and before   •
Date & Time Speakers Topic & Slides

May 2020-05-06

Jinyu Chen visual commonsense R-CNN

May 2020-05-06

Wentao Xie AI CITY CHALLENGE

May 2020-05-06

Chen Gao Iterative Context-Aware Graph Inference for Visual Dialog

May 2020-05-06

Zongheng Tang Unbiased_Scene_Graph_Generation_by_Biased_Learning

May 2020-05-06

Renda Bao Deconfounded Image Captioning

May 2020-05-06

Zitian Wang Two Causal Principles for Improving Visual Dialog

April 2020-04-28

Zhaokai Wang Pointer Networks

April 2020-04-19

Jinyu Chen interbert

April 2020-04-19

Wentao Jiang ActBERT

April 2020-04-19

Tianrui Hui PixelBERT

April 2020-04-19

Zongheng Tang VLBERT

March 2020-03-29

Zongheng Tang Recursive Visual Attention in Visual Dialog

March 2020-03-29

Zitian Wang unbiased scene graph generation from biased training

March 2020-03-25

Lejian Ren CS224n--lecture15

March 2020-03-25

Tianyu YU Dense-Caption

March 2020-03-25

Yier Shu Siamese Box Adaptive Network for Visual Tracking

March 2020-03-25

Yulu Gao SOLOv2

March 2020-03-25

Renda Bao In Defense of Grid Features for Visual Question Answering

March 2020-03-25

Yixuan Qiao TVQA+

March 2020-03-25

Wentao Xie VIDVRD

March 2020-03-25

Zongheng Tang Visual Grounding in Video for UnsupervisedWord Translation

March 2020-03-22

Guanghui Ren ABCNet

March 2020-03-15

Zongheng Tang Sentence Specified Dynamic Video Thumbnail Gener

March 2020-03-15

Wentao Jiang StackGAN-StackGAN++

March 2020-03-15

Renda Bao Survey of VQA

March 2020-03-15

Jinyu Chen REVIE

March 2020-03-15

Chen Gao Virtually Trying on New Clothing with Arbitrary Poses

March 2020-03-15

Guanghui Ren automl_zero

March 2020-03-15

Shaofei Huang temporal segment network

March 2020-03-15

Zongheng Tang Exploiting Temporal Relationships in Video Moment Localization with Natural Language

March 2020-03-08

Wentao Xie A Multigrid Method for Efficiently TrainA Multigrid Method for Efficiently Training Video Modelsing Video Models

March 2020-03-08

Yue Liao End-to-End Learning of Visual Representations from Uncurated Instructional Videos

March 2020-03-01

Jinyu Chen Vision-and-Language Navigation V2

February 2020-02-25

Zhaokai Wang Report on TextVQA Challenge

February 2020-02-25

Wentao Xie TSM: Temporal Shift Module for Efficient Video Understanding

February 2020-02-25

Zongheng Tang STEP: Spatio-Temporal Progressive Learning for Video Action Detection

January 2020-01-12

Wentao Jiang Embodied Question Answering

January 2020-01-12

Zitian Wang Audio-Visual Embodied Navigation

January 2020-01-12

Chen Gao Gibson Env Real-World Perception for Embodied Agents

January 2020-01-12

Jinyu Chen Learning to Navigate Using Mid-Level Visual Priors

January 2020-01-12

Defa Zhu Reinforced Cross-Modal Matching and Self-Supervised Imitation Learning for Vision-Language Navigation

January 2020-01-12

Tianyu Yu Phrase Grouding

January 2020-01-10

Renda Bao Licheng Yu Phd thesis

January 2020-01-10

Lejian Ren Bayesian Relational Memory for Semantic Visual Navigation

January 2020-01-10

Renda Bao From Two Graphs to N Questions A VQA Dataset for Compositional Reasoning on Vision and Commonsense

January 2020-01-05

Guanghui Ren cs224_seq2seq_att

January 2020-01-05

Yixuan Qiao STEP´╝Üspatial-temporal learning for video action detection

January 2020-01-05

Renda Bao From Two Graphs to N Questions A VQA Dataset for Compositional Reasoning on Vision and Commonsense

January 2020-01-05

Shaofei Huang self-supervised lvn

January 2020-01-05

Jinyu Chen Towards Learning a Generic Agent for.pptx

January 2020-01-05

Guanghui Ren Situational Fusion of Visual Representation for Visual Navigation

January 2020-01-05

Zongheng Tang Talk2Nav

January 2020-01-05

Tianrui Hui VLN

January 2020-01-03

Renda Bao up-down VQA

January 2020-01-03

Renda Bao Visual Question Answering as Reading Comprehension

January 2020-01-02

Jinyu Chen YOLACT++

January 2020-01-02

Zongheng Tang anchor free recent work report