A Fast and Accurate One-Stage Approach to Visual Grounding
Presenter: Tianlang Chen Zhengyuan Yang Boqing Gong Liwei Wang Wenbing Huang Dong Yu Jiebo Luo
A Fast and Accurate One-Stage Approach to Visual Grounding - - PowerPoint PPT Presentation
A Fast and Accurate One-Stage Approach to Visual Grounding Zhengyuan Yang Boqing Gong Liwei Wang Wenbing Huang Dong Yu Jiebo Luo Presenter: Tianlang Chen Visual grounding Grounding a language query onto a region of the image
Presenter: Tianlang Chen Zhengyuan Yang Boqing Gong Liwei Wang Wenbing Huang Dong Yu Jiebo Luo
Visual grounding
Query: bottom right grass
–
Phrase localization
–
Referring expression comprehension
Existing framework
Query: center building
Existing framework
One-stage visual grounding
Why one-stage visual grounding
Architecture overview
Architecture
Architecture
– Multiple resolutions – Three parts of input features
Architecture
Datasets
the black backpack on the bottom right
Flickr 30K Entities ReferItGame
Comparison to other methods
Qualitative results
Pred. gt Ours Two- stage
Code & models: https://github.com/zyang-ur/onestage_grounding Poster: #26 Contact: zyang39@cs.rochester.edu