Look, Imagine and Match: Improving Textual-Visual Cross-Modal - - PowerPoint PPT Presentation

look imagine and match improving textual visual cross
SMART_READER_LITE
LIVE PREVIEW

Look, Imagine and Match: Improving Textual-Visual Cross-Modal - - PowerPoint PPT Presentation

Look, Imagine and Match: Improving Textual-Visual Cross-Modal Retrieval with Generative Models Jiuxiang Gu Jianfei Cai Shafiq Joty Li Niu Gang Wang Goal Text-to-Image Retrieval Image-to-Text Retrieval A young man doing a


slide-1
SLIDE 1

Look, Imagine and Match: Improving Textual-Visual Cross-Modal Retrieval with Generative Models

Jiuxiang Gu Jianfei Cai Shafiq Joty Li Niu Gang Wang

slide-2
SLIDE 2

Goal

A young man doing a skateboard trick while others watch A man doing a skate trick during a competition event with a audience Guys on a course made for skate boarding A group of people doing skateboarding tricks on a car A boy riding on his skateboard at a skate park while other guys watch … Bright room with a couch and various different dressers … Image-to-Text Retrieval Text-to-Image Retrieval

slide-3
SLIDE 3

Classical Pipeline

Bright room with a couch and various different dressers

… …

Similarity Image Encoder Text Encoder

𝑤" 𝑢" 𝑗 𝑑

Image Feature Text Feature

slide-4
SLIDE 4

Motivation: Look è Imagine è Match

𝑗 𝑤 𝑢 𝑑 𝑑̂ 𝑗 𝑤 𝑢 𝑑 𝚥̂

Local Similarity Global Similarity Global Similarity Local Similarity Imagine Imagine Image-to-Text Retrieval Text-to-Image Retrieval

slide-5
SLIDE 5

Look è Imagine

slide-6
SLIDE 6

Match

slide-7
SLIDE 7

Look è Imagine

slide-8
SLIDE 8

Match

slide-9
SLIDE 9

Proposed Approach

slide-10
SLIDE 10

Cross-Modal Retrieval with Generative Learning

slide-11
SLIDE 11

Cross-Modal Retrieval with Generative Learning

slide-12
SLIDE 12

Results

slide-13
SLIDE 13

Results (Classical Pipeline)

slide-14
SLIDE 14

Results (Ours)

slide-15
SLIDE 15

At the Poster:

  • Additional details
  • Quantitative results
  • Discussion