Active Perception in Vision-Language Navigation Task - - PowerPoint PPT Presentation

active perception in vision language navigation task
SMART_READER_LITE
LIVE PREVIEW

Active Perception in Vision-Language Navigation Task - - PowerPoint PPT Presentation

Student Webinar Active Perception in Vision-Language Navigation Task (MC (MCIS Lab) (B (Beijing ng Ins nsti titute


slide-1
SLIDE 1

Active Perception in Vision-Language Navigation Task

汪汗青

媒体计算与智能系统实验室 媒体计算与智能系统实验室 (MC (MCIS Lab) 北京理工大学 北京理工大学 (B (Beijing ng Ins nsti titute tute of T echno hnology) Hom Home Page: https://hanqingwa wangai.github.io Em Email: hanqingwang@bit. t.edu.cn

Student Webinar

slide-2
SLIDE 2

Outl tline

  • Introduction to VLN Task
  • Challenges of VLN Task
  • Baseline & Our Approach
  • Learning method
  • Conclusion
slide-3
SLIDE 3

Refe ference

Active Visual Information Gathering for Vision-Language Navigation

Hanqing Wang, Wenguan Wang, Tianmin Shu, Wei Liang, and Jianbing Shen

slide-4
SLIDE 4

VLN Task sk

[Anderson, P., Wu, Q. et al., Vision-and-Language Navigation: Interpreting visually-grounded navigation instructions in real environments, CVPR 2018]

slide-5
SLIDE 5

VLN Task sk

[Manolis S., Abhishek K., Oleksandr M. and et al., Habitat: A Platform for Embodied AI Research, ICCV 2019]

slide-6
SLIDE 6

Instruction: Leave the bathroom and walk forward along the pool. Take

a left at the stairs and go up the stairs. Go up another set of stairs, and enter the massage room first on the right. Stop in the doorway to the massage room.

Observation:

Ground Truth Path

VLN Task sk

slide-7
SLIDE 7

Observation:

Ground Truth Path

Instruction: Leave the bathroom and walk forward along the pool. Take

a left at the stairs and go up the stairs. Go up another set of stairs, and enter the massage room first on the right. Stop in the doorway to the massage room.

VLN Task sk

slide-8
SLIDE 8

Observation:

Ground Truth Path

Instruction: Leave the bathroom and walk forward along the pool. Take

a left at the stairs and go up the stairs. Go up another set of stairs, and enter the massage room first on the right. Stop in the doorway to the massage room.

VLN Task sk

slide-9
SLIDE 9

Observation:

Ground Truth Path

Instruction: Leave the bathroom and walk forward along the pool. Take

a left at the stairs and go up the stairs. Go up another set of stairs, and enter the massage room first on the right. Stop in the doorway to the massage room.

VLN Task sk

slide-10
SLIDE 10

Challenge of f VLN Task sk

Observation: Instruction: Leave the bathroom and walk forward along the pool.

Take a left at the stairs and go up the stairs. Go up another set of stairs, and enter the massage room first on the right. Stop in the doorway to the massage room.

Basic Agent Door 1 Door 2

slide-11
SLIDE 11

Observation:

Basic Agent

Failed

Instruction: Leave the bathroom and walk forward along the pool.

Take a left at the stairs and go up the stairs. Go up another set of stairs, and enter the massage room first on the right. Stop in the doorway to the massage room.

Challenge of f VLN Task sk

slide-12
SLIDE 12

Observation: Instruction: Leave the bathroom and walk forward along the pool.

Take a left at the stairs and go up the stairs. Go up another set of stairs, and enter the massage room first on the right. Stop in the doorway to the massage room.

Our Agent Door 1 Door 2

Insufficient information

Challenge of f VLN Task sk

slide-13
SLIDE 13

Observation:

Our Agent

Instruction: Leave the bathroom and walk forward along the pool.

Take a left at the stairs and go up the stairs. Go up another set of stairs, and enter the massage room first on the right. Stop in the doorway to the massage room.

inconsistent

Challenge of f VLN Task sk

slide-14
SLIDE 14

Observation:

Our Agent

Instruction: Leave the bathroom and walk forward along the pool.

Take a left at the stairs and go up the stairs. Go up another set of stairs, and enter the massage room first on the right. Stop in the doorway to the massage room.

Challenge of f VLN Task sk

slide-15
SLIDE 15

Observation:

Our Agent

Instruction: Leave the bathroom and walk forward along the pool. Take

a left at the stairs and go up the stairs. Go up another set of stairs, and enter the massage room first on the right. Stop in the doorway to the massage room.

Challenge of f VLN Task sk

slide-16
SLIDE 16

Observation:

Our Agent

Instruction: Leave the bathroom and walk forward along the pool. Take

a left at the stairs and go up the stairs. Go up another set of stairs, and enter the massage room first on the right. Stop in the doorway to the massage room.

Succeed!

Challenge of f VLN Task sk

slide-17
SLIDE 17

Starting VP Observed VP Unvisited VP !" #",%

Baseline model &",' = softmax'(#",'

1 23")

3" = LSTM( 9, !", :";% , 3";%)

#",<

Base seline

slide-18
SLIDE 18

Starting VP Observed VP Explored VP Navigated VP Target VP Unvisited VP !",$ %

"

&",$ Collected information

Na Naïve Ex Exploration Module

slide-19
SLIDE 19

Starting VP Observed VP Explored VP Navigated VP Target VP Unvisited VP !",$ %",& ' %",$ Collected information Updated candidate feature

Na Naïve Ex Exploration Module

slide-20
SLIDE 20

Starting VP Observed VP Explored VP Navigated VP Target VP Unvisited VP ! "#,% ! "#,& "#,& '#,& Updated candidate feature Collected information

Na Naïve Ex Exploration Module

slide-21
SLIDE 21

Starting VP Observed VP Explored VP Navigated VP Target VP Unvisited VP ! "#,% ! "#,& Navigation decision making Updated candidate feature

Na Naïve Ex Exploration Module

slide-22
SLIDE 22

Starting VP Observed VP Explored VP Navigated VP Target VP Unvisited VP

Na Naïve Ex Exploration Module

slide-23
SLIDE 23

Need more exploration. Enough. STOP and return.

Starting VP Observed VP Explored VP Navigated VP Target VP Unvisited VP ℎ"

#$/&'

ℎ(

#$/&'

Exploration decision making

Few & Deeper Ex Exploration

slide-24
SLIDE 24

Stop Exploration Just make navigation.

Starting VP Observed VP Explored VP Navigated VP Target VP Unvisited VP Exploration decision making

Few & Deeper Ex Exploration

slide-25
SLIDE 25

Starting VP Observed VP Explored VP Navigated VP Target VP Unvisited VP ! "#,% "#,& Updated candidate feature

Few & Deeper Ex Exploration

slide-26
SLIDE 26

Over vervi view ew

slide-27
SLIDE 27

Training Signals:

+ + +

Imitation Learning Reinforcement Learning

Tr Training

slide-28
SLIDE 28

Navigation Part: Exploration Part:

Imit Imitat atio ion Learn Learnin ing

slide-29
SLIDE 29

Navigation Part:

Actor Loss Critic Loss Immediate reward

= change of distance, if != STOP, +3, if = STOP and SUCCEED,

  • 3, if = STOP and FAIL.

Accumulated discount reward

Reinfo forcement Learning

slide-30
SLIDE 30

Exploration Part:

Exploration-assisted relative reward shaping

Accumulated discount reward Immediate reward

Reinfo forcement Learning

slide-31
SLIDE 31

Quantitative Resu sults

slide-32
SLIDE 32

Observation:

Qualitative Resu sults

slide-33
SLIDE 33

Observation:

Qualitative Resu sults

slide-34
SLIDE 34

Observation:

Qualitative Resu sults

slide-35
SLIDE 35

Observation:

Qualitative Resu sults

slide-36
SLIDE 36

Observation:

Qualitative Resu sults

slide-37
SLIDE 37

Observation:

Failed

Qualitative Resu sults

slide-38
SLIDE 38

Observation:

Qualitative Resu sults

slide-39
SLIDE 39

Observation:

Qualitative Resu sults

slide-40
SLIDE 40

Observation:

Qualitative Resu sults

slide-41
SLIDE 41

Observation:

Qualitative Resu sults

slide-42
SLIDE 42

Observation:

Qualitative Resu sults

slide-43
SLIDE 43

Observation:

Succeed!

Qualitative Resu sults

slide-44
SLIDE 44
  • 1. Active perception is effective in embodied vision tasks.
  • 2. IL warm up + RL is powerful for some problems that are hard to learn

by supervised learning.

  • 3. VLN task is far from been solved, how to fill the performance gap

between seen and new environments.

Conclusi sions

slide-45
SLIDE 45

Thanks for your listening!

Project Page: https://github.com/HanqingWangAI/Active_VLN Email: hanqingwang@bit.edu.cn Advisor: Wei Liang, Wenguan Wang, Jianbing Shen