Domain Adaptation & Transfer: All You Need to Use Simulation “for Real”
Boqing Gong
Tecent AI Lab
Department of Computer Science
Simulation for Real Boqing Gong Tecent AI Lab Department of - - PowerPoint PPT Presentation
Domain Adaptation & Transfer: All You Need to Use Simulation for Real Boqing Gong Tecent AI Lab Department of Computer Science An intelligent robot Semantic segmentation of urban scenes Assign each pixel a semantic label An
Department of Computer Science
Image credit: https://www.cityscapes-dataset.com/
Long, J., Shelhamer, E., & Darrell, T. (2015). Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.
Image credit: https://www.cityscapes-dataset.com/
Image credit: http://synthia-dataset.net/
15 30 45 60
8
[Zhang et al., ICCV’17]
[Jamal et al., CVPR’18]
[Gan et al., CVPR’17]
1 0.5 0.5 1 0.5 0.5 0.5 0.5
Car Children Drink Flowers Street Area Food Water
(a) Input: Video & Query (c) Output: Summary (b) Algorithm: Sequential & Hierarchical Determinantal Point Process (SH-DPP)
Important & diverse shots à Query-relevant, important, & diverse shots à
[Sharghi et al., ECCV’16, CVPR’17, ECCV’18]
[Gan et al., ECCV’16, CVPR’18] [Ding et al., WACV’18]
14
[Shimodaira, ’00] [Huang et al., Bickel et al., ’07] [Sugiyama et al., ’08] [Sethy et al., ’06] [Sethy et al., ’09]
[Evgeniou and Pontil, ’05] [Duan et al., ’09] [Duan et al., Daumé III et al., Saenko et al., ’10] [Kulis et al., Chen et al., ’11]
+
[Pan et al., ’09] [Blitzer et al., ’06] [Gopalan et al., ’11] [Chen et al., ’12] [Daumé III, ’07] [Argyriou et al, ’08] [Gong et al., ’12] [Muandet et al., ’13]
Image Baseline Ours Groundtruth
0% 10% 20% 30% 40% Sky Road Pedestrian Traffic Sign Tree
Θ
s : Source, t : Target
0% 10% 20% 30% 40% Sky Road Pedestrian Traffic Sign Tree
[ICCV’17]
20
0% 10% 20% 30% 40% Sky Road Pedestrian Traffic Sign Tree
15 30 45 60
[Zhang et al., ICCV’17]
15 30 45 60
2 (x,a) 1 (x,a) C (x,a) C+2 C+1
(x,a)m1
m1=1,2,…
(x,a)m2
m2=1,2,…
(x,a)mC
mC=1,2,…
(x,?)n
n=1,2,…
Training data sampled from C related domains Test data from both seen & unseen domains
!
Unseen Seen
M scenes N tasks Setting 3 es
Synthesize Policy for Transfer and Adaptation across Environments and Tasks