Generative Modeling of Infinite Occluded Objects for Compositional - PowerPoint PPT Presentation

Generative Modeling of Infinite Occluded Objects for Compositional Scene Representation Jinyang Yuan, Bin Li, Xiangyang Xue Fudan University { yuanjinyang, libin, xyxue } @fudan.edu.cn Jun 12, 2019 Jinyang Yuan, Bin Li, Xiangyang Xue Infinite Occluded Objects Jun 12, 2019 1 / 10

Compositional Scene Representation Scenes are composed of objects and background The combinations of objects and background are diverse A single representation for the entire scene is relatively complex Single Object Multiple Objects … … … Jinyang Yuan, Bin Li, Xiangyang Xue Infinite Occluded Objects Jun 12, 2019 2 / 10

Compositional Scene Representation Compositional scene representation is desirable Lower representation complexity Higher generalizability to novel scenes Object 1 Object 2 Background Object 1 Object 2 Object 3 Background Scene Scene Jinyang Yuan, Bin Li, Xiangyang Xue Infinite Occluded Objects Jun 12, 2019 3 / 10

Generative Modeling of Infinite Occluded Objects Two major difficulties The number of objects is unknown The perceived objects may be incomplete due to occlusions Normalized Scale and Complete Perceived Appearance Shape Translation Shape Shape Generated Scene Jinyang Yuan, Bin Li, Xiangyang Xue Infinite Occluded Objects Jun 12, 2019 4 / 10

Generative Modeling of Infinite Occluded Objects Background: k = 0 , Objects: k ≥ 1 � ˜ � σ 2 ) Latent Representation s · k ∼ N µ , diag(˜ k ≥ 0 , �� k k ′ =1 ν k ′ � z ind Presence (number of objects) ν k ∼ Beta( α, 1) , ∼ Ber , k ≥ 1 k � � z dep f stn ( f shp ( s shp , s stn Complete Shape n,k ∼ Ber · k ) ) n , k ≥ 1 · k � �� scale and translation normalized shape � � k − 1 � � k z dep k ′ z dep z ind 1 − z ind k ≥ 1 , n,k k ′ =1 n,k ′ Perceived Shape (occlusions) ρ n,k = 1 − � ∞ k ′ =1 ρ n,k ′ , k = 0 � apc ( s apc f obj k ≥ 1 · k ) , Appearance a n,k = apc ( s apc f back · k ) , k = 0 ∞ � σ 2 I ) Generated Scene x n ∼ ρ n,k N ( a n,k , ˆ k =0 Jinyang Yuan, Bin Li, Xiangyang Xue Infinite Occluded Objects Jun 12, 2019 5 / 10

Variational Inference Parameters are inferred by long short-term memories ( LSTMs ) Each object and background are updated sequentially and iteratively The LSTMs imitate the procedure of coordinate ascent � � K N � � q ( h | x ) = q ( s apc q ( s stn · k ) q ( s shp · k | s stn · k ) q ( s apc · k | s stn · k ) q ( ν k | s stn · k ) q ( z ind k | s stn q ( z dep n,k | s shp · k , s stn · 0 ) · k ) · k ) k =1 n =1 · k | s stn 2 )) q ( s ∗ · k ) = N ( s ∗ · k ; µ ∗ · k , diag( σ ∗ · k q ( ν k | s stn · k ) = Beta( ν k ; τ 1 ,k , τ 2 ,k ) q ( z ind k | s stn · k ) = Ber( z ind k ; ζ k ) q ( z dep n,k | s shp · k ) = Ber( z dep · k , s stn n,k ; ξ n,k ) Jinyang Yuan, Bin Li, Xiangyang Xue Infinite Occluded Objects Jun 12, 2019 6 / 10

Experimental Results Gray-S/M RGB1-S/M RGB2-S/M RGB3-S/M RGB4-S/M scene recon obj 1 obj 2 obj 3 obj 4 segre Jinyang Yuan, Bin Li, Xiangyang Xue Infinite Occluded Objects Jun 12, 2019 7 / 10

Experimental Results Table: Comparison of segregation and counting performance with existence of occlusion. N-EM [Greff et al., 2017] AIR [Eslami et al., 2016] Proposed Data set AMI MSE OCA AMI MSE OCA AMI MSE OCA Gray-S 77.3% 10e-3 56.2% 85.4% 6.5e-3 80.9% 94.6 % 2.9 e-3 90.5 % Gray-M 30.5% 22e-3 13.5% 62.8% 9.0e-3 66.0% 71.1 % 7.5 e-3 77.6 % RGB1-S 81.8% 5.6e-3 74.2% 95.3% 2.4e-3 88.8% 98.3 % 1.1 e-3 95.1 % RGB1-M 57.0% 9.4e-3 16.3% 78.2% 3.5e-3 67.9% 82.0 % 3.1 e-3 74.8 % RGB2-S 66.2% 9.0e-3 60.8% 85.7% 3.7e-3 84.4% 92.3 % 2.2 e-3 86.3 % RGB2-M 34.9% 13e-3 12.5% 64.1% 4.8e-3 69.8% 67.9 % 4.7 e-3 71.0 % RGB3-S 29.6% 21e-3 7.44% 91.3% 3.9e-3 90.3% 97.4 % 1.4 e-3 92.5 % RGB3-M 15.4% 22e-3 2.30% 67.5% 5.4e-3 60.5% 77.9 % 3.8 e-3 68.6 % RGB4-S 24.7% 20e-3 10.3% 86.7% 4.0e-3 78.3% 90.7 % 2.5 e-3 83.3 % RGB4-M 3.82% 32e-3 2.35% 56.9% 6.3e-3 58.2% 67.9 % 4.6 e-3 77.3 % Jinyang Yuan, Bin Li, Xiangyang Xue Infinite Occluded Objects Jun 12, 2019 8 / 10

References Eslami, S., Heess, N., Weber, T., Tassa, Y., Szepesvari, D., Kavukcuoglu, K., and Hinton, G. E. (2016). Attend, infer, repeat: Fast scene understanding with generative models. In Advances in Neural Information Processing Systems (NeurIPS) , pages 3225–3233. Greff, K., van Steenkiste, S., and Schmidhuber, J. (2017). Neural expectation maximization. In Advances in Neural Information Processing Systems (NeurIPS) , pages 6691–6701. Jinyang Yuan, Bin Li, Xiangyang Xue Infinite Occluded Objects Jun 12, 2019 9 / 10

Thank You! Jinyang Yuan, Bin Li, Xiangyang Xue Infinite Occluded Objects Jun 12, 2019 10 / 10

Generative Modeling of Infinite Occluded Objects for Compositional - PowerPoint PPT Presentation

Generative Modeling of Infinite Occluded Objects for Compositional Scene Representation Jinyang Yuan, Bin Li, Xiangyang Xue Fudan University { yuanjinyang, libin, xyxue } @fudan.edu.cn Jun 12, 2019 Jinyang Yuan, Bin Li, Xiangyang Xue Infinite

generative design systems Generative Brief Design Definitions Workshop Processes

Mutable Values Announcements Objects (Demo) Objects 4 Objects Objects represent

61A Lecture 12 Announcements Objects (Demo) Objects 4 Objects Objects represent

Infinite graphs P eter Komj ath LC12 P eter Komj ath Infinite graphs Infinite

Generative networks part 2: GANs 23 / 54 Recap on generative networks Generative networks provide

Objects & Inheritance Section 7 Implementing Objects in 401 Ways of implementing objects:

Generative Adversarial Nets(GANs) Troy Cary and Chenzhi Zhao A generative adversarial net is

CSC421/2516 Lecture 18: Generative Adversarial Networks Roger Grosse and Jimmy Ba Roger Grosse

Live Objects Live Objects Live Objects Live Objects Krzys Ostrowski, Ken Birman, Danny Dolev

Infinite Campus Parent Portal Scan and Go https://goo.gl/kNtHrw Infinite Campus Parent Portal

Happy 103rd birthday, Richard Guy Karl Dilcher Infinite products Infinite products involving

Infinite dimensional sub-Riemannian geometry Sylvain Arguill` ere (CIS, Johns Hopkins

Conditional Generative Adversarial Networks (and a brief look at image-to-image translation)

An overview of Boosting Yoav Freund UCSD Plan of talk Generative vs. non-generative

CSC321 Lecture 19: Generative Adversarial Networks Roger Grosse Roger Grosse CSC321 Lecture 19:

LEARNING GENERATIVE MODELS ACROSS INCOMPARABLE SPACES Cha harlot otte Bunne unne , David

QCD$Library$for$GPU$Cluster$with$ Proprietary$Interconnect$for$ GPU$Direct$Communica<on

Shared Governance at Colorado State University: Administrative Professional Council Classified

Modelling Biochemical Reaction Networks Lecture 18: Modeling the cell cycle, Part II Marc R.

Auxiliary Mixture Sampling for Age-Period-Cohort Models Andrea Riebler & Leonhard Held

Topics in f ( R ) THEORIES OF GRAVITY Nathalie Deruelle APC-CNRS, Paris ihes, April 22th 2010 2

Input Evidence Causal Leverage Analysis Lance Shea Partner BakerHostetler Washington, DC

A full sky, low foreground, high resolution CMB map from WMAP Jacques Delabrouille APC Paris

Concordance among Holdouts Scott Duke Kominers Department of Economics, Harvard University and

Sambuz

Useful Links

Newsletter

Mail Us

Generative Modeling of Infinite Occluded Objects for Compositional - PowerPoint PPT Presentation

Generative Modeling of Infinite Occluded Objects for Compositional Scene Representation Jinyang Yuan, Bin Li, Xiangyang Xue Fudan University { yuanjinyang, libin, xyxue } @fudan.edu.cn Jun 12, 2019 Jinyang Yuan, Bin Li, Xiangyang Xue Infinite

generative design systems Generative Brief Design Definitions Workshop Processes

Mutable Values Announcements Objects (Demo) Objects 4 Objects Objects represent

61A Lecture 12 Announcements Objects (Demo) Objects 4 Objects Objects represent

Infinite graphs P eter Komj ath LC12 P eter Komj ath Infinite graphs Infinite

Generative networks part 2: GANs 23 / 54 Recap on generative networks Generative networks provide

Objects &amp; Inheritance Section 7 Implementing Objects in 401 Ways of implementing objects:

Generative Adversarial Nets(GANs) Troy Cary and Chenzhi Zhao A generative adversarial net is

CSC421/2516 Lecture 18: Generative Adversarial Networks Roger Grosse and Jimmy Ba Roger Grosse

Live Objects Live Objects Live Objects Live Objects Krzys Ostrowski, Ken Birman, Danny Dolev

Infinite Campus Parent Portal Scan and Go https://goo.gl/kNtHrw Infinite Campus Parent Portal

Happy 103rd birthday, Richard Guy Karl Dilcher Infinite products Infinite products involving

Infinite dimensional sub-Riemannian geometry Sylvain Arguill` ere (CIS, Johns Hopkins

Conditional Generative Adversarial Networks (and a brief look at image-to-image translation)

An overview of Boosting Yoav Freund UCSD Plan of talk Generative vs. non-generative

CSC321 Lecture 19: Generative Adversarial Networks Roger Grosse Roger Grosse CSC321 Lecture 19:

LEARNING GENERATIVE MODELS ACROSS INCOMPARABLE SPACES Cha harlot otte Bunne unne , David

QCD$Library$for$GPU$Cluster$with$ Proprietary$Interconnect$for$ GPU$Direct$Communica&lt;on

Shared Governance at Colorado State University: Administrative Professional Council Classified

Modelling Biochemical Reaction Networks Lecture 18: Modeling the cell cycle, Part II Marc R.

Auxiliary Mixture Sampling for Age-Period-Cohort Models Andrea Riebler &amp; Leonhard Held

Topics in f ( R ) THEORIES OF GRAVITY Nathalie Deruelle APC-CNRS, Paris ihes, April 22th 2010 2

Input Evidence Causal Leverage Analysis Lance Shea Partner BakerHostetler Washington, DC

A full sky, low foreground, high resolution CMB map from WMAP Jacques Delabrouille APC Paris

Concordance among Holdouts Scott Duke Kominers Department of Economics, Harvard University and

Sambuz

Useful Links

Newsletter

Mail Us

Objects & Inheritance Section 7 Implementing Objects in 401 Ways of implementing objects:

QCD$Library$for$GPU$Cluster$with$ Proprietary$Interconnect$for$ GPU$Direct$Communica<on

Auxiliary Mixture Sampling for Age-Period-Cohort Models Andrea Riebler & Leonhard Held