CSC 2547: Machine Learning for Vision as Inverse Graphics Anthony - PowerPoint PPT Presentation

CSC 2547: Machine Learning for Vision as Inverse Graphics Anthony Bonner www.cs.toronto.edu/~bonner

Paper Presentations • Each week will focus on one topic, as listed on the course web page (soon). • You can vote for your choice of topic/week (soon). • I will assign you to a week (soon). • Papers on each topic will be listed on the course web page. • If you have a particular paper you would like to add to the list, please let me know.

Paper Presentations • Goal: high quality, accessible tutorials. • 7 weeks and 44 students = 6 or 7 students per week and about 15 minutes per student. • 2-week planning cycle: – 2 weeks before your presentation, meet me after class to discuss and assign papers. – The following week, meet the TA for a practice presentation (required). – Present in class under strict time constraints.

Team Presentatations • Papers may be presented in teams of two or more with longer presentations (15 minutes per team member). • Unless a paper is particularly difficult or long, a team will be expected to cover more than one paper (one paper per team member). • A team may cover one of the listed papers and one or more of its references (but see me first).

Tentative Topics • Discriminative approaches. • Generative approaches. • Differentiable rendering. • Capsule networks • Group symmetries and equivariance • Visual attention mechanisms • Adversarial methods

Project Ideas • Improve upon the work in a paper – Even a small improvement is OK • For example, – Make a generative model conditional – Disentangle (some) latent variables – Adapt a method to new circumstances • Different kinds of data • Missing or noisy data – Make a supervised method semi-supervised

Project Ideas • Examples (continued) – Modify the cost function • Introduce learnable parameters into a cost function • Use an adversarial cost • Try a variation on KL divergence – Modify the latent priors • Make the prior learnable • Do not assume Gaussianity – Modify the variational assumptions • Do not assume complete independence • Do not assume Gaussianity

Project Ideas • Implement and compare different methods for the same problem (e.g., different methods for inferring 3D structure) – Clearly and succinctly describe each method – Clearly articulate their differences – Describe their strengths and weaknesses – Ideally, include experiments highlighting the differences between the methods on realistic problems.

Project Considerations • Is your idea sensible? • Can you download all the necessary data? • Do you have the computational resources (GPUs)? • Do you have time to complete it? • Start by duplicating the results in the paper (if the paper gives enough details).

Project Dates • Proposal due February 18 – about 2 pages – include preliminary literature search • Project presentations: March 24 and 31 – about 5 minutes per student (like “spotlight presentations” at a conference) • Project due: April 12 – project report (4-8 pages) and code

Generative Approaches • Given a scene, s, a graphics program, G, produces an image, G(s). • Given an image, x, find s such that G(s) ≈ x • More generally, find P(s|x),. • P(s|x) is high when G(s) is close to x.

Variational Approximations • Finding P(s|x) is intractable in general. • Use variational approximations. • Variational auto-encoders work very well. • G can be a neural net that we learn (unsupervised). • Computationally intensive.

Variational Autoencoders Volume Generator Perspective Transformer 64x64x3 32x32x64 1x32x32 1x32x32x32 1x32x32x32 256x6x6x6 96x15x15x15 16x16x128 8x8x256 512x3x3x3 latent unit 1x1x1024 Sampler 1x1x1024 1x1x 512 4x4x4 conv 5x5x5 conv Target projection Grid generator 5x5 conv 6x6x6 conv 5x5 conv 5x5 conv 4x4 Input image Τ θ (G) transformation Encoder Decoder From Yan et al, Perspective Transformer Nets , arXiv 2017

Disentangled Representations Disentangled Representation y (digit label) z (handwriting style) From Siddharth et al, Semi-supervised Deep Generative Models , NIPS 2017 the

Input images Disentangled Representations Learning Identity manifold Pose manifold coordinates Fixed ID coordinates Fixed Pose Input From Reed et al , Learning to Disentangle Factors of Variation , ICML 2014

Learning 3D Shape From Yan et al, Perspective Transformer Nets , arXiv 2017

Learning 3D Structure From Niu et al , Im2Struct: recovering 3D Shape Structure , CVPR 2018

Scene Understanding From Wu et al , Neural Scene De-rendering , CVPR 2017

Scene Understanding From Huang et al, Occlusion Aware Generative Models , ICLR 2016

Conditional Image Generation ground -truth NN CVAE From Sohn et al , Deep Conditional Generative Models , NIPS 2015

Conditional Image Generation From Ivanov et al , Variational Autoencoder with Arbitrary Conditioning , ICLR 2019

Attribute Conditioned Image Generation viewpoint ? background ? lighting ? … ? age: young 0.9 0.9 gender: female 1.3 1.3 brown hair color: brown -0.4 0.4 expression: smile 0.8 0.8 From Yan et al, Attribute2Image: Conditional Image Generation , arXiv 2016

Infer Relationship Making Visual Analogies Transform query • Given images A, B, C, generate image D so that D is to C as B is to A. From Reed et al, Deep Visual Analogy-Making , NIPS 2015

CSC 2547: Machine Learning for Vision as Inverse Graphics Anthony - PowerPoint PPT Presentation

CSC 2547: Machine Learning for Vision as Inverse Graphics Anthony Bonner www.cs.toronto.edu/~bonner Paper Presentations Each week will focus on one topic, as listed on the course web page (soon). You can vote for your choice of

Graphics Murray Cole Graphics 1 Graphics 2 Graphics 3 Graphics 4 Graphics 5 Graphics 6

STA 4273 / CSC 2547 Spring 2018 Learning Discrete Latent Structure What recently became easy in

Unification of CSC and SE ABET Effor ts Similarity of CSC and SE Programs Similarity of CSC and

Lecture 2: Gradient Estimators CSC 2547 Spring 2018 David Duvenaud Based mainly on slides by Will

CS378 - Mobile Computing 3D Graphics 2D Graphics android.graphics library for 2D graphics

Dynamic Inverse Problems: Schmitt Efficient Algorithms and Approximate Inverse Problems

Statistical Inverse Problems and abstract inverse problems examples Instrumental Variables

1. Algorithms for Inverse Reinforcement Learning 2. Apprenticeship learning via Inverse

3D GRAPHICS design animate render Computer Graphics 3D animation movies Computer Graphics

CSC Effectiveness Review CSC Effectiveness Review Team October 2018 ICANN63 Need for Review of

1 Articulated Figures Inverse Kinematics Articulated Figures Inverse Kinematics Composite

Inverse Kinematics Inverse Kinematics Inverse Kinematics Carnegie Carnegie Sebastian Grassia

Course on Inverse Problems Albert Tarantola Lesson VI: a) General Formulation of the Inverse

Introduction to Machine Learning Introduction to Machine Learning Introduction to Machine

Graphics Processing CS418 Computer Graphics John C. Hart Graphics Processing Graphics

Quantum Machine Learning Adam Brown, HEP-AI Quantum Computing Machine Learning Quantum

Full-information best-choice game with two stops Anna A. Ivashko Institute of Applied

s rs srs r r

Sunu 3 Hazrlayan: Do.Dr. Hakan niz APALAR

Can Demonstratives be Discourse Topics? Evidence from German Right Dislocation We argue for an

DJJ Special Education and IEPs Sylvia Starkey Director Division of IDEA Monitoring and Results

Functional Programming March 16, 2019 Functional Programming March 16, 2019 1 / 12 Mayer

QRPA based calculations for neutrino scattering and electroweak excitations of nuclei. Arturo R.

Microservices and OSGi running with Apache Karaf Agenda No free Lunch - microservices