From processing to learning on graphs Patrick Prez Maths and Images - - PowerPoint PPT Presentation

from processing to learning on graphs
SMART_READER_LITE
LIVE PREVIEW

From processing to learning on graphs Patrick Prez Maths and Images - - PowerPoint PPT Presentation

From processing to learning on graphs Patrick Prez Maths and Images in Paris IHP, 2 March 2017 Signals on graphs Natural graph: mesh, network, etc., Instrumental graph: derived from a related to a real structure, various


slide-1
SLIDE 1

From processing to learning on graphs

Patrick Pérez

Maths and Images in Paris IHP, 2 March 2017

slide-2
SLIDE 2

► Instrumental graph: derived from a collection or a signal, captures its structure, other signals leverage it

2

Signals on graphs

► Natural graph: mesh, network, etc., related to a “real” structure, various signals can live on it

slide-3
SLIDE 3

3

Playing with graph signals

Coding Compress Sample Reconstruct Processing Transform Enhance Edit Learning Cluster Label Infer

slide-4
SLIDE 4

4

Playing with graph signals

Coding Compress Sample Reconstruct Processing Transform Enhance Edit Learning Cluster Label Infer

Puy 2016-2017

slide-5
SLIDE 5

5

Playing with graph signals

Coding Compress Sample Reconstruct Processing Transform Enhance Edit Learning Cluster Label Infer

Puy 2017

slide-6
SLIDE 6

6

Playing with graph signals

Coding Compress Sample Reconstruct Processing Transform Enhance Edit Learning Cluster Label Infer

Garrido 2016

slide-7
SLIDE 7

7

Undirected weighted graph

slide-8
SLIDE 8

8

Graph Laplacian(s)

Vertex degree and degree matrix Symmetric p.s.d. Laplacians

► Combinatorial Laplacian ► Normalized Laplacian

slide-9
SLIDE 9

9

Graph signal and smoothness

Signals / functions on graph

► Scalar ► Multi-dim.

Graph smoothness

► Scalar ► Multi-dimensional

slide-10
SLIDE 10

10

Spectral graph analysis

Laplacian diagonalization and graph harmonics of increasing “frequencies” Graph Fourier transform and its inverse Smooth (k-bandlimited) signals

slide-11
SLIDE 11

11

Spectral graph analysis

slide-12
SLIDE 12

12

Spectral vertex embedding

Rows of truncated Fourier basis ⇒ k-dim embedding of vertices Clustered with k-means in spectral clustering

slide-13
SLIDE 13

► Through frequency filtering

13

Linear filters and convolutions

Filtering in the spectral domain

► With filter Fourier transform Polynomial filtering: from spectral to vertex domain

► Controlled locality and complexity Issues

  • locality on graph
  • computational complexity
slide-14
SLIDE 14

14

Sampling graph signals

Random sampling

► Define vertex sampling distribution ► Draw signal samples accordingly

Problems

► Reconstruction of smooth signals ► Performance as function of m ► Best sampling distribution [Puy et al. 2016]

slide-15
SLIDE 15

15

Reconstructing smooth signals from samples

Smooth interpolation / approximation (noisy measures) k-bandlimited approximation: exact or approximate

[Puy et al. 2016]

slide-16
SLIDE 16

16

Reconstruction quality (1)

Assuming RIP*

► Noisy measurements: ► Noiseless measurements: exact recovery * m large enough, for now

slide-17
SLIDE 17

17

Reconstruction quality (2)

Assuming RIP*

* m large enough, for now

slide-18
SLIDE 18

18

Optimizing sampling

Some vertices are more important

► Norm of spectral embedding: max. energy fraction on vertex from k-bandlimited signal Exists a k-bandlimited signal concentrated on this node; should be sampled Exists no k-bandlimited signal concentrated on this node; can be ignored ► Graph weighted coherence of distribution should be as small as possible

slide-19
SLIDE 19

19

Restricted Isometry Property (RIP)

► vertices are enough to sample all k-bandlimited signals ► In best case, suffice ► Once selected, vertices can be used to sample all k-bandlimited signals

slide-20
SLIDE 20

20

Empirical RIP

slide-21
SLIDE 21

21

Optimal and practical sampling

Optimal sampling distribution

► measurements suffice, but requires computation of harmonics

Efficient approximation

► Rapid computation of alternative vertex embedding of similar norms with columns of R obtained by polynomial filtering of suitable Gaussian signals ► Can serve also for efficient spectral clustering [Tremblay et al. 2016]

slide-22
SLIDE 22

22

Optimal and practical sampling

slide-23
SLIDE 23

23

Extension to group sampling

Given a suitable partition of vertices

► Smooth graph signals almost piece-wise constant on groups

Interest

► Speed and memory gains (working on reduced signal versions) ► Interactive systems: propose sampled groups for user to annotate Random sampling? Reconstruction? [Puy and Pérez 2017] under submission

slide-24
SLIDE 24

24

Extension to group sampling

Given a suitable partition of vertices

► Smooth graph signals almost piece-wise constant on groups

Interest

► Speed and memory gains (working on reduced signal versions) ► Interactive systems: propose sampled groups for user to annotate Random sampling? Reconstruction? [Puy and Pérez 2017] under submission

slide-25
SLIDE 25

25

Group sampling and group coherence

Reasoning at group level

► Group sampling ► Local group coherence: max energy fraction in group from a k-bandlimited signal* ► Group coherence:

*

slide-26
SLIDE 26

26

Restricted Isometry Property (RIP)

► groups are enough to sample all k-bandlimited signals ► In best case, groups suffice

slide-27
SLIDE 27

27

Smooth piece-wise constant reconstruction

slide-28
SLIDE 28

28

Smooth piece-wise constant reconstruction

Assuming RIP

slide-29
SLIDE 29

29

Empirical RIP

slide-30
SLIDE 30

30

Group sampling distributions

slide-31
SLIDE 31

31

slide-32
SLIDE 32

32

Convolutional Neural Nets (CNNs) on graph

CNNs

► Immensely successful for image-related task (recognition, prediction, processing, editing) ► Layers: Convolutions, non-linearities and pooling

Extension to graph signals?

► No natural convolution and pooling ► Graph structure may vary (not only size as with lattices) ► Computational complexity ► A simple proposal [Puy et al. 2017]

slide-33
SLIDE 33

33

Graph-CNNs

Convolution in spectral domain [Bruna et al. 2013]

► Computation and use of Fourier basis not scalable ► Difficult handling of graph changes across inputs

Convolution with polynomial filters [Defferrard et al. 2016, Kipf et al. 2016]

► Better control of complexity and locality ► Not clear handling of graph changes across inputs ► Lack of filter diversity (e.g., rotation invariance on 2D lattice)

Direct convolutions [Monti et al. 2016, Niepert et al. 2016, Puy et al. 2017]

► Local or global pseudo-coordinates ► Include convolution on regular grid as special case

slide-34
SLIDE 34

► Dot product with filter kernel

34

Direct convolution on weighted graph

At each vertex

► Extract a fixed-size signal “patch” Order, Weigh, Assemble

1 2 3 4 5 6

[Puy et al. 2017]

slide-35
SLIDE 35

Weight-based ordering and weighting

35

Direct convolution on weighted graph

Back to classic convolution

► Lexicographical order, no weighting

1 2 6 4 3 5 1 2 3 4 5

slide-36
SLIDE 36

Feature-based nearest neighbor graph

► Given an image, one feature vector at each pixel ► Connect each pixel to its d nearest neighbor in feature space ► Weigh with exponential of feature similarity

36

Non-local weighted pixel graph

slide-37
SLIDE 37

37

One graph convolutional layer

ReLU

slide-38
SLIDE 38

38

Style transfer

Neural example-based stylization [Gatys et al. 2015]

► Iterative modification of noise to fit “statistics” of style image and “content” of target image ► Neural statistics: Gram matrix of feature maps at a layer of a pre-trained deep CNN

slide-39
SLIDE 39

39

Style transfer

Using only a single random graph convolution layer

► Input image only used to build the graph

slide-40
SLIDE 40

40

Style transfer

Using only a single random graph convolution layer

► Input image only used to build the graph

Non-local graph only

slide-41
SLIDE 41

41

Style transfer

Using only a single random graph convolution layer

► Input image only used to build the graph

Non-local graph + Local graph

slide-42
SLIDE 42

42

XXX

► XXX

slide-43
SLIDE 43

43

Color palette transfer

Using only a single random graph convolution layer

target image proposed

  • ptimal transport

source palette

slide-44
SLIDE 44

44

Signal denoising

Trained 3-layer graph CNN

► Local and non-local graphs from noisy input

  • loc. weighted graph

soft thresholding 1 20 local or not no non-linearity 20

  • loc. weighted graph

no non-linearity 1

slide-45
SLIDE 45

45

Image denoising

non local 2nd layer local 2nd layer

Noisy 23.10dB Trained – Local 29.13dB Trained – Non-local 29.42dB Haar soft thresh. 26.78dB

slide-46
SLIDE 46

46

Triangular 3D mesh

Graph

► Vertices: points in 3D space ► Edges: forming triangulated graph ► Weights (if any): associated to local 3D shape

Signals

► Colors ► Normals ► Mesh deformations

slide-47
SLIDE 47

47

Face capture from single video

[Cao et al., 2015] [Suwajanakorn et al., 2014] [Garrido et al., 2016] Detailed 3D face rig

slide-48
SLIDE 48

48

Parametric face model

Two-level coarse linear modelling

► Inter-individual variations: linear space around average neutral face (AAM) ► Expressions: linear space of main modes of deformations around neutral (blendshapes)

Reconstruction and tracking from raw measurements

► Extract person’s neutral shape (morphology) ► Extract/track main deformations (expression/performance) ► Mitigate model limitations through smooth corrections ► Recover person-specific fine scale details

slide-49
SLIDE 49

49

Parametric face model

Two-level coarse linear modelling

► Inter-individual variations: linear space around average neutral face (AAM) ► Expressions: linear space of main modes of deformations around neutral (blendshapes)

Reconstruction and tracking from raw measurements

► Extract person’s neutral shape (morphology) ► Extract/track main deformations (expression/performance) ► Mitigate model limitations through smooth corrections ► Recover person-specific fine scale details

slide-50
SLIDE 50

50

Parametric face model

Two-level coarse linear modelling

► Inter-individual variations: linear space around average neutral face (AAM) ► Expressions: linear space of main modes of deformations around neutral (blendshapes)

Reconstruction and tracking from raw measurements

► Extract person’s neutral shape (morphology) ► Extract/track main deformations (expression/performance) ► Mitigate model limitations through smooth corrections ► Recover person-specific fine scale details

slide-51
SLIDE 51

51

Parametric face model

Two-level coarse linear modelling

► Inter-individual variations: linear space around average neutral face (AAM) ► Expressions: linear space of main modes of deformations around neutral (blendshapes)

Reconstruction and tracking from raw measurements

► Extract person’s neutral shape (morphology) ► Extract/track main deformations (expression/performance) ► Mitigate model limitations through smooth corrections ► Recover person-specific fine scale details

slide-52
SLIDE 52

52

Smooth correction

Layered mesh model Graph harmonics on each coordinate [Vallet and Levy 2008][Li et al. 2013]

[Garrido et al. 2016]

slide-53
SLIDE 53

53

Model personalization and tracking in single video

Monocular video Generic face prior

Multi-layer performance capture

Fine Coarse Medium

slide-54
SLIDE 54

54

Multi-layer performance capture

slide-55
SLIDE 55

55

From capture to animation

Detail learning

Monocular video Generic face prior Fine Coarse Medium

X

Multi-layer performance capture

Fine Coarse Medium Personalized face rig

slide-56
SLIDE 56

56

Personalized face rig

Turn model into a face rig (puppet)

► Ridge regression

slide-57
SLIDE 57

57

Personalized face rig

Turn model into a face rig (puppet)

► Ridge regression fixed editable regression

slide-58
SLIDE 58

58

Rig animation from capture

slide-59
SLIDE 59

59

Rig animation from capture

slide-60
SLIDE 60

From processing to learning on graphs

Patrick Pérez

Maths and Images in Paris IHP, 2 March 2017

► G. Puy, P. Pérez. Structured sampling and fast reconstruction of smooth graph signals. Submitted to Information and Inference ► G. Puy, S. Kitic, P. Pérez. Unifying local and non-local signal processing with graph CNNs. arXiv:1702.07759 ► P. Garrido, M. Zollhoefer, D. Casas, L. Valgaerts, K. Varanasi, P. Pérez, Ch. Theobalt. Reconstruction of personalized 3D face rigs from monocular video. ACM Trans. on Graghics, 35(3), 2016