CreativeAI 3D (Geometric) Domain Niloy Mitra Iasonas Kokkinos - - PowerPoint PPT Presentation

creativeai 3d geometric domain
SMART_READER_LITE
LIVE PREVIEW

CreativeAI 3D (Geometric) Domain Niloy Mitra Iasonas Kokkinos - - PowerPoint PPT Presentation

CreativeAI 3D (Geometric) Domain Niloy Mitra Iasonas Kokkinos Paul Guerrero Nils Thuerey Tobias Ritschel UCL UCL UCL TUM UCL Timetable Niloy Paul Nils Introduction 2:15 pm X X X 2:25 pm X Machine Learning Basics Theory


slide-1
SLIDE 1

Niloy Mitra Iasonas Kokkinos Paul Guerrero Nils Thuerey Tobias Ritschel UCL UCL UCL TUM UCL

CreativeAI
 3D (Geometric) Domain

slide-2
SLIDE 2

SIGGRAPH Asia Course CreativeAI: Deep Learning for Graphics

Timetable

2

Niloy Paul Nils Introduction 2:15 pm X X X Machine Learning Basics ∼ 2:25 pm X Neural Network Basics ∼ 2:55 pm X Feature Visualization ∼ 3:25 pm X Alternatives to Direct Supervision ∼ 3:35 pm X 15 min. break Image Domains 4:15 pm X 3D Domains ∼ 4:40 pm X Motion and Physics ∼ 5:05 pm X Discussion ∼ 5:30 pm X X X Theory + Basics State of the Art

slide-3
SLIDE 3
  • Modeling by example, revisited

Application #1: 3D Modeling

3

[Sung et al. 2017]

Deep neural network predicts the next best part to add and its position to enable non-expert users to create novel shapes.

slide-4
SLIDE 4
  • Modeling by example, revisited

Application #1: 3D Modeling

3

[Sung et al. 2017]

Deep neural network predicts the next best part to add and its position to enable non-expert users to create novel shapes.

slide-5
SLIDE 5
  • Modeling by example, revisited

Application #1: 3D Modeling

3

[Sung et al. 2017]

Deep neural network predicts the next best part to add and its position to enable non-expert users to create novel shapes.

slide-6
SLIDE 6
  • Modeling by example, revisited

Application #1: 3D Modeling

3

[Sung et al. 2017]

Deep neural network predicts the next best part to add and its position to enable non-expert users to create novel shapes.

slide-7
SLIDE 7
  • Modeling by example, revisited

Application #1: 3D Modeling

3

[Sung et al. 2017]

Deep neural network predicts the next best part to add and its position to enable non-expert users to create novel shapes.

slide-8
SLIDE 8

Application #2: Image Understanding

4

[Zhang et al. 2017]

understanding 3D shapes can benefit image understanding

slide-9
SLIDE 9

Application #2: Image Understanding

4

[Zhang et al. 2017]

understanding 3D shapes can benefit image understanding

slide-10
SLIDE 10

Application #3: Semantic Scene Understanding

5

[Song et al. 2017]

slide-11
SLIDE 11

6

Motivating Applications: Semantic Scene Understanding

[Kelly et al. 2017, Kelly and Guerrero et al. 2018]

Application #4: 3D Asset Creation

slide-12
SLIDE 12

6

Motivating Applications: Semantic Scene Understanding

[Kelly et al. 2017, Kelly and Guerrero et al. 2018]

Application #4: 3D Asset Creation

slide-13
SLIDE 13

What’s Different in 3D?

7

slide-14
SLIDE 14
  • Number of Voxels grows as versus occupied surface

What’s Different in 3D?

7

O(n3)

<latexit sha1_base64="pPG/+UG2ht75Yz5Ajs7KOd+Azk=">AB7XicbZDLSgMxFIbP1Fut6pLN8Ei1E2ZsYIui27cWcFeoB1LJs20sZlkSDJCGfoOblwo4tb3cefbmLaz0NYfAh/OYec8wcxZ9q47reTW1ldW9/Ibxa2tnd294r7B0tE0Vog0guVTvAmnImaMw2k7VhRHAaetYHQ9rbeqNJMinszjqkf4YFgISPYWKt5WxYP1dNeseRW3JnQMngZlCBTvVf86vYlSIqDOFY647nxsZPsTKMcDopdBNY0xGeEA7FgWOqPbT2bYTdGKdPgqlsk8YNHN/T6Q40nocBbYzwmaoF2tT879aJzHhpZ8yESeGCjL/KEw4MhJNT0d9pigxfGwBE8XsrogMscLE2IAKNgRv8eRlaJ5VPMt356XaVRZHo7gGMrgwQXU4Abq0ACj/AMr/DmSOfFeXc+5q05J5s5hD9yPn8AYnCOVQ=</latexit><latexit sha1_base64="pPG/+UG2ht75Yz5Ajs7KOd+Azk=">AB7XicbZDLSgMxFIbP1Fut6pLN8Ei1E2ZsYIui27cWcFeoB1LJs20sZlkSDJCGfoOblwo4tb3cefbmLaz0NYfAh/OYec8wcxZ9q47reTW1ldW9/Ibxa2tnd294r7B0tE0Vog0guVTvAmnImaMw2k7VhRHAaetYHQ9rbeqNJMinszjqkf4YFgISPYWKt5WxYP1dNeseRW3JnQMngZlCBTvVf86vYlSIqDOFY647nxsZPsTKMcDopdBNY0xGeEA7FgWOqPbT2bYTdGKdPgqlsk8YNHN/T6Q40nocBbYzwmaoF2tT879aJzHhpZ8yESeGCjL/KEw4MhJNT0d9pigxfGwBE8XsrogMscLE2IAKNgRv8eRlaJ5VPMt356XaVRZHo7gGMrgwQXU4Abq0ACj/AMr/DmSOfFeXc+5q05J5s5hD9yPn8AYnCOVQ=</latexit><latexit sha1_base64="pPG/+UG2ht75Yz5Ajs7KOd+Azk=">AB7XicbZDLSgMxFIbP1Fut6pLN8Ei1E2ZsYIui27cWcFeoB1LJs20sZlkSDJCGfoOblwo4tb3cefbmLaz0NYfAh/OYec8wcxZ9q47reTW1ldW9/Ibxa2tnd294r7B0tE0Vog0guVTvAmnImaMw2k7VhRHAaetYHQ9rbeqNJMinszjqkf4YFgISPYWKt5WxYP1dNeseRW3JnQMngZlCBTvVf86vYlSIqDOFY647nxsZPsTKMcDopdBNY0xGeEA7FgWOqPbT2bYTdGKdPgqlsk8YNHN/T6Q40nocBbYzwmaoF2tT879aJzHhpZ8yESeGCjL/KEw4MhJNT0d9pigxfGwBE8XsrogMscLE2IAKNgRv8eRlaJ5VPMt356XaVRZHo7gGMrgwQXU4Abq0ACj/AMr/DmSOfFeXc+5q05J5s5hD9yPn8AYnCOVQ=</latexit><latexit sha1_base64="pPG/+UG2ht75Yz5Ajs7KOd+Azk=">AB7XicbZDLSgMxFIbP1Fut6pLN8Ei1E2ZsYIui27cWcFeoB1LJs20sZlkSDJCGfoOblwo4tb3cefbmLaz0NYfAh/OYec8wcxZ9q47reTW1ldW9/Ibxa2tnd294r7B0tE0Vog0guVTvAmnImaMw2k7VhRHAaetYHQ9rbeqNJMinszjqkf4YFgISPYWKt5WxYP1dNeseRW3JnQMngZlCBTvVf86vYlSIqDOFY647nxsZPsTKMcDopdBNY0xGeEA7FgWOqPbT2bYTdGKdPgqlsk8YNHN/T6Q40nocBbYzwmaoF2tT879aJzHhpZ8yESeGCjL/KEw4MhJNT0d9pigxfGwBE8XsrogMscLE2IAKNgRv8eRlaJ5VPMt356XaVRZHo7gGMrgwQXU4Abq0ACj/AMr/DmSOfFeXc+5q05J5s5hD9yPn8AYnCOVQ=</latexit>

O(n2)

<latexit sha1_base64="63ak468AXCor9nBfBLOpfhlLeqM=">AB7XicbZDLSgMxFIbP1Fut6pLN8Ei1E2ZKQVdFt24s4K9QDuWTJpYzPJkGSEMvQd3LhQxK3v4863MW1noa0/BD7+cw45w9izrRx3W8nt7a+sbmV3y7s7O7tHxQPj1paJorQJpFcqk6ANeVM0KZhtNOrCiOAk7bwfh6Vm8/UaWZFPdmElM/wkPBQkawsVbrtiwequf9YsmtuHOhVfAyKEGmRr/41RtIkRUGMKx1l3PjY2fYmUY4XRa6CWaxpiM8ZB2LQocUe2n82n6Mw6AxRKZ8waO7+nkhxpPUkCmxnhM1IL9dm5n+1bmLCSz9lIk4MFWTxUZhwZCSanY4GTFi+MQCJorZXREZYWJsQEVbAje8smr0KpWPMt3tVL9KosjDydwCmXw4ALqcAMNaAKBR3iGV3hzpPivDsfi9ack80cwx85nz9g645U</latexit><latexit sha1_base64="63ak468AXCor9nBfBLOpfhlLeqM=">AB7XicbZDLSgMxFIbP1Fut6pLN8Ei1E2ZKQVdFt24s4K9QDuWTJpYzPJkGSEMvQd3LhQxK3v4863MW1noa0/BD7+cw45w9izrRx3W8nt7a+sbmV3y7s7O7tHxQPj1paJorQJpFcqk6ANeVM0KZhtNOrCiOAk7bwfh6Vm8/UaWZFPdmElM/wkPBQkawsVbrtiwequf9YsmtuHOhVfAyKEGmRr/41RtIkRUGMKx1l3PjY2fYmUY4XRa6CWaxpiM8ZB2LQocUe2n82n6Mw6AxRKZ8waO7+nkhxpPUkCmxnhM1IL9dm5n+1bmLCSz9lIk4MFWTxUZhwZCSanY4GTFi+MQCJorZXREZYWJsQEVbAje8smr0KpWPMt3tVL9KosjDydwCmXw4ALqcAMNaAKBR3iGV3hzpPivDsfi9ack80cwx85nz9g645U</latexit><latexit sha1_base64="63ak468AXCor9nBfBLOpfhlLeqM=">AB7XicbZDLSgMxFIbP1Fut6pLN8Ei1E2ZKQVdFt24s4K9QDuWTJpYzPJkGSEMvQd3LhQxK3v4863MW1noa0/BD7+cw45w9izrRx3W8nt7a+sbmV3y7s7O7tHxQPj1paJorQJpFcqk6ANeVM0KZhtNOrCiOAk7bwfh6Vm8/UaWZFPdmElM/wkPBQkawsVbrtiwequf9YsmtuHOhVfAyKEGmRr/41RtIkRUGMKx1l3PjY2fYmUY4XRa6CWaxpiM8ZB2LQocUe2n82n6Mw6AxRKZ8waO7+nkhxpPUkCmxnhM1IL9dm5n+1bmLCSz9lIk4MFWTxUZhwZCSanY4GTFi+MQCJorZXREZYWJsQEVbAje8smr0KpWPMt3tVL9KosjDydwCmXw4ALqcAMNaAKBR3iGV3hzpPivDsfi9ack80cwx85nz9g645U</latexit><latexit sha1_base64="63ak468AXCor9nBfBLOpfhlLeqM=">AB7XicbZDLSgMxFIbP1Fut6pLN8Ei1E2ZKQVdFt24s4K9QDuWTJpYzPJkGSEMvQd3LhQxK3v4863MW1noa0/BD7+cw45w9izrRx3W8nt7a+sbmV3y7s7O7tHxQPj1paJorQJpFcqk6ANeVM0KZhtNOrCiOAk7bwfh6Vm8/UaWZFPdmElM/wkPBQkawsVbrtiwequf9YsmtuHOhVfAyKEGmRr/41RtIkRUGMKx1l3PjY2fYmUY4XRa6CWaxpiM8ZB2LQocUe2n82n6Mw6AxRKZ8waO7+nkhxpPUkCmxnhM1IL9dm5n+1bmLCSz9lIk4MFWTxUZhwZCSanY4GTFi+MQCJorZXREZYWJsQEVbAje8smr0KpWPMt3tVL9KosjDydwCmXw4ALqcAMNaAKBR3iGV3hzpPivDsfi9ack80cwx85nz9g645U</latexit>
slide-15
SLIDE 15
  • Number of Voxels grows as versus occupied surface

What’s Different in 3D?

7

O(n3)

<latexit sha1_base64="pPG/+UG2ht75Yz5Ajs7KOd+Azk=">AB7XicbZDLSgMxFIbP1Fut6pLN8Ei1E2ZsYIui27cWcFeoB1LJs20sZlkSDJCGfoOblwo4tb3cefbmLaz0NYfAh/OYec8wcxZ9q47reTW1ldW9/Ibxa2tnd294r7B0tE0Vog0guVTvAmnImaMw2k7VhRHAaetYHQ9rbeqNJMinszjqkf4YFgISPYWKt5WxYP1dNeseRW3JnQMngZlCBTvVf86vYlSIqDOFY647nxsZPsTKMcDopdBNY0xGeEA7FgWOqPbT2bYTdGKdPgqlsk8YNHN/T6Q40nocBbYzwmaoF2tT879aJzHhpZ8yESeGCjL/KEw4MhJNT0d9pigxfGwBE8XsrogMscLE2IAKNgRv8eRlaJ5VPMt356XaVRZHo7gGMrgwQXU4Abq0ACj/AMr/DmSOfFeXc+5q05J5s5hD9yPn8AYnCOVQ=</latexit><latexit sha1_base64="pPG/+UG2ht75Yz5Ajs7KOd+Azk=">AB7XicbZDLSgMxFIbP1Fut6pLN8Ei1E2ZsYIui27cWcFeoB1LJs20sZlkSDJCGfoOblwo4tb3cefbmLaz0NYfAh/OYec8wcxZ9q47reTW1ldW9/Ibxa2tnd294r7B0tE0Vog0guVTvAmnImaMw2k7VhRHAaetYHQ9rbeqNJMinszjqkf4YFgISPYWKt5WxYP1dNeseRW3JnQMngZlCBTvVf86vYlSIqDOFY647nxsZPsTKMcDopdBNY0xGeEA7FgWOqPbT2bYTdGKdPgqlsk8YNHN/T6Q40nocBbYzwmaoF2tT879aJzHhpZ8yESeGCjL/KEw4MhJNT0d9pigxfGwBE8XsrogMscLE2IAKNgRv8eRlaJ5VPMt356XaVRZHo7gGMrgwQXU4Abq0ACj/AMr/DmSOfFeXc+5q05J5s5hD9yPn8AYnCOVQ=</latexit><latexit sha1_base64="pPG/+UG2ht75Yz5Ajs7KOd+Azk=">AB7XicbZDLSgMxFIbP1Fut6pLN8Ei1E2ZsYIui27cWcFeoB1LJs20sZlkSDJCGfoOblwo4tb3cefbmLaz0NYfAh/OYec8wcxZ9q47reTW1ldW9/Ibxa2tnd294r7B0tE0Vog0guVTvAmnImaMw2k7VhRHAaetYHQ9rbeqNJMinszjqkf4YFgISPYWKt5WxYP1dNeseRW3JnQMngZlCBTvVf86vYlSIqDOFY647nxsZPsTKMcDopdBNY0xGeEA7FgWOqPbT2bYTdGKdPgqlsk8YNHN/T6Q40nocBbYzwmaoF2tT879aJzHhpZ8yESeGCjL/KEw4MhJNT0d9pigxfGwBE8XsrogMscLE2IAKNgRv8eRlaJ5VPMt356XaVRZHo7gGMrgwQXU4Abq0ACj/AMr/DmSOfFeXc+5q05J5s5hD9yPn8AYnCOVQ=</latexit><latexit sha1_base64="pPG/+UG2ht75Yz5Ajs7KOd+Azk=">AB7XicbZDLSgMxFIbP1Fut6pLN8Ei1E2ZsYIui27cWcFeoB1LJs20sZlkSDJCGfoOblwo4tb3cefbmLaz0NYfAh/OYec8wcxZ9q47reTW1ldW9/Ibxa2tnd294r7B0tE0Vog0guVTvAmnImaMw2k7VhRHAaetYHQ9rbeqNJMinszjqkf4YFgISPYWKt5WxYP1dNeseRW3JnQMngZlCBTvVf86vYlSIqDOFY647nxsZPsTKMcDopdBNY0xGeEA7FgWOqPbT2bYTdGKdPgqlsk8YNHN/T6Q40nocBbYzwmaoF2tT879aJzHhpZ8yESeGCjL/KEw4MhJNT0d9pigxfGwBE8XsrogMscLE2IAKNgRv8eRlaJ5VPMt356XaVRZHo7gGMrgwQXU4Abq0ACj/AMr/DmSOfFeXc+5q05J5s5hD9yPn8AYnCOVQ=</latexit>

O(n2)

<latexit sha1_base64="63ak468AXCor9nBfBLOpfhlLeqM=">AB7XicbZDLSgMxFIbP1Fut6pLN8Ei1E2ZKQVdFt24s4K9QDuWTJpYzPJkGSEMvQd3LhQxK3v4863MW1noa0/BD7+cw45w9izrRx3W8nt7a+sbmV3y7s7O7tHxQPj1paJorQJpFcqk6ANeVM0KZhtNOrCiOAk7bwfh6Vm8/UaWZFPdmElM/wkPBQkawsVbrtiwequf9YsmtuHOhVfAyKEGmRr/41RtIkRUGMKx1l3PjY2fYmUY4XRa6CWaxpiM8ZB2LQocUe2n82n6Mw6AxRKZ8waO7+nkhxpPUkCmxnhM1IL9dm5n+1bmLCSz9lIk4MFWTxUZhwZCSanY4GTFi+MQCJorZXREZYWJsQEVbAje8smr0KpWPMt3tVL9KosjDydwCmXw4ALqcAMNaAKBR3iGV3hzpPivDsfi9ack80cwx85nz9g645U</latexit><latexit sha1_base64="63ak468AXCor9nBfBLOpfhlLeqM=">AB7XicbZDLSgMxFIbP1Fut6pLN8Ei1E2ZKQVdFt24s4K9QDuWTJpYzPJkGSEMvQd3LhQxK3v4863MW1noa0/BD7+cw45w9izrRx3W8nt7a+sbmV3y7s7O7tHxQPj1paJorQJpFcqk6ANeVM0KZhtNOrCiOAk7bwfh6Vm8/UaWZFPdmElM/wkPBQkawsVbrtiwequf9YsmtuHOhVfAyKEGmRr/41RtIkRUGMKx1l3PjY2fYmUY4XRa6CWaxpiM8ZB2LQocUe2n82n6Mw6AxRKZ8waO7+nkhxpPUkCmxnhM1IL9dm5n+1bmLCSz9lIk4MFWTxUZhwZCSanY4GTFi+MQCJorZXREZYWJsQEVbAje8smr0KpWPMt3tVL9KosjDydwCmXw4ALqcAMNaAKBR3iGV3hzpPivDsfi9ack80cwx85nz9g645U</latexit><latexit sha1_base64="63ak468AXCor9nBfBLOpfhlLeqM=">AB7XicbZDLSgMxFIbP1Fut6pLN8Ei1E2ZKQVdFt24s4K9QDuWTJpYzPJkGSEMvQd3LhQxK3v4863MW1noa0/BD7+cw45w9izrRx3W8nt7a+sbmV3y7s7O7tHxQPj1paJorQJpFcqk6ANeVM0KZhtNOrCiOAk7bwfh6Vm8/UaWZFPdmElM/wkPBQkawsVbrtiwequf9YsmtuHOhVfAyKEGmRr/41RtIkRUGMKx1l3PjY2fYmUY4XRa6CWaxpiM8ZB2LQocUe2n82n6Mw6AxRKZ8waO7+nkhxpPUkCmxnhM1IL9dm5n+1bmLCSz9lIk4MFWTxUZhwZCSanY4GTFi+MQCJorZXREZYWJsQEVbAje8smr0KpWPMt3tVL9KosjDydwCmXw4ALqcAMNaAKBR3iGV3hzpPivDsfi9ack80cwx85nz9g645U</latexit><latexit sha1_base64="63ak468AXCor9nBfBLOpfhlLeqM=">AB7XicbZDLSgMxFIbP1Fut6pLN8Ei1E2ZKQVdFt24s4K9QDuWTJpYzPJkGSEMvQd3LhQxK3v4863MW1noa0/BD7+cw45w9izrRx3W8nt7a+sbmV3y7s7O7tHxQPj1paJorQJpFcqk6ANeVM0KZhtNOrCiOAk7bwfh6Vm8/UaWZFPdmElM/wkPBQkawsVbrtiwequf9YsmtuHOhVfAyKEGmRr/41RtIkRUGMKx1l3PjY2fYmUY4XRa6CWaxpiM8ZB2LQocUe2n82n6Mw6AxRKZ8waO7+nkhxpPUkCmxnhM1IL9dm5n+1bmLCSz9lIk4MFWTxUZhwZCSanY4GTFi+MQCJorZXREZYWJsQEVbAje8smr0KpWPMt3tVL9KosjDydwCmXw4ALqcAMNaAKBR3iGV3hzpPivDsfi9ack80cwx85nz9g645U</latexit>
slide-16
SLIDE 16

Data Representation .. Many Possibilities!

8

points

slide-17
SLIDE 17

Data Representation .. Many Possibilities!

8

points voxels cells patches

slide-18
SLIDE 18

Challenges

9

slide-19
SLIDE 19
  • 1. Representation


Challenges

9

slide-20
SLIDE 20
  • 1. Representation

  • 2. Neighborhood information
  • who are the neighbouring elements
  • how are the elements ordered


Challenges

9

slide-21
SLIDE 21
  • 1. Representation

  • 2. Neighborhood information
  • who are the neighbouring elements
  • how are the elements ordered

  • 3. Extrinsic versus intrinsic representation


Challenges

9

slide-22
SLIDE 22
  • 1. Representation

  • 2. Neighborhood information
  • who are the neighbouring elements
  • how are the elements ordered

  • 3. Extrinsic versus intrinsic representation

  • 4. Simplicity versus memory/runtime tradeoff

Challenges

9

slide-23
SLIDE 23
  • Image-based

  • Volumetric

  • Surface-based

  • Point-based

Representation for 3D

10

slide-24
SLIDE 24
  • Image-based

  • Volumetric

  • Surface-based

  • Point-based

Representation for 3D

11

slide-25
SLIDE 25

Representation for 3D: Multi-view CNN

12

[Kalogerakis et al. 2015]

slide-26
SLIDE 26

Representation for 3D: Multi-view CNN

12

[Kalogerakis et al. 2015]

slide-27
SLIDE 27

Representation for 3D: Multi-view CNN

12

[Kalogerakis et al. 2015]

slide-28
SLIDE 28

Representation for 3D: Multi-view CNN

12

regular image analysis networks

[Kalogerakis et al. 2015]

slide-29
SLIDE 29

13

3DMV: Joint 3D-Multi-View Prediction for 3D Semantic Scene Segmentation

slide-30
SLIDE 30

Integrating View Information

14

slide-31
SLIDE 31

Representation for 3D: Local Multi-view CNN

15

[Huang et al. 2018]

slide-32
SLIDE 32

Representation for 3D: Local Multi-view CNN

15

Segmentation Correspondence Feature matching Predicting semantic functions

[Huang et al. 2018]

slide-33
SLIDE 33

Representation for 3D: Local Multi-view CNN

15

Segmentation Correspondence Feature matching Predicting semantic functions

[Huang et al. 2018]

localized renderings for point-wise features

slide-34
SLIDE 34

Tangent Convolutions

16

[Tatarchenko et al. 2018] loses information due to occlusion

slide-35
SLIDE 35

Tangent Convolutions

16

[Tatarchenko et al. 2018] loses information due to occlusion project to local patches
 (contrast with PCPNet construction)

slide-36
SLIDE 36

Tangent Convolutions

16

[Tatarchenko et al. 2018] loses information due to occlusion project to local patches
 (contrast with PCPNet construction)

slide-37
SLIDE 37

Dealing with Sparse Points

17

slide-38
SLIDE 38

Dealing with Sparse Points

18

slide-39
SLIDE 39

Improved Performance

19

slide-40
SLIDE 40
  • Image-based
  • PROS: directly use image networks, good performance
  • CONS: rendering is slow and memory-heavy, not very geometric
  • Volumetric

  • Point-based

  • Surface-based

Representation for 3D

20

slide-41
SLIDE 41
  • Image-based

  • Volumetric

  • Surface-based

  • Point-based

Representation for 3D

21

slide-42
SLIDE 42

3D CNNs : Direct Approach

22

[Xiao et al. 2014]

slide-43
SLIDE 43

VoxNet [Maturana et al. 15]

23

▸ Binary occupancy, density grid, etc.

rotational invariance

slide-44
SLIDE 44

Visualization of First Level Filters

24

slide-45
SLIDE 45

Visualization of First Level Filters

24

slide-46
SLIDE 46

Representation for 3D: Volumetric Deformation

25

[Yumer and Mitra 2016]

slide-47
SLIDE 47

Representation for 3D: Volumetric Deformation

25

[Yumer and Mitra 2016]

slide-48
SLIDE 48

Efficient Volumetric Datastructures

26

[Wang et al. 2017]

slide-49
SLIDE 49

Data Structure and CNN Operations

27 shuffled keys (encode position in space) labels (parent label → child indices) downsampling example (“where there is an octant, there is CNN computation”) faster neighbor access

slide-50
SLIDE 50

Data Structure and CNN Operations

27 shuffled keys (encode position in space) labels (parent label → child indices) downsampling example (“where there is an octant, there is CNN computation”) faster neighbor access

slide-51
SLIDE 51

Data Structure and CNN Operations

27 shuffled keys (encode position in space) labels (parent label → child indices) downsampling example (“where there is an octant, there is CNN computation”) faster neighbor access

slide-52
SLIDE 52

Efficient Volumetric Datastructures

28

[Hane et al. 2018]

  • nly generate non-empty voxels

Wang et al. 2017 Encoder Decoder/generator

slide-53
SLIDE 53

Efficient Volumetric Datastructures

29

[Hane et al. 2018]

slide-54
SLIDE 54

Lower Memory Footprint

30

slide-55
SLIDE 55

Adaptive O-CNN

image to planar patch-based shapes [Wang et al. 2018]

slide-56
SLIDE 56

First-order Patches

32

OCNN Adaptive OCNN

slide-57
SLIDE 57

Field Probing Neural Networks for 3D Data

33

[Li et al. 2016]

slide-58
SLIDE 58

Field Probing Neural Networks for 3D Data

33

[Li et al. 2016]

slide-59
SLIDE 59

Spatial Probes

34

slide-60
SLIDE 60

Spatial Probes

34

slide-61
SLIDE 61

Spatial Probes

34

slide-62
SLIDE 62

Method Details

35

slide-63
SLIDE 63

Method Details

35

slide-64
SLIDE 64

Method Details

35

slide-65
SLIDE 65

Method Details

35

slide-66
SLIDE 66
  • Image-based

  • Volumetric
  • PROS: adaptations of image networks
  • CONS: special layers for hierarchical datastructures, still too coarse

  • Surface-based

  • Point-based

Representation for 3D

36

slide-67
SLIDE 67
  • Image-based

  • Volumetric

  • Surface-based

  • Point-based

Representation for 3D

37

slide-68
SLIDE 68

Local/Global Parameterizations

38

slide-69
SLIDE 69

Local/Global Parameterizations

38

[Sinha et al. 2016]

Geometry Image

slide-70
SLIDE 70

Local/Global Parameterizations

38

[Sinha et al. 2016]

Geometry Image Metric Alignment (GWCNN)

[Ezuz et al. 2017]

slide-71
SLIDE 71

Shape Surfaces using Geometry Images

39

slide-72
SLIDE 72

Shape Surfaces using Geometry Images

39

slide-73
SLIDE 73

Shape Surfaces using Geometry Images

39

slide-74
SLIDE 74

Using Geodesic Patches: GCNN

40

[Masci et al. 2015]

slide-75
SLIDE 75

Using Geodesic Patches: GCNN

40

(f ? a)(x) := X

θ,r

a(✓ + ∆✓, r)(D(x)f)(r, ✓)

<latexit sha1_base64="TquvtCTwYBxPgA56TL/7lpYdxY0=">ACO3icbZDLSxBEMZ7NA8z5rHRYy6FizBDRGYkEAkIknjwaIKrws6y1PTWuI09D7prxGXY/yuX/BO5eIlh0jINf07s4hPgoafnxfFdX1pZVWlqPoh7e0/Ojxk6crz/zV5y9evuq8XjuxZW0k9WSpS3OWoiWtCuqxYk1nlSHMU02n6cWnmX96ScaqsjmSUWDHM8LlSmJ7KRh50uQWIZDWAIwVUIH/bAT2ydD5uEx8S4ZaAwYLhLSQHpB20HpjQDw7cWBYGZmshsNON9qO5gX3IW6hK9o6Gna+J6NS1jkVLDVa24+jigcNGlZS09RPaksVygs8p7DAnOyg2Z+xQ2nTKCrDTuFQxz9f+JBnNrJ3nqOnPksb3rzcSHvH7N2e6gUVMxVysSirNXAJsyBhpAxJ1hMHKI1yfwU5RoOSXdy+CyG+e/J9ONnZjh1/ftfd/9jGsSLeiA0RiFi8F/viUByJnpDiq7gWv8SN98376f32/ixal7x2Zl3cKu/vP2AxqYM=</latexit><latexit sha1_base64="TquvtCTwYBxPgA56TL/7lpYdxY0=">ACO3icbZDLSxBEMZ7NA8z5rHRYy6FizBDRGYkEAkIknjwaIKrws6y1PTWuI09D7prxGXY/yuX/BO5eIlh0jINf07s4hPgoafnxfFdX1pZVWlqPoh7e0/Ojxk6crz/zV5y9evuq8XjuxZW0k9WSpS3OWoiWtCuqxYk1nlSHMU02n6cWnmX96ScaqsjmSUWDHM8LlSmJ7KRh50uQWIZDWAIwVUIH/bAT2ydD5uEx8S4ZaAwYLhLSQHpB20HpjQDw7cWBYGZmshsNON9qO5gX3IW6hK9o6Gna+J6NS1jkVLDVa24+jigcNGlZS09RPaksVygs8p7DAnOyg2Z+xQ2nTKCrDTuFQxz9f+JBnNrJ3nqOnPksb3rzcSHvH7N2e6gUVMxVysSirNXAJsyBhpAxJ1hMHKI1yfwU5RoOSXdy+CyG+e/J9ONnZjh1/ftfd/9jGsSLeiA0RiFi8F/viUByJnpDiq7gWv8SN98376f32/ixal7x2Zl3cKu/vP2AxqYM=</latexit><latexit sha1_base64="TquvtCTwYBxPgA56TL/7lpYdxY0=">ACO3icbZDLSxBEMZ7NA8z5rHRYy6FizBDRGYkEAkIknjwaIKrws6y1PTWuI09D7prxGXY/yuX/BO5eIlh0jINf07s4hPgoafnxfFdX1pZVWlqPoh7e0/Ojxk6crz/zV5y9evuq8XjuxZW0k9WSpS3OWoiWtCuqxYk1nlSHMU02n6cWnmX96ScaqsjmSUWDHM8LlSmJ7KRh50uQWIZDWAIwVUIH/bAT2ydD5uEx8S4ZaAwYLhLSQHpB20HpjQDw7cWBYGZmshsNON9qO5gX3IW6hK9o6Gna+J6NS1jkVLDVa24+jigcNGlZS09RPaksVygs8p7DAnOyg2Z+xQ2nTKCrDTuFQxz9f+JBnNrJ3nqOnPksb3rzcSHvH7N2e6gUVMxVysSirNXAJsyBhpAxJ1hMHKI1yfwU5RoOSXdy+CyG+e/J9ONnZjh1/ftfd/9jGsSLeiA0RiFi8F/viUByJnpDiq7gWv8SN98376f32/ixal7x2Zl3cKu/vP2AxqYM=</latexit><latexit sha1_base64="TquvtCTwYBxPgA56TL/7lpYdxY0=">ACO3icbZDLSxBEMZ7NA8z5rHRYy6FizBDRGYkEAkIknjwaIKrws6y1PTWuI09D7prxGXY/yuX/BO5eIlh0jINf07s4hPgoafnxfFdX1pZVWlqPoh7e0/Ojxk6crz/zV5y9evuq8XjuxZW0k9WSpS3OWoiWtCuqxYk1nlSHMU02n6cWnmX96ScaqsjmSUWDHM8LlSmJ7KRh50uQWIZDWAIwVUIH/bAT2ydD5uEx8S4ZaAwYLhLSQHpB20HpjQDw7cWBYGZmshsNON9qO5gX3IW6hK9o6Gna+J6NS1jkVLDVa24+jigcNGlZS09RPaksVygs8p7DAnOyg2Z+xQ2nTKCrDTuFQxz9f+JBnNrJ3nqOnPksb3rzcSHvH7N2e6gUVMxVysSirNXAJsyBhpAxJ1hMHKI1yfwU5RoOSXdy+CyG+e/J9ONnZjh1/ftfd/9jGsSLeiA0RiFi8F/viUByJnpDiq7gWv8SN98376f32/ixal7x2Zl3cKu/vP2AxqYM=</latexit>

[Masci et al. 2015]

slide-76
SLIDE 76

Using Geodesic Patches: GCNN

40

(f ? a)(x) := X

θ,r

a(✓ + ∆✓, r)(D(x)f)(r, ✓)

<latexit sha1_base64="TquvtCTwYBxPgA56TL/7lpYdxY0=">ACO3icbZDLSxBEMZ7NA8z5rHRYy6FizBDRGYkEAkIknjwaIKrws6y1PTWuI09D7prxGXY/yuX/BO5eIlh0jINf07s4hPgoafnxfFdX1pZVWlqPoh7e0/Ojxk6crz/zV5y9evuq8XjuxZW0k9WSpS3OWoiWtCuqxYk1nlSHMU02n6cWnmX96ScaqsjmSUWDHM8LlSmJ7KRh50uQWIZDWAIwVUIH/bAT2ydD5uEx8S4ZaAwYLhLSQHpB20HpjQDw7cWBYGZmshsNON9qO5gX3IW6hK9o6Gna+J6NS1jkVLDVa24+jigcNGlZS09RPaksVygs8p7DAnOyg2Z+xQ2nTKCrDTuFQxz9f+JBnNrJ3nqOnPksb3rzcSHvH7N2e6gUVMxVysSirNXAJsyBhpAxJ1hMHKI1yfwU5RoOSXdy+CyG+e/J9ONnZjh1/ftfd/9jGsSLeiA0RiFi8F/viUByJnpDiq7gWv8SN98376f32/ixal7x2Zl3cKu/vP2AxqYM=</latexit><latexit sha1_base64="TquvtCTwYBxPgA56TL/7lpYdxY0=">ACO3icbZDLSxBEMZ7NA8z5rHRYy6FizBDRGYkEAkIknjwaIKrws6y1PTWuI09D7prxGXY/yuX/BO5eIlh0jINf07s4hPgoafnxfFdX1pZVWlqPoh7e0/Ojxk6crz/zV5y9evuq8XjuxZW0k9WSpS3OWoiWtCuqxYk1nlSHMU02n6cWnmX96ScaqsjmSUWDHM8LlSmJ7KRh50uQWIZDWAIwVUIH/bAT2ydD5uEx8S4ZaAwYLhLSQHpB20HpjQDw7cWBYGZmshsNON9qO5gX3IW6hK9o6Gna+J6NS1jkVLDVa24+jigcNGlZS09RPaksVygs8p7DAnOyg2Z+xQ2nTKCrDTuFQxz9f+JBnNrJ3nqOnPksb3rzcSHvH7N2e6gUVMxVysSirNXAJsyBhpAxJ1hMHKI1yfwU5RoOSXdy+CyG+e/J9ONnZjh1/ftfd/9jGsSLeiA0RiFi8F/viUByJnpDiq7gWv8SN98376f32/ixal7x2Zl3cKu/vP2AxqYM=</latexit><latexit sha1_base64="TquvtCTwYBxPgA56TL/7lpYdxY0=">ACO3icbZDLSxBEMZ7NA8z5rHRYy6FizBDRGYkEAkIknjwaIKrws6y1PTWuI09D7prxGXY/yuX/BO5eIlh0jINf07s4hPgoafnxfFdX1pZVWlqPoh7e0/Ojxk6crz/zV5y9evuq8XjuxZW0k9WSpS3OWoiWtCuqxYk1nlSHMU02n6cWnmX96ScaqsjmSUWDHM8LlSmJ7KRh50uQWIZDWAIwVUIH/bAT2ydD5uEx8S4ZaAwYLhLSQHpB20HpjQDw7cWBYGZmshsNON9qO5gX3IW6hK9o6Gna+J6NS1jkVLDVa24+jigcNGlZS09RPaksVygs8p7DAnOyg2Z+xQ2nTKCrDTuFQxz9f+JBnNrJ3nqOnPksb3rzcSHvH7N2e6gUVMxVysSirNXAJsyBhpAxJ1hMHKI1yfwU5RoOSXdy+CyG+e/J9ONnZjh1/ftfd/9jGsSLeiA0RiFi8F/viUByJnpDiq7gWv8SN98376f32/ixal7x2Zl3cKu/vP2AxqYM=</latexit><latexit sha1_base64="TquvtCTwYBxPgA56TL/7lpYdxY0=">ACO3icbZDLSxBEMZ7NA8z5rHRYy6FizBDRGYkEAkIknjwaIKrws6y1PTWuI09D7prxGXY/yuX/BO5eIlh0jINf07s4hPgoafnxfFdX1pZVWlqPoh7e0/Ojxk6crz/zV5y9evuq8XjuxZW0k9WSpS3OWoiWtCuqxYk1nlSHMU02n6cWnmX96ScaqsjmSUWDHM8LlSmJ7KRh50uQWIZDWAIwVUIH/bAT2ydD5uEx8S4ZaAwYLhLSQHpB20HpjQDw7cWBYGZmshsNON9qO5gX3IW6hK9o6Gna+J6NS1jkVLDVa24+jigcNGlZS09RPaksVygs8p7DAnOyg2Z+xQ2nTKCrDTuFQxz9f+JBnNrJ3nqOnPksb3rzcSHvH7N2e6gUVMxVysSirNXAJsyBhpAxJ1hMHKI1yfwU5RoOSXdy+CyG+e/J9ONnZjh1/ftfd/9jGsSLeiA0RiFi8F/viUByJnpDiq7gWv8SN98376f32/ixal7x2Zl3cKu/vP2AxqYM=</latexit>

[Masci et al. 2015]

slide-77
SLIDE 77

Using Geodesic Patches: GCNN

40

(f ? a)(x) := X

θ,r

a(✓ + ∆✓, r)(D(x)f)(r, ✓)

<latexit sha1_base64="TquvtCTwYBxPgA56TL/7lpYdxY0=">ACO3icbZDLSxBEMZ7NA8z5rHRYy6FizBDRGYkEAkIknjwaIKrws6y1PTWuI09D7prxGXY/yuX/BO5eIlh0jINf07s4hPgoafnxfFdX1pZVWlqPoh7e0/Ojxk6crz/zV5y9evuq8XjuxZW0k9WSpS3OWoiWtCuqxYk1nlSHMU02n6cWnmX96ScaqsjmSUWDHM8LlSmJ7KRh50uQWIZDWAIwVUIH/bAT2ydD5uEx8S4ZaAwYLhLSQHpB20HpjQDw7cWBYGZmshsNON9qO5gX3IW6hK9o6Gna+J6NS1jkVLDVa24+jigcNGlZS09RPaksVygs8p7DAnOyg2Z+xQ2nTKCrDTuFQxz9f+JBnNrJ3nqOnPksb3rzcSHvH7N2e6gUVMxVysSirNXAJsyBhpAxJ1hMHKI1yfwU5RoOSXdy+CyG+e/J9ONnZjh1/ftfd/9jGsSLeiA0RiFi8F/viUByJnpDiq7gWv8SN98376f32/ixal7x2Zl3cKu/vP2AxqYM=</latexit><latexit sha1_base64="TquvtCTwYBxPgA56TL/7lpYdxY0=">ACO3icbZDLSxBEMZ7NA8z5rHRYy6FizBDRGYkEAkIknjwaIKrws6y1PTWuI09D7prxGXY/yuX/BO5eIlh0jINf07s4hPgoafnxfFdX1pZVWlqPoh7e0/Ojxk6crz/zV5y9evuq8XjuxZW0k9WSpS3OWoiWtCuqxYk1nlSHMU02n6cWnmX96ScaqsjmSUWDHM8LlSmJ7KRh50uQWIZDWAIwVUIH/bAT2ydD5uEx8S4ZaAwYLhLSQHpB20HpjQDw7cWBYGZmshsNON9qO5gX3IW6hK9o6Gna+J6NS1jkVLDVa24+jigcNGlZS09RPaksVygs8p7DAnOyg2Z+xQ2nTKCrDTuFQxz9f+JBnNrJ3nqOnPksb3rzcSHvH7N2e6gUVMxVysSirNXAJsyBhpAxJ1hMHKI1yfwU5RoOSXdy+CyG+e/J9ONnZjh1/ftfd/9jGsSLeiA0RiFi8F/viUByJnpDiq7gWv8SN98376f32/ixal7x2Zl3cKu/vP2AxqYM=</latexit><latexit sha1_base64="TquvtCTwYBxPgA56TL/7lpYdxY0=">ACO3icbZDLSxBEMZ7NA8z5rHRYy6FizBDRGYkEAkIknjwaIKrws6y1PTWuI09D7prxGXY/yuX/BO5eIlh0jINf07s4hPgoafnxfFdX1pZVWlqPoh7e0/Ojxk6crz/zV5y9evuq8XjuxZW0k9WSpS3OWoiWtCuqxYk1nlSHMU02n6cWnmX96ScaqsjmSUWDHM8LlSmJ7KRh50uQWIZDWAIwVUIH/bAT2ydD5uEx8S4ZaAwYLhLSQHpB20HpjQDw7cWBYGZmshsNON9qO5gX3IW6hK9o6Gna+J6NS1jkVLDVa24+jigcNGlZS09RPaksVygs8p7DAnOyg2Z+xQ2nTKCrDTuFQxz9f+JBnNrJ3nqOnPksb3rzcSHvH7N2e6gUVMxVysSirNXAJsyBhpAxJ1hMHKI1yfwU5RoOSXdy+CyG+e/J9ONnZjh1/ftfd/9jGsSLeiA0RiFi8F/viUByJnpDiq7gWv8SN98376f32/ixal7x2Zl3cKu/vP2AxqYM=</latexit><latexit sha1_base64="TquvtCTwYBxPgA56TL/7lpYdxY0=">ACO3icbZDLSxBEMZ7NA8z5rHRYy6FizBDRGYkEAkIknjwaIKrws6y1PTWuI09D7prxGXY/yuX/BO5eIlh0jINf07s4hPgoafnxfFdX1pZVWlqPoh7e0/Ojxk6crz/zV5y9evuq8XjuxZW0k9WSpS3OWoiWtCuqxYk1nlSHMU02n6cWnmX96ScaqsjmSUWDHM8LlSmJ7KRh50uQWIZDWAIwVUIH/bAT2ydD5uEx8S4ZaAwYLhLSQHpB20HpjQDw7cWBYGZmshsNON9qO5gX3IW6hK9o6Gna+J6NS1jkVLDVa24+jigcNGlZS09RPaksVygs8p7DAnOyg2Z+xQ2nTKCrDTuFQxz9f+JBnNrJ3nqOnPksb3rzcSHvH7N2e6gUVMxVysSirNXAJsyBhpAxJ1hMHKI1yfwU5RoOSXdy+CyG+e/J9ONnZjh1/ftfd/9jGsSLeiA0RiFi8F/viUByJnpDiq7gWv8SN98376f32/ixal7x2Zl3cKu/vP2AxqYM=</latexit>

[Masci et al. 2015]

slide-78
SLIDE 78

GCNN Architecture

41

slide-79
SLIDE 79

Handling Rotational Ambiguity

42

slide-80
SLIDE 80

map 3D surface to 2D domain

Parameterization for Surface Analysis

43

[Maron et al. 2017]

slide-81
SLIDE 81

map 3D surface to 2D domain

Parameterization for Surface Analysis

43

[Maron et al. 2017]

slide-82
SLIDE 82

Parameterization for Surface Analysis

44

[Maron et al. 2017]

slide-83
SLIDE 83

Parameterization for Surface Analysis

44

[Maron et al. 2017]

slide-84
SLIDE 84
  • Map 3D surface to 2D domain

  • One such mapping: flat torus (seamless => translation-invariant)

  • Many mappings exists: sample a few and average result

  • Which functions to map? 


XYZ, normals, curvature, …

Parameterization for Surface Analysis

45

[Maron et al. 2017]

slide-85
SLIDE 85

Parameterization for Surface Analysis

46

[Maron et al. 2017]

slide-86
SLIDE 86

Texture Transfer (Parameterization + Alignment)

47

[Wang et al. 2016]

slide-87
SLIDE 87

AtlasNet for Surface Generation

48

[Groueix et al. 2018]

condition decoded points on 2D patches

slide-88
SLIDE 88

AtlasNet for Surface Generation

49

Latent representation can be inferred from images or point clouds [Groueix et al. 2018]

slide-89
SLIDE 89

AtlasNet for Surface Generation

50

Latent representation can be inferred from images or point clouds Quad Mesh is generated by mapping a regular grid in 2D domain to 3D points [Groueix et al. 2018]

slide-90
SLIDE 90

AtlasNet for Surface Generation

51

Latent representation can be inferred from images or point clouds texture coordinates come for free!!

slide-91
SLIDE 91
  • Image-based

  • Volumetric

  • Surface-based
  • PROS: parameterize + image networks (instrinsic representation)
  • CONS: suffers from parameterisation artefacts (local versus global distortion), 


requires good quality mesh


  • Point-based

Representation for 3D

52

slide-92
SLIDE 92
  • Image-based

  • Volumetric

  • Surface-based

  • Point-based

Representation for 3D

53

slide-93
SLIDE 93
  • Common representation: native representation

  • Easy to obtain from meshes, depth scans, laser scans

Representation for 3D: Point-based

54

slide-94
SLIDE 94
  • Common representation

  • Easy to obtain from meshes, depth scans, laser scans

  • Unstructured (e.g., any permutation of points gives same shape!)

In Original Representation

55

[Qi et al. 2017]

slide-95
SLIDE 95

PointNet for Point Cloud Analysis

56

permutation-invariant functions

[Qi et al. 2017]

slide-96
SLIDE 96

PointNet for Point Cloud Analysis

57

Use MLPs (h) and max-pooling (g) as simple symmetric functions

[Qi et al. 2017]

slide-97
SLIDE 97

PointNet Architecture

58

[Qi et al. 2017]

slide-98
SLIDE 98

PointNet for Point Cloud Analysis

59

slide-99
SLIDE 99

PointNet++

60

[Qi et al. 2018]

slide-100
SLIDE 100

PCPNet for Local Point Cloud Analysis

61

[Guerrero et al. 2018]

slide-101
SLIDE 101

PCPNet Architecture

62

slide-102
SLIDE 102

PCPNet Architecture

62

slide-103
SLIDE 103

PCPNet Architecture

62

slide-104
SLIDE 104

PCPNet Architecture

62

slide-105
SLIDE 105

PointNet for Point Cloud Synthesis

63

[Su et al. 2017]

slide-106
SLIDE 106

PointNet for Point Cloud Synthesis

63

[Su et al. 2017] Earth Mover Distance as loss function

slide-107
SLIDE 107

SIGGRAPH Asia Course CreativeAI: Deep Learning for Graphics 64

Course Information (slides/code/comments)

http://geometry.cs.ucl.ac.uk/creativeai/