Machine learning for energy landscapes Tristan Bereau Van t Hoff - - PowerPoint PPT Presentation

machine learning for energy landscapes
SMART_READER_LITE
LIVE PREVIEW

Machine learning for energy landscapes Tristan Bereau Van t Hoff - - PowerPoint PPT Presentation

Machine learning for energy landscapes Tristan Bereau Van t Hoff Institute for Molecular Sciences & Informatics Institute University of Amsterdam Introduction to kernel-based ML Today Incorporation of physical symmetries,


slide-1
SLIDE 1

Machine learning for energy landscapes

Tristan Bereau Van ’t Hoff Institute for Molecular Sciences & Informatics Institute University of Amsterdam

slide-2
SLIDE 2

Machine learning for energy landscapes

Tristan Bereau Van ’t Hoff Institute for Molecular Sciences & Informatics Institute University of Amsterdam

  • Introduction to kernel-based ML
  • Incorporation of physical symmetries, conservation laws

Today

slide-3
SLIDE 3

Machine learning for energy landscapes

Tristan Bereau Van ’t Hoff Institute for Molecular Sciences & Informatics Institute University of Amsterdam

  • Introduction to kernel-based ML
  • Incorporation of physical symmetries, conservation laws

Today Supervised ML in chemistry and materials science Thursday

Mohamad Moosavi Kevin Jablonka

slide-4
SLIDE 4

Molecular dynamics

2

slide-5
SLIDE 5

Molecular dynamics

2

F = ma

slide-6
SLIDE 6

Molecular dynamics

2

F = ma

Specify interparticle forces: “force field”

slide-7
SLIDE 7

Molecular dynamics

2

F = ma

Specify interparticle forces: “force field” Numerically integrate particle positions

slide-8
SLIDE 8

Molecular dynamics

2

F = ma

Specify interparticle forces: “force field” Numerically integrate particle positions

Wikipedia

slide-9
SLIDE 9

Molecular dynamics

2

fs ps ns s

µ

ms s

F = ma

Specify interparticle forces: “force field” Numerically integrate particle positions

Wikipedia

slide-10
SLIDE 10

Molecular dynamics

2

fs ps ns s

µ

ms s integration time step

F = ma

Specify interparticle forces: “force field” Numerically integrate particle positions

Wikipedia

slide-11
SLIDE 11

Molecular dynamics

2

fs ps ns s

µ

ms s integration time step Timescales of interest

F = ma

Specify interparticle forces: “force field” Numerically integrate particle positions

Wikipedia

slide-12
SLIDE 12

Emergent complexity

Molecular dynamics

2

fs ps ns s

µ

ms s integration time step Timescales of interest

F = ma

Specify interparticle forces: “force field” Numerically integrate particle positions

Wikipedia

slide-13
SLIDE 13

Data as the 4th pillar of science

3

  • A. Agrawal and A. Choudhary APL Mater. 4 053208 (2016); https://www.bigmax.mpg.de
slide-14
SLIDE 14

Data science

4

Databases Machine learning Hardware

slide-15
SLIDE 15

Data science

4

Databases Machine learning Hardware

DeepMind Google

slide-16
SLIDE 16

Links to machine learning

5

Potential energy surface

Can we build a more accurate PES? Can we easily build an accurate PES? Can we make the numerical integration faster and/or more efficient? …

Interpolation of a high-dimensional function

slide-17
SLIDE 17

Teaser: let’s fit data points

6

Good Not good

slide-18
SLIDE 18

Teaser: let’s fit data points

6

Good Not good

slide-19
SLIDE 19

Teaser: let’s fit data points

6

Good Not good I want to be here

slide-20
SLIDE 20

Teaser: let’s fit data points

6

Good Not good I want to be here

ruthlessly stolen from A. von Lilienfeld

slide-21
SLIDE 21

Multivariate function approximation

7

Sparse data infer smooth function

slide-22
SLIDE 22

Multivariate function approximation

7

Sparse data infer smooth function “inverse problems”

slide-23
SLIDE 23

Multivariate function approximation

7

Sparse data infer smooth function

Rupp, International Journal of Quantum Chemistry 115 (2015)

property

Ê Ê Ê Ê Ê Ê

molecular structure

property descriptor sparse data ground truth interpolation

“inverse problems”

slide-24
SLIDE 24

Multivariate function approximation

7

Sparse data infer smooth function

Rupp, International Journal of Quantum Chemistry 115 (2015)

property

Ê Ê Ê Ê Ê Ê

molecular structure

property descriptor sparse data ground truth interpolation

“inverse problems” Regression: prediction of f : ℝd → ℝ based on noisy data points D = (X, y) = {xn, yn}N

n=1

yn = f(xn) + ε

What is f ? x y

slide-25
SLIDE 25

Kernel methods are vintage

8

Kernel Deep learning

  • needs a representation
  • linear algebra
  • can be efficient with small

data

{ {

  • learns the representation
  • complex mathematical

structure

  • data hungry
slide-26
SLIDE 26

Learning from experience

9

  • [7] Visualization of high-dimensional space

(genera (snake seems like overfitting, fitting to well, cf. Lecture 2)

  • Inductive (based on examples)

source: xkcd

slide-27
SLIDE 27

Extrapolation in machine learning

10

0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 1.8 2.0 r [æ] 5 10 15 20 25 30 35 ULJ(r) [≤]

ULJ(r) Training points Prediction

ULJ(r) = 4✏ ⇣ r ⌘12 − ⇣ r ⌘6

<latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit><latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit><latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit><latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit>
slide-28
SLIDE 28

Extrapolation in machine learning

10

0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 1.8 2.0 r [æ] 5 10 15 20 25 30 35 ULJ(r) [≤]

ULJ(r) Training points Prediction

0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 1.8 2.0 r [æ] 5 10 15 20 25 30 35 ULJ(r) [≤]

ULJ(r) Training points Prediction

ULJ(r) = 4✏ ⇣ r ⌘12 − ⇣ r ⌘6

<latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit><latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit><latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit><latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit>
slide-29
SLIDE 29

Bayesian inference

11

Prediction { Prior beliefs Sampled data

slide-30
SLIDE 30

Bayesian inference

11

Prediction { Prior beliefs Sampled data

f f*

training values test values

y

  • bservations
slide-31
SLIDE 31

Bayesian inference

11

Prediction { Prior beliefs Sampled data

f f*

training values test values

y

  • bservations

Bayes’ formula

p(f, f*|y) = p(y|f) p(f, f*) p(y)

likelihood prior posterior normalization

slide-32
SLIDE 32

Bayesian inference

12

Bayes’ formula

p(f, f*|y) = p(y|f) p(f, f*) p(y)

likelihood prior posterior normalization

  • +

+ +

  • Rasmussen, Advanced lectures on machine learning. Springer, 63-71 (2004)
slide-33
SLIDE 33

Gaussian processes

13

f ∼ GP(m, k)

<latexit sha1_base64="Doab9rHmtdx101r/HIygjLjGtcs=">ACA3icbVBNS8NAEJ3Ur1q/ot70sliEClISEfRY9KDHCvYDmlI2027dDcJuxuhIX/4oXD4p49U9489+4aXPQ1gcDj/dmJnx5wp7TjfVmFpeWV1rbhe2tjc2t6xd/eaKkokoQ0S8Ui2fawoZyFtaKY5bceSYuFz2vJH15nfeqBSsSi81+OYdgUehCxgBGsj9eyDAHmKCeQJrIcE8/SmPkEVcYpGJz27FSdKdAicXNShz1nv3l9SOSCBpqwrFSHdeJdTfFUjPC6aTkJYrGmIzwgHYMDbGgqptOf5igY6P0URBJU6FGU/X3RIqFUmPhm87sVDXvZeJ/XifRwWU3ZWGcaBqS2aIg4UhHKAsE9ZmkRPOxIZhIZm5FZIglJtrEVjIhuPMvL5LmWdV1qu7debl2lcdRhEM4gq4cAE1uIU6NIDAIzDK7xZT9aL9W59zFoLVj6zD39gf4A08qWVA=</latexit><latexit sha1_base64="Doab9rHmtdx101r/HIygjLjGtcs=">ACA3icbVBNS8NAEJ3Ur1q/ot70sliEClISEfRY9KDHCvYDmlI2027dDcJuxuhIX/4oXD4p49U9489+4aXPQ1gcDj/dmJnx5wp7TjfVmFpeWV1rbhe2tjc2t6xd/eaKkokoQ0S8Ui2fawoZyFtaKY5bceSYuFz2vJH15nfeqBSsSi81+OYdgUehCxgBGsj9eyDAHmKCeQJrIcE8/SmPkEVcYpGJz27FSdKdAicXNShz1nv3l9SOSCBpqwrFSHdeJdTfFUjPC6aTkJYrGmIzwgHYMDbGgqptOf5igY6P0URBJU6FGU/X3RIqFUmPhm87sVDXvZeJ/XifRwWU3ZWGcaBqS2aIg4UhHKAsE9ZmkRPOxIZhIZm5FZIglJtrEVjIhuPMvL5LmWdV1qu7debl2lcdRhEM4gq4cAE1uIU6NIDAIzDK7xZT9aL9W59zFoLVj6zD39gf4A08qWVA=</latexit><latexit sha1_base64="Doab9rHmtdx101r/HIygjLjGtcs=">ACA3icbVBNS8NAEJ3Ur1q/ot70sliEClISEfRY9KDHCvYDmlI2027dDcJuxuhIX/4oXD4p49U9489+4aXPQ1gcDj/dmJnx5wp7TjfVmFpeWV1rbhe2tjc2t6xd/eaKkokoQ0S8Ui2fawoZyFtaKY5bceSYuFz2vJH15nfeqBSsSi81+OYdgUehCxgBGsj9eyDAHmKCeQJrIcE8/SmPkEVcYpGJz27FSdKdAicXNShz1nv3l9SOSCBpqwrFSHdeJdTfFUjPC6aTkJYrGmIzwgHYMDbGgqptOf5igY6P0URBJU6FGU/X3RIqFUmPhm87sVDXvZeJ/XifRwWU3ZWGcaBqS2aIg4UhHKAsE9ZmkRPOxIZhIZm5FZIglJtrEVjIhuPMvL5LmWdV1qu7debl2lcdRhEM4gq4cAE1uIU6NIDAIzDK7xZT9aL9W59zFoLVj6zD39gf4A08qWVA=</latexit><latexit sha1_base64="Doab9rHmtdx101r/HIygjLjGtcs=">ACA3icbVBNS8NAEJ3Ur1q/ot70sliEClISEfRY9KDHCvYDmlI2027dDcJuxuhIX/4oXD4p49U9489+4aXPQ1gcDj/dmJnx5wp7TjfVmFpeWV1rbhe2tjc2t6xd/eaKkokoQ0S8Ui2fawoZyFtaKY5bceSYuFz2vJH15nfeqBSsSi81+OYdgUehCxgBGsj9eyDAHmKCeQJrIcE8/SmPkEVcYpGJz27FSdKdAicXNShz1nv3l9SOSCBpqwrFSHdeJdTfFUjPC6aTkJYrGmIzwgHYMDbGgqptOf5igY6P0URBJU6FGU/X3RIqFUmPhm87sVDXvZeJ/XifRwWU3ZWGcaBqS2aIg4UhHKAsE9ZmkRPOxIZhIZm5FZIglJtrEVjIhuPMvL5LmWdV1qu7debl2lcdRhEM4gq4cAE1uIU6NIDAIzDK7xZT9aL9W59zFoLVj6zD39gf4A08qWVA=</latexit>

Rasmussen, Advanced lectures on machine learning. Springer, 63-71 (2004)

slide-34
SLIDE 34

Gaussian processes

13

f ∼ GP(m, k)

<latexit sha1_base64="Doab9rHmtdx101r/HIygjLjGtcs=">ACA3icbVBNS8NAEJ3Ur1q/ot70sliEClISEfRY9KDHCvYDmlI2027dDcJuxuhIX/4oXD4p49U9489+4aXPQ1gcDj/dmJnx5wp7TjfVmFpeWV1rbhe2tjc2t6xd/eaKkokoQ0S8Ui2fawoZyFtaKY5bceSYuFz2vJH15nfeqBSsSi81+OYdgUehCxgBGsj9eyDAHmKCeQJrIcE8/SmPkEVcYpGJz27FSdKdAicXNShz1nv3l9SOSCBpqwrFSHdeJdTfFUjPC6aTkJYrGmIzwgHYMDbGgqptOf5igY6P0URBJU6FGU/X3RIqFUmPhm87sVDXvZeJ/XifRwWU3ZWGcaBqS2aIg4UhHKAsE9ZmkRPOxIZhIZm5FZIglJtrEVjIhuPMvL5LmWdV1qu7debl2lcdRhEM4gq4cAE1uIU6NIDAIzDK7xZT9aL9W59zFoLVj6zD39gf4A08qWVA=</latexit><latexit sha1_base64="Doab9rHmtdx101r/HIygjLjGtcs=">ACA3icbVBNS8NAEJ3Ur1q/ot70sliEClISEfRY9KDHCvYDmlI2027dDcJuxuhIX/4oXD4p49U9489+4aXPQ1gcDj/dmJnx5wp7TjfVmFpeWV1rbhe2tjc2t6xd/eaKkokoQ0S8Ui2fawoZyFtaKY5bceSYuFz2vJH15nfeqBSsSi81+OYdgUehCxgBGsj9eyDAHmKCeQJrIcE8/SmPkEVcYpGJz27FSdKdAicXNShz1nv3l9SOSCBpqwrFSHdeJdTfFUjPC6aTkJYrGmIzwgHYMDbGgqptOf5igY6P0URBJU6FGU/X3RIqFUmPhm87sVDXvZeJ/XifRwWU3ZWGcaBqS2aIg4UhHKAsE9ZmkRPOxIZhIZm5FZIglJtrEVjIhuPMvL5LmWdV1qu7debl2lcdRhEM4gq4cAE1uIU6NIDAIzDK7xZT9aL9W59zFoLVj6zD39gf4A08qWVA=</latexit><latexit sha1_base64="Doab9rHmtdx101r/HIygjLjGtcs=">ACA3icbVBNS8NAEJ3Ur1q/ot70sliEClISEfRY9KDHCvYDmlI2027dDcJuxuhIX/4oXD4p49U9489+4aXPQ1gcDj/dmJnx5wp7TjfVmFpeWV1rbhe2tjc2t6xd/eaKkokoQ0S8Ui2fawoZyFtaKY5bceSYuFz2vJH15nfeqBSsSi81+OYdgUehCxgBGsj9eyDAHmKCeQJrIcE8/SmPkEVcYpGJz27FSdKdAicXNShz1nv3l9SOSCBpqwrFSHdeJdTfFUjPC6aTkJYrGmIzwgHYMDbGgqptOf5igY6P0URBJU6FGU/X3RIqFUmPhm87sVDXvZeJ/XifRwWU3ZWGcaBqS2aIg4UhHKAsE9ZmkRPOxIZhIZm5FZIglJtrEVjIhuPMvL5LmWdV1qu7debl2lcdRhEM4gq4cAE1uIU6NIDAIzDK7xZT9aL9W59zFoLVj6zD39gf4A08qWVA=</latexit><latexit sha1_base64="Doab9rHmtdx101r/HIygjLjGtcs=">ACA3icbVBNS8NAEJ3Ur1q/ot70sliEClISEfRY9KDHCvYDmlI2027dDcJuxuhIX/4oXD4p49U9489+4aXPQ1gcDj/dmJnx5wp7TjfVmFpeWV1rbhe2tjc2t6xd/eaKkokoQ0S8Ui2fawoZyFtaKY5bceSYuFz2vJH15nfeqBSsSi81+OYdgUehCxgBGsj9eyDAHmKCeQJrIcE8/SmPkEVcYpGJz27FSdKdAicXNShz1nv3l9SOSCBpqwrFSHdeJdTfFUjPC6aTkJYrGmIzwgHYMDbGgqptOf5igY6P0URBJU6FGU/X3RIqFUmPhm87sVDXvZeJ/XifRwWU3ZWGcaBqS2aIg4UhHKAsE9ZmkRPOxIZhIZm5FZIglJtrEVjIhuPMvL5LmWdV1qu7debl2lcdRhEM4gq4cAE1uIU6NIDAIzDK7xZT9aL9W59zFoLVj6zD39gf4A08qWVA=</latexit>

random variable: value of the stochastic function at x mean covariance

f(x)

<latexit sha1_base64="Apt6dOASLoU0puF9XJna1LvMt4=">AB63icbVBNS8NAEJ3Ur1q/qh69LBahXkoigh6LXjxWMG2hDWz3bRLdzdhdyOW0L/gxYMiXv1D3vw3btoctPXBwO9GWbmhQln2rjut1NaW9/Y3CpvV3Z29/YPqodHbR2nilCfxDxW3RBrypmkvmG026iKBYhp51wcpv7nUeqNIvlg5kmNB4JFnECDa5FNWfzgfVmtw50CrxCtIDQq0BtWv/jAmqaDSEI617nluYoIMK8MIp7NKP9U0wWSCR7RnqcSC6iCb3zpDZ1YZoihWtqRBc/X3RIaF1lMR2k6BzVgve7n4n9dLTXQdZEwmqaGSLBZFKUcmRvnjaMgUJYZPLcFEMXsrImOsMDE2noNwVt+eZW0Lxqe2/DuL2vNmyKOMpzAKdTBgytowh20wAcCY3iGV3hzhPivDsfi9aSU8wcwx84nz9sX43R</latexit><latexit sha1_base64="Apt6dOASLoU0puF9XJna1LvMt4=">AB63icbVBNS8NAEJ3Ur1q/qh69LBahXkoigh6LXjxWMG2hDWz3bRLdzdhdyOW0L/gxYMiXv1D3vw3btoctPXBwO9GWbmhQln2rjut1NaW9/Y3CpvV3Z29/YPqodHbR2nilCfxDxW3RBrypmkvmG026iKBYhp51wcpv7nUeqNIvlg5kmNB4JFnECDa5FNWfzgfVmtw50CrxCtIDQq0BtWv/jAmqaDSEI617nluYoIMK8MIp7NKP9U0wWSCR7RnqcSC6iCb3zpDZ1YZoihWtqRBc/X3RIaF1lMR2k6BzVgve7n4n9dLTXQdZEwmqaGSLBZFKUcmRvnjaMgUJYZPLcFEMXsrImOsMDE2noNwVt+eZW0Lxqe2/DuL2vNmyKOMpzAKdTBgytowh20wAcCY3iGV3hzhPivDsfi9aSU8wcwx84nz9sX43R</latexit><latexit sha1_base64="Apt6dOASLoU0puF9XJna1LvMt4=">AB63icbVBNS8NAEJ3Ur1q/qh69LBahXkoigh6LXjxWMG2hDWz3bRLdzdhdyOW0L/gxYMiXv1D3vw3btoctPXBwO9GWbmhQln2rjut1NaW9/Y3CpvV3Z29/YPqodHbR2nilCfxDxW3RBrypmkvmG026iKBYhp51wcpv7nUeqNIvlg5kmNB4JFnECDa5FNWfzgfVmtw50CrxCtIDQq0BtWv/jAmqaDSEI617nluYoIMK8MIp7NKP9U0wWSCR7RnqcSC6iCb3zpDZ1YZoihWtqRBc/X3RIaF1lMR2k6BzVgve7n4n9dLTXQdZEwmqaGSLBZFKUcmRvnjaMgUJYZPLcFEMXsrImOsMDE2noNwVt+eZW0Lxqe2/DuL2vNmyKOMpzAKdTBgytowh20wAcCY3iGV3hzhPivDsfi9aSU8wcwx84nz9sX43R</latexit><latexit sha1_base64="Apt6dOASLoU0puF9XJna1LvMt4=">AB63icbVBNS8NAEJ3Ur1q/qh69LBahXkoigh6LXjxWMG2hDWz3bRLdzdhdyOW0L/gxYMiXv1D3vw3btoctPXBwO9GWbmhQln2rjut1NaW9/Y3CpvV3Z29/YPqodHbR2nilCfxDxW3RBrypmkvmG026iKBYhp51wcpv7nUeqNIvlg5kmNB4JFnECDa5FNWfzgfVmtw50CrxCtIDQq0BtWv/jAmqaDSEI617nluYoIMK8MIp7NKP9U0wWSCR7RnqcSC6iCb3zpDZ1YZoihWtqRBc/X3RIaF1lMR2k6BzVgve7n4n9dLTXQdZEwmqaGSLBZFKUcmRvnjaMgUJYZPLcFEMXsrImOsMDE2noNwVt+eZW0Lxqe2/DuL2vNmyKOMpzAKdTBgytowh20wAcCY3iGV3hzhPivDsfi9aSU8wcwx84nz9sX43R</latexit>

Σij = k(xi, xj)

<latexit sha1_base64="B38NMmyREyUD9XF68oO/vbLsTCw=">ACAXicbVDLSgMxFM3UV62vUTeCm2ARKkiZEUE3QtGNy4r2Ae0wZNJMmzbJDElGWoa68VfcuFDErX/hzr8xbWeh1QMXDufcy73BDGjSjvOl5VbWFxaXsmvFtbWNza37O2duoSiUkNRySzQApwqgNU01I81YEsQDRhrB4GriN+6JVDQSd3oUE4+jrqAhxUgbybf32re0y5Gf0v4YXsBaejT46HfP/LtolN2poB/iZuRIshQ9e3PdifCSdCY4aUarlOrL0USU0xI+NCO1EkRniAuqRlqECcKC+dfjCGh0bpwDCSpoSGU/XnRIq4UiMemE6OdE/NexPxP6+V6PDcS6mIE0Eni0KEwZ1BCdxwA6VBGs2MgRhSc2tEPeQRFib0AomBHf+5b+kflJ2nbJ7c1qsXGZx5ME+OAl4IzUAHXoApqAIMH8ARewKv1aD1b9b7rDVnZTO74Besj28OApX4</latexit><latexit sha1_base64="B38NMmyREyUD9XF68oO/vbLsTCw=">ACAXicbVDLSgMxFM3UV62vUTeCm2ARKkiZEUE3QtGNy4r2Ae0wZNJMmzbJDElGWoa68VfcuFDErX/hzr8xbWeh1QMXDufcy73BDGjSjvOl5VbWFxaXsmvFtbWNza37O2duoSiUkNRySzQApwqgNU01I81YEsQDRhrB4GriN+6JVDQSd3oUE4+jrqAhxUgbybf32re0y5Gf0v4YXsBaejT46HfP/LtolN2poB/iZuRIshQ9e3PdifCSdCY4aUarlOrL0USU0xI+NCO1EkRniAuqRlqECcKC+dfjCGh0bpwDCSpoSGU/XnRIq4UiMemE6OdE/NexPxP6+V6PDcS6mIE0Eni0KEwZ1BCdxwA6VBGs2MgRhSc2tEPeQRFib0AomBHf+5b+kflJ2nbJ7c1qsXGZx5ME+OAl4IzUAHXoApqAIMH8ARewKv1aD1b9b7rDVnZTO74Besj28OApX4</latexit><latexit sha1_base64="B38NMmyREyUD9XF68oO/vbLsTCw=">ACAXicbVDLSgMxFM3UV62vUTeCm2ARKkiZEUE3QtGNy4r2Ae0wZNJMmzbJDElGWoa68VfcuFDErX/hzr8xbWeh1QMXDufcy73BDGjSjvOl5VbWFxaXsmvFtbWNza37O2duoSiUkNRySzQApwqgNU01I81YEsQDRhrB4GriN+6JVDQSd3oUE4+jrqAhxUgbybf32re0y5Gf0v4YXsBaejT46HfP/LtolN2poB/iZuRIshQ9e3PdifCSdCY4aUarlOrL0USU0xI+NCO1EkRniAuqRlqECcKC+dfjCGh0bpwDCSpoSGU/XnRIq4UiMemE6OdE/NexPxP6+V6PDcS6mIE0Eni0KEwZ1BCdxwA6VBGs2MgRhSc2tEPeQRFib0AomBHf+5b+kflJ2nbJ7c1qsXGZx5ME+OAl4IzUAHXoApqAIMH8ARewKv1aD1b9b7rDVnZTO74Besj28OApX4</latexit><latexit sha1_base64="B38NMmyREyUD9XF68oO/vbLsTCw=">ACAXicbVDLSgMxFM3UV62vUTeCm2ARKkiZEUE3QtGNy4r2Ae0wZNJMmzbJDElGWoa68VfcuFDErX/hzr8xbWeh1QMXDufcy73BDGjSjvOl5VbWFxaXsmvFtbWNza37O2duoSiUkNRySzQApwqgNU01I81YEsQDRhrB4GriN+6JVDQSd3oUE4+jrqAhxUgbybf32re0y5Gf0v4YXsBaejT46HfP/LtolN2poB/iZuRIshQ9e3PdifCSdCY4aUarlOrL0USU0xI+NCO1EkRniAuqRlqECcKC+dfjCGh0bpwDCSpoSGU/XnRIq4UiMemE6OdE/NexPxP6+V6PDcS6mIE0Eni0KEwZ1BCdxwA6VBGs2MgRhSc2tEPeQRFib0AomBHf+5b+kflJ2nbJ7c1qsXGZx5ME+OAl4IzUAHXoApqAIMH8ARewKv1aD1b9b7rDVnZTO74Besj28OApX4</latexit>

µi = m(xi)

<latexit sha1_base64="D0Sep4kp8I3o60t3thmND8Z0x+o=">AB9XicbVBNSwMxEJ31s9avqkcvwSLUS9kVQS9C0YvHCvYD2nXJptk2NMkuSVYtS/+HFw+KePW/ePfmLZ70NYHA4/3ZpiZFyacaeO6387S8srq2npho7i5tb2zW9rb+o4VYQ2SMxj1Q6xpxJ2jDMcNpOFMUi5LQVDq8nfuBKs1ieWdGCfUF7ksWMYKNle67Ig0YukSi8hSwk6BUdqvuFGiReDkpQ456UPrq9mKSCioN4Vjrjucmxs+wMoxwOi52U0TIa4TzuWSiyo9rPp1WN0bJUeimJlSxo0VX9PZFhoPRKh7RTYDPS8NxH/8zqpiS78jMkNVS2aIo5cjEaBIB6jFieEjSzBRzN6KyArTIwNqmhD8OZfXiTN06rnVr3bs3LtKo+jAIdwBXw4BxqcAN1aABc/wCm/Oo/PivDsfs9YlJ585gD9wPn8A4WRhw=</latexit><latexit sha1_base64="D0Sep4kp8I3o60t3thmND8Z0x+o=">AB9XicbVBNSwMxEJ31s9avqkcvwSLUS9kVQS9C0YvHCvYD2nXJptk2NMkuSVYtS/+HFw+KePW/ePfmLZ70NYHA4/3ZpiZFyacaeO6387S8srq2npho7i5tb2zW9rb+o4VYQ2SMxj1Q6xpxJ2jDMcNpOFMUi5LQVDq8nfuBKs1ieWdGCfUF7ksWMYKNle67Ig0YukSi8hSwk6BUdqvuFGiReDkpQ456UPrq9mKSCioN4Vjrjucmxs+wMoxwOi52U0TIa4TzuWSiyo9rPp1WN0bJUeimJlSxo0VX9PZFhoPRKh7RTYDPS8NxH/8zqpiS78jMkNVS2aIo5cjEaBIB6jFieEjSzBRzN6KyArTIwNqmhD8OZfXiTN06rnVr3bs3LtKo+jAIdwBXw4BxqcAN1aABc/wCm/Oo/PivDsfs9YlJ585gD9wPn8A4WRhw=</latexit><latexit sha1_base64="D0Sep4kp8I3o60t3thmND8Z0x+o=">AB9XicbVBNSwMxEJ31s9avqkcvwSLUS9kVQS9C0YvHCvYD2nXJptk2NMkuSVYtS/+HFw+KePW/ePfmLZ70NYHA4/3ZpiZFyacaeO6387S8srq2npho7i5tb2zW9rb+o4VYQ2SMxj1Q6xpxJ2jDMcNpOFMUi5LQVDq8nfuBKs1ieWdGCfUF7ksWMYKNle67Ig0YukSi8hSwk6BUdqvuFGiReDkpQ456UPrq9mKSCioN4Vjrjucmxs+wMoxwOi52U0TIa4TzuWSiyo9rPp1WN0bJUeimJlSxo0VX9PZFhoPRKh7RTYDPS8NxH/8zqpiS78jMkNVS2aIo5cjEaBIB6jFieEjSzBRzN6KyArTIwNqmhD8OZfXiTN06rnVr3bs3LtKo+jAIdwBXw4BxqcAN1aABc/wCm/Oo/PivDsfs9YlJ585gD9wPn8A4WRhw=</latexit><latexit sha1_base64="D0Sep4kp8I3o60t3thmND8Z0x+o=">AB9XicbVBNSwMxEJ31s9avqkcvwSLUS9kVQS9C0YvHCvYD2nXJptk2NMkuSVYtS/+HFw+KePW/ePfmLZ70NYHA4/3ZpiZFyacaeO6387S8srq2npho7i5tb2zW9rb+o4VYQ2SMxj1Q6xpxJ2jDMcNpOFMUi5LQVDq8nfuBKs1ieWdGCfUF7ksWMYKNle67Ig0YukSi8hSwk6BUdqvuFGiReDkpQ456UPrq9mKSCioN4Vjrjucmxs+wMoxwOi52U0TIa4TzuWSiyo9rPp1WN0bJUeimJlSxo0VX9PZFhoPRKh7RTYDPS8NxH/8zqpiS78jMkNVS2aIo5cjEaBIB6jFieEjSzBRzN6KyArTIwNqmhD8OZfXiTN06rnVr3bs3LtKo+jAIdwBXw4BxqcAN1aABc/wCm/Oo/PivDsfs9YlJ585gD9wPn8A4WRhw=</latexit>

Rasmussen, Advanced lectures on machine learning. Springer, 63-71 (2004)

slide-35
SLIDE 35

Gaussian processes

13

f ∼ GP(m, k)

<latexit sha1_base64="Doab9rHmtdx101r/HIygjLjGtcs=">ACA3icbVBNS8NAEJ3Ur1q/ot70sliEClISEfRY9KDHCvYDmlI2027dDcJuxuhIX/4oXD4p49U9489+4aXPQ1gcDj/dmJnx5wp7TjfVmFpeWV1rbhe2tjc2t6xd/eaKkokoQ0S8Ui2fawoZyFtaKY5bceSYuFz2vJH15nfeqBSsSi81+OYdgUehCxgBGsj9eyDAHmKCeQJrIcE8/SmPkEVcYpGJz27FSdKdAicXNShz1nv3l9SOSCBpqwrFSHdeJdTfFUjPC6aTkJYrGmIzwgHYMDbGgqptOf5igY6P0URBJU6FGU/X3RIqFUmPhm87sVDXvZeJ/XifRwWU3ZWGcaBqS2aIg4UhHKAsE9ZmkRPOxIZhIZm5FZIglJtrEVjIhuPMvL5LmWdV1qu7debl2lcdRhEM4gq4cAE1uIU6NIDAIzDK7xZT9aL9W59zFoLVj6zD39gf4A08qWVA=</latexit><latexit sha1_base64="Doab9rHmtdx101r/HIygjLjGtcs=">ACA3icbVBNS8NAEJ3Ur1q/ot70sliEClISEfRY9KDHCvYDmlI2027dDcJuxuhIX/4oXD4p49U9489+4aXPQ1gcDj/dmJnx5wp7TjfVmFpeWV1rbhe2tjc2t6xd/eaKkokoQ0S8Ui2fawoZyFtaKY5bceSYuFz2vJH15nfeqBSsSi81+OYdgUehCxgBGsj9eyDAHmKCeQJrIcE8/SmPkEVcYpGJz27FSdKdAicXNShz1nv3l9SOSCBpqwrFSHdeJdTfFUjPC6aTkJYrGmIzwgHYMDbGgqptOf5igY6P0URBJU6FGU/X3RIqFUmPhm87sVDXvZeJ/XifRwWU3ZWGcaBqS2aIg4UhHKAsE9ZmkRPOxIZhIZm5FZIglJtrEVjIhuPMvL5LmWdV1qu7debl2lcdRhEM4gq4cAE1uIU6NIDAIzDK7xZT9aL9W59zFoLVj6zD39gf4A08qWVA=</latexit><latexit sha1_base64="Doab9rHmtdx101r/HIygjLjGtcs=">ACA3icbVBNS8NAEJ3Ur1q/ot70sliEClISEfRY9KDHCvYDmlI2027dDcJuxuhIX/4oXD4p49U9489+4aXPQ1gcDj/dmJnx5wp7TjfVmFpeWV1rbhe2tjc2t6xd/eaKkokoQ0S8Ui2fawoZyFtaKY5bceSYuFz2vJH15nfeqBSsSi81+OYdgUehCxgBGsj9eyDAHmKCeQJrIcE8/SmPkEVcYpGJz27FSdKdAicXNShz1nv3l9SOSCBpqwrFSHdeJdTfFUjPC6aTkJYrGmIzwgHYMDbGgqptOf5igY6P0URBJU6FGU/X3RIqFUmPhm87sVDXvZeJ/XifRwWU3ZWGcaBqS2aIg4UhHKAsE9ZmkRPOxIZhIZm5FZIglJtrEVjIhuPMvL5LmWdV1qu7debl2lcdRhEM4gq4cAE1uIU6NIDAIzDK7xZT9aL9W59zFoLVj6zD39gf4A08qWVA=</latexit><latexit sha1_base64="Doab9rHmtdx101r/HIygjLjGtcs=">ACA3icbVBNS8NAEJ3Ur1q/ot70sliEClISEfRY9KDHCvYDmlI2027dDcJuxuhIX/4oXD4p49U9489+4aXPQ1gcDj/dmJnx5wp7TjfVmFpeWV1rbhe2tjc2t6xd/eaKkokoQ0S8Ui2fawoZyFtaKY5bceSYuFz2vJH15nfeqBSsSi81+OYdgUehCxgBGsj9eyDAHmKCeQJrIcE8/SmPkEVcYpGJz27FSdKdAicXNShz1nv3l9SOSCBpqwrFSHdeJdTfFUjPC6aTkJYrGmIzwgHYMDbGgqptOf5igY6P0URBJU6FGU/X3RIqFUmPhm87sVDXvZeJ/XifRwWU3ZWGcaBqS2aIg4UhHKAsE9ZmkRPOxIZhIZm5FZIglJtrEVjIhuPMvL5LmWdV1qu7debl2lcdRhEM4gq4cAE1uIU6NIDAIzDK7xZT9aL9W59zFoLVj6zD39gf4A08qWVA=</latexit>

random variable: value of the stochastic function at x mean covariance

f(x)

<latexit sha1_base64="Apt6dOASLoU0puF9XJna1LvMt4=">AB63icbVBNS8NAEJ3Ur1q/qh69LBahXkoigh6LXjxWMG2hDWz3bRLdzdhdyOW0L/gxYMiXv1D3vw3btoctPXBwO9GWbmhQln2rjut1NaW9/Y3CpvV3Z29/YPqodHbR2nilCfxDxW3RBrypmkvmG026iKBYhp51wcpv7nUeqNIvlg5kmNB4JFnECDa5FNWfzgfVmtw50CrxCtIDQq0BtWv/jAmqaDSEI617nluYoIMK8MIp7NKP9U0wWSCR7RnqcSC6iCb3zpDZ1YZoihWtqRBc/X3RIaF1lMR2k6BzVgve7n4n9dLTXQdZEwmqaGSLBZFKUcmRvnjaMgUJYZPLcFEMXsrImOsMDE2noNwVt+eZW0Lxqe2/DuL2vNmyKOMpzAKdTBgytowh20wAcCY3iGV3hzhPivDsfi9aSU8wcwx84nz9sX43R</latexit><latexit sha1_base64="Apt6dOASLoU0puF9XJna1LvMt4=">AB63icbVBNS8NAEJ3Ur1q/qh69LBahXkoigh6LXjxWMG2hDWz3bRLdzdhdyOW0L/gxYMiXv1D3vw3btoctPXBwO9GWbmhQln2rjut1NaW9/Y3CpvV3Z29/YPqodHbR2nilCfxDxW3RBrypmkvmG026iKBYhp51wcpv7nUeqNIvlg5kmNB4JFnECDa5FNWfzgfVmtw50CrxCtIDQq0BtWv/jAmqaDSEI617nluYoIMK8MIp7NKP9U0wWSCR7RnqcSC6iCb3zpDZ1YZoihWtqRBc/X3RIaF1lMR2k6BzVgve7n4n9dLTXQdZEwmqaGSLBZFKUcmRvnjaMgUJYZPLcFEMXsrImOsMDE2noNwVt+eZW0Lxqe2/DuL2vNmyKOMpzAKdTBgytowh20wAcCY3iGV3hzhPivDsfi9aSU8wcwx84nz9sX43R</latexit><latexit sha1_base64="Apt6dOASLoU0puF9XJna1LvMt4=">AB63icbVBNS8NAEJ3Ur1q/qh69LBahXkoigh6LXjxWMG2hDWz3bRLdzdhdyOW0L/gxYMiXv1D3vw3btoctPXBwO9GWbmhQln2rjut1NaW9/Y3CpvV3Z29/YPqodHbR2nilCfxDxW3RBrypmkvmG026iKBYhp51wcpv7nUeqNIvlg5kmNB4JFnECDa5FNWfzgfVmtw50CrxCtIDQq0BtWv/jAmqaDSEI617nluYoIMK8MIp7NKP9U0wWSCR7RnqcSC6iCb3zpDZ1YZoihWtqRBc/X3RIaF1lMR2k6BzVgve7n4n9dLTXQdZEwmqaGSLBZFKUcmRvnjaMgUJYZPLcFEMXsrImOsMDE2noNwVt+eZW0Lxqe2/DuL2vNmyKOMpzAKdTBgytowh20wAcCY3iGV3hzhPivDsfi9aSU8wcwx84nz9sX43R</latexit><latexit sha1_base64="Apt6dOASLoU0puF9XJna1LvMt4=">AB63icbVBNS8NAEJ3Ur1q/qh69LBahXkoigh6LXjxWMG2hDWz3bRLdzdhdyOW0L/gxYMiXv1D3vw3btoctPXBwO9GWbmhQln2rjut1NaW9/Y3CpvV3Z29/YPqodHbR2nilCfxDxW3RBrypmkvmG026iKBYhp51wcpv7nUeqNIvlg5kmNB4JFnECDa5FNWfzgfVmtw50CrxCtIDQq0BtWv/jAmqaDSEI617nluYoIMK8MIp7NKP9U0wWSCR7RnqcSC6iCb3zpDZ1YZoihWtqRBc/X3RIaF1lMR2k6BzVgve7n4n9dLTXQdZEwmqaGSLBZFKUcmRvnjaMgUJYZPLcFEMXsrImOsMDE2noNwVt+eZW0Lxqe2/DuL2vNmyKOMpzAKdTBgytowh20wAcCY3iGV3hzhPivDsfi9aSU8wcwx84nz9sX43R</latexit>

Σij = k(xi, xj)

<latexit sha1_base64="B38NMmyREyUD9XF68oO/vbLsTCw=">ACAXicbVDLSgMxFM3UV62vUTeCm2ARKkiZEUE3QtGNy4r2Ae0wZNJMmzbJDElGWoa68VfcuFDErX/hzr8xbWeh1QMXDufcy73BDGjSjvOl5VbWFxaXsmvFtbWNza37O2duoSiUkNRySzQApwqgNU01I81YEsQDRhrB4GriN+6JVDQSd3oUE4+jrqAhxUgbybf32re0y5Gf0v4YXsBaejT46HfP/LtolN2poB/iZuRIshQ9e3PdifCSdCY4aUarlOrL0USU0xI+NCO1EkRniAuqRlqECcKC+dfjCGh0bpwDCSpoSGU/XnRIq4UiMemE6OdE/NexPxP6+V6PDcS6mIE0Eni0KEwZ1BCdxwA6VBGs2MgRhSc2tEPeQRFib0AomBHf+5b+kflJ2nbJ7c1qsXGZx5ME+OAl4IzUAHXoApqAIMH8ARewKv1aD1b9b7rDVnZTO74Besj28OApX4</latexit><latexit sha1_base64="B38NMmyREyUD9XF68oO/vbLsTCw=">ACAXicbVDLSgMxFM3UV62vUTeCm2ARKkiZEUE3QtGNy4r2Ae0wZNJMmzbJDElGWoa68VfcuFDErX/hzr8xbWeh1QMXDufcy73BDGjSjvOl5VbWFxaXsmvFtbWNza37O2duoSiUkNRySzQApwqgNU01I81YEsQDRhrB4GriN+6JVDQSd3oUE4+jrqAhxUgbybf32re0y5Gf0v4YXsBaejT46HfP/LtolN2poB/iZuRIshQ9e3PdifCSdCY4aUarlOrL0USU0xI+NCO1EkRniAuqRlqECcKC+dfjCGh0bpwDCSpoSGU/XnRIq4UiMemE6OdE/NexPxP6+V6PDcS6mIE0Eni0KEwZ1BCdxwA6VBGs2MgRhSc2tEPeQRFib0AomBHf+5b+kflJ2nbJ7c1qsXGZx5ME+OAl4IzUAHXoApqAIMH8ARewKv1aD1b9b7rDVnZTO74Besj28OApX4</latexit><latexit sha1_base64="B38NMmyREyUD9XF68oO/vbLsTCw=">ACAXicbVDLSgMxFM3UV62vUTeCm2ARKkiZEUE3QtGNy4r2Ae0wZNJMmzbJDElGWoa68VfcuFDErX/hzr8xbWeh1QMXDufcy73BDGjSjvOl5VbWFxaXsmvFtbWNza37O2duoSiUkNRySzQApwqgNU01I81YEsQDRhrB4GriN+6JVDQSd3oUE4+jrqAhxUgbybf32re0y5Gf0v4YXsBaejT46HfP/LtolN2poB/iZuRIshQ9e3PdifCSdCY4aUarlOrL0USU0xI+NCO1EkRniAuqRlqECcKC+dfjCGh0bpwDCSpoSGU/XnRIq4UiMemE6OdE/NexPxP6+V6PDcS6mIE0Eni0KEwZ1BCdxwA6VBGs2MgRhSc2tEPeQRFib0AomBHf+5b+kflJ2nbJ7c1qsXGZx5ME+OAl4IzUAHXoApqAIMH8ARewKv1aD1b9b7rDVnZTO74Besj28OApX4</latexit><latexit sha1_base64="B38NMmyREyUD9XF68oO/vbLsTCw=">ACAXicbVDLSgMxFM3UV62vUTeCm2ARKkiZEUE3QtGNy4r2Ae0wZNJMmzbJDElGWoa68VfcuFDErX/hzr8xbWeh1QMXDufcy73BDGjSjvOl5VbWFxaXsmvFtbWNza37O2duoSiUkNRySzQApwqgNU01I81YEsQDRhrB4GriN+6JVDQSd3oUE4+jrqAhxUgbybf32re0y5Gf0v4YXsBaejT46HfP/LtolN2poB/iZuRIshQ9e3PdifCSdCY4aUarlOrL0USU0xI+NCO1EkRniAuqRlqECcKC+dfjCGh0bpwDCSpoSGU/XnRIq4UiMemE6OdE/NexPxP6+V6PDcS6mIE0Eni0KEwZ1BCdxwA6VBGs2MgRhSc2tEPeQRFib0AomBHf+5b+kflJ2nbJ7c1qsXGZx5ME+OAl4IzUAHXoApqAIMH8ARewKv1aD1b9b7rDVnZTO74Besj28OApX4</latexit>

µi = m(xi)

<latexit sha1_base64="D0Sep4kp8I3o60t3thmND8Z0x+o=">AB9XicbVBNSwMxEJ31s9avqkcvwSLUS9kVQS9C0YvHCvYD2nXJptk2NMkuSVYtS/+HFw+KePW/ePfmLZ70NYHA4/3ZpiZFyacaeO6387S8srq2npho7i5tb2zW9rb+o4VYQ2SMxj1Q6xpxJ2jDMcNpOFMUi5LQVDq8nfuBKs1ieWdGCfUF7ksWMYKNle67Ig0YukSi8hSwk6BUdqvuFGiReDkpQ456UPrq9mKSCioN4Vjrjucmxs+wMoxwOi52U0TIa4TzuWSiyo9rPp1WN0bJUeimJlSxo0VX9PZFhoPRKh7RTYDPS8NxH/8zqpiS78jMkNVS2aIo5cjEaBIB6jFieEjSzBRzN6KyArTIwNqmhD8OZfXiTN06rnVr3bs3LtKo+jAIdwBXw4BxqcAN1aABc/wCm/Oo/PivDsfs9YlJ585gD9wPn8A4WRhw=</latexit><latexit sha1_base64="D0Sep4kp8I3o60t3thmND8Z0x+o=">AB9XicbVBNSwMxEJ31s9avqkcvwSLUS9kVQS9C0YvHCvYD2nXJptk2NMkuSVYtS/+HFw+KePW/ePfmLZ70NYHA4/3ZpiZFyacaeO6387S8srq2npho7i5tb2zW9rb+o4VYQ2SMxj1Q6xpxJ2jDMcNpOFMUi5LQVDq8nfuBKs1ieWdGCfUF7ksWMYKNle67Ig0YukSi8hSwk6BUdqvuFGiReDkpQ456UPrq9mKSCioN4Vjrjucmxs+wMoxwOi52U0TIa4TzuWSiyo9rPp1WN0bJUeimJlSxo0VX9PZFhoPRKh7RTYDPS8NxH/8zqpiS78jMkNVS2aIo5cjEaBIB6jFieEjSzBRzN6KyArTIwNqmhD8OZfXiTN06rnVr3bs3LtKo+jAIdwBXw4BxqcAN1aABc/wCm/Oo/PivDsfs9YlJ585gD9wPn8A4WRhw=</latexit><latexit sha1_base64="D0Sep4kp8I3o60t3thmND8Z0x+o=">AB9XicbVBNSwMxEJ31s9avqkcvwSLUS9kVQS9C0YvHCvYD2nXJptk2NMkuSVYtS/+HFw+KePW/ePfmLZ70NYHA4/3ZpiZFyacaeO6387S8srq2npho7i5tb2zW9rb+o4VYQ2SMxj1Q6xpxJ2jDMcNpOFMUi5LQVDq8nfuBKs1ieWdGCfUF7ksWMYKNle67Ig0YukSi8hSwk6BUdqvuFGiReDkpQ456UPrq9mKSCioN4Vjrjucmxs+wMoxwOi52U0TIa4TzuWSiyo9rPp1WN0bJUeimJlSxo0VX9PZFhoPRKh7RTYDPS8NxH/8zqpiS78jMkNVS2aIo5cjEaBIB6jFieEjSzBRzN6KyArTIwNqmhD8OZfXiTN06rnVr3bs3LtKo+jAIdwBXw4BxqcAN1aABc/wCm/Oo/PivDsfs9YlJ585gD9wPn8A4WRhw=</latexit><latexit sha1_base64="D0Sep4kp8I3o60t3thmND8Z0x+o=">AB9XicbVBNSwMxEJ31s9avqkcvwSLUS9kVQS9C0YvHCvYD2nXJptk2NMkuSVYtS/+HFw+KePW/ePfmLZ70NYHA4/3ZpiZFyacaeO6387S8srq2npho7i5tb2zW9rb+o4VYQ2SMxj1Q6xpxJ2jDMcNpOFMUi5LQVDq8nfuBKs1ieWdGCfUF7ksWMYKNle67Ig0YukSi8hSwk6BUdqvuFGiReDkpQ456UPrq9mKSCioN4Vjrjucmxs+wMoxwOi52U0TIa4TzuWSiyo9rPp1WN0bJUeimJlSxo0VX9PZFhoPRKh7RTYDPS8NxH/8zqpiS78jMkNVS2aIo5cjEaBIB6jFieEjSzBRzN6KyArTIwNqmhD8OZfXiTN06rnVr3bs3LtKo+jAIdwBXw4BxqcAN1aABc/wCm/Oo/PivDsfs9YlJ585gD9wPn8A4WRhw=</latexit>

Rasmussen, Advanced lectures on machine learning. Springer, 63-71 (2004)

Kα = p

kernel target property

slide-36
SLIDE 36

Linear-ridge vs kernel-ridge regression

14

Linear-ridge regression kernel-ridge regression (ML)

Ax = b

Kα = p

slide-37
SLIDE 37

Linear-ridge vs kernel-ridge regression

14

Linear-ridge regression kernel-ridge regression (ML)

Ax = b

Kα = p

N m m N

slide-38
SLIDE 38

Linear-ridge vs kernel-ridge regression

14

Linear-ridge regression kernel-ridge regression (ML)

Ax = b

Kα = p

N m m N

in general: m ⌧ N

slide-39
SLIDE 39

Linear-ridge vs kernel-ridge regression

14

Linear-ridge regression kernel-ridge regression (ML)

Ax = b

Kα = p

N m m N

in general: m ⌧ N

N N N N

slide-40
SLIDE 40

Linear-ridge vs kernel-ridge regression

14

Linear-ridge regression kernel-ridge regression (ML)

Ax = b

Kα = p

N m m N

in general: m ⌧ N

N N N N

Kij =Kij(xi, xj) =Kij(|xi − xj|) = exp ✓ −|xi − xj| σ ◆

slide-41
SLIDE 41

Linear-ridge vs kernel-ridge regression

14

Linear-ridge regression kernel-ridge regression (ML)

Ax = b

Kα = p

N m m N

in general: m ⌧ N

N N N N

Kij =Kij(xi, xj) =Kij(|xi − xj|) = exp ✓ −|xi − xj| σ ◆

error # training points

k e r n e l linear

slide-42
SLIDE 42

Linear-ridge vs kernel-ridge regression

14

Linear-ridge regression kernel-ridge regression (ML)

Ax = b

Kα = p

N m m N

in general: m ⌧ N

N N N N

Kij =Kij(xi, xj) =Kij(|xi − xj|) = exp ✓ −|xi − xj| σ ◆

error # training points

k e r n e l linear

  • +

+ +

  • measures similarity

between data points

slide-43
SLIDE 43

Kernel machine learning 101

15

0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 1.8 2.0 r [æ] 5 10 15 20 25 30 35 ULJ(r) [≤]

ULJ(r) Training points Prediction

ULJ(r) = 4✏ ⇣ r ⌘12 − ⇣ r ⌘6

<latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit><latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit><latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit><latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit>

K(r, r′ ) = exp (− (r − r′ )2 2σ2 ) 1) Define representation and kernel

slide-44
SLIDE 44

Kernel machine learning 101

15

0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 1.8 2.0 r [æ] 5 10 15 20 25 30 35 ULJ(r) [≤]

ULJ(r) Training points Prediction

ULJ(r) = 4✏ ⇣ r ⌘12 − ⇣ r ⌘6

<latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit><latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit><latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit><latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit>

K(r, r′ ) = exp (− (r − r′ )2 2σ2 ) 1) Define representation and kernel

Kα = U α = (K + λ𝕁)−1U

2) Train your model:

(K + λI) α = U

Regularization: “hyperparameter” scales noise level Inverse is ill defined Optimize weight coefficients on training set

slide-45
SLIDE 45

Kernel machine learning 101

15

0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 1.8 2.0 r [æ] 5 10 15 20 25 30 35 ULJ(r) [≤]

ULJ(r) Training points Prediction

ULJ(r) = 4✏ ⇣ r ⌘12 − ⇣ r ⌘6

<latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit><latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit><latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit><latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit>

K(r, r′ ) = exp (− (r − r′ )2 2σ2 ) 1) Define representation and kernel

Training points Query sample Predicted energy

U(r) = ∑

i

αiK(r*

i , r)

3) Make a prediction:

Kα = U α = (K + λ𝕁)−1U

2) Train your model:

(K + λI) α = U

Regularization: “hyperparameter” scales noise level Inverse is ill defined Optimize weight coefficients on training set

slide-46
SLIDE 46

Extrapolation in machine learning

16

0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 1.8 2.0 r [æ] 5 10 15 20 25 30 35 ULJ(r) [≤]

ULJ(r) Training points Prediction

0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 1.8 2.0 r [æ] 5 10 15 20 25 30 35 ULJ(r) [≤]

ULJ(r) Training points Prediction

ULJ(r) = 4✏ ⇣ r ⌘12 − ⇣ r ⌘6

<latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit><latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit><latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit><latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit>

K(r, r′ ) = exp (− (r − r′ )2 2σ2 )

U(r) = ∑

i

αiK(r*

i , r)

Define kernel

Training points Query sample Predicted energy

α = (K + λ𝕁)−1U

Make a prediction: Train your model:

slide-47
SLIDE 47

Extrapolation in machine learning

16

0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 1.8 2.0 r [æ] 5 10 15 20 25 30 35 ULJ(r) [≤]

ULJ(r) Training points Prediction

0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 1.8 2.0 r [æ] 5 10 15 20 25 30 35 ULJ(r) [≤]

ULJ(r) Training points Prediction

ULJ(r) = 4✏ ⇣ r ⌘12 − ⇣ r ⌘6

<latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit><latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit><latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit><latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit>

K(r, r′ ) = exp (− (r − r′ )2 2σ2 )

U(r) = ∑

i

αiK(r*

i , r)

Define kernel

Training points Query sample Predicted energy

α = (K + λ𝕁)−1U

Make a prediction: Train your model: Conformational space missing from training

slide-48
SLIDE 48

Extrapolation in machine learning

16

0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 1.8 2.0 r [æ] 5 10 15 20 25 30 35 ULJ(r) [≤]

ULJ(r) Training points Prediction

0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 1.8 2.0 r [æ] 5 10 15 20 25 30 35 ULJ(r) [≤]

ULJ(r) Training points Prediction

Uniform prior

ULJ(r) = 4✏ ⇣ r ⌘12 − ⇣ r ⌘6

<latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit><latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit><latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit><latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit>

K(r, r′ ) = exp (− (r − r′ )2 2σ2 )

U(r) = ∑

i

αiK(r*

i , r)

Define kernel

Training points Query sample Predicted energy

α = (K + λ𝕁)−1U

Make a prediction: Train your model: Conformational space missing from training

slide-49
SLIDE 49

0.2 0.4 0.6 0.8 1.0 1.2 1.4 1.6 1.8 2.0 r [æ] 5 10 15 20 25 30 35 ULJ(r) [≤]

ULJ(r) Training points Prediction

Conformational space missing from training

Linking conformational and interpolation spaces

17

  • [7] Visualization of high-dimensional space

(genera (snake seems like overfitting, fitting to well, cf. Lecture 2)

  • ML training set size is limited

(kernels!) Use physics to reduce the interpolation space

slide-50
SLIDE 50

Symmetries and conservation laws

slide-51
SLIDE 51

Mechanics 101: Principle of least action

19

𝒯[x(t)] = ∫

t2 t1

dt L[x(t), · x(t), t]

action microtrajectory Lagrangian L = T − V kinetic energy potential energy

slide-52
SLIDE 52

Mechanics 101: Principle of least action

19

𝒯[x(t)] = ∫

t2 t1

dt L[x(t), · x(t), t] Hamilton’s principle: system minimizes action (variational principle)

· 𝒯[x*(t)] = 0

action microtrajectory Lagrangian L = T − V kinetic energy potential energy

slide-53
SLIDE 53

Mechanics 101: Principle of least action

19

𝒯[x(t)] = ∫

t2 t1

dt L[x(t), · x(t), t] Hamilton’s principle: system minimizes action (variational principle)

· 𝒯[x*(t)] = 0

action microtrajectory Lagrangian

stationarity under small perturbations leads to Euler-Lagrange equations

δ𝒯 = ∫

t2 t1

dt L(x* + ε, · x* + · ε, t) − L(x*, · x*, t) = ∫

t2 t1

dt (ε ∂L ∂x + · ε ∂L ∂x ) = ∫

t2 t1

dt (ε ∂L ∂x − ε d dt ∂L ∂· x ) = 0

L = T − V kinetic energy potential energy

slide-54
SLIDE 54

Mechanics 101: Principle of least action

19

𝒯[x(t)] = ∫

t2 t1

dt L[x(t), · x(t), t] Hamilton’s principle: system minimizes action (variational principle)

· 𝒯[x*(t)] = 0

action microtrajectory Lagrangian

stationarity under small perturbations leads to Euler-Lagrange equations

δ𝒯 = ∫

t2 t1

dt L(x* + ε, · x* + · ε, t) − L(x*, · x*, t) = ∫

t2 t1

dt (ε ∂L ∂x + · ε ∂L ∂x ) = ∫

t2 t1

dt (ε ∂L ∂x − ε d dt ∂L ∂· x ) = 0

integration by parts &

ε(t1) = ε(t2) = 0

L = T − V kinetic energy potential energy

slide-55
SLIDE 55

From symmetries, to invariants, to conserved quantities

20

𝒯[x(t), y(t), z(t)] = ∫ dt m 2 (· x2 + · y2 + · z2) − mgz

slide-56
SLIDE 56

From symmetries, to invariants, to conserved quantities

20

𝒯[x(t), y(t), z(t)] = ∫ dt m 2 (· x2 + · y2 + · z2) − mgz Introduce constant translations along and :

x y

𝒯[x(t) + x0, y(t) + y0, z(t)] = ∫ dt m 2 (· x2 + · y2 + · z2) − mgz = 𝒯[x(t), y(t), z(t)]

slide-57
SLIDE 57

From symmetries, to invariants, to conserved quantities

20

𝒯[x(t), y(t), z(t)] = ∫ dt m 2 (· x2 + · y2 + · z2) − mgz Introduce constant translations along and :

x y

𝒯[x(t) + x0, y(t) + y0, z(t)] = ∫ dt m 2 (· x2 + · y2 + · z2) − mgz = 𝒯[x(t), y(t), z(t)] (Translational) symmetry leaves the action invariant. It leaves the Euler-Lagrange equation unchanged: ∂L ∂x − d dt ∂L ∂· x = 0

slide-58
SLIDE 58

From symmetries, to invariants, to conserved quantities

20

𝒯[x(t), y(t), z(t)] = ∫ dt m 2 (· x2 + · y2 + · z2) − mgz Introduce constant translations along and :

x y

𝒯[x(t) + x0, y(t) + y0, z(t)] = ∫ dt m 2 (· x2 + · y2 + · z2) − mgz = 𝒯[x(t), y(t), z(t)] (Translational) symmetry leaves the action invariant. It leaves the Euler-Lagrange equation unchanged: ∂L ∂x − d dt ∂L ∂· x = 0 ∂L ∂x = 0 ∂L ∂· x = m · x = const . Translational invariance implies linear momentum conversation

slide-59
SLIDE 59

From symmetries to conserved quantities (cont’d)

21

𝒯[r(t)] = ∫ dt m 2 · r2 − V(r) Rotational symmetry Apply transformation r → r′ where r′ (t) = Rr(t) = r(t) + α × r(t) One can show that 𝒯[r(t) + α × r(t)] = 𝒯[r(t)] Conservation of angular momentum Time translation Apply transformation r → r′ where r′ (t + ϵ) = r(t) One can show that 𝒯[r′ (t + ϵ)] = 𝒯[r(t)] Conservation of energy

(up to a boundary term) Bañados & Reyes, arXiv:1601.03616v3

slide-60
SLIDE 60

Noether’s theorem

22

3 examples:

  • Translational symmetry: Linear momentum conservation
  • Rotational symmetry: Angular momentum conservation
  • Time translation: Energy conservation

To every differentiable symmetry generated by local actions there corresponds a conserved quantity

slide-61
SLIDE 61

Noether’s theorem

22

3 examples:

  • Translational symmetry: Linear momentum conservation
  • Rotational symmetry: Angular momentum conservation
  • Time translation: Energy conservation

To every differentiable symmetry generated by local actions there corresponds a conserved quantity

Ceriotti, JCP 150 (2019)

slide-62
SLIDE 62

2 ways of encoding symmetries:

  • Representation
  • ML model f
slide-63
SLIDE 63

Encoding symmetries in the representation

slide-64
SLIDE 64

Translational and rotational symmetries

25

Behler-Parrinello Coulomb matrix

G1

i

X

all ji

eRijRs2fcRij:

G2

i 21 X all j;ki

1 cosijk eR2

ijR2 ikR2 jkfcRijfcRikfcRjk;

Behler & Parrinello, Phys Rev Lett 98 (2007) Cartesian coordinates Symmetry functions Distances Angles

Rupp, Tkatchenko, Müller, von Lilienfeld, Phys Rev Lett, 108 (2012)

  • =

∀ = | − | ∀ ≠ ⎧ ⎨ ⎪ ⎪ ⎩ ⎪ ⎪ C Z i j ZZ i j R R 1 2

ij i i j i j 2.4

Distances

slide-65
SLIDE 65

Representation: the Coulomb matrix

26

Hansen et al., J Chem Theory Comput, 9 (2013)

Symmetries of the representation should emulate symmetries of the system

  • 1. Translation
  • 2. Rotations
  • 3. Mirror reflection

~ Coulomb’s law E = qiqj |ri − rj|

slide-66
SLIDE 66

Representation: the Coulomb matrix

26

Hansen et al., J Chem Theory Comput, 9 (2013)

Symmetries of the representation should emulate symmetries of the system

  • 1. Translation
  • 2. Rotations
  • 3. Mirror reflection

Problems:

  • 1. Dimensionality from # atoms

2. Ordering of the atoms ~ Coulomb’s law E = qiqj |ri − rj|

slide-67
SLIDE 67

Optimizing the representation links to the physics

27

Huang and von Lilienfeld, J Chem Phys 145 (2016)

Learning a Gaussian function Learning atomization energies

slide-68
SLIDE 68

Optimizing the representation links to the physics

27

Huang and von Lilienfeld, J Chem Phys 145 (2016)

Learning a Gaussian function Learning atomization energies Non-unique representation!

slide-69
SLIDE 69

Optimizing the representation links to the physics

27

Huang and von Lilienfeld, J Chem Phys 145 (2016)

Learning a Gaussian function Learning atomization energies Non-unique representation!

ULJ(r) = 4✏ ⇣ r ⌘12 − ⇣ r ⌘6

<latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit><latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit><latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit><latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit>
slide-70
SLIDE 70

Optimizing the representation links to the physics

27

Huang and von Lilienfeld, J Chem Phys 145 (2016)

Learning a Gaussian function Learning atomization energies Non-unique representation!

ULJ(r) = 4✏ ⇣ r ⌘12 − ⇣ r ⌘6

<latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit><latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit><latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit><latexit sha1_base64="G3h4pfP2Yj/buXVPhz0Vw6uTCrc=">ACWHichVFNa9wFJSd7+3XJj328shS2By62CG0zaEQ2kspPaRQN4G1u8ja510RSTbSc2Ex/pM5FEL+Sg7R7vrQJoUOCA0z85A0yislHUXRTRBubG5t7+zu9Z48fb8RX/4IcrayswEaUq7WXOHSpMCFJCi8ri1znCi/yq09L/+IXWidL850WFWaz4wspODkpUm/SiZNajV8/dIO7RF8gBNIsXJSlQZShQWN19sQ0sJyAamTM83BQmrlbE5HP5v4uIU3/wu9bTueTfqDaBStAI9J3JEB63A+6V+n01LUGg0JxZ0bx1FWcMtSaGw7aW1w4qLKz7DsaeGa3RZs2qmhdemUJRWr8MwUr9c6Lh2rmFzn1Sc5q7h95S/Jc3rql4nzXSVDWhEeuDiloBlbCsGabSoiC18IQLK/1dQcy5L4f8Z/R8CfHDJz8myfHodBR/Oxmcfeza2GWv2CEbspi9Y2fsMztnCRPsN7sLtoLt4DYMwp1wbx0Ng27mJfsL4cE9X2yxCQ=</latexit>

Empirically test for relevant physics

slide-71
SLIDE 71

Encoding symmetries in the ML model

slide-72
SLIDE 72

Encoding symmetries in ML models using group theory

29

Risi Kondor, Group theoretical methods in machine learning, PhD thesis (2008)

Action of group on input sample

G

x ↦ Tg(x)

slide-73
SLIDE 73

Encoding symmetries in ML models using group theory

29

Risi Kondor, Group theoretical methods in machine learning, PhD thesis (2008)

Action of group on input sample

G

x ↦ Tg(x)

Can we find a kernel that is invariant to this group action?

f(Tg(x)) = f(x)∀g ∈ G k(x, x′ ) = k(Tg(x), Tg′ (x′ ))

slide-74
SLIDE 74

Encoding symmetries in ML models using group theory

29

Risi Kondor, Group theoretical methods in machine learning, PhD thesis (2008)

Action of group on input sample

G

x ↦ Tg(x)

Can we find a kernel that is invariant to this group action?

f(Tg(x)) = f(x)∀g ∈ G k(x, x′ ) = k(Tg(x), Tg′ (x′ ))

To ensure invariance, symmetrize the kernel

kG(x, x′ ) = 1 |G| ∑

g∈G

k(x, Tg(x′ ))

slide-75
SLIDE 75

Example of symmetrized kernel

30

Vanilla/naïve kernel

k(ρ, ρ′ ) = ∫ drρ(r)ρ(r′ )

slide-76
SLIDE 76

Example of symmetrized kernel

30

Vanilla/naïve kernel

k(ρ, ρ′ ) = ∫ drρ(r)ρ(r′ )

SOAP kernel/representation*

Bartók, Kondor, Csányi, Phys Rev B 87 (2013) *Smooth Overlap of Atomic Positions: is a distance metric between two samples

k(ρ, ρ0) = Z

  • S(ρ, ˆ

Rρ0)

  • n

d ˆ R = = Z d ˆ R

  • Z

ρ(r)ρ0( ˆ Rr)dr

  • n

̂ R

slide-77
SLIDE 77

Invariant vs. covariant properties

31

Glielmo, Sollich, De Vita, Phys Rev B 95 (2017)

Tensorial property (e.g., dipole moment, force) rotates with the sample

slide-78
SLIDE 78

Invariant vs. covariant properties

31

Glielmo, Sollich, De Vita, Phys Rev B 95 (2017)

Tensorial property (e.g., dipole moment, force) rotates with the sample

slide-79
SLIDE 79

Invariant vs. covariant properties

31

Glielmo, Sollich, De Vita, Phys Rev B 95 (2017)

Tensorial property (e.g., dipole moment, force) rotates with the sample

slide-80
SLIDE 80

Invariant vs. covariant properties

31

Glielmo, Sollich, De Vita, Phys Rev B 95 (2017)

Tensorial property (e.g., dipole moment, force) rotates with the sample “Build kernel so as to encode the rotational properties of the target property”

slide-81
SLIDE 81

Covariant kernels

32

Encode rotational properties of the target property in the kernel

Glielmo, Sollich, De Vita, Phys Rev B 95 (2017)

slide-82
SLIDE 82

Covariant kernels

32

Encode rotational properties of the target property in the kernel

ˆ f(Sρ | D) = Sˆ f(ρ | D).

Descriptor Training data Transformation (rotation/inversion) Force prediction

Glielmo, Sollich, De Vita, Phys Rev B 95 (2017)

slide-83
SLIDE 83

Covariant kernels

32

Encode rotational properties of the target property in the kernel

ˆ f(Sρ | D) = Sˆ f(ρ | D).

Descriptor Training data Transformation (rotation/inversion) Force prediction “Transform the configuration, and the prediction transforms with it”

Glielmo, Sollich, De Vita, Phys Rev B 95 (2017)

slide-84
SLIDE 84

Covariant kernels

33

K(Sρ, S0ρ0) = SK(ρ, ρ0)S0T.

Configurations Transformations (rotation/inversion) Kernel

Glielmo, Sollich, De Vita, Phys Rev B 95 (2017)

K(ρ,ρ0) =

Z

dRRkb(ρ,Rρ0)

Kµ(ρ, ρ0) = 1 L X

ij

φ(ri, rj)ri ⌦ r0T

j ,

⇣ ⌘

ri rj

slide-85
SLIDE 85

Conclusions

34

Take advantage of symmetries Noether: symmetry leads to conservation law

  • [7] Visualization of high-dimensional space

(genera (snake seems like overfitting, fitting to well, cf. Lecture 2)

  • 0.2
0.4 0.6 0.8 1.0 1.2 1.4 1.6 1.8 2.0 r [æ] 5 10 15 20 25 30 35 ULJ(r) [≤] ULJ(r) Training points Prediction

Extrapolation in ML models of energy landscapes Can lead to catastrophic physics

K(Sρ, S0ρ0) = SK(ρ, ρ0)S0T.

Build symmetries in ML model Work with subset of kernels that a priori satisfy conservation law