Electron Density-Based Machine Learning for Accelerating Quantum - - PowerPoint PPT Presentation

electron density based machine learning for accelerating
SMART_READER_LITE
LIVE PREVIEW

Electron Density-Based Machine Learning for Accelerating Quantum - - PowerPoint PPT Presentation

Electron Density-Based Machine Learning for Accelerating Quantum Calculations Joshua Lansford and D. G. Vlachos 2019 Blue Waters Symposium, Sunriver OR June 5, 2019 Materials Gap in Catalysis: Theory and Experiments [2] [1] 8.29 5.75 Pt Au


slide-1
SLIDE 1

Electron Density-Based Machine Learning for Accelerating Quantum Calculations

Joshua Lansford and D. G. Vlachos

2019 Blue Waters Symposium, Sunriver OR

June 5, 2019

slide-2
SLIDE 2

[1] J. Feng, and J. L. Lansford et al. AIP Adv. 8, 035021 (2018). [2] M. Núñez, J. L. Lansford, and D.G. Vlachos, Nat. Chem. – Under Review [3] Liu et al., ACS Cat (2018) [4] C. A. Koval et al., Basic Research Needs: Catalysis Science to Transform Energy Technologies 2017).

2

Materials Gap in Catalysis: Theory and Experiments

Physics + Data science[4] is needed to understand both dynamic changes[4] and static properties of complex materials ! " 𝐼|𝛺 = ⟩ 𝐹|𝛺

(111) (100)

Pt Au

*OOH form. *OH des. 8.29 5.75

[1] [2] [3]

?

[3]

slide-3
SLIDE 3

[1] K. Ding et al., Science 350, 189 (2015). [2] F. Huth et al., Nature Materials 10, 352 (2011). [3] J. F. Li et al., Nature 464, 392 (2010).

3

Materials Gap in Catalysis: Vibrational Spectroscopy

Infrared (IR) spectroscopy of dispersed Pt atoms and nanoparticles for CO oxidation[1] 2-D Infrared (IR) spectroscopy

  • f a semiconductor[2]
  • Vibrational spectroscopy is a precise (<1% uncertainties)

surface technique that is rapidly advancing.

  • Spectra are relatively insensitive to temperature and can

be used in-situ or operando[3]

slide-4
SLIDE 4

[1] Dabo et al., J. Am. Chem. Soc. 129 (2007) [2] RK Brandt et al. Surf Sci. 271 (1992)

The Argument for CO as a Probe Molecule

4

  • Exp. CO Frequency on Pt(111)[1]

Site-type

  • Freq. [cm-1]

atop 2070 bridge 1830 fcc 1760

  • C-O frequency depends on both site-type and site coordination
  • C-O has well defined peaks that can be visually identified by the human eye and brain
  • There are no quantitative methods to determine surface structure from vibrational spectra

C-O frequency (atop bound CO) vs. Pt CN (■) Low coverage adsorption, at various temperatures[2] ω=1,997 + 10.0CN cm-1

slide-5
SLIDE 5

Outline

5

Goal

  • Determine local microstructure of Pt nanoparticles from experimental

vibrational spectra using CO as a probe molecule

Plan

  • Assess accuracy of DFT in recreating IR spectra
  • Provide an overview of surrogate modeling
  • Combine data science techniques with expert knowledge to better

understand data and improve sampling, highlighting data visualization

  • Illustrate key details of the surrogate models for generating synthetic IR

spectra and learning the corresponding local structure

  • Show model results and provide an application to experimental

vibrational spectra

slide-6
SLIDE 6

Infrared-spectroscopy Data- and Physics-driven Machine Learning Reveal Surface Microstructure of Complex Materials - submitted

Frequency Scales with Generalized Coordination Number

6

C-O stretch frequencies for CO at an atop site Generalized Coordination Number (GCN) is a coordination number weighted by second nearest neighbors[1]

[1] F. Calle-Vallejo et al., Angew. Chem. Int.

  • Ed. 53, 8316 (2014).

C-O frequency is a descriptor of local structure but in experiments we must untangle spectra generated from many CO molecules on many different GCNs – We need intensities!

slide-7
SLIDE 7

[1] Porezag and Pederson. Phys. Rev. B. 54, 11 (1996) [2] G. Kresse and J. Furthmüller, Phys. Rev. B. 54, 11169 (1996). [3] T. A. Manz and N. G. Limas, RSC Advances 6, 47771 (2016).

7

Compute Intensities using the derivative in dipole moment μ (dynamic dipole moment) with respect to the normal mode displacement (Q).[1]

  • Normal mode (hessian of the forces) for identifying peak locations (frequencies)
  • VASP[2] for computing electron densities
  • CHARGEMOL[3] for integrating over the electron densities to get the dipoles
  • Matrix product of the dipole Jacobian and the normal mode vectors to compute intensities

𝑒𝝂 𝑒𝑅- = .

/01 23 𝜖𝝂

𝜖𝑆/ 𝑌/- Methods: Generating Spectra from First Principles

slide-8
SLIDE 8

[1] I. Dabo et al., J. Am. Chem. Soc. 129, 11045 (2007). [2] J. L. Lansford, A. V. Mironenko, and D. G. Vlachos, Nature Communications 8, 1842 (2017). Infrared-spectroscopy Data- and Physics-driven Machine Learning Reveal Surface Microstructure of Complex Materials - submitted

IR Spectra of CO on Pt(111) with a c(4x2) Overlayer

8

DFT generated spectra reproduces experimental spectra (frequencies and intensities)

1) Existing literature supports accuracy of measuring and computing frequencies on surfaces[1,2]

0.25 ML at the atop position 0.25 ML at the bridge position

Pt-CO stretch frequency 2) There is not always a

  • ne-to-one

correspondence between intensity and concentration 3) There are more frequencies than just the C-O stretch

slide-9
SLIDE 9

Infrared-spectroscopy Data- and Physics-driven Machine Learning Reveal Surface Microstructure of Complex Materials - submitted

9

Surrogate Model Overview: Iterative Design

Simulated Spectra Surrogate Spectral Model Multinomial Regression Local Structure DFT Data Synthesizing Spectra

  • Outlier removal
  • Harmonic approx.
  • Lateral

interactions

  • Spectral mixing
  • Convolution
slide-10
SLIDE 10

Infrared-spectroscopy Data- and Physics-driven Machine Learning Reveal Surface Microstructure of Complex Materials - submitted

10

Surrogate Model Overview: Iterative Design

Simulated Spectra Surrogate Spectral Model Surrogate Structure Model Local Structure DFT Data

slide-11
SLIDE 11

Infrared-spectroscopy Data- and Physics-driven Machine Learning Reveal Surface Microstructure of Complex Materials - submitted

11

Outliers inhibit learning both because they result in large gradients during training and because there are not enough samples with similar feature values to predict them.

Data Visualization: Site-type Data

slide-12
SLIDE 12

Infrared-spectroscopy Data- and Physics-driven Machine Learning Reveal Surface Microstructure of Complex Materials - submitted

12

Removing samples that are not local minima on the potential energy surface applies expert knowledge to remove unphysical outliers

Data Visualization: Site-type Data

slide-13
SLIDE 13

13 Bias Node X1 Xn a1 Bias Node ak

Input Layer (501 intensities) Hidden Layer(s) 𝒈𝒋 = 𝒇𝒃𝑼𝒙𝒋 ∑𝒍0𝟐

𝑳0𝟓 𝒇𝒃𝑼𝒙𝒍

Output Layer (site-type/GCN range)

f1(X) f2(X) f3(X) f4(X)

% Atop % Bridge % 3-fold % 4-fold

Surrogate Model Details: The Activation Function

slide-14
SLIDE 14

Infrared-spectroscopy Data- and Physics-driven Machine Learning Reveal Surface Microstructure of Complex Materials - submitted [1] L. Hou, C.-P. Yu, and D. Samaras, arXiv preprint arXiv:1611.05916 (2016).

14

Wasserstein loss with [0,0,0,1] for three probability sets compared to the single-valued kl-divergence

Surrogate Model Details: The Loss Function

Kl-divergence compares probabilities between two distributions at each index (pi and ti) while Wasserstein compares the cumulative probability at each index (CDF(P)i and CDF(T)i) and takes into account inter-class relationships[1]

𝑋C = .

D01 E

.

  • 01

D

𝑞- − .

  • 01

D

𝑢-

C

slide-15
SLIDE 15

Infrared-spectroscopy Data- and Physics-driven Machine Learning Reveal Surface Microstructure of Complex Materials - submitted

15

3-fold and 4-fold Bridge Atop

Model Results: Site-type Histogram

slide-16
SLIDE 16

16

GCN group GCN values 7 5.5-6.1 8 6.1-6.6 9 6.6-7.2 10 7.2-7.9 11 7.9-8.5 12 High Coverage Low-index planes

Model Results: Generalized Coordination Histograms

GCN group GCN values 1 0-1.8 2 1.8-2.8 3 2.8-3.7 4 3.7-4.3 5 4.3-4.9 6 4.9-5.5 GCN Groups Determined by Clustering

slide-17
SLIDE 17

[1] H. Steininger, S. Lehwald, and H. Ibach, Surf. Sci. 123, 264 (1982). [2] C. Klünker et al., Surf. Sci. 360, 104 (1996). [3] P. Zhang et al., J. Phys. Chem. C 113, 17518 (2009). [4] W. Chen et al., J. Phys. Chem. B 107, 9808 (2003).

17

  • r

Pt(111) c(4x2) 0.5 ML[1] Pt(110) 1 ML[2] STM of 55 nm Au @0.7 nm Pt/Pt[3]

*A voltage of -0.1 V will only shift the C-O frequency by 2.9 cm-1.[4] CO saturated 0.5 M H2SO4 at

  • 0.1 V*

UHV

Experimental Application: Spectra from Literature

slide-18
SLIDE 18

[1] H. Steininger, S. Lehwald, and H. Ibach, Surf. Sci. 123, 264 (1982). [2] S. Karakatsani, et al., Surf. Sci. 606, 383 (2012). [3] P. Zhang et al., J. Phys. Chem. C 113, 17518 (2009).

18

Experimental Application: Expert Information

Pt(111) c(4x2) 0.5 ML[1] A combination of LEED and TPD measurements tell us that at 0.5 ML this c(4x2) overlayer results in 50% atop and 50% ridge sites. At high pressures this spectra could correspond 62% atop and 38% bridge. Pt(111) 0.17 ML[1] Trends in LEED studies suggest at low coverages almost all CO is adsorbed at atop sites on Pt(111) Pt(110) 1.0 ML[2] Pt(110) can undergo reconstruction, however, at the maximum coverage of 1 ML it is observed to deconstruct with all CO in the atop position.

  • r

STM of 55 nm Au @0.7 nm Pt/Pt[3] Because the nanoparticle system is in liquid, coverages are low. This would preclude ordered high spatial overlayers of the low-index

  • planes. The uniformity of the nanoparticles would suggest that most
  • ccupied sites are at a low-index plane of the same site.
slide-19
SLIDE 19

Infrared-spectroscopy Data- and Physics-driven Machine Learning Reveal Surface Microstructure of Complex Materials - submitted

19

High coverage low index planes Pt(111) low coverage The supposed high-coverage Pt(110) surface has significant 4-fold contribution. This is unexpected.

Experimental Application: Predicted Histograms

slide-20
SLIDE 20

20 Slight bump Extended tail

The parts of the spectra resulting in predicted adsorption at 4-fold sites for Pt(110) (yellow line) is likely due to the extended tail below 400 cm-1 and the slight bump at 1700 cm-1

Experimental Application: A New Insight

slide-21
SLIDE 21

Conclusions

21

  • We are able to synthesize spectra with a surrogate

model efficiently

  • We successfully implemented a multinomial neural

network to predict the proportion of occupied site- types and GCN histograms of synthetic spectra

  • We demonstrated the applicability of this model to

experimental data

  • We iteratively used data science tools and

philosophies with expert knowledge to identify areas

  • f our combined {target, feature} space that needed

more data and to generalize our model to high coverage systems with varying convoluting functions

slide-22
SLIDE 22

[1] J. Feng, and J. L. Lansford et al. AIP Adv. 8, 035021 (2018). [2] M. Núñez, J. L. Lansford, and D.G. Vlachos, Nat. Chem. (2019) [3] M. Salciccioli et al., Chem. Eng. Sci. 66, 4319 (2011). [4] H. Pritchard, J. Phys. Chem. A. (2005).

22

Future Work on Blue Waters

! " 𝐼|𝛺 = ⟩ 𝐹|𝛺

(111) (100)

Pt Au

*OOH form. *OH des. 8.29 5.75

[1] [2]

𝑠

  • = 𝑙- K

L

𝐷

L

r = κ ∗ 𝑙Q𝑈 ℎ ∗ exp −∆𝐻-

𝑙Q𝑈 K

L

𝐷

L

Transition State Theory[3,4]

slide-23
SLIDE 23

23

Transition State Theory and the Potential Energy Surface

local minima Reactants local minima Products Transition State Saddle Point ΔH‡ H: Enthalpy

slide-24
SLIDE 24

[1] D.A. McQarrie, Statistical Mechanics, University Science Books (2000).

24

Transition State Theory and the Potential Energy Surface

local minima local minima Transition State Reactants Products Saddle Point ΔG‡ 𝐻 = 𝐼 − 𝑈𝑇 [1] 𝑇 = 𝑔𝑣𝑜𝑑𝑢𝑗𝑝𝑜(𝑤𝑗𝑐𝑠𝑏𝑢𝑗𝑝𝑜𝑡) H: Enthalpy G: Gibbs Energy S: Entropy

slide-25
SLIDE 25

25

Transition State Theory Computational Complexity

𝑃 𝑂2 𝑃 𝑂2 𝑃 𝑂2 𝑃 𝑂i

Vibrational Calculations

slide-26
SLIDE 26

[1] W. Kohn, Rev. of Mod. Phys. 71, 1253 (1999). [2] Anderson, A.B. and R.G. Parr, J. Chem. Phys., (1970).

26

Outline of Future Work

Issues with the current technique for addressing the materials gap

  • 1. Need more data
  • 2. Frequency calculations are very slow!
  • The electronic density distribution completely specifies the energy of a chemical system’s

state and can be calculated using density functional theory (DFT) based on the Kohn Sham equation[1]

  • Frequencies at equilibrium can be computed directly from equilibrium (ground state)

electron density[2] Combining geometric and electronic density information we should be able to generate a chemical representation that facilitates extrapolation.

  • 1. Need more data – automatic structure generation for generative adversarial networks
  • 2. Frequency calculations are very slow! – deep neural networks trained on electron density
slide-27
SLIDE 27

Funding from DARPA and RAPID and funding from the Blue Waters Graduate Fellowship for the next phase of my work Professor Dionisios G. Vlachos for advisement

27

The Vlachos group

Acknowledgements