Neural Network Model Chemistries
RIKEN 3/25/2017 John Parkhill Department of Chemistry and Biochemistry, Notre Dame
Neural Network Model Chemistries RIKEN 3/25/2017 John Parkhill - - PowerPoint PPT Presentation
Neural Network Model Chemistries RIKEN 3/25/2017 John Parkhill Department of Chemistry and Biochemistry, Notre Dame Parkhill Group 5 Students (K Yao) Research Areas: Non-equilibrium realtime 35 . 5 30 quantum electronic dynamics.
RIKEN 3/25/2017 John Parkhill Department of Chemistry and Biochemistry, Notre Dame
5 Students (K Yao) Research Areas: Non-equilibrium realtime quantum electronic dynamics. Applications of Neural Networks to electronic structure theory.
0. 2.5 5. 7.5 10. 5 10 15 20 25 (eV) t (fs) . . . . . . . . . . . . . . . . . . 200 250 300 350 400 450 500 550 5 10 15 20 25 30 35 Wavelength (nm) Absorbance (au) 20 25 Absorbance (au) (a) (b)
d dtγp = 1 2{γsγtV (s)pr
st V (t)st prηpηr + γsγtV (s)st prV (t)pr st ηpηr
−γpγtV (s)pt
rsV (t)rs ptηrηs − γpγtV (s)rs ptV (t)pt rsηrηs}
0 ps 0.5 ps 1 ps 1.5 ps 2 ps 2.5 ps 3 ps 4 ps 6 ps 8 ps 10 ps 12 ps 16 ps 20 ps
!4 !2 2 4 ΔA (a) (b)
Realtime Transient Abs. spectra (Expt, Vauthey left)
Electronic timescale ~ 4*10-4 fs 1ps = 107 fock builds 114 days at 1 build/second Dynamics is sequential and parallelization in time is weak.
Force-Fields
Today
Density Functional Ab-Initio 2000+ Atoms <100 Atoms <40 Atoms Force-Fields Density Functional Ab-Initio 2000+ Atoms <100 Atoms <40 Atoms
Soon
Neural Network
Hemoglobin : 16000 Daltons ~ 16000 orbitals vs Limit of Most KS ~ 3000 With orbital free-DFT you only need 1
But you must know a mysterious ‘functional’ which maps the density to kinetic energy….
500 1000 1500 2000 100 200 300 400 Number of Al atoms Time per SCF Orbital-Free Kohn-Sham
Ekin = Z T(n(r))dr
1.8 2.0 2.2 2.4 2.6 2.8
0.0 N-N distance HbohrL Energy HhartreeL TW4 LLP GE2 TF KS
D Garcia-Aldea, JE Alvarellos Journal of chemical physics 127.14 (2007): 144109.
TGGA = Z τTF(n(r))F ✓|rn(r)| n(r)3/4 ◆ Accuracy ~ 1 % No Bonding ! No shell structure
1 2 3 4
5 10 15 20 Distance from nucleus HbohrL Enhancement factor Fplus-VW Fsch-VW Fplus-TF Fsch-TF
Local kinetic energy Tiomas Fermi and Von Weisacker
structure
distance, while VW based converge
Pseudo 2-d input motivated by computational limitations Density along lines fed into convolutional neural network. ~1 million quadrature points per atom. ~2000 inputs per quadrature point. ~Barely tractable quadrature point
T{n(r)} = Z F{n(r), r0}τtf(r0)dr0
yi
mn = f(bi + a=p
X
a=0 b=q
X
b=0
yi−1
(m+a)(n+b)wi−1,i ab
) f(x) = max(0, x)
τT F = 3 10(3π2)
5/3n(r)2/3dr
Future thoughts: 3D Convolutional Networks Basis sets ~4000 samples per grid point 106 samples per atom
T{n(r)} = Z F{n(r), r0}τtf(r0)dr0
Learn F as a function of n(r), given as a slice of the surface.
Kohn-Sham Neural Network
C2H2 C2H4 C2H6 Accurate (KS) Neural Net Error(NN-KS) Enhancement Factor in C-C bonding plane
Errors of the NN’s optimal density in Hydrogen
electronic structure and the NN.
Sample Digest Train
Evaluate
Functions, Radial*Spherical Harmonics)
Coulomb integrals etc.
Symmetry functions
G1H G1O G2HH G2HO G2OO G2HH’
G(r1, r2, r3 ...)
G2HO’ G2OO’
How to parameterize a molecule with invariances and retain invertibility? How to express atomic number differences avoiding separate channels. Depth Map Coulomb Matrix? Sorted by distance or atomic number? Symmetry Functions?
fnlm(x, y, z) = RnYl,m Rn = e−(r−r0)2/(2σ2)
Embedded Space Real Space Tiis embedding is reversible! can go between geometry and embedded geometry reversibly.
|rlmi = X
atoms
fr(x, y, z)Y l
m(x, y, z) ⇤ (Atm.Number)
E = X
Atoms
Eatom
E = X
Molecules
Emol + X
pairs
Epair + ... E = X
Bonds
Ebond
Back propagates atom networks for each element with only 1 energy Uses separate monomer dimer etc. training data Like Behler-Parinello but bond energies vary less
N1 N3 N2
Eone− body Ethree −body Etwo−body Etotal
2000 4000 6000 8000 1 2 3 4 5 6 30 60 90 120 150 180 Number of Atoms Second Day NN-MP2 MBE RI-MP2 MBE
wall time
0.000 0.002
0.000 0.002 MP2 Three-body Energy (a.u.) NN Three-body Energy (a.u.)
0.0 5×10-5 1×10-4 Error of Many-Body Energies (a.u.)
2 3 4 5 6
0.000 0.005 0.010 0.015 Distance (Å) Energy (a.u.)
NN MP2 AMEOBA09
0.00 0.02 Energies (a.u.) RIMP2 MP2-MBE NN-MBE AMOEBA09
10 Error (×10-4 a.u.)
200 400 600 800 1000 1200 1400 3 6 9 12 15 18 90 180 270 360 450 540 Number of molecules NN-MBE/AMOEBA09 (Seconds) MP2-MBE (Days)
AMOEBA09 NN-MBE / MP2-MBE
Provided by TensorFlow We Code
water geometry optimization
100 200 300 400 500
Optimization Step Energy (a.u.)
Expresses the total molecular energy as a sum of bonds
O O O H N H H H H H H H H H H H H
bond energy (kcal / mol)
EDFT : -939.6928 Ha EDIM-NN: -939.6927 Ha
Similar errors for vitamin B12, D3 etc…
700,000 Carbon Carbon bond energies.
Because of GPU dependencies and large datasets required, the most powerful ML PES methods are not in common use in chemistry Tiey will be soon. Over the next year TensorMol and several other packages will appear where users can “Roll their own” ML-PES’s with minimal effort. Tiese will compete heavily with DFT Tie domain of chemical space which can be explored in a weekend is about to exponentially increase. THANKS!