Text and Data Mining for Material Synthesis Elsa Olivetti, MIT - - PowerPoint PPT Presentation

text and data mining for material synthesis
SMART_READER_LITE
LIVE PREVIEW

Text and Data Mining for Material Synthesis Elsa Olivetti, MIT - - PowerPoint PPT Presentation

Text and Data Mining for Material Synthesis Elsa Olivetti, MIT Gerbrand Ceder, UC Berkeley Departments of Materials Science & Engineering Andrew McCallum, UMass Amherst Department of Computer Science & Engineering 1 Challenges for


slide-1
SLIDE 1

Text and Data Mining for Material Synthesis

Elsa Olivetti, MIT Gerbrand Ceder, UC Berkeley Departments of Materials Science & Engineering Andrew McCallum, UMass Amherst Department of Computer Science & Engineering

1

slide-2
SLIDE 2

1980 1991 2008 1954 1963 1982

Silicon Solar Cells Lithium Ion Batteries

1st practical silicon solar cell invented at Bell Labs Sharp produces 1st practical solar module of silicon solar cells Kyocera 1st mass produces polysilicon cells by today’s standard process Oxford demonstrates 1st viable rechargeable lithium battery Sony sells 1st commercially available Li-ion batteries for high price consumer electronics 1st uses of Li-ion battery in production vehicles

Challenges for technology development: Timeline for development is long

slide-3
SLIDE 3

Modern data‐driven and first‐principles materials design accelerates pace of wha what to make…

3

+ + H 

i

2 i1 Ne

 Vnuclear(r

i) i1 Ne

 1 2 1 rj ri

ji Ne

i Ne

3.2V

  • 3. 86

V 3.7 V 3.76 V 4.09 V

Phase diagrams Surfaces Bandgaps

slide-4
SLIDE 4

4

slide-5
SLIDE 5
slide-6
SLIDE 6

Text extraction workflow

slide-7
SLIDE 7

Status of data dissemination

slide-8
SLIDE 8

8

slide-9
SLIDE 9

9

Example: suggesting synthesis conditions for a specific morphology in titania

Edward Kim et al., Chemistry of Materials 2017

Experimentally‐accessible (and reported) variables to facilitate practical synthesis route planning.

*Grey circled points = accuracy test points All other points = training data points

slide-10
SLIDE 10

Virtual synthesis screening is hard: data is sparse & scarce

“Sparse” = high‐dimensional vector of synthesis actions “Scarce” = materials of interest  not many papers published to train on Can deep learning / generative models be useful for synthesis screening?

slide-11
SLIDE 11

Schematic of Variational Autoencoder

Variational autoencoder:

  • Loss = reconstruction + f(Gaussian)
  • Also a generative model

Edward Kim et al., npj Computational Materials 2017

Collaborator, Stefanie Jegelka, CSAIL, MIT

slide-12
SLIDE 12

Data augmentation with text & data mining

Edward Kim et al., npj Computational Materials 2017

slide-13
SLIDE 13

Example: suggesting synthesis conditions for stabilizing desired materials

Polymorphs for MnO2

  • verlaid with most probable

alkali‐ion use in synthesis (intercalation‐based phase stability)

Edward Kim et al., npj Computational Materials 2017

Photocatalysts Lithium‐ion batteries Molecular sieves Alkaline batteries

10,200 articles

slide-14
SLIDE 14

Exploratory: Rare phase in common material

Clustering of latent space shows driving conditions for polymorph of TiO2 for photocatalysis

Edward Kim et al., npj Computational Materials 2017

slide-15
SLIDE 15

Comparison of literature / virtual samples for SrTiO3 synthesis

Calcination Sintering Annealing NaOH (M) Reference 800C, 2h ‐ ‐ 1 Ye et al, 2016 800C, 2h 1250C, 2h ‐ ‐ Zhao et al, 2004 1000C, 12h ‐ 500C, 2h ‐ Zhao et al, 2015 600‐750C, 4h ‐ ‐ ‐ Puangpetch et al, 2008 721C, 1.8h ‐ 468C, 0.4h ‐ N/A ‐ ‐ 450C, 0.9h 1 N/A 955C, 6h 1182C, 7.5h ‐ ‐ N/A

Edward Kim et al., npj Computational Materials 2017

One cannot train a model exclusively on literature data and classify something as successful or not, since there are no negative examples in the literature

slide-16
SLIDE 16

Key elements in machine learning for energy technology

Ramprasad et al npj computational materials, 2017

slide-17
SLIDE 17

Applicability and Next steps

  • Continue to improve pipeline and disseminate information to the

community

  • Inform structure for data going forward
  • Use cases in:
  • Solid state synthesis, hydrothermal and sol gel methods
  • Alloy design
  • Electrolyte performance
slide-18
SLIDE 18

Thank you

  • livetti.mit.edu

synthesisproject.org

18