text and data mining for material synthesis
play

Text and Data Mining for Material Synthesis Elsa Olivetti, MIT - PowerPoint PPT Presentation

Text and Data Mining for Material Synthesis Elsa Olivetti, MIT Gerbrand Ceder, UC Berkeley Departments of Materials Science & Engineering Andrew McCallum, UMass Amherst Department of Computer Science & Engineering 1 Challenges for


  1. Text and Data Mining for Material Synthesis Elsa Olivetti, MIT Gerbrand Ceder, UC Berkeley Departments of Materials Science & Engineering Andrew McCallum, UMass Amherst Department of Computer Science & Engineering 1

  2. Challenges for technology development: Timeline for development is long 1954 1963 1982 Silicon Solar Cells Kyocera 1 st mass 1 st practical silicon solar Sharp produces 1 st practical solar produces polysilicon cells cell invented at Bell Labs module of silicon by today’s standard solar cells process 1980 1991 2008 Lithium Ion Batteries Sony sells 1 st commercially 1 st uses of Li-ion Oxford demonstrates 1 st viable rechargeable available Li-ion batteries for battery in production lithium battery high price consumer electronics vehicles

  3. Modern data ‐ driven and first ‐ principles materials design accelerates pace of wha what to make… + + H  N e N e N e N e      1 1  i  2 V nuclear ( r i ) r j  r i 2 i  1 i  1 j  i i Surfaces Phase diagrams Bandgaps 3.7 V 4.09 3. 86 V V 3.2V 3.76 V 3

  4. 4

  5. Text extraction workflow

  6. Status of data dissemination

  7. 8

  8. Example: suggesting synthesis conditions for a specific morphology in titania *Grey circled points = accuracy test points All other points = training data points Experimentally ‐ accessible (and reported) variables to facilitate practical synthesis route planning. 9 Edward Kim et al., Chemistry of Materials 2017

  9. Virtual synthesis screening is hard: data is sparse & scarce “Sparse” = high ‐ dimensional vector of synthesis actions “Scarce” = materials of interest  not many papers published to train on Can deep learning / generative models be useful for synthesis screening?

  10. Schematic of Variational Autoencoder Variational autoencoder: • Loss = reconstruction + f(Gaussian) • Also a generative model Collaborator, Stefanie Jegelka, CSAIL, MIT Edward Kim et al., npj Computational Materials 2017

  11. Data augmentation with text & data mining Edward Kim et al., npj Computational Materials 2017

  12. Example: suggesting synthesis conditions for stabilizing desired materials Alkaline batteries Polymorphs for MnO 2 overlaid with most probable Photocatalysts alkali ‐ ion use in synthesis (intercalation ‐ based phase stability) 10,200 articles Molecular sieves Lithium ‐ ion batteries Edward Kim et al., npj Computational Materials 2017

  13. Exploratory: Rare phase in common material Clustering of latent space shows driving conditions for polymorph of TiO 2 for photocatalysis Edward Kim et al., npj Computational Materials 2017

  14. Comparison of literature / virtual samples for SrTiO 3 synthesis Calcination Sintering Annealing NaOH (M) Reference 800C, 2h ‐ ‐ 1 Ye et al, 2016 800C, 2h 1250C, 2h ‐ ‐ Zhao et al, 2004 1000C, 12h ‐ 500C, 2h ‐ Zhao et al, 2015 600 ‐ 750C, 4h ‐ ‐ ‐ Puangpetch et al, 2008 721C, 1.8h ‐ 468C, 0.4h ‐ N/A ‐ ‐ 450C, 0.9h 1 N/A 955C, 6h 1182C, 7.5h ‐ ‐ N/A One cannot train a model exclusively on literature data and classify something as successful or not, since there are no negative examples in the literature Edward Kim et al., npj Computational Materials 2017

  15. Key elements in machine learning for energy technology Ramprasad et al npj computational materials, 2017

  16. Applicability and Next steps • Continue to improve pipeline and disseminate information to the community • Inform structure for data going forward • Use cases in: • Solid state synthesis, hydrothermal and sol gel methods • Alloy design • Electrolyte performance

  17. Thank you olivetti.mit.edu synthesisproject.org 18

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend