www.data61.csiro.au
DATA ANALYTICS IN NANOMATERIALS DISCOVERY
Michael Fernandez | OCE-Postdoctoral Fellow September 2016
NANOMATERIALS DISCOVERY Michael Fernandez | OCE-Postdoctoral Fellow - - PowerPoint PPT Presentation
DATA ANALYTICS IN NANOMATERIALS DISCOVERY Michael Fernandez | OCE-Postdoctoral Fellow September 2016 www.data61.csiro.au Materials Discovery Process Materials Genome Project Integrating computational methods and information with sophisticated
www.data61.csiro.au
Michael Fernandez | OCE-Postdoctoral Fellow September 2016
Data Analytics for Nanomaterials| Michael Fernandez 3 |
Integrating computational methods and information with sophisticated computational and analytical tools to shorten the duration
materials development from 10-20 years to 2 or 3 years.
Data Analytics for Nanomaterials| Michael Fernandez 2 |
Chris Watkins
Piotr Szul Yulia Arzhaeva
Amanda Barnard Sun Baichuan
Data Analytics for Nanomaterials| Michael Fernandez 2 |
Data Analytics for Nanomaterials| Michael Fernandez 4 |
ππ πΌπ 2 + π ππ£ππππ π π + 1 2 Οπβ π ππ 1 π πβπ π
chemistry
elements
deployable
Data Analytics for Nanomaterials| Michael Fernandez 5 |
nanodiamonds
Barnard, A. S. et al. Nanoscale 3, 958β62 (2011)
The arrow indicates (111)|(111) interface between two 4 nm sized nanodiamonds. DFTB simulations of the surface electrostatic potential of dodecahedral diamond nanoparticles of a) 2.2 nm and b) 2.5 nm
Data Analytics for Nanomaterials| Michael Fernandez 6 |
Materials libraries Experiment design
Database
Performance Measurement Lead materials scale up Knowledge for rational design Banks of materials Hypothetical materials In-silico screening
complexity and diversity
problem
Potyrailo, R. et al. ACS Comb. Sci. 13, 579β633 (2011).
Theory, modeling and informatics
Data Analytics for Nanomaterials| Michael Fernandez 7 |
Departing from the Edisonian approach
We can accurately predict a property, so it can be computed for entire materials spaces Combinatorial In silico design
In silico structure generation
Data Analytics for Nanomaterials| Michael Fernandez 7 |
Data Analytics for Nanomaterials| Michael Fernandez 7 |
Data Analytics for Nanomaterials| Michael Fernandez 9 |
Polydispersive sample Quasi-monodisperse sample Purification Controlled synthesis
Polydispersity can be detrimental for high-performing applications Purification of polydispersive nanoparticles samples is expensive
Data Analytics for Nanomaterials| Michael Fernandez 10 |
Nanodiamonds Graphenes
Data Analytics for Nanomaterials| Michael Fernandez 10 |
The predictors of Xi are finite mixtures of archetypes Zj, which are convex combinations
Finds a kο΄ m matrix Z that corresponds to the archetypal or βpure patternsβ in the data in such a way that each data point can be represented as a mixture of those archetypes. In other words, the archetypal analysis yields the two n ο΄ k coefficient matrices Ξ± and Ξ², which minimize the residual sum of squares:
Cutler, A. & Breiman, L. Archetypal Analysis. Technometrics 36, 338β347 (1994).
Data Analytics for Nanomaterials| Michael Fernandez 11 |
Nanodiamonds
Fernandez, M. & Barnard, A. S. ACS Nano 9, 11980β11992 (2015).
Graphene nanoflakes
Data Analytics for Nanomaterials| Michael Fernandez 13 |
Nanodiamonds prototypes
Fernandez, M. & Barnard, A. S. ACS Nano 9, 11980β11992 (2015).
Graphene prototypes
Data Analytics for Nanomaterials| Michael Fernandez 13 |
Fernandez, M. & Barnard, A. S. ACS Nano 9, 11980β11992 (2015).
Data Analytics for Nanomaterials| Michael Fernandez 13 |
Defects, oxidation and edge passivation yield large structural diversity
Data Analytics for Nanomaterials| Michael Fernandez 13 |
P P P
Single Si substitution by P yields a large structural diversity
Data Analytics for Nanomaterials| Michael Fernandez 13 |
nitroimidazole benzimidazole
ZIF-68
Data Analytics for Nanomaterials| Michael Fernandez 13 |
Data Analytics for Nanomaterials| Michael Fernandez 13 |
Modification with of 35 functional groups gives a total of ~1.5 million (354) unique combinations
MOF ZBP Organic Linkers
Data Analytics for Nanomaterials| Michael Fernandez 13 |
Data Analytics for Nanomaterials| Michael Fernandez 13 |
Fernandez, M., Shi, H. & Barnard, A. S Carbon (2016). doi:10.1016/j.carbon.2016.03.005
Data Analytics for Nanomaterials| Michael Fernandez 13 |
Estimation of the graphene Band Gap from topological features
Pi and Pj , are the values of a bond order of the carbon atoms in graphene, while L is the topological distance, whilst ο€L
ij is a delta function delta function
Fernandez, M. et al. ACS Comb. Sci. (2016) doi:10.1021/acscombsci.6b00094
Data Analytics for Nanomaterials| Michael Fernandez 13 |
Fernandez, M.; Shi, H.; Barnard, A et al.
the summation is over the N atom pairs in the graphene structure, and rij is the distance of these pairs and B is a smoothing parameter set to 10.
Data Analytics for Nanomaterials| Michael Fernandez 13 |
Fernandez, M.; Shi, H.; Barnard, A et al. J. Chem. Inf. Model. (2015), 55, 2500-2506
Data Analytics for Nanomaterials| Michael Fernandez 13 |
Fernandez, M., et al. J. Phys. Chem. C 117, 14095β14105 (2013).
Data Analytics for Nanomaterials| Michael Fernandez 13 |
System size Accuracy of electronic calculations methods vs. system size
TBDF/ Semiempirical Density Functional
Accuracy
Quantum Monte Carlo Coupled Cluster
Data Analytics for Nanomaterials| Michael Fernandez 15 |
Machine learning predictions:
structure Machine Learning
Partial Database Screening Full Database Predictions
Ramakrishnan, R. et al. J. Chem. Theory Comput. 11, 2087β2096 (2015). Fernandez, M et al. J. Chem. Inf. Model. (2015), 55, 2500-2506
Data Analytics for Nanomaterials| Michael Fernandez 13 |
big gap big gap
Data Analytics for Nanomaterials| Michael Fernandez 13 |
Predictions Machine learning calibration
Data storage Input jobs Queue management Computational resources Resubmit or kill failed runs Finished runs Accuracy refinement
Data Analytics for Nanomaterials| Michael Fernandez 13 |
Data Analytics for Nanomaterials| Michael Fernandez 13 |
Ag-NP
Electrostatic Potential
Data Analytics for Nanomaterials| Michael Fernandez 13 |
C2 Feature maps 10x10
5x5 convolution 2x2 subsampling 5x5 convolution 2x2 subsampling Fully connected
input 32x32 C1 Feature maps 28x28 S1 Feature maps 14x14 S2 Feature maps 5x5 B1 B2 Output
Image 32x32
www.data61.csiro.au
Data61 Michael Fernandez e michael.fernandezllamosa@csiro.au w https://research.csiro.au/mmm/