classification of X-ray Diffraction Patterns Shreyaa Raghavan Tonio - - PowerPoint PPT Presentation

classification of x ray diffraction patterns
SMART_READER_LITE
LIVE PREVIEW

classification of X-ray Diffraction Patterns Shreyaa Raghavan Tonio - - PowerPoint PPT Presentation

Tackling Data Scarcity in Materials Research: Using Semi-supervised, Adversarial Training to Improve classification of X-ray Diffraction Patterns Shreyaa Raghavan Tonio Buonassisi, Zhe Liu MIT Photovoltaic Research Lab Contact:


slide-1
SLIDE 1

Shreyaa Raghavan Tonio Buonassisi, Zhe Liu MIT Photovoltaic Research Lab Contact: shreyaar@mit.edu

Tackling Data Scarcity in Materials Research: Using Semi-supervised, Adversarial Training to Improve classification of X-ray Diffraction Patterns

slide-2
SLIDE 2

X-ray Diffraction (XRD) Pattern Classification

▪ A typical machine learning problem ▪ Classification of crystals by space groups and dimensionality ▪ Currently, uses experimental data & computer- generated, synthetic data during training

Figure 1. Examples of perovskite XRD patterns with different dimensionalities1 (i.e. 0D, 2D, 3D)

1Sun, S. et al., Joule 3, 1437–1451 (2019).

XRD Pattern Example

slide-3
SLIDE 3

(1) Data Scarcity regarding generating

labeled experimental data1

(2) Simulated data in the training can be

detrimental to the classifier

Challenges with Current Classifier (autoXRD)

Goal: Mimic experimental data and improve efficacy of non- experimental data (with generative adversarial network – GAN2)

1Oviedo, F., Ren, Z., Sun, S. et al. npj Comput Mater 5, 60 (2019).

  • 2A. Shrivastava, T. Pfister, O. Tuzel, J. Susskind, W. Wang, and R. Webb. Learning from simulated and unsupervised images through adversarial training.

arXiv:1612.07828, 2016.

slide-4
SLIDE 4

Proposed Method 1: Using Generative Adversarial Training

Simulated XRD input Binary Classification Real vs Fake Refiner Model Refined XRD Example Unlabeled Experimental XRD example Discriminator Model Model Update Model Update

Main Advantage: Training with Unlabeled Experimental Data

slide-5
SLIDE 5

Proposed Method 2: Gaussian Filter

Effect of Gaussian Filter on simulated XRD data (i.e. widening peaks) Effect of Refiner Model VS

slide-6
SLIDE 6

Results

Augmented Data 500 Data Accuracy (%) 1000 Data Accuracy (%) 2000 Data Accuracy (%) 4000 Data Accuracy (%) Simulated 12.7 23.9 26.2 34.6 Refiner Model A (20 to 1) 39.8 51.2 53.2 62.9 Refiner Model B (30 to 1) 11.0 17.3 44.2 49.5 Gaussian Filter 11.8 21.6 38.2 49.8

Table 1. Accuracies after 5-Fold Cross Validation of Space Group Classification using Proposed Methods and no Experimental Data

slide-7
SLIDE 7

▪ Accelerating characterization tasks with machine/deep learning ▪ Generalizable to tackle data scarcity in materials research and other

fields (where there’s lack of large, labeled dataset)

Future Work

slide-8
SLIDE 8

References

[1] Christopher Bowles et al. Gan augmentation: Augmenting training data using generative adversarial networks. ArXiv, abs/1810.10863, 2018. [2] Oviedo, F., Ren, Z., Sun, S. et al. Fast and interpretable classification of small X-ray diffraction datasets using data augmentation and deep neural networks. npj Comput Mater 5, 60 (2019). https://doi.org/10.1038/s41524-019-0196-x [3] A. Shrivastava, T. Pfister, O. Tuzel, J. Susskind, W. Wang, and R. Webb. Learning from simulated and unsupervised images through adversarial training. arXiv preprint arXiv:1612.07828, 2016. [4] Sun, S. et al. Accelerated development of perovskite-inspired materials via high-throughput synthesis and machine-learning diagnosis. Joule3, 1437–1451 (2019). [5] https://github.com/mjdietzx/SimGAN

slide-9
SLIDE 9

Thank You! ☺

Contact: shreyaar@mit.edu