Machine learning for analysis of experimental scattering data in materials chemistry

Published:

Using generative models for structure solution from pair distribution function data
Crystallographic methods, such as single crystal and powder diffraction, allow establishing the links between material structure and properties that are at the heart of materials development. However, other approaches for atomic-scale structure determination are needed for nanostructured materials that have limited long-range order. Over the past decades, Pair Distribution Function (PDF) analysis has become a core tool for analysis of nanomaterial structure. Currently, PDF analysis is mainly done by fitting a known starting model to an experimental PDF to extract quantitative structural information. However, identifying a model or solving a structure de novo, from a PDF, is still an enormous challenge. I develop ML models that can identify the structural model from a PDF, extract structural motifs from the PDF or in some cases solve a simple nanoparticle structure directly from a PDF.
Collaborators: I collaborated with the Nanostructure Group UPCH, University of Copenhagen, lead by Assoc. Prof. Kirsten Marie Ørnsbjerg Jensen, with Professor Simon J. L. Billinge from Columbia University, with Assistant Professor Raghavendra Selvan from Department of Computer Science, UCPH and with Senior Lecturer Keith Tobias Butler from University College London.

Papers - Identifying the structural model from a PDF:

  1. POMFinder: identifying polyoxometallate cluster structures from pair distribution function data using explainable machine learning

Papers - Solving the structure from a PDF:

  1. DeepStruc: Towards structure solution from pair distribution function data using deep generative models
  2. Characterising the atomic structure of mono-metallic nanoparticles from x-ray scattering data using conditional generative models

Papers - Extracting structural motifs from a PDF:

  1. Extracting structural motifs from pair distribution function data of nanostructures using explainable machine learning highlighted in MAX IV annual report 2022
  2. Characterisation of intergrowth in metal oxide materials using structure-mining: the case of γ-MnO2
  3. Atomic structural changes in the formation of transition metal tungstates: the role of polyoxometalate structures in material crystallization
  4. ClusterFinder: a fast tool to find cluster structures from pair distribution function data

Papers - Other related papers:

  1. Machine learning for analysis of experimental scattering and spectroscopy data in materials chemistry
  2. A GPU-Accelerated Open-Source Python Package for Calculating Powder Diffraction, Small-Angle-, and Total Scattering with the Debye Scattering Equation
  3. CHILI: Chemically-Informed Large-scale Inorganic Nanomaterials Dataset for Advancing Graph Machine Learning