Data-efficient and interpretable inverse materials design using a disentangled variational autoencoder

Cheng Zeng; Zulqarnain Khan; Nathan Post

doi:10.55092/aimat20250002

Article

Open Access

Cite

Expand

Data-efficient and interpretable inverse materials design using a disentangled variational autoencoder

download PDF Supplementary data

Cheng Zeng^1,²^,∗, Zulqarnain Khan¹^,∗, Nathan Post²

¹ Institute for Experiential AI, Northeastern University, Boston, MA 02115, USA

² The Roux Institute, Northeastern University, Portland, ME 04101, USA

* c.zeng@northeastern.edu; z.khan@northeastern.edu

Volume
Volume 1 Issue 1, 2025
Citation
Zeng C, Khan Z, Post N. Data-efficient and interpretable inverse materials design using a disentangled variational autoencoder. AI Mater. 2025(1):0002, https://doi.org/10.55092/aimat20250002.
DOI
10.55092/aimat20250002
Copyright
Copyright2024 by the authors. Published by ELSP.

Abstract

Inverse materials design has proven successful in accelerating novel material discovery. Many inverse materials design methods use unsupervised learning where a latent space is learned to offer a compact description of materials representations. A latent space learned this way is likely to be entangled, in terms of the target property and other properties of the materials. This makes the inverse design process ambiguous. Here, we present a semi-supervised learning approach based on a disentangled variational autoencoder to learn a probabilistic relationship between features, latent variables and target properties. This approach is data efficient because it combines all labelled and unlabelled data in a coherent manner, and it uses expert-informed prior distributions to improve model robustness even with limited labelled data. It is in essence interpretable, as the learnable target property is disentangled out of the other properties of the materials, and an extra layer of interpretability can be provided by a post-hoc analysis of the classification head of the model. We demonstrate this new approach on an experimental high-entropy alloy dataset with chemical compositions as input and single-phase formation as the single target property. High-entropy alloys were chosen as example materials because of the vast chemical space of their possible combinations of compositions and atomic configurations. While single property is used in this work, the disentangled model can be extended to customize for inverse design of materials with multiple target properties.

Keywords

inverse materials design; high-entropy alloys; disentangled variational autoencoder; interpretable methods

Preview

view pdf