Predicting Indium Phosphide Quantum Dot Properties from Synthetic Procedures Using Machine Learning
Publication Date:
July 6 2022
Article Citation:
Chem. Mater. 2022, 34, 14, 6296–6311
Article Authors
Hao A. Nguyen, Florence Y. Dou, Nayon Park, Shenwei Wu, Harrison Sarsito, Benedicte Diakubama, Helen Larson, Emily Nishiwaki, Micaela Homer, Melanie Cash, and Brandi M. Cossairt
Article DOI:
https://doi.org/10.1021/acs.chemmater.2c00640

Prediction of chemical reaction outcomes using machine learning (ML) has emerged as a powerful tool for advancing materials synthesis. However, this approach requires large and diverse datasets, which are extremely limited in the field of nanomaterials synthesis due to inconsistent and nonstandardized reporting in the literature and a lack of understanding of synthetic mechanisms. In this study, we extracted parameters of InP quantum dot (QD) syntheses as our inputs and resultant properties (absorption, emission, diameter) as our outputs from 72 publications. We “filled in” missing outputs using a data imputation method to prepare a complete dataset containing 216 entries for training and testing predictive ML models. We defined the descriptor space in two ways (condensed and extended) based on either chemical identity or the role of reagents to explore the best approach for categorizing input features. We achieved mean absolute errors (MAEs) as low as 20.29, 11.46, and 0.33 nm for absorption, emission, and diameter, respectively, with our best ML model.

We used these models to deploy an accessible and interactive web app for designing syntheses of InP. Using this web app, we investigated chemical trends in InP syntheses, such as the effects of common additives, like zinc salts and trioctylphosphine. We also designed and conducted new experiments based on extensions of literature procedures and compared our experimentally measured properties to predictions, thus evaluating the “real-life” accuracy of our models. Conversely, we used inverse design to obtain InP QDs with specific properties. Finally, we applied the same approach to train, test, and launch predictive models for CdSe QDs by expanding a previously published dataset. Altogether, our data preprocessing method and ML implementations demonstrate the ability to design materials with targeted properties and explore underlying reaction mechanisms even when faced with limited data resources.

 

Check out the App

 

Check out the code