TY - JOUR
T1 - Global Property Prediction
T2 - A Benchmark Study on Open-Source, Perovskite-like Datasets
AU - Mayr, Felix
AU - Gagliardi, Alessio
N1 - Publisher Copyright:
©
PY - 2021/5/18
Y1 - 2021/5/18
N2 - Screening combinatorial space for novel materials, such as perovskite-like ones for photovoltaics, has resulted in a high amount of simulated high-throughput data and analysis thereof. This study proposes a comprehensive comparison of structural fingerprint-based machine learning models on seven open-source databases of perovskite-like materials to predict band gaps and energies. It shows that none of the given methods, including graph neural networks, are able to capture arbitrary databases evenly, while underlining that commonly used metrics are highly database-dependent in typical workflows. In addition, the applicability of variance selection and autoencoders to significantly reduce fingerprint size indicates that models built with common fingerprints only rely on a submanifold of the available fingerprint space.
AB - Screening combinatorial space for novel materials, such as perovskite-like ones for photovoltaics, has resulted in a high amount of simulated high-throughput data and analysis thereof. This study proposes a comprehensive comparison of structural fingerprint-based machine learning models on seven open-source databases of perovskite-like materials to predict band gaps and energies. It shows that none of the given methods, including graph neural networks, are able to capture arbitrary databases evenly, while underlining that commonly used metrics are highly database-dependent in typical workflows. In addition, the applicability of variance selection and autoencoders to significantly reduce fingerprint size indicates that models built with common fingerprints only rely on a submanifold of the available fingerprint space.
UR - http://www.scopus.com/inward/record.url?scp=85106422277&partnerID=8YFLogxK
U2 - 10.1021/acsomega.1c00991
DO - 10.1021/acsomega.1c00991
M3 - Article
AN - SCOPUS:85106422277
SN - 2470-1343
VL - 6
SP - 12722
EP - 12732
JO - ACS Omega
JF - ACS Omega
IS - 19
ER -