TY - JOUR
T1 - A Data-Science Approach to Experimental Catalyst Discovery
T2 - Integrating Exploration, Exploitation, and Serendipity
AU - Nakanowatari, Sunao
AU - Takahashi, Keisuke
AU - Dam, Hieu Chi
AU - Taniike, Toshiaki
N1 - Publisher Copyright:
© 2025 American Chemical Society.
PY - 2025/6/6
Y1 - 2025/6/6
N2 - Predicting the performance of heterogeneous catalysts is difficult because it involves complex interactions and unknown elementary reactions; hence, traditional catalyst development relies on trial and error. Machine learning offers a structured approach to address these issues. However, this approach is limited by challenges such as descriptor design, sparse data, and context-dependent interactions. In this study, two machine learning systems were developed to address these challenges in catalyst discovery: a recommender system that balances exploration and exploitation, and a serendipiter that identifies unexpected discoveries for the recommender─catalysts expected to exhibit high performance despite being predicted as most likely non-high-performing. These systems were tested on the oxidative coupling of methane, and the results demonstrated a promising improvement in the efficiency of catalyst discovery. The recommender, based on evidence theory, uses binary combinations of catalyst components as descriptors to predict performance. It handles incomplete data by quantifying contradictions and uncertainty, facilitating a balance between exploration (testing unevidenced catalysts) and exploitation (refining known high-performing ones). The recommender efficiently identified a diverse range of high-performing catalysts through adaptive sampling with 160 catalysts. The serendipiter, a meta-learner, identifies unexpected high-performing catalysts by leveraging different machine learning models. It increased the occurrence of serendipitous discoveries to 50%, compared to 3% with the recommender alone. In summary, these systems improve the efficiency and reproducibility of catalyst discovery by balancing exploitation, exploration, and serendipity.
AB - Predicting the performance of heterogeneous catalysts is difficult because it involves complex interactions and unknown elementary reactions; hence, traditional catalyst development relies on trial and error. Machine learning offers a structured approach to address these issues. However, this approach is limited by challenges such as descriptor design, sparse data, and context-dependent interactions. In this study, two machine learning systems were developed to address these challenges in catalyst discovery: a recommender system that balances exploration and exploitation, and a serendipiter that identifies unexpected discoveries for the recommender─catalysts expected to exhibit high performance despite being predicted as most likely non-high-performing. These systems were tested on the oxidative coupling of methane, and the results demonstrated a promising improvement in the efficiency of catalyst discovery. The recommender, based on evidence theory, uses binary combinations of catalyst components as descriptors to predict performance. It handles incomplete data by quantifying contradictions and uncertainty, facilitating a balance between exploration (testing unevidenced catalysts) and exploitation (refining known high-performing ones). The recommender efficiently identified a diverse range of high-performing catalysts through adaptive sampling with 160 catalysts. The serendipiter, a meta-learner, identifies unexpected high-performing catalysts by leveraging different machine learning models. It increased the occurrence of serendipitous discoveries to 50%, compared to 3% with the recommender alone. In summary, these systems improve the efficiency and reproducibility of catalyst discovery by balancing exploitation, exploration, and serendipity.
KW - adaptive sampling
KW - evidence theory
KW - high-throughput experimentation
KW - oxidative coupling of methane
KW - serendipity
UR - https://www.scopus.com/pages/publications/105004590122
U2 - 10.1021/acscatal.5c00100
DO - 10.1021/acscatal.5c00100
M3 - Article
AN - SCOPUS:105004590122
SN - 2155-5435
VL - 15
SP - 8691
EP - 8705
JO - ACS Catalysis
JF - ACS Catalysis
IS - 11
ER -