The overarching objective of our program is to define general data science driven workflows that incorporate physical organic precepts and can be deployed directly within the reaction optimization process. Successfully developing such a workflow would have three key impacts on the chemical synthesis enterprise: 1) significantly streamline the empirical, costly process of reaction optimization, 2) algorithms would be applied to predict how new substrates, catalysts, and reagents (as well as reaction conditions) perform in the reaction of interest as extrapolations of this sort are poorly intuited. The ability to know quantitatively the generalizability of a reaction will rapidly accelerate the uptake of new methods in chem ical synthesis. And 3) as the data driven tools described herein utilize physical organic methods to describe molecules mathematically, the resulting correlations derived from empirical data can be interpreted to provide mechanistic insights into how catalysts/substrates interact. This provides one with the foundation to ?transfer? knowledge to new reactions and develop general catalyst design principles. We plan to continue to deliver to the community a compelling reason to change the culture of reaction development from empirical optimization and observations to an insightful, efficient, and high quality data producing process. This work will be accomplished in the context of asymmetric catalysis and focus on the following question: can we develop tools to predict reaction outcomes for completely new examples not represented within the training dataset required for the initial correlation, while simultaneously having interpretable/explainable statistical models? This will be accomplished by exploring various enantioselective processes catalyzed by a multitude of catalysts and interrogating the processes using modern computational chemistry and statistical methods. We will validate these new approaches by exploring if data-mining and new data collection can be used to build correlations with structural features of molecules for the prediction of altogether new examples. Within this we will ask fundamental questions about how catalyst dynamics coupled with non-covalent interactions impact catalyst performance and how to compile this information for new catalyst design strategies. Ultimately, we plan to deliver to the community a platform and pathway to facilitate reaction optimism holistically using easy to apply data science methods.

Public Health Relevance

We wish to continue our development of new reactions and modern data science tools for applications in the synthesis of bioactive molecules. Progress in these fields will have a significant impact in how molecules are prepared for biomedical applications. A particular emphasis involves building new data-driven workflows, which empower a wide-range of enterprises related to medicine.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Unknown (R35)
Project #
1R35GM136271-01
Application #
9930428
Study Section
Special Emphasis Panel (ZRG1)
Program Officer
Yang, Jiong
Project Start
2020-04-01
Project End
2025-03-31
Budget Start
2020-04-01
Budget End
2021-03-31
Support Year
1
Fiscal Year
2020
Total Cost
Indirect Cost
Name
University of Utah
Department
Chemistry
Type
Schools of Arts and Sciences
DUNS #
009095365
City
Salt Lake City
State
UT
Country
United States
Zip Code
84112