Synergistic integration of topology and machine learning for the predictions of protein-ligand binding affinities and mutation impacts

Wei, Guowei

Abstract

Fundamental challenges that hinder the current understanding of biomolecular systems are their tremendous complexity, high dimensionality and excessively large data sets associated with their geometric modeling and simulations. These challenges call for innovative strategies for handling massive biomolecular datasets. Topology, in contrast to geometry, provides a unique tool for dimensionality reduction and data simplification. However, traditional topology typically incurs with excessive reduction in geometric information. Persistent homology is a new branch of topology that is able to bridge traditional topology and geometry, but suffers from neglecting biological information. Built upon PI?s recent work in the topological data analysis of biomolecules, this project will explore how to integrate topological data analysis and machine learning to significantly improve the current state-of-the-art predictions of protein-ligand binding and mutation impact established in the PI?s preliminary studies. These improvements will be achieved through developing physics-embedded topological methodologies and advanced deep learning architectures for tackling heterogeneous biomolecular data sets arising from a variety of physical and biological considerations. Finally, the PI will establish robust databases and online servers for the proposed predictions.

Public Health Relevance

The project concerns the integration of topological data analysis and machine learning architectures for the predictions of protein-ligand binding affinities and mutation induced protein stability changes from massive data sets. This new data approach has considerable impact for future generation methods in computational biophysics and drug design.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Institute of General Medical Sciences (NIGMS)
Type: Research Project (R01)
Project #: 1R01GM126189-01A1
Application #: 9591863
Study Section: Macromolecular Structure and Function D Study Section (MSFD)
Program Officer: Lyster, Peter

Project Start: 2018-08-01
Project End: 2022-07-31
Budget Start: 2018-08-01
Budget End: 2019-07-31
Support Year: 1
Fiscal Year: 2018
Total Cost
Indirect Cost

Institution

Name: Michigan State University
Department: Biostatistics & Other Math Sci
Type: Schools of Arts and Sciences
DUNS #: 193247145

City: East Lansing
State: MI
Country: United States
Zip Code: 48824

Related projects


NIH 2020 R01 GM	Synergistic integration of topology and machine learning for the predictions of protein-ligand binding affinities and mutation impacts Wei, Guowei / Michigan State University
NIH 2020 R01 GM	Synergistic integration of topology and machine learning for the predictions of protein-ligand binding affinities and mutation impacts Wei, Guowei / Michigan State University
NIH 2019 R01 GM	Synergistic integration of topology and machine learning for the predictions of protein-ligand binding affinities and mutation impacts Wei, Guowei / Michigan State University
NIH 2018 R01 GM	Synergistic integration of topology and machine learning for the predictions of protein-ligand binding affinities and mutation impacts Wei, Guowei / Michigan State University

Publications

Bramer, David; Wei, Guo-Wei (2018) Blind prediction of protein B-factor and flexibility. J Chem Phys 149:134107

Comments

Be the first to comment on this grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: