Computational methods for structural-functional studies of proteins

Grishin, Nick

Abstract

Accurate multiple sequence alignment (MSA) is the major unsolved problem in protein bioinformatics. Alignments and similarity searches are essential first steps in experimental design for all studies involving proteins, and the accuracy of these methods is crucial for the success in biomedical research. In the course of this project, we plan to significantly improve the accuracy of alignments and the precision of sequence similarity detection between protein families. During the last few years, our group proposed 4 methods for MSA construction and 2 methods for remote homology inference. Presently, our latest program PROMALS3D is judged to be the most accurate aligner for weakly similar sequences. We also performed a comprehensive survey of kinase sequences and structures that revealed 25 homologous groups (superfamilies) in 10 structural folds. Building on these results, we propose to: 1) Improve sensitivity of sequence profile similarity search, mainly by using known relationships between database sequences. 2) Develop software for accurate MSA of sequences with low similarity. The emphasis is being made on employment of structural features and predictions to improve MSA quality. 3) Design an easy to use web server for exploration of protein families. The server could be queried with a single sequence to find, align and analyze its homologs, or with a set of sequences. The central component of the server is MSA using the software developed in this project. 4) Assemble a database of high quality MSAs for kinases and their relatives, and make testable structure-functional predictions for groups without experimental annotations. Since kinases attract considerable attention due to their medical relevance (e.g. cancer studies), this database should be a valuable asset to researchers.

Public Health Relevance

Accurate multiple sequence alignment is the major unsolved problem in protein bioinformatics. Alignments and sequence similarity searches are essential first steps in experimental design for all studies involving proteins, and the accuracy of these methods is crucial for the success of biomedical research. We will improve alignment accuracy and using the new method will analyze kinases, which are a medically important group of enzymes attracting high interest because of their relevance to many diseases, cancer in particular.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Institute of General Medical Sciences (NIGMS)
Type: Research Project (R01)
Project #: 5R01GM094575-04
Application #: 8490396
Study Section: Macromolecular Structure and Function D Study Section (MSFD)
Program Officer: Wehrle, Janna P

Project Start: 2010-07-01
Project End: 2014-06-30
Budget Start: 2013-07-01
Budget End: 2014-06-30
Support Year: 4
Fiscal Year: 2013
Total Cost: $296,207
Indirect Cost: $109,914

Institution

Name: University of Texas Sw Medical Center Dallas
Department: Biochemistry
Type: Schools of Medicine
DUNS #: 800771545

City: Dallas
State: TX
Country: United States
Zip Code: 75390

Related projects


NIH 2017 R01 GM	Computational methods for structural-functional studies of protein Grishin, Nick V. / University of Texas Sw Medical Center Dallas
NIH 2016 R01 GM	Computational methods for structural-functional studies of protein Grishin, Nick V. / University of Texas Sw Medical Center Dallas
NIH 2015 R01 GM	Computational methods for structural-functional studies of protein Grishin, Nick V. / University of Texas Sw Medical Center Dallas	$287,975
NIH 2014 R01 GM	Computational methods for structural-functional studies of protein Grishin, Nick V. / University of Texas Sw Medical Center Dallas
NIH 2013 R01 GM	Computational methods for structural-functional studies of proteins Grishin, Nick V. / University of Texas Sw Medical Center Dallas	$296,207
NIH 2012 R01 GM	Computational methods for structural-functional studies of proteins Grishin, Nick V. / University of Texas Sw Medical Center Dallas	$306,789
NIH 2011 R01 GM	Computational methods for structural-functional studies of proteins Grishin, Nick V. / University of Texas Sw Medical Center Dallas	$305,984
NIH 2010 R01 GM	Computational methods for structural-functional studies of proteins Grishin, Nick V. / University of Texas Sw Medical Center Dallas	$309,075

Publications

Medvedev, Kirill E; Kinch, Lisa N; Grishin, Nick V (2018) Functional and evolutionary analysis of viral proteins containing a Rossmann-like fold. Protein Sci 27:1450-1463

Schaeffer, R Dustin; Liao, Yuxing; Cheng, Hua et al. (2017) ECOD: new developments in the evolutionary classification of domains. Nucleic Acids Res 45:D296-D302

Zhang, Jing; Kinch, Lisa N; Cong, Qian et al. (2017) Assessing predictions of fitness effects of missense mutations in SUMO-conjugating enzyme UBE2I. Hum Mutat 38:1051-1063

Zhang, Jing; Cong, Qian; Fan, Xiao-Ling et al. (2017) Mitogenomes of Giant-Skipper Butterflies reveal an ancient split between deep and shallow root feeders. F1000Res 6:222

Pei, Jimin; Grishin, Nick V (2017) Expansion of divergent SEA domains in cell surface proteins and nucleoporin 54. Protein Sci 26:617-630

Janzen, Daniel H; Burns, John M; Cong, Qian et al. (2017) Nuclear genomes distinguish cryptic species suggested by their DNA barcodes and ecology. Proc Natl Acad Sci U S A 114:8313-8318

Fédry, Juliette; Liu, Yanjie; Péhau-Arnaudet, Gérard et al. (2017) The Ancient Gamete Fusogen HAP2 Is a Eukaryotic Class II Fusion Protein. Cell 168:904-915.e10

Cong, Qian; Shen, Jinhui; Li, Wenlin et al. (2017) The first complete genomes of Metalmarks and the classification of butterfly families. Genomics 109:485-493

Gao, Qiang; Binns, Derk D; Kinch, Lisa N et al. (2017) Pet10p is a yeast perilipin that stabilizes lipid droplets and promotes their assembly. J Cell Biol 216:3199-3217

Li, Peng; Kinch, Lisa N; Ray, Ann et al. (2017) Acute Hepatopancreatic Necrosis Disease-Causing Vibrio parahaemolyticus Strains Maintain an Antibacterial Type VI Secretion System with Versatile Effector Repertoires. Appl Environ Microbiol 83:

Showing the most recent 10 out of 89 publications

Comments

Be the first to comment on Nick Grishin's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: