Protein sequence, structure, and computational analysis

Pollock, David

Abstract

: The proposed research will increase understanding of the relationship between protein sequence and function through development of innovative computational and statistical technologies. The approach is designed to maximize information extracted from datasets that include dense sampling of sequences from diverse taxa. The development of new and fast phylogeny-based likelihood methods will allow researchers to take advantage of large multi-protein datasets sampled over a range and density of biodiversity that is currently uncommon, but will increase rapidly in the near future. In the first phase, the project will develop novel computational methods to analyze patterns of protein evolution and coevolution, create a fast method for analyzing large, taxonomically diverse datasets, and evaluate the utility and accuracy of model approximations using this method, and begin to develop methods to manage and visualize sequence, structure, function, and phylogenetic information from large, taxonomically diverse datasets. In the second phase, it will further develop novel computational methods to analyze patterns of protein evolution and coevolution, apply analytical tools to a broad range of proteins and protein complexes, implement computer programs employing these methods that are accessible to the general community, and provide filtered access to protein sequence biodiversity data for easy analysis and visualization. The long-term goal of this project is to understand the relationship between sequence diversity and structure such that more accurate predictions of the effect of substitution can be made. It will determine the value of taxonomic diversity in predicting functional and structural information. By focusing on the near-human evolutionary environment (the vertebrates), results will be directly applicable towards understanding the structural context of human proteins and the effect of substitutions in human proteins that may lead to both single locus and quantitative disease.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Institute of General Medical Sciences (NIGMS)
Type: Exploratory/Developmental Grants Phase II (R33)
Project #: 4R33GM065612-03
Application #: 6834875
Study Section: Special Emphasis Panel (ZRG1-SSS-H (01))
Program Officer: Wehrle, Janna P

Project Start: 2002-08-01
Project End: 2006-07-31
Budget Start: 2004-08-01
Budget End: 2005-07-31
Support Year: 3
Fiscal Year: 2004
Total Cost: $222,165
Indirect Cost

Institution

Name: Louisiana State University A&M Col Baton Rouge
Department: Biology
Type: Schools of Arts and Sciences
DUNS #: 075050765

City: Baton Rouge
State: LA
Country: United States
Zip Code: 70803

Related projects


NIH 2005 R33 GM	Protein sequence, structure, and computational analysis Pollock, David D. / Louisiana State University A&M Col Baton Rouge	$21,076
NIH 2005 R33 GM	Protein sequence, structure, and computational analysis Pollock, David D. / University of Colorado Denver	$231,140
NIH 2004 R33 GM	Protein sequence, structure, and computational analysis Pollock, David D. / Louisiana State University A&M Col Baton Rouge	$222,165

Publications

Castoe, Todd A; Poole, Alexander W; Gu, Wanjun et al. (2010) Rapid identification of thousands of copperhead snake (Agkistrodon contortrix) microsatellite loci from modest amounts of 454 shotgun genome sequence. Mol Ecol Resour 10:341-7

de Koning, A P Jason; Gu, Wanjun; Pollock, David D (2010) Rapid likelihood analysis on large phylogenies using partial sampling of substitution histories. Mol Biol Evol 27:249-65

Castoe, T A; Gu, W; de Koning, A P J et al. (2009) Dynamic nucleotide mutation gradients and control region usage in squamate reptile mitochondrial genomes. Cytogenet Genome Res 127:112-27

Castoe, Todd A; de Koning, A P Jason; Kim, Hyun-Min et al. (2009) Evidence for an ancient adaptive episode of convergent molecular evolution. Proc Natl Acad Sci U S A 106:8986-91

Thai, Vu; Renesto, Patricia; Fowler, C Andrew et al. (2008) Structural, biochemical, and in vivo characterization of the first virally encoded cyclophilin from the Mimivirus. J Mol Biol 378:71-86

Gu, Wanjun; Castoe, Todd A; Hedges, Dale J et al. (2008) Identification of repeat structure in large genomes using repeat probability clouds. Anal Biochem 380:77-83

Mikkelsen, Tarjei S; Wakefield, Matthew J; Aken, Bronwen et al. (2007) Genome of the marsupial Monodelphis domestica reveals innovation in non-coding sequences. Nature 447:167-77

Gu, Wanjun; Ray, David A; Walker, Jerilyn A et al. (2007) SINEs, evolution and genome structure in the opossum. Gene 396:46-58

Gentles, Andrew J; Wakefield, Matthew J; Kohany, Oleksiy et al. (2007) Evolutionary dynamics of transposable elements in the short-tailed opossum Monodelphis domestica. Genome Res 17:992-1004

Krishnan, Neeraja M; Seligmann, Herve; Stewart, Caro-Beth et al. (2004) Ancestral sequence reconstruction in primate mitochondrial DNA: compositional bias and effect on functional inference. Mol Biol Evol 21:1871-83

Comments

Be the first to comment on David Pollock's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: