This project extends and applies the hidden Markov model methods, which were developed under previous NSF support, to the problems of finding remote homologies between proteins, producing multiple alignments of related protein domains, and classifying protein domains into families and subfamilies based on common structure, function and phylogenetic relationship. New methods of combining mRNA expression data along with sequence similarity to do functional analysis and discover pathways of interactions between proteins are also being developed. More sophisticated models of protein structure will provide better recognition of active sites and better detection of remote homologies. In developing more powerful and comprehensive computational methods for protein classification and analysis, and making them available on the world wide web, this work will contribute to the basic study of a wide variety of fundamental protein families and may also lead to the discovery of important new protein families.

Agency
National Science Foundation (NSF)
Institute
Division of Biological Infrastructure (DBI)
Application #
9808007
Program Officer
Sylvia J. Spengler
Project Start
Project End
Budget Start
1998-10-01
Budget End
2003-03-31
Support Year
Fiscal Year
1998
Total Cost
$560,773
Indirect Cost
Name
University of California Santa Cruz
Department
Type
DUNS #
City
Santa Cruz
State
CA
Country
United States
Zip Code
95064