A major objective of bioinformatics is to predict protein structure/function from sequence; the most successful current methods use sequence homology. These current homology-based methods explicitly or implicitly assume the paradigm sequence yields structure yields function. Since protein function depends on flexibility, movement, or even lack of structure (disorder) in some cases, combining various motion predictions with structure predictions should improve prediction of function. Algorithms for predicting flexibility from sequence and local disorder have been developed, as have algorithms to predict sequences that snitch between two structured states. Sequence complexity might also be an indirect measure of mobility and still other measures could be discovered by the study of known examples. We propose to determine the interplay of homology-based predictions that use sequence information only with explicit representations of both structure information and motion information as determined from amino acid sequence. Structural information will be represented by helix, sheet, etc., predictions, and by hydrophobic moment calculations. Motion information will be represented by flexibility prediction, switch sequence prediction, order / disorder prediction, sequence complexity, and new measures if any are discovered in the course of this work. Since different functions would involve motional parameters to different extents, the plan is to apply this combined approach on a sequence family basis. Novel comparisons, called Attribute Profiles, are proposed for the representation of structure and motion information. Sets of Gribskov/Eisenberg 1D Profiles and associated Attribute Profiles will be combined into single predictions using neural network data models. Prediction outcomes will be compared with experimental observations and evaluated using the jackknife method. If successful, this work will improve prediction of protein function from amino acid sequence, which is important given the significant numbers of protein sequences with undetermined functions coming out of the various genome projects. Motion and disorder are important pieces that need to be added to the characterization of protein structure/function. For example, taxol has been shown recently to bind to a disordered loop in Bcl-2. Alzheimer disease, transmissible spongiform encephalopathies, Parkinson disease and infectious agents such as Staphylococcal aureus, foot-and-mouth disease virus, and (perhaps) HIV depend critically on disordered regions of protein. Thus, understanding the interplay of sequence, flexibility, order / disorder, complexity, structure and function clearly relates to human health.

Agency
National Institute of Health (NIH)
Institute
National Library of Medicine (NLM)
Type
Research Project (R01)
Project #
5R01LM006916-02
Application #
6391288
Study Section
Biomedical Library and Informatics Review Committee (BLR)
Program Officer
Florance, Valerie
Project Start
2000-05-01
Project End
2003-04-30
Budget Start
2001-05-01
Budget End
2002-04-30
Support Year
2
Fiscal Year
2001
Total Cost
$332,118
Indirect Cost
Name
Washington State University
Department
Biochemistry
Type
Schools of Arts and Sciences
DUNS #
041485301
City
Pullman
State
WA
Country
United States
Zip Code
99164
Xue, Bin; Romero, Pedro R; Noutsou, Maria et al. (2013) Stochastic machines as a colocalization mechanism for scaffold protein function. FEBS Lett 587:1587-91
Radivojac, Predrag; Obradovic, Zoran; Smith, David K et al. (2004) Protein flexibility and intrinsic disorder. Protein Sci 13:71-80
Smith, David K; Radivojac, Predrag; Obradovic, Zoran et al. (2003) Improved amino acid flexibility parameters. Protein Sci 12:1060-72
Vucetic, Slobodan; Brown, Celeste J; Dunker, A Keith et al. (2003) Flavors of protein disorder. Proteins 52:573-84
Radivojac, Predrag; Obradovic, Zoran; Brown, Celeste J et al. (2003) Prediction of boundaries between intrinsically ordered and disordered protein regions. Pac Symp Biocomput :216-27
Obradovic, Zoran; Peng, Kang; Vucetic, Slobodan et al. (2003) Predicting intrinsic disorder from amino acid sequence. Proteins 53 Suppl 6:566-72
Brown, Celeste J; Takayama, Sachiko; Campen, Andrew M et al. (2002) Evolutionary rate heterogeneity in proteins with long disordered regions. J Mol Evol 55:104-10
Dunker, A Keith; Brown, Celeste J; Lawson, J David et al. (2002) Intrinsic disorder and protein function. Biochemistry 41:6573-82
Radivojac, Predrag; Obradovic, Zoran; Brown, Celeste J et al. (2002) Improving sequence alignments for intrinsically disordered proteins. Pac Symp Biocomput :589-600
Iakoucheva, Lilia M; Brown, Celeste J; Lawson, J David et al. (2002) Intrinsic disorder in cell-signaling and cancer-associated proteins. J Mol Biol 323:573-84

Showing the most recent 10 out of 15 publications