We investigate and develop statistical methods for analysis and prediction of protein structure from sequence information. Lately, we have sought to incorporate multiple-alignments of homologous sequences into our prediction methods. Such alignments can be utilized in many ways, and have yielded improvements in secondary structure prediction of up to 10%. We are seeking to explain these improvements in terms of sequence profiles, residue mutability, and the propensity to permit insertions and deletions in the sequence. Although it is possible to determine mutation-permissive locations in such alignments, it appears that the major improvements in prediction accuracy come from determining the residue profile and the gap-propensity, and not from the apparent mutability of particular residues. We have developed a novel condensed representatiion for protein secondary structure which permits summarization of the entire PDB (Protein Data Bank) on only a few sheets of paper. This has facilitated our attempts to organize and classify the PDB so that improved class-prediction may result in better structure prediction. We have attempted to predict reverse-turn classification from sequence, as this is hypothesized to be an important determinant of protein secondary structure. Using molecular-dynamics simulations, we have sought energy minimal structures for short residue fragments, and are in the process of correlating them with the relative frequency of occurence of such turns in the PDB. Our program MUSEQAL for multiple alignment of protein sequences, was implemented on a parallel architecture, using """"""""speculative computation"""""""" techniques. The properties of this new program have been extensively investigated, and reveal that in typical situations, as few as 70% of residue pairs may be aligned the same way by two different alignment programs, although both produce acceptable alignments. This finding highlights the unreliability of parts of many alignments. A new program KINFIT II was fully developed and tested, resulting in a new tool for ligand-receptor kinetic analysis. Advice and consultation were given to several groups at NIH for using this and the LIGAND and ALLFIT programs. Several hundred copies of these programs and documentation have been distributed so far.

Agency
National Institute of Health (NIH)
Institute
Center for Information Technology (CIT)
Type
Intramural Research (Z01)
Project #
1Z01CT000227-05
Application #
5201617
Study Section
Project Start
Project End
Budget Start
Budget End
Support Year
5
Fiscal Year
1995
Total Cost
Indirect Cost
Name
Center for Information Technology
Department
Type
DUNS #
City
State
Country
United States
Zip Code
Pajevic, Sinisa; Plenz, Dietmar (2009) Efficient network reconstruction from dynamical cascades identifies small-world topology of neuronal avalanches. PLoS Comput Biol 5:e1000271
McQueen, Philip G; McKenzie, F Ellis (2008) Host control of malaria infections: constraints on immune and erythropoeitic response kinetics. PLoS Comput Biol 4:e1000149
Coppey, Mathieu; Boettiger, Alistair N; Berezhkovskii, Alexander M et al. (2008) Nuclear trapping shapes the terminal gradient in the Drosophila embryo. Curr Biol 18:915-9
Sam, Vichetra; Tai, Chin-Hsien; Garnier, Jean et al. (2008) Towards an automatic classification of protein structural domains based on structural similarity. BMC Bioinformatics 9:74
Hendler, Richard W; Shrager, Richard I; Meuse, Curtis W (2008) The ability of actinic light to modify the bacteriorhodopsin photocycle revisited: heterogeneity vs photocooperativity. Biochemistry 47:5406-16
Williams, Ruth R E; Azuara, Veronique; Perry, Pascale et al. (2006) Neural induction promotes large-scale chromatin reorganisation of the Mash1 locus. J Cell Sci 119:132-40
Nishizuka, Satoshi; Washburn, Newell R; Munson, Peter J (2006) Evaluation method of ordinary flatbed scanners for quantitative density analysis. Biotechniques 40:442, 444, 446 passim
McQueen, Philip G; McKenzie, F Ellis (2006) Competition for red blood cells can enhance Plasmodium vivax parasitemia in mixed-species malaria infections. Am J Trop Med Hyg 75:112-25
Sam, Vichetra; Tai, Chin-Hsien; Garnier, Jean et al. (2006) ROC and confusion analysis of structure comparison methods identify the main causes of divergence from manual protein classification. BMC Bioinformatics 7:206
Knodler, Leigh A; Steele-Mortimer, Olivia (2005) The Salmonella effector PipB2 affects late endosome/lysosome distribution to mediate Sif extension. Mol Biol Cell 16:4108-23

Showing the most recent 10 out of 15 publications