PredictProtein (PP) was the first Internet server for protein structure prediction when it went online in 1992 at EMBL. Ever since it has also been the most widely used structure prediction server. Since 1999, PredictProtein runs without financial public support at Columbia University. Limited CPU resources, prevent us from applying the best current methods; limited human resources prevent us from making the results more readily available to molecular biologists. PP differs from most other resources in two ways. Firstly, it tries merging a variety of tools into one single report. Secondly, a number of methods are unique to PP, e.g. the PHD and PROF methods for predictions of secondary structure, solvent accessibility, and transmembrane helices. Here, we propose a variety of technical and scientific solutions improving the functionality of PredictProtein. (1) The technical solutions address job and data handling, database update, user interface, web page layout, presentation of results, and directly linking original resources. (2) The systematic combination of methods requires evaluating these in parallel on identical tasks, e.g., at which level of probability should a signal peptide prediction override the membrane prediction. Our major focus will be on improving predictions for membrane helical proteins, developing methods predicting beta-membrane proteins, and on using structure predictions to more accurately infer functional information. Improving membrane predictions has become particularly urgent, since the recently solved high-resolution structures revealed that all existing methods were over-estimated. We hope that a combination of existing and new methods and a refinement of the respective alignments used will considerably improve prediction accuracy. To predict beta-membrane proteins, we want to explore a combination of novel prediction methods based on neural networks and similar systems with a Markovian-like model that implements the observed grammar in these proteins. As a particular example for using structural information to improve the reliability of inferring function, we propose to investigate the conservation of enzymatic activity.

Agency
National Institute of Health (NIH)
Institute
National Library of Medicine (NLM)
Type
Research Project (R01)
Project #
1R01LM007329-01A1
Application #
6610635
Study Section
Biomedical Library and Informatics Review Committee (BLR)
Program Officer
Ye, Jane
Project Start
2003-05-01
Project End
2007-04-30
Budget Start
2003-05-01
Budget End
2004-04-30
Support Year
1
Fiscal Year
2003
Total Cost
$342,656
Indirect Cost
Name
Columbia University (N.Y.)
Department
Biochemistry
Type
Schools of Medicine
DUNS #
621889815
City
New York
State
NY
Country
United States
Zip Code
10032
Kaján, László; Yachdav, Guy; Vicedo, Esmeralda et al. (2013) Cloud prediction of protein structure and function with PredictProtein for Debian. Biomed Res Int 2013:398968
Schlessinger, Avner; Schaefer, Christian; Vicedo, Esmeralda et al. (2011) Protein disorder--a breakthrough invention of evolution? Curr Opin Struct Biol 21:412-8
Bigelow, Henry; Rost, Burkhard (2009) Online tools for predicting integral membrane proteins. Methods Mol Biol 528:3-23
Bromberg, Yana; Rost, Burkhard (2009) Correlating protein function and stability through the analysis of single amino acid substitutions. BMC Bioinformatics 10 Suppl 8:S8
Wrzeszczynski, Kazimierz O; Rost, Burkhard (2009) Cell cycle kinases predicted from conserved biophysical properties. Proteins 74:655-68
Kernytsky, Andrew; Rost, Burkhard (2009) Using genetic algorithms to select most predictive protein features. Proteins 75:75-88
Jiang, Guoqian; Chute, Christopher G (2009) Auditing the semantic completeness of SNOMED CT using formal concept analysis. J Am Med Inform Assoc 16:89-102
Bromberg, Yana; Yachdav, Guy; Ofran, Yanay et al. (2009) New in protein structure and function annotation: hotspots, single nucleotide polymorphisms and the 'Deep Web'. Curr Opin Drug Discov Devel 12:408-19
Bromberg, Yana; Overton, John; Vaisse, Christian et al. (2009) In silico mutagenesis: a case study of the melanocortin 4 receptor. FASEB J 23:3059-69
Meszaros, Balint; Simon, Istvan; Dosztanyi, Zsuzsanna (2009) Prediction of protein binding regions in disordered proteins. PLoS Comput Biol 5:e1000376

Showing the most recent 10 out of 41 publications