The project is divided into 3 subprojects: 1.Iterative automatic methods for refining multiple sequence alignment. Currently starting point of constructing CDD database is a multiple alignment imported form protein family models as SMART and PFAM. Since all known multiple sequence alignment algorithms are less than perfect the imported alignment have errors that are now removed by hand. The goal of the project is to provide reduce and simplify this manual work. 2.New approaches for detecting structural similarities. The goal of this subproject is to develop new approaches for fast detection of structural similarities that would complement currently used method (VAST). Thus particular attention is given to loop structure and developing of indexing methods. 3.Applications of graph theory similarity relations between domains. The similarity relation between proteins sequences or structures can be conveniently represented as a graph. The goal of this subproject is to study the properties of such graphs and develop tools for their analysis.

Agency
National Institute of Health (NIH)
Institute
National Library of Medicine (NLM)
Type
Intramural Research (Z01)
Project #
1Z01LM779892-01
Application #
6828685
Study Section
(CBB)
Project Start
Project End
Budget Start
Budget End
Support Year
1
Fiscal Year
2003
Total Cost
Indirect Cost
Name
National Library of Medicine
Department
Type
DUNS #
City
State
Country
United States
Zip Code
Jeong, Jieun; Berman, Piotr; Przytycka, Teresa M (2008) Improving strand pairing prediction through exploring folding cooperativity. IEEE/ACM Trans Comput Biol Bioinform 5:484-91
Zheng, Jie; Rogozin, Igor B; Koonin, Eugene V et al. (2007) Support for the Coelomata clade of animals from a rigorous analysis of the pattern of intron conservation. Mol Biol Evol 24:2583-92
Zotenko, Elena; Dogan, Rezarta Islamaj; Wilbur, W John et al. (2007) Structural footprinting in protein structure comparison: the impact of structural fragments. BMC Struct Biol 7:53
Jothi, Raja; Przytycka, Teresa M; Aravind, L (2007) Discovering functional linkages and uncharacterized cellular pathways using phylogenetic profile comparisons: a comprehensive assessment. BMC Bioinformatics 8:173
Kann, Maricel G; Jothi, Raja; Cherukuri, Praveen F et al. (2007) Predicting protein domain interactions from coevolution of conserved regions. Proteins 67:811-20
Jothi, Raja; Zotenko, Elena; Tasneem, Asba et al. (2006) COCO-CL: hierarchical clustering of homology relations based on evolutionary correlations. Bioinformatics 22:779-88
Jeong, Jieun; Berman, Piotr; Przytycka, Teresa (2006) Fold classification based on secondary structure--how much is gained by including loop topology? BMC Struct Biol 6:3
Chakrabarti, Saikat; Lanczycki, Christopher J; Panchenko, Anna R et al. (2006) State of the art: refinement of multiple sequence alignments. BMC Bioinformatics 7:499
Jothi, Raja; Cherukuri, Praveen F; Tasneem, Asba et al. (2006) Co-evolutionary analysis of domains in interacting proteins reveals insights into domain-domain interactions mediating protein-protein interactions. J Mol Biol 362:861-75
Chakrabarti, Saikat; Lanczycki, Christopher J; Panchenko, Anna R et al. (2006) Refining multiple sequence alignments with conserved core regions. Nucleic Acids Res 34:2598-606

Showing the most recent 10 out of 13 publications