With the complete sequencing of numerous genomes and the annotation of proteomes, one of the next major challenges in biology is to understand the functions and integration of the encoded proteins (a NIH Roadmap area of emphasis). Deciphering protein function is a very time consuming, expensive process, as reflected by the disproportionately low percentage of proteins with well-established functions. One approach for extrapolating established functions to new proteins is to predict short motifs. Short motifs target proteins for post-translational modification, trafficking to cellular compartments, and binding to other proteins or molecules. Our cross-disciplinary team has built Minimotif Miner (MnM), a short motif database and platform- independent web-tool that identifies motif consensus sequences in protein queries and thus potential new protein functions (http://mnm.engr.uconn.edu/). MnM can also be used to develop new hypotheses of how specific mutations cause human disease and to identify putative targets for the development of therapeutic drugs, antibiotics, insecticides, and antiviral agents. Despite the utility of MnM and other motif resources, prediction of functional motifs still has two major limitations, which we address in this proposal. 1) To reduce the false-positive prediction of motifs, we have created a new language that allows us to consider the 3- dimensional structural conservation of motifs. For each motif, we will build specific motif definitions by combining experimental data with data from motif structures in the Protein Data Bank. We will also determine the sequence permutations that can form the observed motif structure by using molecular dynamic simulations. 2) To build a more comprehensive motif database we will use artificial intelligence to mine PubMed. The expert system will use automated literature screening, document summarization, and motif identification efficiency score to extract the majority of known motifs from PubMed. Addressing these limitations will vastly increase the utility of short motif prediction.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM079689-03
Application #
7617090
Study Section
Biodata Management and Analysis Study Section (BDMA)
Program Officer
Remington, Karin A
Project Start
2007-05-01
Project End
2009-07-31
Budget Start
2009-05-01
Budget End
2009-07-31
Support Year
3
Fiscal Year
2009
Total Cost
$60,937
Indirect Cost
Name
University of Connecticut
Department
Neurosciences
Type
Schools of Medicine
DUNS #
022254226
City
Farmington
State
CT
Country
United States
Zip Code
06030
Sharma, Surbhi; Young, Richard J; Chen, Jingchun et al. (2018) Minimotifs dysfunction is pervasive in neurodegenerative disorders. Alzheimers Dement (N Y) 4:414-432
Sharma, Surbhi; Toledo, Oniel; Hedden, Michael et al. (2016) The Functional Human C-Terminome. PLoS One 11:e0152731
Schiller, Martin R (2016) The minimotif synthesis hypothesis for the origin of life. J Transl Sci 2:289-296
Lyon, Kenneth F; Strong, Christy L; Schooler, Steve G et al. (2015) Natural variability of minimotifs in 1092 people indicates that minimotifs are targets of evolution. Nucleic Acids Res 43:6399-412
Kundeti, Vamsi; Rajasekaran, Sanguthevar; Dinh, Hieu (2014) Border length minimization problem on a square array. J Comput Biol 21:446-55
Kundeti, Vamsi; Rajasekaran, Sanguthevar (2012) Self assembly of rectangular shapes on concentration programming and probabilistic tile assembly models. Nat Comput 11:
Mi, Tian; Rajasekaran, Sanguthevar; Merlin, Jerlin Camilus et al. (2012) Achieving high accuracy prediction of minimotifs. PLoS One 7:e45589
Merlin, Jerlin C; Rajasekaran, Sanguthevar; Mi, Tian et al. (2012) Reducing false-positive prediction of minimotifs with a genetic interaction filter. PLoS One 7:e32630
Mi, Tian; Merlin, Jerlin Camilus; Deverasetty, Sandeep et al. (2012) Minimotif Miner 3.0: database expansion and significantly improved reduction of false-positive predictions from consensus sequences. Nucleic Acids Res 40:D252-60
Sargeant, David P; Gryk, Michael R; Maciejewski, Mark W et al. (2012) Secondary structure, a missing component of sequence-based minimotif definitions. PLoS One 7:e49957

Showing the most recent 10 out of 23 publications