Protein Sequence Motif Discovery & Homology Search

Elkan, Charles

Abstract

Motifs--short, gapless regions of similar sequence--have been shown to be useful for understanding the evolutionary and functional relationships among biopolymers. We are making good progress on automating the process of extracting motif descriptions from groups of related protein or DNA sequences and using those descriptions to search sequence databases and to analyze newly sequenced molecules. Recent results show that our methods are able to detect distant and subtle relationships among proteins not apparent using other sequence-based methods. We have made progress on three fronts. We have improved the MEME [I] algorithm for motif discovery--Multiple Expectation maximization for Motif Elicitation; we have ported MEME to the Intel Paragon massively parallel computer and made it available to the world as WWW server; we have developed MAST--Motif Alignment Search Tool--based on a new method for assessing the statistical significance of multiple motif scores. The MEME algorithm has been improved to enable it to discover motif patterns in situations where little is known about the number or arrangements of motifs within the training sequences. This was accomplished using background information about protein motifs--encoded as a mixture of Dirichlet priors--in a novel way [2]. This technique dramatically improves the ability of MEME to discover motif patterns when the pattern only occurs in a few of the sequences in the training set and when the pattern is very weak but occurs multiple times in some sequences in the training set. The biological community is now served by a parallel implementation of MEME running on SDSC's Intel Paragon computer. We have made this service available via a world-wide web site [3] and are proceeding to advertise it. We expect it to be a valuable addition to single sequence search tools (e.g., BLAST, FAST) and multiple alignment tools because MEME patterns are able to detect more distant relationships than single sequence searches and because MEME can be used in situations where the sequences are too distantly related to be multiply aligned reliably. We have developed a new method for searching sequence databases with one or more motifs that characterize a protein family and implemented it in the MAST algorithm (Bailey and Gribskov, in preparation). One novel feature of this program is a method for calculating the p-value for multiple motif scores. This allows biologists to evaluate the statistical significance of apparent sequence similarities. We are planning to make MAST available on-line through a web site to complement the usefulness of the MEME web site. [l] T.L. Bailey and C. Elkan """"""""Fitting a mixture model by expectation maximization to discover motifs in biopolymers"""""""" , Proc. Second Int. Conf. Intelligent Sys. Molec. Biol., (28-36), AAAI Press, 1994. [2] T.L. Bailey and M. Gribskov """"""""The megaprior heuristic for discovering sequence patterns"""""""", To appear: Proc. Fourth Int. Conf. Intelligent Sys. Molec. Biol., AAAI Press, 1996. [3] W. Grundy, T.L. Bailey, C. Elkan """"""""ParaMEME: Discovering DNA and protein motifs with a scalable parallel Computer--RESEARCH ABSTRACT"""""""", To appear: Proc. Fourth Int. Conf. Intelligent Sys. Molec.Biol. AAAI Press, 1996.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Center for Research Resources (NCRR)
Type: Biotechnology Resource Grants (P41)
Project #: 7P41RR008605-03
Application #: 5225726
Study Section

Project Start
Project End
Budget Start
Budget End
Support Year: 3
Fiscal Year: 1996
Total Cost
Indirect Cost

Related projects

Publications

Pantoja, Joe Luis; Morgan, Ashley E; Grossi, Eugene A et al. (2017) Undersized Mitral Annuloplasty Increases Strain in the Proximal Lateral Left Ventricular Wall. Ann Thorac Surg 103:820-827

Morgan, Ashley E; Wozniak, Curtis J; Gulati, Sarthak et al. (2017) Association of Uneven MitraClip Application and Leaflet Stress in a Finite Element Model. JAMA Surg 152:111-114

Ge, Liang; Wu, Yife; Soleimani, Mehrdad et al. (2016) Moderate Ischemic Mitral Regurgitation After Posterolateral Myocardial Infarction in Sheep Alters Left Ventricular Shear but Not Normal Strain in the Infarct and Infarct Borderzone. Ann Thorac Surg 101:1691-9

Morgan, Ashley E; Pantoja, Joe Luis; Weinsaft, Jonathan et al. (2016) Finite Element Modeling of Mitral Valve Repair. J Biomech Eng 138:021009

Morgan, Ashley E; Pantoja, Joe L; Grossi, Eugene A et al. (2016) Neochord placement versus triangular resection in mitral valve repair: A finite element model. J Surg Res 206:98-105

Purvine, Emilie; Monson, Kyle; Jurrus, Elizabeth et al. (2016) Energy Minimization of Discrete Protein Titration State Models Using Graph Theory. J Phys Chem B 120:8354-60

Bucero, Marta Abril; Bajaj, Chandrajit; Mourrain, Bernard (2016) On the construction of general cubature formula by flat extensions. Linear Algebra Appl 502:104-125

Ebeida, Mohamed S; Rushdi, Ahmad A; Awad, Muhammad A et al. (2016) Disk Density Tuning of a Maximal Random Packing. Comput Graph Forum 35:259-269

Yang, Pei-Chi; Boras, Britton W; Jeng, Mao-Tsuen et al. (2016) A Computational Modeling and Simulation Approach to Investigate Mechanisms of Subcellular cAMP Compartmentation. PLoS Comput Biol 12:e1005005

Watson, Shana R; Liu, Piaomu; Peña, Edsel A et al. (2016) Comparison of Aortic Collagen Fiber Angle Distribution in Mouse Models of Atherosclerosis Using Second-Harmonic Generation (SHG) Microscopy. Microsc Microanal 22:55-62

Showing the most recent 10 out of 270 publications

Comments

Be the first to comment on Charles Elkan's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants:

Abstract

Funding Agency

Related projects

Publications

Comments