The most highly conserved regions of proteins can be represented as blocks of aligned sequence segments, typically with multiple blocks for a given protein family. The investigators have developed and validated improved methodologies for multiple alignment comparison: position-based sequence weights efficiently neutralize sequence redundancy in a simple and intuitive way, position-based pseudo-counts greatly improve searching performance over current widely used methods for column representation, blocks-versus-blocks searching allows sensitive detection of subtle relationships, and embedding methods effectively deal with problems caused by non-conserved regions between blocks. In addition, the investigators expanded the Blocks WWW and e-mail sites by adding Block Maker for making blocks from user-defined families, LAMA for searching the Blocks Database with user-generated blocks, COBBLER for automated sequence database searching of consensus-embedded sequences, sequence logo displays of blocks, and automated tree-making from multi-block alignments. Furthermore, the Blocks Database has been expanded by merging automatically generated Prosite-based blocks with manually generated Print blocks for purposes of searching and documentation retrieval. The investigators have begun to exploit these powerful new tools to address biological questions of interest in the laboratory, including characterization of novel DNA methyltransferases and inteins. They propose to continue expansion of their Internet site by improving methodologies and adding new capabilities, and to exploit these new biological discoveries experimentally. One new capability that they propose is full implementation of a new strategy for degenerate PCR primer design and its experimental verification using C(5) DNA methyltransferases. This effort could yield new eukaryotic family members of great importance for understanding epigenetic phenomena, some of which, such as parental imprinting, are relevant to human disease.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM029009-19
Application #
2900541
Study Section
Genome Study Section (GNM)
Project Start
1981-04-01
Project End
2002-03-31
Budget Start
1999-04-01
Budget End
2000-03-31
Support Year
19
Fiscal Year
1999
Total Cost
Indirect Cost
Name
Fred Hutchinson Cancer Research Center
Department
Type
DUNS #
075524595
City
Seattle
State
WA
Country
United States
Zip Code
98109
Ng, Pauline C; Henikoff, Steven (2003) SIFT: Predicting amino acid changes that affect protein function. Nucleic Acids Res 31:3812-4
Tompa, Rachel; McCallum, Claire M; Delrow, Jeffrey et al. (2002) Genome-wide profiling of DNA methylation reveals transposon targets of CHROMOMETHYLASE3. Curr Biol 12:65-8
Lindroth, A M; Cao, X; Jackson, J P et al. (2001) Requirement of CHROMOMETHYLASE3 for maintenance of CpXpG methylation. Science 292:2077-80
Colbert, T; Till, B J; Tompa, R et al. (2001) High-throughput screening for induced point mutations. Plant Physiol 126:480-4
Kunin, V; Chan, B; Sitbon, E et al. (2001) Consistency analysis of similarity between multiple alignments: prediction of protein function and fold structure from analysis of local sequence motifs. J Mol Biol 307:939-49
Malik, H S; Eickbush, T H (2001) Phylogenetic analysis of ribonuclease H domains suggests a late, chimeric origin of LTR retrotransposable elements and retroviruses. Genome Res 11:1187-97
Malik, H S; Henikoff, S (2000) Dual recognition-incision enzymes might be involved in mismatch repair and meiosis. Trends Biochem Sci 25:414-8
Henikoff, J G; Pietrokovski, S; McCallum, C M et al. (2000) Blocks-based methods for detecting protein homology. Electrophoresis 21:1700-6
McCallum, C M; Comai, L; Greene, E A et al. (2000) Targeted screening for induced mutations. Nat Biotechnol 18:455-7
Malik, H S; Burke, W D; Eickbush, T H (2000) Putative telomerase catalytic subunits from Giardia lamblia and Caenorhabditis elegans. Gene 251:101-8

Showing the most recent 10 out of 44 publications