Completion of the DNA sequence of the yeast genome has made accessible a large number of questions about the organization and expression of eukaryotic genomes. Important among these questions is defining a complete minimum protein set necessary for eukaryotic cell growth and regulation, key to understanding human cancer. A hallmark of the eukaryotes is the abundant presence of introns, internal gene sequences not found in the mature messenger RNAs (mRNAs) that specify the protein coding capacity of the genome. The presence of introns clouds our ability to see open reading frames in the genomic sequence. To understand the complete coding capacity of the yeast genome, and of other eukaryotic genomes, we must first be able to recognize introns in the genomic sequence. With the complete sequence of the yeast genome in hand, we have the opportunity to map the positions of all the nuclear pre-mRNA introns in the yeast genome, and thus reveal its protein coding capacity. At this writing 220 yeast introns are known or predicted, but these have been identified in a biased, ad hoc fashion. We have developed a powerful molecular approach to the direct detection of introns in a manner not biased by the contents of the gene in which it is embedded. Oligonucleotides complementary to the unique lariat sequence formed during splicing (""""""""branchmers"""""""") specifically prime reverse transcription of lariat intron RNA. Mutations that inactivate the lariat debranching enzyme cause dramatic accumulation of intron RNA in yeast. Thus branchmer oligonucleotides will be used to generate expressed intron probes.
Our aims are (1) to create and screen libraries of """"""""expressed intron tag"""""""" clones derived from strains of yeast that accumulate large-amounts of intron RNA. These clones will be sequenced to generate a database of expressed intron sequences, (2) to identify genomic sequences similar to known introns using informatic approaches and test these for splicing potential in vivo, and (3) to refine repeated applications of each approach until a complete set of confirmed introns is mapped to the sequence of the genome. Finding all the introns will be essential to the complete understanding of the coding capacity of the genome.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Exploratory/Developmental Grants (R21)
Project #
5R21CA077813-02
Application #
2749001
Study Section
Special Emphasis Panel (ZHG1-HGR-N (J1))
Program Officer
Marks, Cheryl L
Project Start
1997-08-15
Project End
2000-07-31
Budget Start
1998-08-01
Budget End
2000-07-31
Support Year
2
Fiscal Year
1998
Total Cost
Indirect Cost
Name
University of California Santa Cruz
Department
Biochemistry
Type
Schools of Arts and Sciences
DUNS #
City
Santa Cruz
State
CA
Country
United States
Zip Code
95064
Shvetsov, Yurii B; Hernandez, Brenda Y; Wong, Sze H et al. (2009) Intraindividual variability in serum micronutrients: effects on reliability of estimated parameters. Epidemiology 20:36-43
Grate, Leslie; Ares Jr, Manuel (2002) Searching yeast intron data at Ares lab Web site. Methods Enzymol 350:380-92
Clark, Tyson A; Sugnet, Charles W; Ares Jr, Manuel (2002) Genomewide analysis of mRNA processing in yeast using splicing-specific microarrays. Science 296:907-10
Davis, C A; Grate, L; Spingola, M et al. (2000) Test of intron predictions reveals novel splice sites, alternatively spliced mRNAs and new introns in meiotically regulated genes of yeast. Nucleic Acids Res 28:1700-6
Brown, M P; Grundy, W N; Lin, D et al. (2000) Knowledge-based analysis of microarray gene expression data by using support vector machines. Proc Natl Acad Sci U S A 97:262-7
Ares Jr, M; Grate, L; Pauling, M H (1999) A handful of intron-containing genes produces the lion's share of yeast mRNA. RNA 5:1138-9
Spingola, M; Grate, L; Haussler, D et al. (1999) Genome-wide bioinformatic and molecular analysis of introns in Saccharomyces cerevisiae. RNA 5:221-34