Advances in technology over the last decade have resulted in a golden age of genomics in life sciences. Sequencing technology has yielded 169 completed and published genomes, including human, and there are 432 prokaryotic and 367 eukaryotic genome-sequencing projects ongoing. This explosion in sequence information has been coupled with the development of many high throughput technologies, such as microarrays, which have given us an unprecedented capacity to assay almost every aspect of genomes and the biological systems they direct. A critical gap in our framework for comprehensive understanding of biological systems is that despite having finished genome sequences for 21 eukaryotes, there is probably no single eukaryote for which we can say with confidence that we have a full list of all the genes and their products. The budding yeast, Saccharomyces cerevisiae, was the first eukaryotic genome to be fully sequenced, and is arguably the pre-eminent organism for genomic studies, and many key eukaryotic pathways and mechanisms have been elucidated in yeast. In the eight years since S. cerevisiae was sequenced, numerous studies have resulted in the addition or removal of genes from the Saccharomyces Genome Database, yet hundreds of ambiguities remain. This application seeks to use a novel iterative custom microarray approach to comprehensively define the genic potential of the yeast genome, by identifying and sizing all transcripts within the genome, and determining whether they are protein coding or not. Initial characterization of novel transcripts, to define their functions, will be carried out. Preliminary tudies suggest that there is a wealth of undiscovered genes in yeast, and that this study will form a prototypic methodology for rapidly determining the transcriptomes of a wide variety of organisms, and for providing an interpretive framework for initial functional characterization of these transcriptomes.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Project (R01)
Project #
5R01HG003468-02
Application #
7022956
Study Section
Genome Study Section (GNM)
Program Officer
Feingold, Elise A
Project Start
2005-02-25
Project End
2008-01-31
Budget Start
2006-02-01
Budget End
2007-01-31
Support Year
2
Fiscal Year
2006
Total Cost
$430,829
Indirect Cost
Name
Stanford University
Department
Genetics
Type
Schools of Medicine
DUNS #
009214214
City
Stanford
State
CA
Country
United States
Zip Code
94305
Ropars, Jeanne; Maufrais, Corinne; Diogo, Dorothée et al. (2018) Gene flow contributes to diversification of the major fungal pathogen Candida albicans. Nat Commun 9:2253
Muzzey, Dale; Sherlock, Gavin; Weissman, Jonathan S (2014) Extensive and coordinated control of allele-specific expression by both transcription and translation in Candida albicans. Genome Res 24:963-73
Muzzey, Dale; Schwartz, Katja; Weissman, Jonathan S et al. (2013) Assembly of a phased diploid Candida albicans genome facilitates allele-specific measurements and provides a simple model for repeat and indel structure. Genome Biol 14:R97
Risso, Davide; Schwartz, Katja; Sherlock, Gavin et al. (2011) GC-content normalization for RNA-Seq data. BMC Bioinformatics 12:480
Lee, Albert; Hansen, Kasper Daniel; Bullard, James et al. (2008) Novel low abundance and transient RNAs in yeast revealed by tiling microarrays and ultra high-throughput sequencing are not conserved across closely related yeast species. PLoS Genet 4:e1000299