Comprehensive Mapping and Annotation of the E. coli Transcriptome

Wanner, Barry

Abstract

An interdisciplinary team of experimental biologists (Drs. Tyrrell Conway, Barry L. Wanner, and Daoguo Zhou), computational biologists (Drs. Michael R. Gribskov, Daisuke Kihara, and David W. Ussery), and mathematical modelers (Drs. Julio Collado-Vides and Bernhard O. Palsson) will tackle the challenge of creating whole transcriptome maps at the single nucleotide level of the model cell E. coli K-12. Results from high-throughput deep sequencing of cDNAs of total cellular RNA (RNA_Seq) will be used to generate comprehensive maps of all transcribed regions across the entire genome, to define computationally all large and small protein-encoding and non-encoding RNAs, and to quantify expression levels under a variety of growth conditions in wild-type cells and selected transcription factor mutants. Comprehensive maps of transcription start sites will be created by use of a protocol recently developed by our consultant Joerg Vogel to identify primary transcripts. These measurements will be used together with mathematical modeling to decode the first comprehensive transcriptional network of a living cell, thereby providing the framework for integration of measurements of different data types, from results for genetic interactions, protein-DNA interactions (ChIPchip and ChIP_Seq), protein-protein interactions, metabolomics, phenotyping, proteomics, cellular localization of E. coli proteins (e. g., imaging data for fluorescently tagged E. coli ASKA ORFeome clones at www.EcoliHub.org/GenoBase), three-dimensional imaging (electron tomography) of E. coli cells, and for other data sets generated elsewhere. These studies will be extended to other E. coli by development of whole transcriptome maps of pathogenic E. coli EDL933, the prototype terohemorrhagic E. coli O157:H7 (EHEC) during growth in vitro and in the mouse intestine, leading to creation of the first comprehensive extracellular in vivo expression transcriptome. These studies will be carried out with methods developed by our consultant Jay C. Hinton for isolation of bacterial RNA from mice (and infected cell cultures) for preparation of cDNAs for deep sequencing. Similar procedures will be used to generate whole transcriptome maps of Salmonella enterica serovar Typhimurium during growth in vitro and following infection of cultured macrophages and epithelial cells, thereby creating the first comprehensive intracellular in vivo expression transcriptome. Results obtained throughout the course of this project will be made public in accordance with NIH data sharing guidelines, for analysis, visualization, comparison, and downloading at www.EcoliHub.org/GenExpDB. Likewise, all computational tools implemented or developed in this project will be freely provided to users at www.EcoliHub.org. No organism can rival E. coli in the amount of baseline information and experimental tractability for all the measurements required for whole cell systems biology. The development of whole transcriptome maps of E. coli will lay the foundation for development of robust mathematical models of E. coli biochemistry and physiology and thereby the creation of a computerized, interactive """"""""virtual cell."""""""" Solving the E. coli cell will provide critical new insights into the fundamental nature of life.

Public Health Relevance

No other organism comes close to E. coli in the sheer depth or breadth of existing knowledge of its component parts or cellular processes. Understanding how these processes interact to form a living cell will require their characterization, quantification, integration, and mathematical modeling - that is, Systems Biology. A comprehensive whole transcriptome map of E. coli K-12 will provide the groundwork for predicting the behavior of other cells, including disease-causing microbes.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Institute of General Medical Sciences (NIGMS)
Type: NIH Challenge Grants and Partnerships Program (RC1)
Project #: 5RC1GM092047-02
Application #: 7945311
Study Section: Special Emphasis Panel (ZRG1-IMM-E (58))
Program Officer: Anderson, James J

Project Start: 2009-09-30
Project End: 2012-08-31
Budget Start: 2010-09-01
Budget End: 2012-08-31
Support Year: 2
Fiscal Year: 2010
Total Cost: $473,748
Indirect Cost

Institution

Name: Purdue University
Department: Biology
Type: Schools of Arts and Sciences
DUNS #: 072051394

City: West Lafayette
State: IN
Country: United States
Zip Code: 47907

Related projects


NIH 2010 RC1 GM	Comprehensive Mapping and Annotation of the E. coli Transcriptome Wanner, Barry L. / Purdue University	$473,748
NIH 2009 RC1 GM	Comprehensive Mapping and Annotation of the E. coli Transcriptome Wanner, Barry L. / Purdue University	$500,000

Publications

Otsuka, Yuta; Muto, Ai; Takeuchi, Rikiya et al. (2015) GenoBase: comprehensive resource database of Escherichia coli K-12. Nucleic Acids Res 43:D606-17

Nakayashiki, Toru; Saito, Natsumi; Takeuchi, Rikiya et al. (2013) The tRNA thiolation pathway modulates the intracellular redox state in Escherichia coli. J Bacteriol 195:2039-49

Nakayashiki, Toru; Mori, Hirotada (2013) Genome-wide screening with hydroxyurea reveals a link between nonessential ribosomal proteins and reactive oxygen species production. J Bacteriol 195:1226-35

Tohsato, Yukako; Baba, Tomoya; Mazaki, Yusaku et al. (2010) Environmental dependency of gene knockouts on phenotype microarray analysis in Escherichia coli. J Bioinform Comput Biol 8 Suppl 1:83-99

Rajagopala, Seesandra V; Yamamoto, Natsuko; Zweifel, Adrienne E et al. (2010) The Escherichia coli K-12 ORFeome: a resource for comparative molecular microbiology. BMC Genomics 11:470

Hsieh, Yi-Ju; Wanner, Barry L (2010) Global regulation by the seven-component Pi signaling system. Curr Opin Microbiol 13:198-203

Comments

Be the first to comment on Barry Wanner's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: