Understanding the complexity of the transcriptomes in E. coli K12

Su, Zhengchang

Abstract

Using high density tilling microarray and/or directional RNA-seq techniques, it was recently found that alternative and varying-level (dynamic) transcriptions within operons are highly prevalent and the non-coding RNA (ncRNA) and anti-sense RNA (asRNA) transcriptions are also highly pervasive. Thus prokaryotic transcriptomes seem to be more complex than previously thought. However, little is known about the patterns and rules of the transcriptomic complexity and its biological implications, as well as its underlying molecular mechanisms. Moreover, the generality of such transcriptomic complexity remains unknown because inconsistent and even contradictory results have been reported even in the same strains. Furthermore, the investigation of the transcriptomic complexity in prokaryotes has been hindered by the highly biased sequence reads of the current directional RNA-seq techniques that is further confounded in prokaryotes for the highly labile nature and extremely low concentrations of their RNAs in the cells, and the lack of an effective method for their enrichment, leading to highly non-uniform read coverage, and even numerous uncovered gaps in transcribed regions. Such highly non-uniform read coverage and prevalent uncovered gap make it very difficult to assembly full-length transcripts, let alone to detect dynamic transcriptions along operons. Consequently, little is known about the transcriptomic complexity in many medically important prokaryotes, and even in the most widely-studied model bacterium E. coli K12. This project plans to address these problems using E. coli K12 as the model system.
The specific aims are: 1) to develop an algorithm and tool for sufficiently correcting the read biases in the current RNA-seq techniques;2) to develop an accurate and efficient algorithm and tool for simultaneously assembling prokaryotic full-length transcripts and detecting possible dynamic transcriptions along the assembled transcripts using RNA-seq short reads;3) to characterize the patterns and biological roles of alternative and dynamic operon utilizations as well as asRNA and ncRNA transcriptions in E. coli K12. Accomplishment of this project will not only further our understanding of the global architecture and complexity of the transcriptomes in E. coli K12, but also will provide the research community with computational tools and experimental methods to address the similar questions in other prokaryotes, thereby facilitating the community efforts to decipher gene regulatory networks in all sequenced prokaryotic genomes. A better understanding of the gene regulatory networks of medically, agriculturally and industriously important prokaryotes will enhance our ability to prevent and cure infectious diseases, and to produce foods and other important products.>

Public Health Relevance

The discoveries from this project will lead to an unprecedentedly detailed and holistic understanding of the complex gene transcriptions in E. coli K12 and the roles they play in bacterial responses to environmental changes. This project will also provide the research community with accurate and effective tools for studying the complex gene transcriptions in numerous medically important bacteria. As gene transcription plays crucial roles in bacterial infections, a better understanding of the complex gene transcription in bacteria will help design new strategies to prevent and cure many infectious diseases.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Institute of General Medical Sciences (NIGMS)
Type: Research Project (R01)
Project #: 1R01GM106013-01A1
Application #: 8697623
Study Section: Biodata Management and Analysis Study Section (BDMA)
Program Officer: Brazhnik, Paul

Project Start: 2014-06-01
Project End: 2018-03-31
Budget Start: 2014-06-01
Budget End: 2015-03-31
Support Year: 1
Fiscal Year: 2014
Total Cost: $276,521
Indirect Cost: $76,521

Institution

Name: University of North Carolina Charlotte
Department: Biostatistics & Other Math Sci
Type: Other Domestic Higher Education
DUNS #: 066300096

City: Charlotte
State: NC
Country: United States
Zip Code: 28223

Related projects


NIH 2017 R01 GM	Understanding the complexity of the transcriptomes in E. coli K12 Su, Zhengchang / University of North Carolina Charlotte	$258,646
NIH 2016 R01 GM	Understanding the complexity of the transcriptomes in E. coli K12 Su, Zhengchang / University of North Carolina Charlotte
NIH 2015 R01 GM	Understanding the complexity of the transcriptomes in E. coli K12 Su, Zhengchang / University of North Carolina Charlotte
NIH 2014 R01 GM	Understanding the complexity of the transcriptomes in E. coli K12 Su, Zhengchang / University of North Carolina Charlotte	$276,521