Recent studies show that large double stranded DNA (dsDNA) virus proteomes are much more complex than previously appreciated. Ribosome profiling and mass spectrometry analyses have identified hundreds of novel coding regions in multiple dsDNA viruses, raising the fundamental question of how do dsDNA viruses encode and regulate such expansive proteomes? We have a limited understanding of the mechanisms by which dsDNA viruses generate such diverse proteomes, in large part due to our poor understanding of viral transcript structure. Defining the sequence of full-length viral transcripts (the full-length transcriptome) identifies the mRNAs that encode viral proteins, and potentially novel transcripts encoding currently unrecognized viral proteins. Defining full-length transcriptomes also provides a global view of viral promoters, and 5' untranslated regions (UTRs) that regulate mRNA stability and translation efficiency. Standard RNA-Seq approaches are insufficient to identify full-length dsDNA virus transcriptomes as widespread transcription of the viral genomes creates over-lapping transcriptional units that can prevent the definition of transcript termini, and complicate assignment of splice junctions to specific viral transcripts. Asa result, the full-length transcriptome has not been defined for any large dsDNA virus. New, high throughput approaches are needed to define full-length dsDNA virus transcriptomes, and thereby understand the regulation of viral gene expression and the mechanisms driving viral proteome complexity. Using human cytomegalovirus (HCMV) as a model large dsDNA virus, we propose an innovative combination of techniques and analyses to define full-length dsDNA virus transcriptomes. Our multi-disciplinary approach uses novel bioinformatics methods to merge long and short read length sequencing data, known as hybrid sequencing, with high definition transcription start site analysis to provide an end-to-end view of full-length viral transcripts n a genome-wide scale. Our preliminary results show that our approach identifies novel HCMV coding regions resulting from alterative transcription start usage and splicing, and new promoters controlling the expression of viral transcripts. Successful completion of this project will provide a new paradigm for defining full-length dsDNA virus transcriptomes that will be useful for determining the mechanism driving dsDNA virus proteome complexity, and the regulatory mechanisms controlling viral gene expression.

Public Health Relevance

Defining full length viral transcriptomes is important to understand how viruses encode and regulate their proteomes. In this proposal we use a multi-disciplinary approach to develop a new high throughput method to define full-length transcriptomes of dsDNA viruses.

Agency
National Institute of Health (NIH)
Institute
National Institute of Allergy and Infectious Diseases (NIAID)
Type
Exploratory/Developmental Grants (R21)
Project #
1R21AI123811-01
Application #
9092162
Study Section
Virology - B Study Section (VIRB)
Program Officer
Beisel, Christopher E
Project Start
2016-02-01
Project End
2018-01-31
Budget Start
2016-02-01
Budget End
2017-01-31
Support Year
1
Fiscal Year
2016
Total Cost
Indirect Cost
Name
University of North Carolina Chapel Hill
Department
Microbiology/Immun/Virology
Type
Schools of Medicine
DUNS #
608195277
City
Chapel Hill
State
NC
Country
United States
Zip Code
27599
Arend, Kyle C; Lenarcic, Erik M; Moorman, Nathaniel J (2018) The 5' Untranslated Region of the Major Immediate Early mRNA Is Necessary for Efficient Human Cytomegalovirus Replication. J Virol 92:
Kutchko, Katrina M; Madden, Emily A; Morrison, Clayton et al. (2018) Structural divergence creates new functional features in alphavirus genomes. Nucleic Acids Res 46:3657-3670
Hawman, David W; Carpentier, Kathryn S; Fox, Julie M et al. (2017) MUTATIONS IN THE E2 GLYCOPROTEIN AND THE 3' UNTRANSLATED REGION ENHANCE CHIKUNGUNYA VIRUS VIRULENCE IN MICE. J Virol :
Arend, Kyle C; Lenarcic, Erik M; Vincent, Heather A et al. (2017) Kinome Profiling Identifies Druggable Targets for Novel Human Cytomegalovirus (HCMV) Antivirals. Mol Cell Proteomics 16:S263-S276
Arend, Kyle C; Ziehr, Benjamin; Vincent, Heather A et al. (2016) Multiple Transcripts Encode Full-Length Human Cytomegalovirus IE1 and IE2 Proteins during Lytic Infection. J Virol 90:8855-65