Next-generation sequencing technologies and their applications (e.g., ChIP-seq, HiC) are generating an astonishing amount of genomic, epigenomic and transcriptomic data, serving the scientific community a rich, growing resource and producing new types of reference (i.e., the reference epigenomes) in the "post- genome" era. However, there is a serious bottleneck for investigators to take full advantage of these data for biomedical research. Conventional Genome Browsers limit biologists to examine a gene or a genomic region at a time, and limit them to compare at most a couple dozen datasets visually. Additional bioinformatics expertise is required to manipulate and analyze these data, and to compare investigators'own data with public data produced by consortiums. In this proposal, we introduce The Wash U Epigenome Browser and its associated visualization and analysis tools to give investigators a next- generation experience in exploring, manipulating and analyzing large genomic datasets. Our strategy is to combine state-of-the-art web technologies, programming practices and user interface design to deliver the most intuitive, easy-to-use, and comprehensive bioinformatics tools in the format of a next-generation Genome Browser.
In Specific Aim 1 we will develop and extend The Wash U Epigenome Browser as a visual bioinformatics engine that enables biologists to visualize hundreds of genome-wide datasets, annotate genomics data with metadata, visually navigate and manipulate the data, and easily generate testable hypothesis. If successful, it will produce an effective tool to accelerate scientific interpretation of large genomic data.
In Specific Aim 2 we will develop the Epigenome Browser to become versatile visualization systems that can rapidly evolve and adapt for new data type and new analysis. We will demonstrate the potential of rapid development to meet new visualization and analysis needs by solving two difficult problems: visualizing long-range chromatin interaction data and visualizing data on repeats and transposable elements. The Wash U Epigenome Browser and its associated visualization and analysis tools promise to revolutionize how biologists engage with large genomic and epigenomic datasets produced by next-generation technologies and will serve as a novel bioinformatics platform for diagnosis and treatment of disease.

Public Health Relevance

New tools are needed to help investigators navigate and manipulate the enormous genomic and epigenomic data produced by modern sequencing-based technologies. We propose to develop a next- generation Epigenome Browser that works as a visual bioinformatics engine. Not only will this new Browser greatly enhance how investigators explore and take advantage of public consortium data, it will eventually help investigators make use of next-generation data for disease diagnosis and therapy.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Project (R01)
Project #
1R01HG007354-01
Application #
8560622
Study Section
Biodata Management and Analysis Study Section (BDMA)
Program Officer
Pazin, Michael J
Project Start
2013-07-16
Project End
2017-06-30
Budget Start
2013-07-16
Budget End
2014-06-30
Support Year
1
Fiscal Year
2013
Total Cost
$396,750
Indirect Cost
$104,000
Name
Washington University
Department
Genetics
Type
Schools of Medicine
DUNS #
068552207
City
Saint Louis
State
MO
Country
United States
Zip Code
63130
Li, Daofeng; Zhang, Bo; Xing, Xiaoyun et al. (2015) Combining MeDIP-seq and MRE-seq to investigate genome-wide CpG methylation. Methods 72:29-40
Nagarajan, Raman P; Zhang, Bo; Bell, Robert J A et al. (2014) Recurrent epimutations activate gene body promoters in primary glioblastoma. Genome Res 24:761-74
Zhou, Xin; Li, Daofeng; Lowdon, Rebecca F et al. (2014) methylC Track: visual integration of single-base resolution DNA methylation data on the WashU EpiGenome Browser. Bioinformatics 30:2206-7
Zhang, Bo; Xing, XiaoYun; Li, Jing et al. (2014) Comparative DNA methylome analysis of endometrial carcinoma reveals complex and distinct deregulation of cancer promoters and enhancers. BMC Genomics 15:868
Lacin, Haluk; Rusch, Jannette; Yeh, Raymond T et al. (2014) Genome-wide identification of Drosophila Hb9 targets reveals a pivotal role in directing the transcriptome within eight neuronal lineages, including activation of nitric oxide synthase and Fd59a/Fox-D. Dev Biol 388:117-33
Raney, Brian J; Dreszer, Timothy R; Barber, Galt P et al. (2014) Track data hubs enable visualization of user-defined genome-wide annotations on the UCSC Genome Browser. Bioinformatics 30:1003-5
Sundaram, Vasavi; Cheng, Yong; Ma, Zhihai et al. (2014) Widespread contribution of transposable elements to the innovation of gene regulatory networks. Genome Res 24:1963-76
Stevens, Michael; Cheng, Jeffrey B; Li, Daofeng et al. (2013) Estimating absolute methylation levels at single-CpG resolution from methylation enrichment and restriction enzyme sequencing methods. Genome Res 23:1541-53