Cancer begins with mutations of the genome that can include substitutions, indels, duplications, gene fusions, and genome rearrangements. These mutational signatures vary widely between tissues and, often, between individual tumors. To understand this mutational landscape, cancer researchers need tools for visualizing the structural variation in cancer DNA, and its impact on gene transcription and regulation. For greater insight, these data should be augmented with gene annotations, results from functional genomics project like ENCODE, and population-level data. The classic tool for visualizing human gene annotations is the web-based genome browser. However, genome browsers are not generally well-suited to visualizing large-scale structural variation: for that, circular visualization tools (such as Circos) are often used, but those tools are not very interactive (or web-based). Additionally, few genome browsers allow analysis to be performed within the application itself. JBrowse is the most popular genome browser built using JavaScript and dynamic web technology. Originally aimed at model organism genome projects, it is the web-based browser predominantly used by such projects, and yet its single largest user is now the UK-based Cancer Genome Project. We have developed prototypes of an interactive circular view for JBrowse, a data loading pipeline to import human genome annotations, a framework for running interactive analyses from within the browser. We plan to extend these prototypes to a fully-fledged system for visualizing structural variants and other mutations in cancer genomes, and running analyses on those data. We will integrate this software with the systems of the Cancer Genome Project, the International Cancer Genome Consortium, and the Genomic Data Commons.

Public Health Relevance

The proposed software will help cancer scientists visualize and interpret the mutation and/or scrambling of genome structure that precedes carcinogenesis. It will combine the most popular JavaScript genome browser (JBrowse) with one of the most powerful visualization techniques for genome rearrangements (Circos-style views), allowing cancer genomes to be interpreted in the context of the substantial public data on human genes and their function. We will integrate the resulting software with key cancer data portals including the NCI-funded Genomic Data Commons.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Resource-Related Research Projects--Cooperative Agreements (U24)
Project #
5U24CA220441-04
Application #
9986722
Study Section
Special Emphasis Panel (ZCA1)
Program Officer
Li, Jerry
Project Start
2017-08-01
Project End
2021-07-31
Budget Start
2020-08-01
Budget End
2021-07-31
Support Year
4
Fiscal Year
2020
Total Cost
Indirect Cost
Name
University of California Berkeley
Department
Biomedical Engineering
Type
Biomed Engr/Col Engr/Engr Sta
DUNS #
124726725
City
Berkeley
State
CA
Country
United States
Zip Code
94710