Our aim is to advance genome analysis by providing interactive exploratory visualization tools enabling researchers to refine, correct and augment initial automated results. The cost of sequencing a genome has been dramatically reduced by several orders of magnitude in the last decade, and the natural consequence is that more and more researchers are sequencing more and more new genomes, both within populations and across every species. Each new exome or genome sequenced requires visualization because computational genome analysis remains an imperfect art, but with effective visualization, genome interpretation can take advantage of human perceptual capabilities to integrate information. Web Apollo will provide an easy to use, web-based environment offering multiple, distributed users high-performance visualization for interactive exploration of genome data, and the tools needed for producing highly accurate annotations that are maximally informative for downstream analysis. This project will enable every member of the burgeoning genomic community to fully mobilize molecular research data, whether collected by individual researchers or made available from public aggregation sites to benefit investigations into genetics and human diseases. This project will enable researchers to (i) Annotate genomic variants and haplotypes using: summarization tracks including background population frequencies and quality scores; projections of variants onto protein displays to see intersection with protein features such as active sites and secondary structure to assess biological impact; and individualized haplotype tracks to annotate unique disease associated transcripts; (ii) Improve exon-intron accuracy for protein coding transcripts via direct comparison to related proteins in an integrated multiple sequence alignment (MSA) view; (iii) Annotate biological function with GO terms (iv) Explore the genome using bookmarking of key sites for rapid navigation, and a novel cylindrical view to examine large-scale chromosomal rearrangements across multiple haplotypes simultaneously; (v) Analyze the data interactively using graph thresholding to define discrete features, dynamic visual filtering of data tracks and their contents, and visual folding of the genome to bring genes with shared attributes such as paralogy or shared phenotypes and the variants within them in the same visual field; (vi) Collaborate securely in real-time with selected team members over the Internet using hybrid clouds for private data and freely-hosted servers for public data.

Public Health Relevance

Apollo will provide a suite of collaborative and interactive investigation tools to understand the relationship between DNA variation and human disease. Its real- time collaborative environment mobilizes more researchers, who can easily connect with one another and working together as a concerted team more rapidly gain insight into the growing volume of genomic information. Apollo's visual exploratory tools gives maximum leverage to human cognitive abilities to pinpoint unique differences and recognize correlations across the genome in their investigations into the genetic orig ins of complex diseases.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM080203-08
Application #
8843004
Study Section
Biodata Management and Analysis Study Section (BDMA)
Program Officer
Ravichandran, Veerasamy
Project Start
2007-08-01
Project End
2018-04-30
Budget Start
2015-05-01
Budget End
2016-04-30
Support Year
8
Fiscal Year
2015
Total Cost
Indirect Cost
Name
Lawrence Berkeley National Laboratory
Department
Type
DUNS #
078576738
City
Berkeley
State
CA
Country
United States
Zip Code
94720
Poynton, Helen C; Hasenbein, Simone; Benoit, Joshua B et al. (2018) The Toxicogenome of Hyalella azteca: A Model for Sediment Ecotoxicology and Evolutionary Toxicology. Environ Sci Technol 52:6009-6022
Schoville, Sean D; Chen, Yolanda H; Andersson, Martin N et al. (2018) A model species for agricultural pest genomics: the genome of the Colorado potato beetle, Leptinotarsa decemlineata (Coleoptera: Chrysomelidae). Sci Rep 8:1931
Harper, Lisa; Campbell, Jacqueline; Cannon, Ethalinda K S et al. (2018) AgBioData consortium recommendations for sustainable genomics and genetics databases for agriculture. Database (Oxford) 2018:
Papanicolaou, Alexie; Schetelig, Marc F; Arensburger, Peter et al. (2017) Erratum to: The whole genome sequence of the Mediterranean fruit fly, Ceratitis capitata (Wiedemann), reveals insights into the biology and adaptive evolution of a highly invasive pest species. Genome Biol 18:11
Putman, Tim E; Lelong, Sebastien; Burgstaller-Muehlbacher, Sebastian et al. (2017) WikiGenomes: an open web application for community consumption and curation of gene annotation data in Wikidata. Database (Oxford) 2017:
Buels, Robert; Yao, Eric; Diesh, Colin M et al. (2016) JBrowse: a dynamic web platform for genome visualization and analysis. Genome Biol 17:66
Lee, Eduardo; Helt, Gregg A; Reese, Justin T et al. (2013) Web Apollo: a web-based genomic annotation editing platform. Genome Biol 14:R93
Lee, Ed; Harris, Nomi; Gibson, Mark et al. (2009) Apollo: a community resource for genome annotation editing. Bioinformatics 25:1836-7