The UCSC Genome Browser (genome.ucsc.edu) is a vital resource for the biomedical community, providing timely, convenient access to sequence and annotations for the human and all other vertebrate reference species genomes, along with selected model invertebrates. However, the enormous capacity of next-generation sequencing platforms is rapidly outstripping all of the resources we can provide. Additionally, biomedical datasets are increasingly deemed to potentially contain identifying data, which raises substantial privacy protection issues. In this proposal, we outline a project to expand the Genome Browser architecture to address these growing data volume and privacy requirements. Our preliminary studies indicate that cloud computing and a web services model (such as is supported by Amazon) may provide a good solution to our data storage requirements and offer a high-performance solution for mapping, visualizing, and analyzing next-generation sequencing data, as well as providing a personalized browsing solution for biomedical research scientists who require an individual instance of the Genome Browser for confidentiality or configurability reasons, but wish to avoid the overhead of installing a local mirror of our browser. Through this project we plan to complete a feasibility study of the cloud computing solution and investigate (and potentially prototype) a web services architecture for the Genome Browser. We will also streamline the Genome Browser mirror site installation package for those labs that still require a local copy of the browser. Through this work, we will enhance biomedical research through an enhanced Genome Browser toolset that provides high-performance access to large datasets and controlled access for the visualization and analysis of confidential datasets.

Public Health Relevance

This project aims to provide biomedical research scientists with improved capabilities for mapping, visualizing, and analyzing next-generation sequence data, a secure platform for analyzing confidential data, and increased flexibility for applying individual visualization and analysis tools to Genome Browser datasets. This will directly benefit the approximately 100,000 biomedical scientists who use our resources.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Biotechnology Resource Grants (P41)
Project #
3P41HG002371-10S1
Application #
7921323
Study Section
Ethical, Legal, Social Implications Review Committee (GNOM)
Program Officer
Felsenfeld, Adam
Project Start
2009-09-30
Project End
2012-08-31
Budget Start
2009-09-30
Budget End
2012-08-31
Support Year
10
Fiscal Year
2009
Total Cost
$690,000
Indirect Cost
Name
University of California Santa Cruz
Department
Biostatistics & Other Math Sci
Type
Schools of Engineering
DUNS #
125084723
City
Santa Cruz
State
CA
Country
United States
Zip Code
95064
Lincoln, Stephen E; Yang, Shan; Cline, Melissa S et al. (2017) Consistency of BRCA1 and BRCA2 Variant Classifications Among Clinical Diagnostic Laboratories. JCO Precis Oncol 1:
Haeussler, Maximilian; Raney, Brian J; Hinrichs, Angie S et al. (2015) Navigating protected genomics data with UCSC Genome Browser in a Box. Bioinformatics 31:764-6
Nguyen, Ngan; Hickey, Glenn; Zerbino, Daniel R et al. (2015) Building a pan-genome reference for a population. J Comput Biol 22:387-401
Rosenbloom, Kate R; Armstrong, Joel; Barber, Galt P et al. (2015) The UCSC Genome Browser database: 2015 update. Nucleic Acids Res 43:D670-81
Karolchik, Donna; Barber, Galt P; Casper, Jonathan et al. (2014) The UCSC Genome Browser database: 2014 update. Nucleic Acids Res 42:D764-70
Raney, Brian J; Dreszer, Timothy R; Barber, Galt P et al. (2014) Track data hubs enable visualization of user-defined genome-wide annotations on the UCSC Genome Browser. Bioinformatics 30:1003-5
Paten, Benedict; Zerbino, Daniel R; Hickey, Glenn et al. (2014) A unifying model of genome evolution under parsimony. BMC Bioinformatics 15:206
Rosenbloom, Kate R; Sloan, Cricket A; Malladi, Venkat S et al. (2013) ENCODE data in the UCSC Genome Browser: year 5 update. Nucleic Acids Res 41:D56-63
Ewing, Adam D; Ballinger, Tracy J; Earl, Dent et al. (2013) Retrotransposition of gene transcripts leads to structural variation in mammalian genomes. Genome Biol 14:R22
Kuhn, Robert M; Haussler, David; Kent, W James (2013) The UCSC genome browser and associated tools. Brief Bioinform 14:144-61

Showing the most recent 10 out of 71 publications