The UCSC Genome Browser (genome.ucsc.edu) is a vital resource for the biomedical community, providing timely, convenient access to sequence and annotations for the human and all other vertebrate reference species genomes, along with selected model invertebrates. However, the enormous capacity of next-generation sequencing platforms is rapidly outstripping all of the resources we can provide. Additionally, biomedical datasets are increasingly deemed to potentially contain identifying data, which raises substantial privacy protection issues. In this proposal, we outline a project to expand the Genome Browser architecture to address these growing data volume and privacy requirements. Our preliminary studies indicate that cloud computing and a web services model (such as is supported by Amazon) may provide a good solution to our data storage requirements and offer a high-performance solution for mapping, visualizing, and analyzing next-generation sequencing data, as well as providing a personalized browsing solution for biomedical research scientists who require an individual instance of the Genome Browser for confidentiality or configurability reasons, but wish to avoid the overhead of installing a local mirror of our browser. Through this project we plan to complete a feasibility study of the cloud computing solution and investigate (and potentially prototype) a web services architecture for the Genome Browser. We will also streamline the Genome Browser mirror site installation package for those labs that still require a local copy of the browser. Through this work, we will enhance biomedical research through an enhanced Genome Browser toolset that provides high-performance access to large datasets and controlled access for the visualization and analysis of confidential datasets.
This project aims to provide biomedical research scientists with improved capabilities for mapping, visualizing, and analyzing next-generation sequence data, a secure platform for analyzing confidential data, and increased flexibility for applying individual visualization and analysis tools to Genome Browser datasets. This will directly benefit the approximately 100,000 biomedical scientists who use our resources.
Lincoln, Stephen E; Yang, Shan; Cline, Melissa S et al. (2017) Consistency of BRCA1 and BRCA2 Variant Classifications Among Clinical Diagnostic Laboratories. JCO Precis Oncol 1: |
Haeussler, Maximilian; Raney, Brian J; Hinrichs, Angie S et al. (2015) Navigating protected genomics data with UCSC Genome Browser in a Box. Bioinformatics 31:764-6 |
Nguyen, Ngan; Hickey, Glenn; Zerbino, Daniel R et al. (2015) Building a pan-genome reference for a population. J Comput Biol 22:387-401 |
Rosenbloom, Kate R; Armstrong, Joel; Barber, Galt P et al. (2015) The UCSC Genome Browser database: 2015 update. Nucleic Acids Res 43:D670-81 |
Karolchik, Donna; Barber, Galt P; Casper, Jonathan et al. (2014) The UCSC Genome Browser database: 2014 update. Nucleic Acids Res 42:D764-70 |
Raney, Brian J; Dreszer, Timothy R; Barber, Galt P et al. (2014) Track data hubs enable visualization of user-defined genome-wide annotations on the UCSC Genome Browser. Bioinformatics 30:1003-5 |
Paten, Benedict; Zerbino, Daniel R; Hickey, Glenn et al. (2014) A unifying model of genome evolution under parsimony. BMC Bioinformatics 15:206 |
Rosenbloom, Kate R; Sloan, Cricket A; Malladi, Venkat S et al. (2013) ENCODE data in the UCSC Genome Browser: year 5 update. Nucleic Acids Res 41:D56-63 |
Ewing, Adam D; Ballinger, Tracy J; Earl, Dent et al. (2013) Retrotransposition of gene transcripts leads to structural variation in mammalian genomes. Genome Biol 14:R22 |
Kuhn, Robert M; Haussler, David; Kent, W James (2013) The UCSC genome browser and associated tools. Brief Bioinform 14:144-61 |
Showing the most recent 10 out of 71 publications