In the last 10 years the UCSC Genome Browser has become a standard resource in the field. The number of web hits and users continues to increase, more than doubling in the last four years. The browser is also widely featured in scientific papers and presentations. As genomics penetrates ever more deeply into science and medicine, the value of the browser's definitive collection of data sets mapped to reference genome, coordinated, and made interactively accessible, continues to grow. In the next five years we will tackle several key challenges: (1) Update the UCSC gene set to reflect recent dramatic progress in genomics methods with the widespread use of technologies such as next-generation genome sequencing, RNA-seq, and ChlP-seq;these have deepened our view of human genes, both coding and non-coding, including alternative splice variants, regulatory elements, and haplotype variants. (2) Enhance and unify in a common framework our linkages to functional elements of the genome that have been associated with known human variation by projects such as 1000 Genomes and dbSNP, with homologous elements in other species by projects such as Genome 10K, with experimental information by projects such as ENCODE and Roadmap Epigenomics, and with disease phenotypes by projects such as OMIM and COSMIC. The value to research of these linkages and the advanced integration we create increase sharply as they become more comprehensive and more accurate. (3) Continue the fundamental transition to a more distributed database that started with the development of browser data hubs. Broaden our reach with more remote mirror sites and increased training, and increase the security of data uploaded to the browser by users. These developments are essential for the transition from the era of reference genomics to the era of personal genomics, where the data sets are too large, too distributed, and too sensitive to be handled like reference genome data. Since these data still need to map to a reference genome, these innovations will make the browser more relevant than ever in the era of personal genomes.
Brozovic, Matija; Dantec, Christelle; Dardaillon, Justine et al. (2018) ANISEED 2017: extending the integrated ascidian database to the exploration and evolutionary comparison of genome-scale datasets. Nucleic Acids Res 46:D718-D725 |
Casper, Jonathan; Zweig, Ann S; Villarreal, Chris et al. (2018) The UCSC Genome Browser database: 2018 update. Nucleic Acids Res 46:D762-D769 |
Canver, Matthew C; Haeussler, Maximilian; Bauer, Daniel E et al. (2018) Integrated design, execution, and analysis of arrayed and pooled CRISPR genome-editing experiments. Nat Protoc 13:946-986 |
GTEx Consortium (2018) Erratum: Genetic effects on gene expression across human tissues. Nature 553:530 |
Dyke, Stephanie O M; Linden, Mikael; Lappalainen, Ilkka et al. (2018) Registered access: authorizing data access. Eur J Hum Genet 26:1721-1731 |
Howard, Jonathan M; Lin, Hai; Wallace, Andrew J et al. (2018) HNRNPA1 promotes recognition of splice site decoys by U2AF2 in vivo. Genome Res 28:689-698 |
GTEx Consortium; Laboratory, Data Analysis &Coordinating Center (LDACC)—Analysis Working Group; Statistical Methods groups—Analysis Working Group et al. (2017) Genetic effects on gene expression across human tissues. Nature 550:204-213 |
Saha, Ashis; Kim, Yungil; Gewirtz, Ariel D H et al. (2017) Co-expression networks reveal the tissue-specific regulation of transcription and splicing. Genome Res 27:1843-1858 |
Tyner, Cath; Barber, Galt P; Casper, Jonathan et al. (2017) The UCSC Genome Browser database: 2017 update. Nucleic Acids Res 45:D626-D634 |
Vivian, John; Rao, Arjun Arkal; Nothaft, Frank Austin et al. (2017) Toil enables reproducible, open source, big biomedical data analyses. Nat Biotechnol 35:314-316 |
Showing the most recent 10 out of 41 publications