A large number of APIs, software, and web interfaces will be made available to the informatics and research communities through the engineering efforts of individual teams spread across several institutions as part of the proposed Center for Big Data in Translational Genomics. A well-designed training and outreach strategy will provide both software users and developers with the knowledge they need to effectively integrate and employ these resources in their work. The Center will provide online documentation, help pages, FAQs, text and video tutorials, and usage scenarios for the tools and APIs developed. Through in-person training seminars, hands-on workshops, and short courses, biomedical researchers and data scientists will learn about the tools developed by the Center. Developers will be provided online assistance to the Center's APIs and tools through an interactive mailing list and via source code management tools that allow the open tracking of issues and developer queries. Finally, within the Center itself, training activities for pre-doctoral and post-doctoral students will be provided to a new and diverse generation of data scientists so that they may understand, use and enhance the Center's tools.

Public Health Relevance

The Center's training and outreach strategy will provide a broad spectrum of software users and developers with the knowledge they need to effectively integrate and employ these resources in their work. This will train data scientists to bring genomics into the big data era to use genomics to understand disease and develop new precision treatments.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Specialized Center--Cooperative Agreements (U54)
Project #
1U54HG007990-01
Application #
8932071
Study Section
Special Emphasis Panel (ZRG1-BST-R (52))
Program Officer
Brooks, Lisa
Project Start
Project End
Budget Start
2014-07-01
Budget End
2015-06-30
Support Year
1
Fiscal Year
2014
Total Cost
$69,842
Indirect Cost
$26,139
Name
University of California Santa Cruz
Department
Type
DUNS #
125084723
City
Santa Cruz
State
CA
Country
United States
Zip Code
95064
Kozanitis, Christos; Patterson, David A (2016) GenAp: a distributed SQL interface for genomic data. BMC Bioinformatics 17:63
Gordon, David; Huddleston, John; Chaisson, Mark J P et al. (2016) Long-read sequence assembly of the gorilla genome. Science 352:aae0344
Haeussler, Maximilian; Schönig, Kai; Eckert, Hélène et al. (2016) Evaluation of off-target and on-target scoring algorithms and integration into the guide RNA selection tool CRISPOR. Genome Biol 17:148
Speir, Matthew L; Zweig, Ann S; Rosenbloom, Kate R et al. (2016) The UCSC Genome Browser database: 2016 update. Nucleic Acids Res 44:D717-25
Jain, Miten; Olsen, Hugh E; Paten, Benedict et al. (2016) The Oxford Nanopore MinION: delivery of nanopore sequencing to the genomics community. Genome Biol 17:239
Yang, Shan; Cline, Melissa; Zhang, Can et al. (2016) DATA SHARING AND REPRODUCIBLE CLINICAL GENETIC TESTING: SUCCESSES AND CHALLENGES. Pac Symp Biocomput 22:166-176
Ip, Camilla L C; Loose, Matthew; Tyson, John R et al. (2015) MinION Analysis and Reference Consortium: Phase 1 data release and analysis. F1000Res 4:1075
Novak, Adam M; Rosen, Yohei; Haussler, David et al. (2015) Canonical, stable, general mapping using context schemes. Bioinformatics 31:3569-76
Philippakis, Anthony A; Azzariti, Danielle R; Beltran, Sergi et al. (2015) The Matchmaker Exchange: a platform for rare disease gene discovery. Hum Mutat 36:915-21
Paten, Benedict; Diekhans, Mark; Druker, Brian J et al. (2015) The NIH BD2K center for big data in translational genomics. J Am Med Inform Assoc 22:1143-7

Showing the most recent 10 out of 11 publications