The goal of Computational Analysis Core B is to manage, analyze, and extract maximum scientific value from the data generated by the investigators of this Program Project. The primary emphasis of Core B will be to assist Program investigators in the analysis of RNA-Seq data and to connect the results of these measurements with prior knowledge to unveil new biology. Importantly, these data will provide a rich source for discovering previously unappreciated shifts in mRNA isoform usage, novel microRNAs and biologically relevant noncoding transcripts. The Core reflects a novel collaboration between researchers spanning two institutions that collectively possess immense expertise and resources in computation and data analysis: The University of Washington (UW) Department of Computer Science & Engineering and Sage Bionetworks, a non-profit biomedical research organization created to revolutionize how researchers approach the complexity of human biological information and the treatment of disease. The vast data sets generated by next-generation sequence technologies create enormous opportunities but also significant challenges. Core B will assure optimal management and interpretation of data obtained from the RNA-Seq studies proposed by Projects 1, 3 and 4 and Core A. These studies will generate RNA-Seq data for both short (e.g., microRNA) and long (e.g., messenger RNA and long noncoding RNA) protocols. As described below. Core Director Dr. Ruzzo (UW), and co-investigators Dr. Brig Mecham and Dr. Adam Margolin (Sage) are exceptionally well qualified to carry out the proposed work, and an existing collaboration with them has already generated interesting findings as described in Project 1 (Blau). In support of the Program, Core B's activities will vary in accordance with the needs of the individual Projects. Core B will provide an essential resource to P01 investigators who have not previously had access to formalized bioinformatics support. For investigators with existing bioinformatics collaborations Core B will assist in bringing uniform best practices to quality assessment, analysis and interpretation of RNA-Seq data while minimizing duplication of effort. This includes developing procedures and simple automated workflows to integrate existing specialized tools to provide a unified framework for storage, comparison, analysis and visualization of these data sets. Interaction with Sage Bionetworks will be particularly valuable, exploiting their tools for integration and visualization of diverse biological data, and their ongoing effort to standardize and distribute all publicly available microarray and sequencing data. This work required developing automated workflows that reliably processed 15,000 distinct microarray data sets, yielding standardized information in a usable format for the community. Similar work is underway for RNASeq data. These workflows will serve as a template for the proposed analyses. Additionally, Dr. Ruzzo's experience with next-generation sequence technologies (both RNA-Seq and ChlP-Seq analyses) and expertise in prediction of conserved noncoding RNAs offers the prospect of identifying important, novel players in the unique biological systems addressed by this POI.

Public Health Relevance

Sophisticated computational approaches are required to optimally analyze results from next generation sequencing technologies, and this Core provides these capabilities. By extending our analyses into noncoding regions of the genome and into appropriately curated publicly available datasets we can maximize return on investment.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Program Projects (P01)
Project #
4P01GM081619-09
Application #
8976268
Study Section
Special Emphasis Panel (ZRG1)
Project Start
Project End
Budget Start
2015-12-01
Budget End
2016-11-30
Support Year
9
Fiscal Year
2016
Total Cost
Indirect Cost
Name
University of Washington
Department
Type
DUNS #
605799469
City
Seattle
State
WA
Country
United States
Zip Code
98195
Hofsteen, Peter; Robitaille, Aaron Mark; Strash, Nicholas et al. (2018) ALPK2 Promotes Cardiogenesis in Zebrafish and Human Pluripotent Stem Cells. iScience 2:88-100
Moody, James D; Levy, Shiri; Mathieu, Julie et al. (2017) First critical repressive H3K27me3 marks in embryonic stem cells identified using designed protein inhibitor. Proc Natl Acad Sci U S A 114:10125-10130
Mathieu, Julie; Ruohola-Baker, Hannele (2017) Metabolic remodeling during the loss and acquisition of pluripotency. Development 144:541-551
Rabinowitz, Jeremy S; Robitaille, Aaron M; Wang, Yuliang et al. (2017) Transcriptomic, proteomic, and metabolomic landscape of positional memory in the caudal fin of zebrafish. Proc Natl Acad Sci U S A 114:E717-E726
Eschenhagen, Thomas; Bolli, Roberto; Braun, Thomas et al. (2017) Cardiomyocyte Regeneration: A Consensus Statement. Circulation 136:680-686
Ware, Carol B (2017) Concise Review: Lessons from Naïve Human Pluripotent Cells. Stem Cells 35:35-41
Palpant, Nathan J; Wang, Yuliang; Hadland, Brandon et al. (2017) Chromatin and Transcriptional Analysis of Mesoderm Progenitor Cells Identifies HOPX as a Regulator of Primitive Hematopoiesis. Cell Rep 20:1597-1608
Hoshino, Akina; Ratnapriya, Rinki; Brooks, Matthew J et al. (2017) Molecular Anatomy of the Developing Human Retina. Dev Cell 43:763-779.e4
Kim, Yong Kyun; Refaeli, Ido; Brooks, Craig R et al. (2017) Gene-Edited Human Kidney Organoids Reveal Mechanisms of Disease in Podocyte Development. Stem Cells 35:2366-2378
Artoni, Filippo; Kreipke, Rebecca E; Palmeira, Ondina et al. (2017) Loss of foxo rescues stem cell aging in Drosophila germ line. Elife 6:

Showing the most recent 10 out of 114 publications