RNA-Seq studies indicate that the cancer transcriptome are shaped by genetic changes, variation in gene transcription, mRNA processing, editing and stability, and the cancer microbiome. Deciphering this variation and understanding its implications on tumorigenesis requires sophisticated computational analyses. Most RNA-Seq analyses rely on methods that first map short reads to a reference genome, and then compare them to annotated transcripts or assemble them. However, this strategy can be limited when the cancer genome is substantially different than the reference or for detecting sequences from the cancer microbiome. 'Assembly first' (de novo) methods that combine reads into transcripts without any mapping are a compelling alternative. The assembled transcriptome can then be used to identify mutations, splicing patterns, expression levels, tumor-associated microbes, and - if collected from single cells - characterize tumor heterogeneity. There is thus an enormous need for computationally efficient, accurate and user friendly tools for transcriptome reconstruction and analysis in cancer. Trinity, first released in mid-2011 and freely available as Open Source, is the leading software for de novo RNA-Seq assembly, with over 16,000 downloads, 177 literature citations, and a host of modules for downstream analyses, contributed by 3rd party developers. While widely-adopted in the general research community, Trinity (and any de novo RNA-Seq assembly) is only now emerging in the cancer domain. Here, we will enhance and maintain Trinity as a leading tool for cancer transcriptomics. We will tailor analytic modules for critical tasks in cancer biology, working with a network of cancer researchers on Driving Cancer Projects (Aim 1). We will continue to update the Trinity software to enhance the core algorithm, leverage new sequencing technologies as they arise, and incorporate additional 3rd party tools (Aim 2). We will enhance the Trinity software for different computational environments, including user-friendly interfaces to high performance computing infrastructure freely available to any NCI-funded researcher (Aim 3). We will grow the Trinity cancer user community, using online and in- person training and support (Aim 4), to allow any cancer researcher to leverage it.

Public Health Relevance

The RNA in cancer cells can be altered in its sequence, level and structure, and understanding these alterations can help us diagnose disease and find new drug targets. To find these variations, we need to concatenate together short sequences measured experimentally. Here, we enhance and maintain the Trinity software for this purpose, and build computational resources and training materials to help cancer researchers use it.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Resource-Related Research Projects--Cooperative Agreements (U24)
Project #
5U24CA180922-05
Application #
9312775
Study Section
Special Emphasis Panel (ZCA1)
Program Officer
Li, Jerry
Project Start
2013-09-17
Project End
2018-08-31
Budget Start
2017-09-01
Budget End
2018-08-31
Support Year
5
Fiscal Year
2017
Total Cost
Indirect Cost
Name
Broad Institute, Inc.
Department
Type
DUNS #
623544785
City
Cambridge
State
MA
Country
United States
Zip Code
02142
Filbin, Mariella G; Tirosh, Itay; Hovestadt, Volker et al. (2018) Developmental and oncogenic programs in H3K27M gliomas dissected by single-cell RNA-seq. Science 360:331-335
Bryant, Donald M; Johnson, Kimberly; DiTommaso, Tia et al. (2017) A Tissue-Mapped Axolotl De Novo Transcriptome Enables Identification of Limb Regeneration Factors. Cell Rep 18:762-776
Tanay, Amos; Regev, Aviv (2017) Scaling single-cell genomics from phenomenology to mechanism. Nature 541:331-338
Venteicher, Andrew S; Tirosh, Itay; Hebert, Christine et al. (2017) Decoupling genetics, lineages, and microenvironment in IDH-mutant gliomas by single-cell RNA-seq. Science 355:
Lee, Joo-Hyeon; Tammela, Tuomas; Hofree, Matan et al. (2017) Anatomically and Functionally Distinct Lung Mesenchymal Populations Marked by Lgr5 and Lgr6. Cell 170:1149-1163.e12
Puram, Sidharth V; Tirosh, Itay; Parikh, Anuraag S et al. (2017) Single-Cell Transcriptomic Analysis of Primary and Metastatic Tumor Ecosystems in Head and Neck Cancer. Cell 171:1611-1624.e24
Singer, Meromit; Wang, Chao; Cong, Le et al. (2016) A Distinct Gene Module for Dysfunction Uncoupled from Activation in Tumor-Infiltrating T Cells. Cell 166:1500-1511.e9
Wagner, Allon; Regev, Aviv; Yosef, Nir (2016) Revealing the vectors of cellular identity with single-cell genomics. Nat Biotechnol 34:1145-1160
Chen, Jenny; Shishkin, Alexander A; Zhu, Xiaopeng et al. (2016) Evolutionary analysis across mammals reveals distinct classes of long non-coding RNAs. Genome Biol 17:19
Tirosh, Itay; Izar, Benjamin; Prakadan, Sanjay M et al. (2016) Dissecting the multicellular ecosystem of metastatic melanoma by single-cell RNA-seq. Science 352:189-96

Showing the most recent 10 out of 15 publications