The long-term objective of this proposal is to gain an enhanced understanding of the structural genomics of solid tumors through development of a novel, sequence-based method capable of identifying all types of structural rearrangements that occur in tumor genomes. Genome rearrangements can promote cancer development, progression and/or resistance to therapy by altering gene regulation and/or function, and the involved genes are potential therapeutic targets. This is well established in leukemia and lymphoma, but less so in solid tumors, in part because of the difficulty of identifying the genes involved in complex structural rearrangements. We describe here a powerful and high resolution, sequence-based analytical approach called End Sequence Profiling (ESP). ESP maps copy number aberrations and directly identifies and clones en masse genome breakpoints associated with genome rearrangements such as inversions, translocations, deletions and amplifications. ESP is accomplished by constructing a BAC library of a tumor genome, end sequencing a larger number of BAC clones, and mapping the BAC end sequences (BES) onto the normal genome sequence. Paired BES that map to different parts of the normal genome span structural rearrangements. Sequencing these clones will reveal exact breakpoints and involved genes.
In Specific Aim 1 we will: Implement ESP as a cost effective sequence-based technology for determining the structural organization of tumor genomes and clone rearrangement breakpoints en masse. Determine the minimum sequencing depth needed to yield the maximum structural information. Determine if ESP can reproducibly identify recurrent rearrangements between tumors, and if so, whether specific sequence elements are associated with these rearrangements.
In Specific Aim 2 we will: Develop robust computational methods for the analysis, visual representation, and integration of ESP data with the human reference sequence, making possible comparison of ESP data from independent tumors. Knowledge of how genome rearrangements such as inversions and translocations impact local gene expression is critical. Thus, we will integrate ESP-based structure data with expression microarray data and co-localize aberrantly expressed genes with genome rearrangement breakpoints.
In Specific Aim 3 : We will biologically and clinically validate key ESP findings. We believe ESP provides a rational framework for sequencing tumor genomes. In fact, ( 100 tumor genomes can be analyzed at ( 10 kb resolution for less than sequencing a single 3000 Mb genome yielding hundreds of novel biomarkers and therapeutic targets associated with translocations, inversions, and complex rearrangements. This is important because, just as a comprehensive systems-based knowledge of human biology is predicated on the structural organization and sequence of the human genome, a structure-based view of tumor genomes is essential for a comprehensive understanding of tumor biology.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Exploratory/Developmental Grants Phase II (R33)
Project #
1R33CA103068-01
Application #
6686622
Study Section
Special Emphasis Panel (ZCA1-SRRB-C (M1))
Program Officer
Couch, Jennifer A
Project Start
2003-08-01
Project End
2006-07-31
Budget Start
2003-08-01
Budget End
2004-07-31
Support Year
1
Fiscal Year
2003
Total Cost
$614,707
Indirect Cost
Name
University of California San Francisco
Department
Internal Medicine/Medicine
Type
Schools of Medicine
DUNS #
094878337
City
San Francisco
State
CA
Country
United States
Zip Code
94143
Raphael, Benjamin J; Volik, Stanislav; Yu, Peng et al. (2008) A sequence-based survey of the complex structural organization of tumor genomes. Genome Biol 9:R59
Bashir, Ali; Volik, Stanislav; Collins, Colin et al. (2008) Evaluation of paired-end sequencing strategies for detection of genome rearrangements in cancer. PLoS Comput Biol 4:e1000051
Volik, Stanislav; Raphael, Benjamin J; Huang, Guiqing et al. (2006) Decoding the fine-scale structure of a breast cancer genome and transcriptome. Genome Res 16:394-404