Deep resequencing of human genes has led to the discovery of rare, nonsynonymous sequence variants that are robustly associated with complex human phenotypes. Such studies have historically been rate-limited by the cost of DNA sequencing. Although a new generation of sequencing platforms is reducing costs by over two orders of magnitude, the routine sequencing of complete human genomes continues to be prohibitively expensive. Recently, methods have been developed to enable the efficient capture of specific subsets of the genome. With these methods, the cost of sequencing all of the protein-coding sequences (i.e. ~1% of the human genome split across ~180,000 discontiguous subsequences) may soon be on par with that of dense genotyping arrays. The goals of this proposal are to further the development of these targeting methods, and to integrate them into a scalable resequencing pipeline that relies on second-generation sequencing technology. Specifically, we will: (1) optimize and evaluate candidate strategies for multiplex capture, including array hybridization and gap-fill molecular inversion probes, while extending their application to the full protein-coding genome;(2) integrate optimized capture methods, second-generation sequencing technology, and sequence analysis software into a scalable resequencing pipeline;(3) develop the requisite computational tools for translating raw sequence data generated by new sequencing platforms into quality-tagged, consensus predictions of sequence variants;(4) make our data and methods broadly available, and facilitate the goals of this program through open communication with other investigators and the NIH.

Public Health Relevance

- As we enter an era of """"""""personalized medicine"""""""", DNA sequencing technology will be increasingly important to public health, contributing towards the unraveling of the genetic basis of human disease, as well as serving an increasing role in clinical diagnostics. Next-generation sequencing technologies have the potential to markedly accelerate genetics research, but are hindered by the lack of equivalently powerful methods to target specific subsets of the human genome. We propose here to develop technologies that meet this critical need, focusing specifically on the development of a scalable resequencing pipeline that targets the ~1% of the genome that is protein-coding. The principal investigator (PI) proposes to evaluate two different methods that will capture approximately 1% of the human genome that represents the protein coding genome (PCG).

Agency
National Institute of Health (NIH)
Institute
National Heart, Lung, and Blood Institute (NHLBI)
Type
Research Project (R01)
Project #
5R01HL094976-02
Application #
7691365
Study Section
Special Emphasis Panel (ZHG1-HGR-N (O1))
Program Officer
Gan, Weiniu
Project Start
2008-09-30
Project End
2012-06-30
Budget Start
2009-07-01
Budget End
2012-06-30
Support Year
2
Fiscal Year
2009
Total Cost
$2,056,829
Indirect Cost
Name
University of Washington
Department
Genetics
Type
Schools of Medicine
DUNS #
605799469
City
Seattle
State
WA
Country
United States
Zip Code
98195
Knowles, Michael R; Ostrowski, Lawrence E; Loges, Niki T et al. (2013) Mutations in SPAG1 cause primary ciliary dyskinesia associated with defective outer and inner dynein arms. Am J Hum Genet 93:711-20
Knowles, Michael R; Leigh, Margaret W; Ostrowski, Lawrence E et al. (2013) Exome sequencing identifies mutations in CCDC114 as a cause of primary ciliary dyskinesia. Am J Hum Genet 92:99-106
Norton, Nadine; Li, Duanxiang; Rampersaud, Evadnie et al. (2013) Exome sequencing and genome-wide linkage analysis in 17 families illustrate the complex contribution of TTN truncating variants to dilated cardiomyopathy. Circ Cardiovasc Genet 6:144-53
Rosenthal, Elisabeth A; Ranchalis, Jane; Crosslin, David R et al. (2013) Joint linkage and association analysis with exome sequence data implicates SLC25A40 in hypertriglyceridemia. Am J Hum Genet 93:1035-45
O'Roak, Brian J; Vives, Laura; Girirajan, Santhosh et al. (2012) Sporadic autism exomes reveal a highly interconnected protein network of de novo mutations. Nature 485:246-50
Horani, Amjad; Druley, Todd E; Zariwala, Maimoona A et al. (2012) Whole-exome capture and sequencing identifies HEATR2 mutation as a cause of primary ciliary dyskinesia. Am J Hum Genet 91:685-93
Bamshad, Michael J; Ng, Sarah B; Bigham, Abigail W et al. (2011) Exome sequencing as a tool for Mendelian disease gene discovery. Nat Rev Genet 12:745-55
Zariwala, Maimoona A; Omran, Heymut; Ferkol, Thomas W (2011) The emerging genetics of primary ciliary dyskinesia. Proc Am Thorac Soc 8:430-3
O'Roak, Brian J; Deriziotis, Pelagia; Lee, Choli et al. (2011) Exome sequencing in sporadic autism spectrum disorders identifies severe de novo mutations. Nat Genet 43:585-9
Kumar, Akash; White, Thomas A; MacKenzie, Alexandra P et al. (2011) Exome sequencing identifies a spectrum of mutation frequencies in advanced and lethal prostate cancers. Proc Natl Acad Sci U S A 108:17087-92

Showing the most recent 10 out of 26 publications