With the reference human genome sequence now completed, the next wave of large-scale sequencing will be aimed at genomes that can further inform the human sequence or otherwise provide significant value for biological discovery. These sequences must be of high quality, yet must be generated efficiently and at a substantially lower cost. In this proposal, we describe technical developments that will allow us to produce longer sequence read lengths, decrease sequencing costs, improve physical map construction, streamline genome assembly, and automate sequence finishing. To support these advances, we will develop enhanced informatics tools and infrastructure to effectively integrate and improve management of the entire range of our laboratory processes. On the basis of these technical developments, we will produce genome sequence data at a rate of 3.3M reads/month in Year 1, scaling moderately to 3.8M reads/month in Year 3. Over the same time period, we aim to increase average read length by at least 300 bp, and to cut our per-read cost from $1.35 to $0.75 or less. Refined methods and tools to more efficiently finish genome sequences to high quality and continuity standards, as well as methods and tools for detection and annotation of genes and other elements encoded within those genomes, will further enhance the output data from our Center. Coupled with advances in strategy, these improvements will substantially improve the efficiency and the economics of genome sequencing, making it much more feasible to consider the analysis of additional human and animal genomes. ? ?

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Specialized Center--Cooperative Agreements (U54)
Project #
5U54HG003079-02
Application #
6821336
Study Section
Special Emphasis Panel (ZHG1-HGR-P (O1))
Program Officer
Felsenfeld, Adam
Project Start
2003-11-10
Project End
2006-10-31
Budget Start
2004-12-10
Budget End
2005-10-31
Support Year
2
Fiscal Year
2005
Total Cost
$46,141,967
Indirect Cost
Name
Washington University
Department
Genetics
Type
Schools of Medicine
DUNS #
068552207
City
Saint Louis
State
MO
Country
United States
Zip Code
63130
Peng, Xinxin; Chen, Zhongyuan; Farshidfar, Farshad et al. (2018) Molecular Characterization and Clinical Relevance of Metabolic Expression Subtypes in Human Cancers. Cell Rep 23:255-269.e4
Huang, Kuan-Lin; Mashl, R Jay; Wu, Yige et al. (2018) Pathogenic Germline Variants in 10,389 Adult Cancers. Cell 173:355-370.e14
Martin, Alicia R; Karczewski, Konrad J; Kerminen, Sini et al. (2018) Haplotype Sharing Provides Insights into Fine-Scale Population History and Disease in Finland. Am J Hum Genet 102:760-775
Kronenberg, Zev N; Fiddes, Ian T; Gordon, David et al. (2018) High-resolution comparative analysis of great ape genomes. Science 360:
Ding, Li; Bailey, Matthew H; Porta-Pardo, Eduard et al. (2018) Perspective on Oncogenic Processes at the End of the Beginning of Cancer Genomics. Cell 173:305-320.e10
Seiler, Michael; Peng, Shouyong; Agrawal, Anant A et al. (2018) Somatic Mutational Landscape of Splicing Factor Genes and Their Functional Consequences across 33 Cancer Types. Cell Rep 23:282-296.e4
Liu, Yang; Sethi, Nilay S; Hinoue, Toshinori et al. (2018) Comparative Molecular Analysis of Gastrointestinal Adenocarcinomas. Cancer Cell 33:721-735.e8
Raghavan, Neha S; Brickman, Adam M; Andrews, Howard et al. (2018) Whole-exome sequencing in 20,197 persons for rare variants in Alzheimer's disease. Ann Clin Transl Neurol 5:832-842
Jayasinghe, Reyka G; Cao, Song; Gao, Qingsong et al. (2018) Systematic Analysis of Splice-Site-Creating Mutations in Cancer. Cell Rep 23:270-281.e3
Saltz, Joel; Gupta, Rajarsi; Hou, Le et al. (2018) Spatial Organization and Molecular Correlation of Tumor-Infiltrating Lymphocytes Using Deep Learning on Pathology Images. Cell Rep 23:181-193.e7

Showing the most recent 10 out of 234 publications