With the reference human genome sequence now completed, the next wave of large-scale sequencing will be aimed at genomes that can further inform the human sequence or otherwise provide significant value for biological discovery. These sequences must be of high quality, yet must be generated efficiently and at a substantially lower cost. In this proposal, we describe technical developments that will allow us to produce longer sequence read lengths, decrease sequencing costs, improve physical map construction, streamline genome assembly, and automate sequence finishing. To support these advances, we will develop enhanced informatics tools and infrastructure to effectively integrate and improve management of the entire range of our laboratory processes. On the basis of these technical developments, we will produce genome sequence data at a rate of 3.3M reads/month in Year 1, scaling moderately to 3.8M reads/month in Year 3. Over the same time period, we aim to increase average read length by at least 300 bp, and to cut our per-read cost from $1.35 to $0.75 or less. Refined methods and tools to more efficiently finish genome sequences to high quality and continuity standards, as well as methods and tools for detection and annotation of genes and other elements encoded within those genomes, will further enhance the output data from our Center. Coupled with advances in strategy, these improvements will substantially improve the efficiency and the economics of genome sequencing, making it much more feasible to consider the analysis of additional human and animal genomes. ? ?
Showing the most recent 10 out of 234 publications