The International 1000 Genomes Project (1000 Genomes) aims to leverage the emergence of next generation sequencing technologies to catalogue common human genetic variability. The ambitious goals and timeline of 1000 Genomes will require highly coordinated collaboration by multiple research groups. While aspects of our development efforts will involve all three platforms, a major proportion of our proposal will focus on optimal integration of SOLID data into the 100 Genomes data production pipeline. Our goal is to insure that the unique capabilities of the platform are maximized during data processing, while adhering to common data and analytical standards established across the 1000 Genomes.
In Aim 1, we will develop tools for monitoring data quality, focusing partly tools for detecting experimental biases and partly on developing better quality metrics.
In Aim 2, we will develop tools for detection of genetic variation through the use of recursive use of alignment and variant discovery.
In Aim 3, we will further develop client/server software capable of simultaneous viewing of sequence data across multiple sites for the purpose of quality control and variant inspection. The deliverable of our proposal is series of stand-alone software utilities that can be integrated into the software pipelines developed by the 1000 Genome DCC's and that fit within the collective analytical framework of 1000 genomes participants. This collaborative proposal includes teams from academia (UCLA), industry (Applied Biosystems) and non-profit research institutes (TGen).

Public Health Relevance

The purpose of this proposal is to develop tools for monitoring and interpreting data from the 1000 Genomes Project.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Project--Cooperative Agreements (U01)
Project #
5U01HG005210-02
Application #
7929662
Study Section
Special Emphasis Panel (ZHG1-HGR-M (M2))
Program Officer
Brooks, Lisa
Project Start
2009-09-11
Project End
2012-06-30
Budget Start
2010-07-01
Budget End
2012-06-30
Support Year
2
Fiscal Year
2010
Total Cost
$710,820
Indirect Cost
Name
Translational Genomics Research Institute
Department
Type
DUNS #
118069611
City
Phoenix
State
AZ
Country
United States
Zip Code
85004
Homer, Nils; Nelson, Stanley F (2010) Improved variant discovery through local re-alignment of short-read next-generation sequencing data using SRMA. Genome Biol 11:R99
Li, Heng; Homer, Nils (2010) A survey of sequence alignment algorithms for next-generation sequencing. Brief Bioinform 11:473-83
1000 Genomes Project Consortium; Abecasis, Gonçalo R; Altshuler, David et al. (2010) A map of human genome variation from population-scale sequencing. Nature 467:1061-73