Data Processing and Visualization for 1000 Genomes

Craig, David

Abstract

The International 1000 Genomes Project (1000 Genomes) aims to leverage the emergence of next generation sequencing technologies to catalogue common human genetic variability. The ambitious goals and timeline of 1000 Genomes will require highly coordinated collaboration by multiple research groups. While aspects of our development efforts will involve all three platforms, a major proportion of our proposal will focus on optimal integration of SOLID data into the 100 Genomes data production pipeline. Our goal is to insure that the unique capabilities of the platform are maximized during data processing, while adhering to common data and analytical standards established across the 1000 Genomes.
In Aim 1, we will develop tools for monitoring data quality, focusing partly tools for detecting experimental biases and partly on developing better quality metrics.
In Aim 2, we will develop tools for detection of genetic variation through the use of recursive use of alignment and variant discovery.
In Aim 3, we will further develop client/server software capable of simultaneous viewing of sequence data across multiple sites for the purpose of quality control and variant inspection. The deliverable of our proposal is series of stand-alone software utilities that can be integrated into the software pipelines developed by the 1000 Genome DCC's and that fit within the collective analytical framework of 1000 genomes participants. This collaborative proposal includes teams from academia (UCLA), industry (Applied Biosystems) and non-profit research institutes (TGen).

Public Health Relevance

The purpose of this proposal is to develop tools for monitoring and interpreting data from the 1000 Genomes Project.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Human Genome Research Institute (NHGRI)
Type: Research Project--Cooperative Agreements (U01)
Project #: 5U01HG005210-02
Application #: 7929662
Study Section: Special Emphasis Panel (ZHG1-HGR-M (M2))
Program Officer: Brooks, Lisa

Project Start: 2009-09-11
Project End: 2012-06-30
Budget Start: 2010-07-01
Budget End: 2012-06-30
Support Year: 2
Fiscal Year: 2010
Total Cost: $710,820
Indirect Cost

Institution

Name: Translational Genomics Research Institute
Department
Type
DUNS #: 118069611

City: Phoenix
State: AZ
Country: United States
Zip Code: 85004

Related projects


NIH 2010 U01 HG	Data Processing and Visualization for 1000 Genomes Craig, David W. / Translational Genomics Research Institute	$710,820
NIH 2009 U01 HG	Data Processing and Visualization for 1000 Genomes Craig, David W. / Translational Genomics Research Institute	$718,000

Publications

Li, Heng; Homer, Nils (2010) A survey of sequence alignment algorithms for next-generation sequencing. Brief Bioinform 11:473-83

1000 Genomes Project Consortium; Abecasis, Gonçalo R; Altshuler, David et al. (2010) A map of human genome variation from population-scale sequencing. Nature 467:1061-73

Homer, Nils; Nelson, Stanley F (2010) Improved variant discovery through local re-alignment of short-read next-generation sequencing data using SRMA. Genome Biol 11:R99

Comments

Be the first to comment on David Craig's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: