Structural variation (SV), involving deletions, duplications, insertions and inversions of DNA segments, accounts for a large proportion of human genetic diversity. Comprehensive identification and analysis of these genetic variants will help us more fully elucidate the biology of their functional effects on human health and demography. Despite recent advances, the tools and data needed to comprehensively identify all types of SVs, genotype each variant, integrate and phase these variants remain lacking. Indeed, the data released from the early phases of the 1000 Genomes Project (1000GP) (1000 Genomes Project Consortium, 2010;1000 Genomes Project Consortium, 2012) are biased primarily towards the detection of deletions within relatively unique regions of the genome. As a consortium, we propose to pool expertise from various research groups to provide an integrative analysis of SVs by combining rigorous computational algorithmic development with extensive experimental validation. The new algorithms we develop and the high confidence lists of SVs obtained will be rapidly made available as a public resource.
|Fan, Xian; Abbott, Travis E; Larson, David et al. (2014) BreakDancer - Identification of Genomic Structural Variation from Paired-End Read Mapping. Curr Protoc Bioinformatics 2014:|
|Chen, Ken; Chen, Lei; Fan, Xian et al. (2014) TIGRA: a targeted iterative graph routing assembler for breakpoint assembly. Genome Res 24:310-7|
|Carbone, Lucia; Harris, R Alan; Gnerre, Sante et al. (2014) Gibbon genome and the fast karyotype evolution of small apes. Nature 513:195-201|
|Fan, Xian; Zhou, Wanding; Chong, Zechen et al. (2014) Towards accurate characterization of clonal heterogeneity based on structural variation. BMC Bioinformatics 15:299|