As we enter the era of 'personalized genomics', there is enormous capacity for data generation by current technologies, and even greater potential with new methods. The Baylor College of Medicine-Human Genome Sequencing Center (BCM-HGSC) will lead this next phase with technical innovations and high throughput data production. These efforts will continue a history of outstanding production efficiencies, premiere data quality, major achievements in cost reduction, a stellar track record in creating and integrating technologies, and experience in driving community - engagement. A full menu of project options will be offered to the NHGRI, however the BCM-HGSC is primarily motivated to advance human genetics. We will therefore prioritize sequencing of primates and other close human relatives, and use novel sequencing platforms for the identification of inherited 'functional'mutations in human populations. The primate sequencing will benefit from all the advances in sequencing and assembly strategy, such as new software and BAC pooling, to ensure high quality and complete reconstruction of the genomes. With conservative projections >200 Gb of raw sequence data will be generated in the first year of this proposal. A Functional Mutation Discovery (FMD) project will identify the majority of putative functional mutations in all 20,000 human genes, in 1,800 people. The FMD project can proceed at this scale (nearly one billion 'haplicons') because the technical challenges are ideally suited to take advantage of new sequencing technologies and platforms, particularly that offered by 454 Life Sciences. By focusing on samples drawn from diverse populations, this project will populate a database for construction of libraries of genotyping probes that will be used by the wider range of individual investigators who are attempting disease gene discovery. The HGSC's unique position in the Texas Medical Center ensures that the mutation discovery will be geared to disease gene studies. Genome sequencing targets will also include insects and metagenomes, cancer cells, specific diseases and wide screens to discover genetic variants in humans and other species. 960 Mb of targeted finishing and long segments 'genome refinement'will be undertaken. Each project will emphasize 'complete packages'so that cDNAs and SNPS will be identified and research communities will be engaged. The program will rely heavily on the Genboree tool for integrated genomics and will use its unique architecture to enable proper management of sensitive DNA sequence data from patient sources.

National Institute of Health (NIH)
National Human Genome Research Institute (NHGRI)
Specialized Center--Cooperative Agreements (U54)
Project #
Application #
Study Section
Special Emphasis Panel (ZHG1-HGR-P (A1))
Program Officer
Wang, Lu
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Baylor College of Medicine
Schools of Medicine
United States
Zip Code
Benoit, Joshua B; Adelman, Zach N; Reinhardt, Klaus et al. (2016) Unique features of a global human ectoparasite identified through sequencing of the bed bug genome. Nat Commun 7:10165
Xue, Cheng; Raveendran, Muthuswamy; Harris, R Alan et al. (2016) The population genomics of rhesus macaques (Macaca mulatta) based on whole-genome sequences. Genome Res 26:1651-1662
Liu, Xiaoming; White, Simon; Peng, Bo et al. (2016) WGSA: an annotation pipeline for human genome sequencing studies. J Med Genet 53:111-2
Zheng, Siyuan; Cherniack, Andrew D; Dewal, Ninad et al. (2016) Comprehensive Pan-Genomic Characterization of Adrenocortical Carcinoma. Cancer Cell 29:723-36
Yu, Bing; Pulit, Sara L; Hwang, Shih-Jen et al. (2016) Rare Exome Sequence Variants in CLCN6 Reduce Blood Pressure Levels and Hypertension Risk. Circ Cardiovasc Genet 9:64-70
Bechsgaard, J; Vanthournout, B; Funch, P et al. (2016) Comparative genomic study of arachnid immune systems indicates loss of beta-1,3-glucanase-related proteins and the immune deficiency pathway. J Evol Biol 29:277-91
Campbell, Ian M; Gambin, Tomasz; Jhangiani, Shalini N et al. (2016) Multiallelic Positions in the Human Genome: Challenges for Genetic Analyses. Hum Mutat 37:231-4
Green, Robert C; Goddard, Katrina A B; Jarvik, Gail P et al. (2016) Clinical Sequencing Exploratory Research Consortium: Accelerating Evidence-Based Practice of Genomic Medicine. Am J Hum Genet 98:1051-66
Rustagi, Navin; Hampton, Oliver A; Li, Jie et al. (2016) ITD assembler: an algorithm for internal tandem duplication discovery from short-read sequencing data. BMC Bioinformatics 17:188
Chakraborty, Rikhia; Burke, Thomas M; Hampton, Oliver A et al. (2016) Alternative genetic mechanisms of BRAF activation in Langerhans cell histiocytosis. Blood 128:2533-2537

Showing the most recent 10 out of 336 publications