We propose to develop the Genboree system to enable collaborative translational studies of genome variation on any scale. By employing an innovative combination of information technologies such as Life Science Identifiers, XML, Distributed Annotation System and web services, Genboree will minimize technical barriers to collaboration by interdisciplinary research groups including clinicians, biologists, and genome scientists. Two independently NIH-funded projects, """"""""Identification of Nonsyndromic Hearing Impairment Genes"""""""" (""""""""NSHI"""""""") and """"""""Parallel Sequence Profiling of Ion Channels in Epilepsy"""""""" (""""""""Epilepsy"""""""") will be employed to drive development and enhance usability of Genboree software. Genboree system will enable the study of genetic susceptibility factors to complex yet prevalent diseases such as NSHI and Epilepsy. An improved understanding of causative patterns of genome variation that is anticipated to emerge from these efforts will produce predictive and diagnostic tests and will help elucidate pathological mechanisms. The understanding of mechanisms will in turn point to means of disease prevention and to targets for therapeutic intervention. This program of research has the potential to propel the 21st century medicine from a diagnose-and-treat to a predict-and-prevent paradigm. One of the main informatic challenges facing this program of research is the integration of clinical and genomic information. Another challenge is the integration of collaborative efforts involving clinicians, biologists, and genome scientists. Yet another challenge is extreme scalability required to handle the torrent of genome variation information that will surpass by orders of magnitude the amounts generated by genome projects in the past. None of these three challenges are adequately addressed at this time. The general aim of this proposal is to address these challenges through the development of the Genboree system. Genboree prototype was initially developed to support collaborative genome annotation and comparative genomic projects at the Human Genome Sequencing Center at Baylor College of Medicine. One of the key features of Genboree is the ability to integrate data using assembled genome sequences as a reference by projecting any experimentally or computationally derived data elements onto the reference genomic sequence in the form of annotations. Due to built-in .access control features, Genboree is within reach of HIPAA compliance and thus capable of integrating both patient information and the genomic information obtained from patient samples. Building on the current Genboree prototype, a system will be developed to integrate data, tools, and discovery pipelines to support collaborative translational studies of genome variation. ? ? ?

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Project (R01)
Project #
5R01HG004009-02
Application #
7275266
Study Section
Biodata Management and Analysis Study Section (BDMA)
Program Officer
Good, Peter J
Project Start
2006-08-15
Project End
2009-03-31
Budget Start
2007-04-01
Budget End
2008-03-31
Support Year
2
Fiscal Year
2007
Total Cost
$568,110
Indirect Cost
Name
Baylor College of Medicine
Department
Genetics
Type
Schools of Medicine
DUNS #
051113330
City
Houston
State
TX
Country
United States
Zip Code
77030
Riehle, Kevin; Coarfa, Cristian; Jackson, Andrew et al. (2012) The Genboree Microbiome Toolset and the analysis of 16S rRNA microbial sequences. BMC Bioinformatics 13 Suppl 13:S11
Li, Jian; Harris, R Alan; Cheung, Sau Wai et al. (2012) Genomic hypomethylation in the human germline associates with selective structural mutability in the human genome. PLoS Genet 8:e1002692
Evani, Uday S; Challis, Danny; Yu, Jin et al. (2012) Atlas2 Cloud: a framework for personal genome analysis in the cloud. BMC Genomics 13 Suppl 6:S19
Challis, Danny; Yu, Jin; Evani, Uday S et al. (2012) An integrative variant analysis suite for whole exome next-generation sequencing data. BMC Bioinformatics 13:8
Fawcett, Gloria L; Raveendran, Muthuswamy; Deiros, David Rio et al. (2011) Characterization of single-nucleotide variation in Indian-origin rhesus macaques (Macaca mulatta). BMC Genomics 12:311
Miller, Christopher A; Settle, Stephen H; Sulman, Erik P et al. (2011) Discovering functional modules by identifying recurrent and mutually exclusive mutational patterns in tumors. BMC Med Genomics 4:34
Celestino-Soper, Patricia B S; Shaw, Chad A; Sanders, Stephan J et al. (2011) Use of array CGH to detect exonic copy number variants throughout the genome in autism families detects a novel deletion in TMLHE. Hum Mol Genet 20:4360-70
Miller, Christopher A; Hampton, Oliver; Coarfa, Cristian et al. (2011) ReadDepth: a parallel R package for detecting copy number alterations from short sequencing reads. PLoS One 6:e16327
Coarfa, Cristian; Yu, Fuli; Miller, Christopher A et al. (2010) Pash 3.0: A versatile software package for read mapping and integrative analysis of genomic and epigenomic variation using massively parallel DNA sequencing. BMC Bioinformatics 11:572
Milosavljevic, Aleksandar (2010) Putting epigenome comparison into practice. Nat Biotechnol 28:1053-6

Showing the most recent 10 out of 11 publications