Craniofacial (CF) abnormalities constitute more than a third of all human structural birth defects. To define their genetic etiology, detailed molecular understanding is required of coordinated movement and fusion of embryonic facial prominences - as disruption of these morphogenetic events cause defects such as orofacial clefts (OFC). The NIH FaceBase initiative is an important step to address this need, as it aims to generate comprehensive whole-genome expression datasets using microarrays or Next-Gen RNA-sequencing (RNA-seq) on mouse embryonic CF tissue. However, genome-wide profiling identifies several thousand expressed genes and it is a formidable challenge to predict and prioritize the select few genes that are critical to tissue development or pathogenesis. We posit that although there is a wealth of genomic-level data available, this deficit remains because an adequate strategy has not yet been applied to identify these important candidate CF genes. We recently developed an innovative approach - termed in silico whole embryo body (WB) subtraction - to identify such important genes based on developmentally-enriched expression. We have applied this novel approach to ~15% of FaceBase data and assembled this knowledge as a user-friendly web-based interactive tool SysFACE (Systems tool for craniofacial expression-based gene discovery, http://bioinformatics.udel.edu/Research/SysFACE). Even with limited datasets, the beta version of SysFACE is significantly more effective, compared with unprocessed FaceBase datasets, in identification of known genes associated with OFCs from both linkage and GWAS studies. To process all existing FaceBase datasets, we will generate additional platform-specific WB reference datasets and evaluate these further with machine learning strategies to identify genes important to CF development (Aim 1). Subsequently, we aim to experimentally validate these tissue-enriched gene expression profiles, and to assemble this knowledge - along with a new evidence-based functional gene regulatory network (GRN) that will allow all molecular data from the CF published literature to be represented on systems level - as a user-friendly web-based interactive resource (Aim 2), which will also be made available through FaceBase. Development of SysFACE, as outlined in this application, will greatly improve prediction of candidate CF genes, provide an excellent resource for CF-network construction, and will facilitate CF gene discovery efforts by developmental biologists and clinicians.
Craniofacial malformations are common among structural birth defects among which orofacial clefts alone occur in 1/800 live-births and carry a lifetime cost for medical treatment, rehabilitation services and lost productivity of more than $100,000 per affected person. This application seeks to analyze FaceBase gene expression data using an integrated approach to develop a web-based user-friendly tool SysFACE - for both clinicians and scientists - that predicts and prioritizes craniofacial genes. SysFACE, available through FaceBase, will accelerate craniofacial disease gene discovery, which in turn will facilitate identification of new therapeutic approaches.
Cox, Liza L; Cox, Timothy C; Moreno Uribe, Lina M et al. (2018) Mutations in the Epithelial Cadherin-p120-Catenin Complex Cause Mendelian Non-Syndromic Cleft Lip with or without Cleft Palate. Am J Hum Genet 102:1143-1157 |
Liu, Huan; Busch, Tamara; Eliason, Steven et al. (2017) Exome sequencing provides additional evidence for the involvement of ARHGAP29 in Mendelian orofacial clefting and extends the phenotypic spectrum to isolated cleft palate. Birth Defects Res 109:27-37 |