Craniofacial (CF) abnormalities constitute more than a third of all human structural birth defects. To define their genetic etiology, detailed molecular understanding is required of coordinated movement and fusion of embryonic facial prominences - as disruption of these morphogenetic events cause defects such as orofacial clefts (OFC). The NIH FaceBase initiative is an important step to address this need, as it aims to generate comprehensive whole-genome expression datasets using microarrays or Next-Gen RNA-sequencing (RNA-seq) on mouse embryonic CF tissue. However, genome-wide profiling identifies several thousand expressed genes and it is a formidable challenge to predict and prioritize the select few genes that are critical to tissue development or pathogenesis. We posit that although there is a wealth of genomic-level data available, this deficit remains because an adequate strategy has not yet been applied to identify these important candidate CF genes. We recently developed an innovative approach - termed in silico whole embryo body (WB) subtraction - to identify such important genes based on developmentally-enriched expression. We have applied this novel approach to ~15% of FaceBase data and assembled this knowledge as a user-friendly web-based interactive tool SysFACE (Systems tool for craniofacial expression-based gene discovery, http://bioinformatics.udel.edu/Research/SysFACE). Even with limited datasets, the beta version of SysFACE is significantly more effective, compared with unprocessed FaceBase datasets, in identification of known genes associated with OFCs from both linkage and GWAS studies. To process all existing FaceBase datasets, we will generate additional platform-specific WB reference datasets and evaluate these further with machine learning strategies to identify genes important to CF development (Aim 1). Subsequently, we aim to experimentally validate these tissue-enriched gene expression profiles, and to assemble this knowledge - along with a new evidence-based functional gene regulatory network (GRN) that will allow all molecular data from the CF published literature to be represented on systems level - as a user-friendly web-based interactive resource (Aim 2), which will also be made available through FaceBase. Development of SysFACE, as outlined in this application, will greatly improve prediction of candidate CF genes, provide an excellent resource for CF-network construction, and will facilitate CF gene discovery efforts by developmental biologists and clinicians.

Public Health Relevance

Craniofacial malformations are common among structural birth defects among which orofacial clefts alone occur in 1/800 live-births and carry a lifetime cost for medical treatment, rehabilitation services and lost productivity of more than $100,000 per affected person. This application seeks to analyze FaceBase gene expression data using an integrated approach to develop a web-based user-friendly tool SysFACE - for both clinicians and scientists - that predicts and prioritizes craniofacial genes. SysFACE, available through FaceBase, will accelerate craniofacial disease gene discovery, which in turn will facilitate identification of new therapeutic approaches.

Agency
National Institute of Health (NIH)
Institute
National Institute of Dental & Craniofacial Research (NIDCR)
Type
Small Research Grants (R03)
Project #
5R03DE024776-02
Application #
9107846
Study Section
Special Emphasis Panel (ZDE1)
Program Officer
Scholnick, Steven
Project Start
2015-08-01
Project End
2017-05-31
Budget Start
2016-06-01
Budget End
2017-05-31
Support Year
2
Fiscal Year
2016
Total Cost
Indirect Cost
Name
University of Delaware
Department
Biology
Type
Schools of Arts and Sciences
DUNS #
059007500
City
Newark
State
DE
Country
United States
Zip Code
19716