Methods for retrospective multi-site research will be developed in this project. Integrating data from multiple existing sources has the potential to substantially advance the study of behavior, particularly with respect to understanding complex behavioral disorders. Dyslexia is used as a model to develop methods for complex disorders because of the significant behavioral heterogeneity across people with dyslexia that could be used to develop a richer understanding of reading disability and other complex disorders with equally varied behavioral profiles such as autism or attention deficit hyperactivity disorder. Methods for three critical phases of retrospective multi-site research will be developed.
Aim 1 is focused on subject privacy, including the development of an automated data de-identification software tool that can be used for multi-dimensional data sets.
Aim 2 is focused on characterizing the behavioral heterogeneity within and across samples of complex disorders, [including the use of multiple imputation to address missing-ness in multi-site data sets.] Aim 3 is focused on how to appropriately analyze data from different samples that have included matched and unmatched case-control study designs. Our goal is to enhance the quality control and scientific power for behavioral and biologic studies of complex disorders, as well as advance the behavioral and neurobiological understanding of dyslexia. By developing standards for integrating behavioral and biological data, retrospective databases could be used to reveal etiologies and establish a consensus for the effective treatment of complex behavioral disorders.

Public Health Relevance

Integrating data from existing sources has the great potential to speed the identification of endophenotypes and their neurogenetic origins for complex behavioral disorders, develop biomarkers for tracking disease and treatment efficacy, and evaluate treatment effects. This project focuses on the development of methods for retrospective multi-site behavioral and biological studies, using dyslexia as a model. The results will establish standards for analysis of multi-site data that will enhance quality control and statistical power.

National Institute of Health (NIH)
Eunice Kennedy Shriver National Institute of Child Health & Human Development (NICHD)
Research Project (R01)
Project #
Application #
Study Section
Special Emphasis Panel (ZRG1-HDM-Q (54))
Program Officer
Miller, Brett
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
Medical University of South Carolina
Schools of Medicine
United States
Zip Code
Eckert, Mark A; Vaden Jr, Kenneth I; Maxwell, Amanda B et al. (2017) Common Brain Structure Findings Across Children with Varied Reading Disability Profiles. Sci Rep 7:6009
Eckert, Mark A; Berninger, Virginia W; Vaden Jr, Kenneth I et al. (2016) Gray Matter Features of Reading Disability: A Combined Meta-Analytic and Direct Analysis Approach(1,2,3,4). eNeuro 3:
Eckert, Mark A; Berninger, Virginia W; Hoeft, Fumiko et al. (2016) A case of Bilateral Perisylvian Syndrome with reading disability. Cortex 76:121-4
Song, Xuebo; Wang, James; Wang, Anlin et al. (2015) DeID - a data sharing tool for neuroimaging studies. Front Neurosci 9:325
Wang, James Z; Zhang, Yuanyuan; Dong, Liang et al. (2014) G-Bean: an ontology-graph based web tool for biomedical literature retrieval. BMC Bioinformatics 15 Suppl 12:S1
Li, Lin; Zhang, Qizhi; Ding, Yihua et al. (2014) Automatic diagnosis of melanoma using machine learning methods on a spectroscopic system. BMC Med Imaging 14:36
Song, Xuebo; Li, Lin; Srimani, Pradip K et al. (2014) Measure the Semantic Similarity of GO Terms Using Aggregate Information Content. IEEE/ACM Trans Comput Biol Bioinform 11:468-76
Raskind, Wendy H; Peter, Beate; Richards, Todd et al. (2012) The genetics of reading disabilities: from phenotypes to candidate genes. Front Psychol 3:601