There is an acute and increasing need to adapt standard statistical methods and to develop new approaches for the analysis of very large data sets. A data set is very large if it raises very difficult or insurmountable computational problems for standard data analysis using available computing systems. The accelerated increase in size and complexity of data sets is due in part to increased computational and storage capabilities, new measurement technologies, study designs, and an increasing number of study """"""""units."""""""" This proposal is concerned with statistical methods for the analysis of an emerging type of very large data set, where very high dimensional outcomes and predictors, such as images or densely sampled biosignals, are recorded at multiple visits on hundreds or thousands of subjects. The methods proposed will describe the cross-sectional, longitudinal and measurement error variability in longitudinal studies where observed data are functions or images. Methods for scalar on function/image regression analysis will also be addressed for the case of very highly dimensional predictors. The proposed methodology is inspired by and applied to very large studies of sleep and Diffusion Tensor Imaging (DTI) brain tractography.
The project provides statistical analysis methods for very large data sets where images or densely sampled biological signals are measured at multiple visits. Methods are applied to longitudinal sleep electroencephalogram (EEG) data and brain tractography obtained from Diffusion Tensor Imaging (DTI) in Multiple Sclerosis (MS) and healthy subjects.
Caldito, Natalia Gonzalez; Saidha, Shiv; Sotirchos, Elias S et al. (2018) Brain and retinal atrophy in African-Americans versus Caucasian-Americans with multiple sclerosis: a longitudinal study. Brain 141:3115-3129 |
Smirnova, Ekaterina; Ivanescu, Andrada; Bai, Jiawei et al. (2018) A practical guide to big data. Stat Probab Lett 136:25-29 |
Dworkin, J D; Linn, K A; Oguz, I et al. (2018) An Automated Statistical Technique for Counting Distinct Multiple Sclerosis Lesions. AJNR Am J Neuroradiol 39:626-633 |
Chén, Oliver Y; Crainiceanu, Ciprian; Ogburn, Elizabeth L et al. (2018) High-dimensional multivariate mediation with application to neuroimaging data. Biostatistics 19:121-136 |
Dworkin, J D; Sati, P; Solomon, A et al. (2018) Automated Integration of Multimodal MRI for the Probabilistic Detection of the Central Vein Sign in White Matter Lesions. AJNR Am J Neuroradiol 39:1806-1813 |
Valcarcel, Alessandra M; Linn, Kristin A; Vandekar, Simon N et al. (2018) MIMoSA: An Automated Method for Intermodal Segmentation Analysis of Multiple Sclerosis Brain Lesions. J Neuroimaging 28:389-398 |
Valcarcel, Alessandra M; Linn, Kristin A; Khalid, Fariha et al. (2018) A dual modeling approach to automatic segmentation of cerebral T2 hyperintensities and T1 black holes in multiple sclerosis. Neuroimage Clin 20:1211-1221 |
Dworkin, Jordan D; Shinohara, Russell T; Bassett, Danielle S (2018) The landscape of NeuroImage-ing research. Neuroimage 183:872-883 |
Reardon, P K; Seidlitz, Jakob; Vandekar, Simon et al. (2018) Normative brain size variation and brain shape diversity in humans. Science 360:1222-1227 |
Bai, Jiawei; Sun, Yifei; Schrack, Jennifer A et al. (2018) A two-stage model for wearable device data. Biometrics 74:744-752 |
Showing the most recent 10 out of 102 publications