The continuous evolution in our ability to measure and record complex biomedical data has opened new opportunities, as well as new challenges in the development of evidence based medical care and health management. This application is concerned with complex patient level information, acquired in the form of high frequency functional data (ECG signals, cerebral environment monitors, images, etc.) recorded over several visits or in the setting of medium to long periods of intensive physiological monitoring (for example, Intensive Care Unit settings). We conceptually characterize this information framework as longitudinal functional data. This involves representation of these data classes, in relation to two time scales: historical time, indexing long term changes in the dynamic of processes under investigation, and clock time, indexing short term dynamics. This characterization achieves the goals of identifying sources of variation in the data that are readil interpretable for scientific investigation. We propose a comprehensive holistic development of the theory and methodology for the analysis of longitudinal functional data that span the subjects of regression, clustering and classification and dynamic computation. These developments will provide interpretation and rigorous inference to help guide health care decisions based on complex biomedical data. Even though longitudinal and functional data analysis have established solid bodies of theory and methods, current literature does not yet address analysis of longitudinal functional data with multiple covariates under flexible assumptions. In addition, most applications in the functional data analysis literature involve analysis of data in relatively short periods of time and methods are not directly applicable to medium to long periods of intensive physiological monitoring settings. We propose to analyze these larger scale data sets by the proposed novel longitudinal functional data framework involving chunking longer periods of follow-up into longitudinal units. This is a novel idea in thi literature which utilizes both longitudinal and functional data analysis tools to achieve data analysis in a new level of data complexity. A second element of innovation in our application will consist in the development of fast and accurate algorithms for statistical inference in real time, which will make our methodological contribution ever more useful for clinical and public health applications. We propose three Specific Aims: 1) To develop statistical methods for regression analysis and prediction in the setting of longitudinal functional data; 2) To develop clustering an classification methodology for longitudinal functional data; 3) To develop fast and feasible estimation techniques aimed at online learning in high dimensional settings. These three Aims are accompanied by a fourth exploratory Aim, where we propose to develop statistical methods for time to event analysis using longitudinal functional predictors. Applications for the proposed methodology will include Intensive Care Unit data on traumatic brain injury and cardiopulmonary arrest patients and ERP data in autism spectrum disorder studies.

Public Health Relevance

The continuous evolution in our ability to measure and record complex patient level information, acquired in the form of high frequency functional data recorded over several visits or in the setting of medium to long periods of intensive physiological monitoring has opened new opportunities, as well as new challenges in the development of evidence based medical care and health management. We conceptually characterize this information framework as longitudinal functional data and propose a comprehensive holistic development of the statistical theory and methodology for the analysis of longitudinal functional data involving regression, clustering and dynamic computation. The proposed methodological framework will provide interpretation and rigorous inference from these data structures, which will help guide medical and health care decisions.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
1R01GM111378-01A1
Application #
8963397
Study Section
Biostatistical Methods and Research Design Study Section (BMRD)
Program Officer
Marcus, Stephen
Project Start
2015-08-01
Project End
2019-05-31
Budget Start
2015-08-01
Budget End
2016-06-30
Support Year
1
Fiscal Year
2015
Total Cost
$303,010
Indirect Cost
$101,879
Name
University of California Los Angeles
Department
Biostatistics & Other Math Sci
Type
Schools of Public Health
DUNS #
092530369
City
Los Angeles
State
CA
Country
United States
Zip Code
90095
Xiao, Ran; Xu, Yuan; Pelter, Michele M et al. (2018) A Deep Learning Approach to Examine Ischemic ST Changes in Ambulatory ECG Recordings. AMIA Jt Summits Transl Sci Proc 2017:256-262
Dickinson, Abigail; DiStefano, Charlotte; Senturk, Damla et al. (2018) Peak alpha frequency is a neural marker of cognitive function across the autism spectrum. Eur J Neurosci 47:643-651
Gadhoumi, Kais; Do, Duc; Badilini, Fabio et al. (2018) Wavelet leader multifractal analysis of heart rate variability in atrial fibrillation. J Electrocardiol 51:S83-S87
Dickinson, Abigail; DiStefano, Charlotte; Lin, Yin-Ying et al. (2018) Interhemispheric alpha-band hypoconnectivity in children with autism spectrum disorder. Behav Brain Res 348:227-234
Hasenstab, Kyle; Scheffler, Aaron; Telesca, Donatello et al. (2017) A multi-dimensional functional principal components analysis of EEG data. Biometrics 73:999-1009
Hasenstab, Kyle; Sugar, Catherine; Telesca, Donatello et al. (2016) Robust functional clustering of ERP data with application to a study of implicit learning in autism. Biostatistics 17:484-98
Frohlich, Joel; Senturk, Damla; Saravanapandian, Vidya et al. (2016) A Quantitative Electrophysiological Biomarker of Duplication 15q11.2-q13.1 Syndrome. PLoS One 11:e0167179