Advances in wearable electronics, personal mobile devices, and sensor technology are opening the door to many promising applications in medical care and biomedical research. However, the resulting datasets are often challenging to process due to variability caused by extraneous effects unrelated to the tasks of interest, such as changes in environmental conditions, heteroscedasticity in measurement noise, or patient idiosyncrasies. These effects produce systematic differences between the data used to train machine- learning algorithms and the data on which they are applied in practice, impairing real-world performance. The proposed research will address the fundamental problem of factoring out extraneous effects associated with known nuisance variables. We will develop a novel methodology for extracting features that ar.e invariant to nuisance variables-and hence also to the associated extraneous effects-but that are still useful for classification or regression. The methodology is based on nonparametric deep-network models that perform automatic normalization of the data, and further enforce invariance via adversarial learning. We will apply the approach to an important problem in stroke rehabilitation, the quantitated dosing of motor training. Using a dataset of sensor-based motion data, we will train the model to identify and count functional movements in stroke patients performing rehabilitation activities. We expect to show that our approach can surmount patient variability to enable rigorous movement classification and quantitation. The proposed work is significant, because it will empower investigators to undertake the dosing trials critically needed in stroke rehabilitation. The proposed work is innovative, because it departs from traditional data preprocessing techniques by combining advanced data normalization and model calibration procedures. Our work is likely to have a positive impact on stroke rehabilitation by facilitating the research required to change clinical practice and improve stroke outcomes. Our quantitative approach is broadly generalizable to applications hindered by nuisance variables, such as medical diagnostics and genomics.

Public Health Relevance

The proposed research is relevant to public health, because it will enable the precise measurement of rehabilitation after stroke. This will facilitate important recovery research that is expected to inform stroke rehabilitation practices and reduce the burden of stroke disability. The quantitative methodology will advance data-driven discovery in the presence of nuisance variables, which is relevant to biomedical research applications such as diagnostics, genetics, and imaging.

Agency
National Institute of Health (NIH)
Institute
National Library of Medicine (NLM)
Type
Research Project (R01)
Project #
1R01LM013316-01
Application #
9916457
Study Section
Special Emphasis Panel (ZLM1)
Program Officer
Ye, Jane
Project Start
2019-07-16
Project End
2022-06-30
Budget Start
2019-07-16
Budget End
2020-06-30
Support Year
1
Fiscal Year
2019
Total Cost
Indirect Cost
Name
New York University
Department
Biostatistics & Other Math Sci
Type
Organized Research Units
DUNS #
041968306
City
New York
State
NY
Country
United States
Zip Code
10012