We propose to create a massively scalable toolkit to enable large, multi-center Patient-centered Information Commons (PIC) at local, regional and, national scale, where the focus is the alignment of all available biomedical data per individual. Such a Commons is a prerequisite for conducting the large-N, Big Data, longitudinal studies essential for understanding causation in the Precision Medicine (1) framework while simultaneously addressing key complexities of Patient Centric Outcome Research studies required under ACA (Affordable Care Act). Our proposal is solidly grounded in our experience over the last 25 in harnessing clinical care data to the research enterprise. In creating PIC we propose to focus on: 1. Enable the identification and retrieval of all data that pertain to individal health by creating a data sharing architecture that is capacious enough for all relevant data types and that enables patient and institutional autonomy to be respected. 2. Test fully-scaled implementations of the proposed architecture early in the development process, with the active involvement of a committed user community that seeks to use allowed us to refine our designs to facilitate subsequent robust dissemination and adoption. 3. Provide commodity workflows that can be used to 'clean'and complete the often noisy and sparse data gathered in the course of observational studies. 4. Embrace decentralization while enabling the construction of a nationally or regionally-scaled patient-centered information commons. 5. Encourage the selection of standards through the tools that enable the construction of patient-centered information commons. 6. Employ diagnostic classification and prognostication as figures of merit to measure how well a patient-centered information commons adds the understanding of patient populations. In addition to the research and development agenda we have also taken on the development of educational opportunities for end user community to become more familiar with the methods and challenges of data science.

Public Health Relevance

Large populations of individuals characterized by many different and complementary types of data, for example genetic, environmental, imaging, behavioral and clinical findings will allow significant progress in our ability to accurate classif individuals as to their disease or disease risk and provide more precise predictions of their disease course. The proposed toolkit enables such chacterization at the local and national scale.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Specialized Center--Cooperative Agreements (U54)
Project #
1U54HG007963-01
Application #
8775065
Study Section
Special Emphasis Panel ()
Program Officer
Brooks, Lisa
Project Start
2014-09-29
Project End
2018-08-31
Budget Start
2014-09-29
Budget End
2015-04-30
Support Year
1
Fiscal Year
2014
Total Cost
$2,082,841
Indirect Cost
$418,530
Name
Harvard Medical School
Department
Miscellaneous
Type
Schools of Medicine
DUNS #
047006379
City
Boston
State
MA
Country
United States
Zip Code
02115
Kerpedjiev, Peter; Abdennur, Nezar; Lekschas, Fritz et al. (2018) HiGlass: web-based visual exploration and analysis of genome interaction maps. Genome Biol 19:125
Kothari, Cartik; Wack, Maxime; Hassen-Khodja, Claire et al. (2018) Phelan-McDermid syndrome data network: Integrating patient reported outcomes with clinical notes and curated genetic reports. Am J Med Genet B Neuropsychiatr Genet 177:613-624
Kartoun, Uri; Aggarwal, Rahul; Beam, Andrew L et al. (2018) Development of an Algorithm to Identify Patients with Physician-Documented Insomnia. Sci Rep 8:7862
Diao, James A; Kohane, Isaac S; Manrai, Arjun K (2018) Biomedical informatics and machine learning for clinical genomics. Hum Mol Genet 27:R29-R34
Hejblum, Boris P; Cui, Jing; Lahey, Lauren J et al. (2018) Association Between Anti-Citrullinated Fibrinogen Antibodies and Coronary Artery Disease in Rheumatoid Arthritis. Arthritis Care Res (Hoboken) 70:1113-1117
Can, Anil; Castro, Victor M; Dligach, Dmitriy et al. (2018) Elevated International Normalized Ratio Is Associated With Ruptured Aneurysms. Stroke 49:2046-2052
Yu, Sheng; Ma, Yumeng; Gronsbell, Jessica et al. (2018) Enabling phenotypic big data with PheNorm. J Am Med Inform Assoc 25:54-60
Can, Anil; Rudy, Robert F; Castro, Victor M et al. (2018) Low Serum Calcium and Magnesium Levels and Rupture of Intracranial Aneurysms. Stroke 49:1747-1750
Wilson, Ander; Zigler, Corwin M; Patel, Chirag J et al. (2018) Model-averaged confounder adjustment for estimating multivariate exposure effects with linear regression. Biometrics 74:1034-1044
Gutiérrez-Sacristán, Alba; Guedj, Romain; Korodi, Gabor et al. (2018) Rcupcake: an R package for querying and analyzing biomedical data through the BD2K PIC-SURE RESTful API. Bioinformatics 34:1431-1432

Showing the most recent 10 out of 60 publications