The proposed project is in response to the U01 Funding Opportunity Announcement (FOA) for the Big Data to Knowledge (BD2K) Development of Software Tools and Methods for Biomedical Big Data in the topic area of applying metadata. The overall goal here is to design, develop and evaluate an integrated platform for clinical research metadata standardization leveraging both standards-based representation and scalable Semantic Web technologies. The ultimate goal is to advance clinical research data discovery and analytic capabilities for clinical and translational centers and investigators. Clinical and translational research studies increasingly involve the manipulation of large datasets (e.g., patient records and genomic profiles) and the application of complex methods. To derive clinically relevant conclusions from such large datasets, the clinical and translational research community faces significant data integration challenges related to scalability, interactivity, representation standards, sustainability, and robustness. Failure to deal with these challenges will have a significant negative impact on downstream data reuse, sharing and analysis in the broader scientific communities. Detailed Clinical Models (DCMs) have been regarded as the basis for retaining computable meaning when data are exchanged between heterogeneous clinical systems. Amongst the emerging national and international initiatives on the standardization of DCM modeling are the Clinical Informatics Modeling Initiative (CIMI) and the HL7 Fast Healthcare Interoperability Resources (FHIR). FHIR is an emerging HL7 standard; it leverages existing logical and theoretical models to provide a consistent, easy to implement, and rigorous mechanism for exchanging data between healthcare applications. However, currently the toolbox that enables HL7 FHIR as a global data model to standardize clinical research metadata is very limited. Such metadata include data dictionaries associated with clinical research datasets and a variety of underlying data models in the existing integrated data repositories (IDRs) such as the Informatics for Integrating Biology and the Bedside (i2b2). The proposed project leverages emerging Semantic Web technologies to provide a scalable standards-based framework that enables effective and efficient big data integration and semantic sharing. The proposed project builds on semantic metadata software and infrastructure developed in our previous projects, including an NIH U24 bioCADDIE (biomedical and healthCAre Data Discovery Index Ecosystem) pilot project (PI: Jiang) that investigates the feasibility of indexing clinical research datasets using HL7 FHIR, and an NCI U01 supplement (PI: Jiang) that creates an open-source IDR (e.g., i2b2) with FHIR- based cancer data services for cancer research. The objective of the proposed project is to consolidate, develop, and evaluate methods and tools for standardizing clinical research metadata and data models using HL7 FHIR.
Our specific aims are: 1) Consolidate our bioCADDIE tools for indexing clinical research metadata using HL7 FHIR; 2) Create methods and tools for integrating i2b2 clinical data repository with HL7 FHIR; 3) Deploy an integrated web-portal for community-based metadata harmonization and tool dissemination. The proposed project will produce a suite of methods and tools for clinical research metadata standardization using HL7 FHIR and effectively facilitate secondary use of clinical research data and applications, ultimately advancing clinical and translational data discovery and analytics.

Public Health Relevance

The project is to consolidate, develop, and evaluate methods and tools for standardizing clinical research metadata and data models using a next generation health data standard known as HL7 FHIR. If successful, the project will effectively facilitate secondary use of clinical research data and applications, ultimately advancing clinical and translational data discovery and analytics. 1

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Project--Cooperative Agreements (U01)
Project #
1U01HG009450-01
Application #
9152392
Study Section
Special Emphasis Panel (ZRG1-BST-U (50)R)
Program Officer
Sofia, Heidi J
Project Start
2016-09-15
Project End
2019-08-31
Budget Start
2016-09-15
Budget End
2017-08-31
Support Year
1
Fiscal Year
2016
Total Cost
$477,000
Indirect Cost
$177,000
Name
Mayo Clinic, Rochester
Department
Type
DUNS #
006471700
City
Rochester
State
MN
Country
United States
Zip Code
55905
Hong, Na; Wen, Andrew; Shen, Feichen et al. (2018) Integrating Structured and Unstructured EHR Data Using an FHIR-based Type System: A Case Study with Medication Data. AMIA Jt Summits Transl Sci Proc 2017:74-83
Hong, Na; Prodduturi, Naresh; Wang, Chen et al. (2017) Shiny FHIR: An Integrated Framework Leveraging Shiny R and HL7 FHIR to Empower Standards-Based Clinical Data Applications. Stud Health Technol Inform 245:868-872
Solbrig, Harold R; Prud'hommeaux, Eric; Grieve, Grahame et al. (2017) Modeling and validating HL7 FHIR profiles using semantic web Shape Expressions (ShEx). J Biomed Inform 67:90-100
Jiang, Guoqian; Kiefer, Richard; Prud'hommeaux, Eric et al. (2017) Building Interoperable FHIR-Based Vocabulary Mapping Services: A Case Study of OHDSI Vocabularies and Mappings. Stud Health Technol Inform 245:1327
Sharma, Deepak K; Solbrig, Harold R; Prud'hommeaux, Eric et al. (2017) D2Refine: A Platform for Clinical Research Study Data Element Harmonization and Standardization. AMIA Jt Summits Transl Sci Proc 2017:259-267
Jiang, Guoqian; Kiefer, Richard C; Sharma, Deepak K et al. (2017) A Consensus-Based Approach for Harmonizing the OHDSI Common Data Model with HL7 FHIR. Stud Health Technol Inform 245:887-891
Peterson, Kevin J; Jiang, Guoqian; Brue, Scott M et al. (2017) Mining Hierarchies and Similarity Clusters from Value Set Repositories. AMIA Annu Symp Proc 2017:1372-1381