In working to achieve the long-term goal of the AnVIL of providing a unified platform for ingestion and organization for a multitude of current and future genomic and genome-related datasets, it is clear that enabling the cross-talk of multiple platforms and datasets is a critical barrier. To overcome this challenge, we propose to improve, adopt, and implement shared standards for DRS and RAS within the Terra platform. To do that, we propose the following aims and subaims: 1. Enhance cross-system interoperability through development and implementation of standards for authentication and for sharing data and applications 1.1 Interoperability standards and technical requirements: Improve interoperability standards, scope technical work needed to support these standards, and develop conventions across systems to facilitate the realization of these use cases 1.2 Terra DRS and RAS support: ? Ensure GA4GH DRS 1.1 support. Data should be accessible over the protocol supporting the systems described in the interoperability use cases. DRS 1.1 is the latest version of the GA4GH standard and includes GUID support ? Include RAS account linking and token management ? Ensure PFB search handoff support improvements as the convention is refined ? Ensure Leonardo accommodates DRS 1.1 and RAS support 1.3 Galaxy and R / Bioconductor DRS and RAS support: ? Ensure GA4GH DRS 1.1 support in Galaxy and R / Bioconductor, so data can be accessed using that standard ? Include RAS token sharing with Galaxy and RStudio instances in Terra, allowing Galaxy and R / Bioconductor to access DRS accessible data using RAS credentials 2. Complete the scoping required to integrate RAS support into the Data Use Oversight System (DUOS) 2.1 Complete scoping to add support to DUOS to enable data access committees to issue visas to researchers

Public Health Relevance

Currently, analyzing data from various datasets, platforms, and ICs is intensely arduous because of time, cost, authentication, and technical expertise limitations. Through the implementation of standards that are focused on making authentication, data harmonization, and search functions interoperable within and external to the AnVIL ecosystem, the researcher burden to find, manage, and compute on large-scale data is greatly reduced. In enabling more timely and targeted analysis, the work in making platforms more interoperable stands to benefit researchers and patients.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Resource-Related Research Projects--Cooperative Agreements (U24)
Project #
3U24HG010262-03S1
Application #
10222996
Study Section
Special Emphasis Panel (ZHG1)
Program Officer
Wiley, Kenneth L
Project Start
2018-09-19
Project End
2023-06-30
Budget Start
2020-08-13
Budget End
2021-06-30
Support Year
3
Fiscal Year
2020
Total Cost
Indirect Cost
Name
Broad Institute, Inc.
Department
Type
DUNS #
623544785
City
Cambridge
State
MA
Country
United States
Zip Code
02142