We propose to create IDASH, a national center for biomedical computing that will develop new algorithms, open-source tools, computational infrastructure and services that will enable biomedical and behavioral researchers nationwide to integrate Data for Analysis, Anonymization, and Sharing, IDASH will address fundamental challenges to research progress by providing a secure, privacy-preserving environment in which researchers can analyze genomic, transcriptomic and highly annotated phenotypical data. Leveraging the high performance capabilities of the San Diego Supercomputer Center (SDSC), and scalable cyberinfrastructure developed by the California Institute for Telecommunications and Information Technology, iDASH will provide synergistic application of tools and systems to advance research and improve human health. iDASH will focus on privacy protection through anonymization, data simulation, and an informed consent management system. It will focus on data analysis through the development of new tools for data annotation and integration across temporal and spatial dimensions, and develop algorithms for rare event detection and risk adjustment. To enable efficient analysis of short-reads from massively parallel sequencing, compression algorithms and a new genomic query system will be developed. Three Driving Biological Projects that span the molecular-individual-population spectrum will motivate, inform and support tool development: (1) Molecular Phenotyping of Kawasaki Disease;(2) Post-Marketing Pharmacosurveillance of Anticoagulation Agents;(3) Individualized Intervention to Enhance Physical Activity. iDASH trainees will complete core biomedical informatics courses and will have options for short- and long-term graduate training at San Diego State University and UCSD. We will collaborate with other NCBCs and disseminate tools via annual workshops for users and developers, presentations at major conferences, and scientific publications. We will develop a comprehensive web portal to download tools, upload data, and obtain documentation and user-friendly training materials. An experienced leadership team will use effective project management practices to support collaboration as well as monitor and ensure progress toward iDASH goals.

Public Health Relevance

Contemporary biomedical and behavioral research requires significant computational resources. There is an increasing divide between researchers who have these resources and those who do not. iDASH will decrease this gap and accelerate discoveries by providing innovative services, algorithms, open-source software, infrastructure, and training to facilitate data analysis and sharing by biomedical researchers.

National Institute of Health (NIH)
National Heart, Lung, and Blood Institute (NHLBI)
Specialized Center--Cooperative Agreements (U54)
Project #
Application #
Study Section
Special Emphasis Panel (ZRG1-BST-K (52))
Program Officer
Larkin, Jennie E
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of California San Diego
Internal Medicine/Medicine
Schools of Medicine
La Jolla
United States
Zip Code
Wang, Lichang; Fang, Yong; Aref, Dima et al. (2016) PALME: PAtients Like My gEnome. AMIA Jt Summits Transl Sci Proc 2016:219-24
Levy, Eric; Marty, Rachel; Gárate Calderón, Valentina et al. (2016) Immune DNA signature of T-cell infiltration in breast tumor exomes. Sci Rep 6:30064
Wang, Shuang; Zhang, Yuchen; Dai, Wenrui et al. (2016) HEALER: homomorphic computation of ExAct Logistic rEgRession for secure rare disease variants analysis in GWAS. Bioinformatics 32:211-8
Wang, Shuang; Jiang, Xiaoqian; Singh, Siddharth et al. (2016) Genome privacy: challenges, technical approaches to mitigate risk, and ethical considerations in the United States. Ann N Y Acad Sci :
Shi, Eileen; Chmielecki, Juliann; Tang, Chih-Min et al. (2016) FGFR1 and NTRK3 actionable alterations in ""Wild-Type"" gastrointestinal stromal tumors. J Transl Med 14:339
Farhan, Wael; Wang, Zhimu; Huang, Yingxiang et al. (2016) A Predictive Model for Medical Events Based on Contextual Embedding of Temporal Sequences. JMIR Med Inform 4:e39
Tang, Haixu; Jiang, Xiaoqian; Wang, Xiaofeng et al. (2016) Protecting genomic data analytics in the cloud: state of the art and opportunities. BMC Med Genomics 9:63
Azad, Priti; Zhao, Huiwen W; Cabrales, Pedro J et al. (2016) Senp1 drives hypoxia-induced polycythemia via GATA1 and Bcl-xL in subjects with Monge's disease. J Exp Med 213:2729-2744
Doan, Son; Maehara, Cleo K; Chaparro, Juan D et al. (2016) Building a Natural Language Processing Tool to Identify Patients With High Clinical Suspicion for Kawasaki Disease from Emergency Department Notes. Acad Emerg Med 23:628-36
Shi, Haoyi; Jiang, Chao; Dai, Wenrui et al. (2016) Secure Multi-pArty Computation Grid LOgistic REgression (SMAC-GLORE). BMC Med Inform Decis Mak 16 Suppl 3:89

Showing the most recent 10 out of 155 publications