We propose to create IDASH, a national center for biomedical computing that will develop new algorithms, open-source tools, computational infrastructure and services that will enable biomedical and behavioral researchers nationwide to integrate Data for Analysis, Anonymization, and Sharing, IDASH will address fundamental challenges to research progress by providing a secure, privacy-preserving environment in which researchers can analyze genomic, transcriptomic and highly annotated phenotypical data. Leveraging the high performance capabilities of the San Diego Supercomputer Center (SDSC), and scalable cyberinfrastructure developed by the California Institute for Telecommunications and Information Technology, iDASH will provide synergistic application of tools and systems to advance research and improve human health. iDASH will focus on privacy protection through anonymization, data simulation, and an informed consent management system. It will focus on data analysis through the development of new tools for data annotation and integration across temporal and spatial dimensions, and develop algorithms for rare event detection and risk adjustment. To enable efficient analysis of short-reads from massively parallel sequencing, compression algorithms and a new genomic query system will be developed. Three Driving Biological Projects that span the molecular-individual-population spectrum will motivate, inform and support tool development: (1) Molecular Phenotyping of Kawasaki Disease;(2) Post-Marketing Pharmacosurveillance of Anticoagulation Agents;(3) Individualized Intervention to Enhance Physical Activity. iDASH trainees will complete core biomedical informatics courses and will have options for short- and long-term graduate training at San Diego State University and UCSD. We will collaborate with other NCBCs and disseminate tools via annual workshops for users and developers, presentations at major conferences, and scientific publications. We will develop a comprehensive web portal to download tools, upload data, and obtain documentation and user-friendly training materials. An experienced leadership team will use effective project management practices to support collaboration as well as monitor and ensure progress toward iDASH goals.

Public Health Relevance

Contemporary biomedical and behavioral research requires significant computational resources. There is an increasing divide between researchers who have these resources and those who do not. iDASH will decrease this gap and accelerate discoveries by providing innovative services, algorithms, open-source software, infrastructure, and training to facilitate data analysis and sharing by biomedical researchers.

National Institute of Health (NIH)
National Heart, Lung, and Blood Institute (NHLBI)
Specialized Center--Cooperative Agreements (U54)
Project #
Application #
Study Section
Special Emphasis Panel (ZRG1-BST-K (52))
Program Officer
Wells, Barbara L
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of California San Diego
Internal Medicine/Medicine
Schools of Medicine
La Jolla
United States
Zip Code
Ji, Zhanglong; Jiang, Xiaoqian; Wang, Shuang et al. (2014) Differentially private distributed logistic regression using private and public data. BMC Med Genomics 7 Suppl 1:S14
Li, Zhonghan; Chao, Ti-Chun; Chang, Kung-Yen et al. (2014) The long noncoding RNA THRIL regulates TNF? expression through its interaction with hnRNPL. Proc Natl Acad Sci U S A 111:1002-7
Hinske, Ludwig Christian; Fran├ža, Gustavo S; Torres, Hugo A M et al. (2014) miRIAD-integrating microRNA inter- and intragenic data. Database (Oxford) 2014:
Kinsella, Marcus; Patel, Anand; Bafna, Vineet (2014) The elusive evidence for chromothripsis. Nucleic Acids Res 42:8231-42
Patel, Anand; Schwab, Richard; Liu, Yu-Tsueng et al. (2014) Amplification and thrifty single-molecule sequencing of recurrent somatic structural variations. Genome Res 24:318-28
Hepler, N Lance; Scheffler, Konrad; Weaver, Steven et al. (2014) IDEPI: rapid prediction of HIV-1 antibody epitopes and other phenotypic features from sequence data using a flexible machine learning platform. PLoS Comput Biol 10:e1003842
Ronen, Roy; Zhou, Dan; Bafna, Vineet et al. (2014) The genetic basis of chronic mountain sickness. Physiology (Bethesda) 29:403-12
Gordon, C T; Jimenez-Fernandez, S; Daniels, L B et al. (2014) Pregnancy in women with a history of Kawasaki disease: management and outcomes. BJOG 121:1431-8
Gan, Zhuohui; Wang, Jianwu; Salomonis, Nathan et al. (2014) MAAMD: a workflow to standardize meta-analyses and comparison of affymetrix microarray data. BMC Bioinformatics 15:69
Doan, Son; Lin, Ko-Wei; Conway, Mike et al. (2014) PhenDisco: phenotype discovery system for the database of genotypes and phenotypes. J Am Med Inform Assoc 21:31-6

Showing the most recent 10 out of 58 publications