EDAC: ENCODE Data Analysis Center

Weng, Zhiping

Abstract

The objective of the Encyclopedia of DNA Elements (ENCODE) Project is to provide a complete inventory of all functional elements in the human genome using high-throughput experiments as well as computational methods. This proposal aims to create the ENCODE Data Analysis Center (EDAC, or the DAC), consisting of a multi-disciplinary group of leading scientists who will respond to directions from the Analysis Working Group (AWG) of ENCODE and thus integrate data generated by all groups in the ENCODE Consortium in an unbiased manner. These analyses will substantially augment the value of the ENCODE data by integrating diverse data types. The DAC members are leaders in their respective fields of bioinformatics, computational machine learning, algorithm development, and statistical theory and application to genomic data (Zhiping Weng, Manolis Kellis, Mark Gerstein, Mark Daly, Roderic Guigo, Shirley Liu, Rafael Irizarry, and William Noble). They have a strong track record of delivering collaborative analysis in the context of the ENCODE and modENCODE Projects, in which this group of researchers was responsible for the much of the analyses and the majority of the figures and tables in the ENCODE and modENCODE papers. The proposed DAC will pursue goals summarized as the following seven aims:
Aim 1. To work with the AWG to define and prioritize integrative analyses of ENCODE data;
Aim 2. To provide shared computational guidelines and infrastructure for data processing, common analysis tasks, and data exchange;
Aim 3. To facilitate and carry out data integration for element-specific analyses;
Aim 4. To facilitate and carry out exploratory data analyses across elements;
Aim 5. To facilitate and carry out comparative analyses across human, mouse, fly, and worm;
Aim 6. To facilitate integration with the genome-wide association studies community and disease datasets;
and Aim 7. To facilitate writing Consortium papers and assist evaluating ENCODE data.

Public Health Relevance

The Encyclopedia of DNA Elements (ENCODE) Project is a coordinated effort to apply high-throughput, cost-efficient approaches to generate a comprehensive catalog of functional elements in the human genome. This proposal establishes a data analysis center to support, facilitate, and enhance integrative analyses of the ENCODE Consortium, with the ultimate goal of facilitating the scientific and medical communities in interpreting this human genome and using it to understand human biology and improve human health.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Human Genome Research Institute (NHGRI)
Type: Biotechnology Resource Cooperative Agreements (U41)
Project #: 5U41HG007000-02
Application #: 8548395
Study Section: Special Emphasis Panel (ZHG1-HGR-M (M3))
Program Officer: Pazin, Michael J

Project Start: 2012-09-21
Project End: 2016-07-31
Budget Start: 2013-08-01
Budget End: 2014-07-31
Support Year: 2
Fiscal Year: 2013
Total Cost: $1,871,839
Indirect Cost: $269,195

Institution

Name: University of Massachusetts Medical School Worcester
Department: Biostatistics & Other Math Sci
Type: Schools of Medicine
DUNS #: 603847393

City: Worcester
State: MA
Country: United States
Zip Code: 01655

Related projects


NIH 2016 U41 HG	EDAC: ENCODE Data Analysis Center Weng, Zhiping / University of Massachusetts Medical School Worcester	$1,378,926
NIH 2015 U41 HG	EDAC: ENCODE Data Analysis Center Weng, Zhiping / University of Massachusetts Medical School Worcester	$2,005,492
NIH 2014 U41 HG	EDAC: ENCODE Data Analysis Center Weng, Zhiping / University of Massachusetts Medical School Worcester
NIH 2013 U41 HG	EDAC: ENCODE Data Analysis Center Weng, Zhiping / University of Massachusetts Medical School Worcester	$1,871,839
NIH 2013 U41 HG	EDAC: ENCODE Data Analysis Center Weng, Zhiping / University of Massachusetts Medical School Worcester	$115,680
NIH 2012 U41 HG	EDAC: ENCODE Data Analysis Center Weng, Zhiping / University of Massachusetts Medical School Worcester	$2,460,045

Publications

Chan, Rachel C W; Libbrecht, Maxwell W; Roberts, Eric G et al. (2018) Segway 2.0: Gaussian mixture models and minibatch training. Bioinformatics 34:669-671

Fu, Shaliu; Wang, Qin; Moore, Jill E et al. (2018) Differential analysis of chromatin accessibility and histone modifications for predicting mouse developmental enhancers. Nucleic Acids Res 46:11184-11201

Miyamoto, Kei; Nguyen, Khoi T; Allen, George E et al. (2018) Chromatin Accessibility Impacts Transcriptional Reprogramming in Oocytes. Cell Rep 24:304-311

Libbrecht, Maxwell W; Bilmes, Jeffrey A; Noble, William Stafford (2018) Choosing non-redundant representative subsets of protein sequence data sets using submodular optimization. Proteins 86:454-466

Dixon, Jesse R; Xu, Jie; Dileep, Vishnu et al. (2018) Integrative detection and analysis of structural variation in cancer genomes. Nat Genet 50:1388-1398

Ursu, Oana; Boley, Nathan; Taranova, Maryna et al. (2018) GenomeDISCO: a concordance score for chromosome conformation capture experiments using random walks on contact map graphs. Bioinformatics 34:2701-2707

Lagarde, Julien; Uszczynska-Ratajczak, Barbara; Carbonell, Silvia et al. (2017) High-throughput annotation of full-length long noncoding RNAs with capture long-read sequencing. Nat Genet 49:1731-1740

Yard?mc?, Galip Gürkan; Noble, William Stafford (2017) Software tools for visualizing Hi-C data. Genome Biol 18:26

Yang, Tao; Zhang, Feipeng; Yard?mc?, Galip Gürkan et al. (2017) HiCRep: assessing the reproducibility of Hi-C data using a stratum-adjusted correlation coefficient. Genome Res 27:1939-1949

Yan, Koon-Kiu; Yardimci, Galip Gürkan; Yan, Chengfei et al. (2017) HiC-spector: a matrix library for spectral and reproducibility analysis of Hi-C contact maps. Bioinformatics 33:2199-2201

Showing the most recent 10 out of 51 publications

Comments

Be the first to comment on Zhiping Weng's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: