Software for the analysis of large-scale genotyping and sequencing studies

Purcell, Shaun

Abstract

The objective of this project is to develop software for the analysis of data from large-scale genotyping and sequencing studies, building on the existing software package PLINK and its companion package PLINK/Seq. Designed to manipulate and analyze whole-genome SNP datasets, PLINK has been actively developed for over six years and has a wide base of users, with over 5000 citations in peer-reviewed journals. Over the past years, we have added considerable support for the analysis of large rare variant datasets, primarily focused on whole-exome sequencing studies in PLINK/Seq. In this renewal application, we seek to 1) provide tighter integration between PLINK and PLINK/Seq, aiming to provide a single interface for both genome-wide association and sequencing studies, particularly in the context of large statistically-imputed datasets;2) enhance the data-integration facilities already present, across different classes of genetic variation as well as large, diverse datasets;3) provide improved handling of family-based datasets, focused on de novo and inherited variation in (nuclear) family-based association studies;4) to work on improving performance on very large datasets. Particular attention will be paid to ensure interoperability with other major software, file-formats and resources that are generated by the broader genetics community.

Public Health Relevance

This Project is to develop software for the analysis of large datasets from modern genetic studies. New high-throughput genotyping and sequencing technologies are capable of producing vast amounts of data, but there is a need for analytic tools that biomedical researchers can use. These studies have the potential to uncover genetic determinants for a large number of diseases and traits, which can be relevant for prediction of risk, and give insight into novel targets for treatments.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Human Genome Research Institute (NHGRI)
Type: Research Project (R01)
Project #: 2R01HG005827-05A1
Application #: 8762148
Study Section: Biodata Management and Analysis Study Section (BDMA)
Program Officer: Brooks, Lisa

Project Start: 2010-09-27
Project End: 2017-06-30
Budget Start: 2014-08-01
Budget End: 2015-06-30
Support Year: 5
Fiscal Year: 2014
Total Cost: $370,000
Indirect Cost: $136,385

Institution

Name: Icahn School of Medicine at Mount Sinai
Department: Psychiatry
Type: Schools of Medicine
DUNS #: 078861598

City: New York
State: NY
Country: United States
Zip Code: 10029

Related projects


NIH 2016 R01 HG	Software for the analysis of large-scale genotyping and sequencing studies Purcell, Shaun M. / Brigham and Women's Hospital	$370,000
NIH 2015 R01 HG	Software for the analysis of large-scale genotyping and sequencing studies Purcell, Shaun M. / Icahn School of Medicine at Mount Sinai
NIH 2014 R01 HG	Software for the analysis of large-scale genotyping and sequencing studies Purcell, Shaun M. / Icahn School of Medicine at Mount Sinai	$370,000
NIH 2012 R01 HG	Software for the analysis of large-scale genotyping and sequencing studies Purcell, Shaun M. / Icahn School of Medicine at Mount Sinai	$335,610
NIH 2011 R01 HG	Software for the analysis of large-scale genotyping and sequencing studies Purcell, Shaun M. / Massachusetts General Hospital	$95,568
NIH 2011 R01 HG	Software for the analysis of large-scale genotyping and sequencing studies Purcell, Shaun M. / Icahn School of Medicine at Mount Sinai	$244,092
NIH 2010 R01 HG	Software for the analysis of large-scale genotyping and sequencing studies Purcell, Shaun M. / Massachusetts General Hospital	$319,350

Publications

Grinde, Kelsey E; Qi, Qibin; Thornton, Timothy A et al. (2018) Generalizing polygenic risk scores from Europeans to Hispanics/Latinos. Genet Epidemiol :

Ruderfer, Douglas M; Charney, Alexander W; Readhead, Ben et al. (2016) Polygenic overlap between schizophrenia risk and antipsychotic response: a genomic medicine approach. Lancet Psychiatry 3:350-7

Ruderfer, Douglas M; Hamamsy, Tymor; Lek, Monkol et al. (2016) Patterns of genic intolerance of rare copy number variation in 59,898 human exomes. Nat Genet 48:1107-11

Rees, E; Kirov, G; Walters, J T et al. (2015) Analysis of exome sequence in 604 trios for recessive genotypes in schizophrenia. Transl Psychiatry 5:e607

Sham, Pak C; Purcell, Shaun M (2014) Statistical power and significance testing in large-scale genetic studies. Nat Rev Genet 15:335-46

Purcell, Shaun M; Moran, Jennifer L; Fromer, Menachem et al. (2014) A polygenic burden of rare disruptive mutations in schizophrenia. Nature 506:185-90

Fromer, Menachem; Pocklington, Andrew J; Kavanagh, David H et al. (2014) De novo mutations in schizophrenia implicate synaptic networks. Nature 506:179-84

Fromer, Menachem; Purcell, Shaun M (2014) Using XHMM Software to Detect Copy Number Variation in Whole-Exome Sequencing Data. Curr Protoc Hum Genet 81:7.23.1-21

Fromer, Menachem; Moran, Jennifer L; Chambert, Kimberly et al. (2012) Discovery and statistical genotyping of copy-number variation from whole-exome sequencing depth. Am J Hum Genet 91:597-607

Ruderfer, D M; Kirov, G; Chambert, K et al. (2011) A family-based study of common polygenic variation and risk of schizophrenia. Mol Psychiatry 16:887-8

Comments

Be the first to comment on Shaun Purcell's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: