Every possible missense variant that is compatible with life is likely present in the germline of a living human. Some of these variants alter protein activity or abundance, and, consequently, may impact disease risk. However, only ~2% of all presently reported missense variants have clinical interpretations. Most of the remaining variants, as well as nearly all missense variants not yet observed, are rare and cannot be interpreted using traditional approaches, creating a major challenge for the clinical use of genomic information. Our goal is to address this challenge by measuring the functional consequences of nearly every possible missense variant in clinically relevant proteins using deep mutational scanning. In a deep mutational scan, a library of protein variants is subjected to selection for the function of the protein, and high-throughput DNA sequencing is used to read out the enrichment or depletion of each variant, revealing the variant's function. Despite recent progress, deep mutational scanning suffers from two major limitations. The first lies in the requirement to handcraft a specific assay for the function of each protein. With over 4,000 disease-associated genes in the human genome, this one-at-a-time approach is impractical. Thus, we propose Variant Abundance by Massively Parallel Sequencing (VAMP-seq), a functional assay that is both informative of variant effect and generalizable to many proteins. The assay is based on the fact that, despite their diversity, most proteins share a key requirement: they must be abundant enough to perform their molecular function. We will generate VAMP-seq abundance data for nearly all possible missense variants in a set of ten clinically important proteins, refining VAMP-seq as a tool for assessing missense variation in many, if not most, disease-relevant genes. We will also combine VAMP-seq with chemical perturbations to reveal fundamental features of protein synthesis, folding and degradation, as well as to identify variants whose low abundance could be ameliorated pharmacologically. The second major limitation is that deep mutational scans typically quantify the effect of variants on a protein's activity or on cell growth. These simple measurements sometimes fail to capture the complexity of the relationship between genotype and human phenotype. Thus, we propose Microscope- Assisted Visuospatial Sorting (MAViS), which will enable multiplex assessment of variant effects on more complex phenotypes like a cell's internal organization, shape or behavior. We will apply MAViS to several disease-related genes, generating rich phenotypic data for nearly all possible missense variants. The data we gather from both VAMP-seq and MAViS will be used to generate comprehensive ?look-up tables? describing the effects of nearly every missense variant in each gene. We will also analyze these variant effects in the context of known pathogenic and benign variants, using a learning-based approach to make comprehensive predictions of missense variant pathogenicity.

Public Health Relevance

Every possible mutation that is compatible with life is likely present in a human alive today. However, we know the consequences of only a tiny fraction of these mutations, creating a major challenge for the clinical use of genomic information. We propose to measure the effect of hundreds of thousands of mutations simultaneously, using this information to predict the effect of these mutations on human health.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM109110-06
Application #
9744796
Study Section
Genetic Variation and Evolution Study Section (GVE)
Program Officer
Krasnewich, Donna M
Project Start
2014-09-01
Project End
2022-08-31
Budget Start
2019-09-01
Budget End
2020-08-31
Support Year
6
Fiscal Year
2019
Total Cost
Indirect Cost
Name
University of Washington
Department
Genetics
Type
Schools of Medicine
DUNS #
605799469
City
Seattle
State
WA
Country
United States
Zip Code
98195
Gray, Vanessa E; Hause, Ronald J; Luebeck, Jens et al. (2018) Quantitative Missense Variant Effect Prediction Using Large-Scale Mutagenesis Data. Cell Syst 6:116-124.e3
Matreyek, Kenneth A; Starita, Lea M; Stephany, Jason J et al. (2018) Multiplex assessment of protein variant abundance by massively parallel sequencing. Nat Genet 50:874-882
Rose, John C; Stephany, Jason J; Wei, Cindy T et al. (2018) Rheostatic Control of Cas9-Mediated DNA Double Strand Break (DSB) Generation and Genome Editing. ACS Chem Biol 13:438-442
McDonald, Matthew G; Ray, Sutapa; Amorosi, Clara J et al. (2017) Expression and Functional Characterization of Breast Cancer-Associated Cytochrome P450 4Z1 in Saccharomyces cerevisiae. Drug Metab Dispos 45:1364-1371
Taskinen, Barbara; Ferrada, Evandro; Fowler, Douglas M (2017) Early emergence of negative regulation of the tyrosine kinase Src by the C-terminal Src kinase. J Biol Chem 292:18518-18529
Starita, Lea M; Ahituv, Nadav; Dunham, Maitreya J et al. (2017) Variant Interpretation: Functional Assays to the Rescue. Am J Hum Genet 101:315-325
Rubin, Alan F; Gelman, Hannah; Lucas, Nathan et al. (2017) A statistical framework for analyzing deep mutational scanning data. Genome Biol 18:150
Manolio, Teri A; Fowler, Douglas M; Starita, Lea M et al. (2017) Bedside Back to Bench: Building Bridges between Basic and Clinical Genomic Research. Cell 169:6-12
Weile, Jochen; Sun, Song; Cote, Atina G et al. (2017) A framework for exhaustively mapping functional missense variants. Mol Syst Biol 13:957
Relling, M V; Krauss, R M; Roden, D M et al. (2017) New Pharmacogenomics Research Network: An Open Community Catalyzing Research and Translation in Precision Medicine. Clin Pharmacol Ther 102:897-902

Showing the most recent 10 out of 15 publications