The nematode C. elegans is one of the most thoroughly studied organisms in biology. It is used by thousands of scientists to gain insight into basic biological processes, many of which are shared with humans. Its genome is fully sequenced and is now the subject of several systematic studies to identify the functional elements of the genome and to integrate this knowledge into an understanding of the larger question of how a genome specifies an organism. Although disruption of DNA sequence is one of the most powerful ways to probe its function, mutations have been described for fewer than 8,000 of its >20,000 protein coding genes and almost none of the other functional elements, including regulatory sequences. RNAi, discovered in C. elegans, provides an alternative means of altering gene function, but it has several important limitations, including variable effectiveness, a requirement for an RNA intermediate, variable tissue sensitivity and a lack of heritability. The available mutants are not readily studied in large systematic projects;they exist in a variety of different backgrounds, and their sheer number complicates handling. The present project aims to create a set of 2,000 sequenced strains derived from mutagenesis, each with 500-750 mutations, in a standard background that will be readily used both by investigators who are pursuing the function of particular DNA elements and by investigators looking broadly across the genome for elements affecting particular phenotypes. In addition, through the creation of a large set of mutations that can be associated with phenotype the collection will provide a platform for understanding the broader question of how different amino acid changes in varying contexts may affect function. This latter question is of key importance, as large numbers of human variants will be discovered in whole genome and exome resequencing projects in the coming years, which will require a strong framework to facilitate interpretation. Such projects include the 1,000 genomes project already underway and the NHLBI project currently soliciting proposals (RFA-OD-09-004).

Public Health Relevance

For genomics to realize its full potential in improving public health and health care delivery, it will be essential for scientists and physicians to be able to predict the effect of the many rare mutations that each of us harbor. By generating a data set of more than 1,000,000 mutations across the C. elegans genome, we will provide a platform that will facilitate the discovery of both direct precedents and general principles that will enable improved predictions.

National Institute of Health (NIH)
National Human Genome Research Institute (NHGRI)
High Impact Research and Research Infrastructure Programs (RC2)
Project #
Application #
Study Section
Special Emphasis Panel (ZGM1-GDB-7 (CR))
Program Officer
Brooks, Lisa
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of Washington
Schools of Medicine
United States
Zip Code
Mok, Calvin A; Au, Vinci; Thompson, Owen A et al. (2017) MIP-MAP: High-Throughput Mapping of Caenorhabditis elegans Temperature-Sensitive Mutants via Molecular Inversion Probes. Genetics 207:447-463
Thompson, Owen A; Snoek, L Basten; Nijveen, Harm et al. (2015) Remarkably Divergent Regions Punctuate the Genome Assembly of the Caenorhabditis elegans Hawaiian Strain CB4856. Genetics 200:975-89
Thompson, Owen; Edgley, Mark; Strasbourger, Pnina et al. (2013) The million mutation project: a new approach to genetics in Caenorhabditis elegans. Genome Res 23:1749-62
Huang, Lijia; Szymanska, Katarzyna; Jensen, Victor L et al. (2011) TMEM237 is mutated in individuals with a Joubert syndrome related disorder and expands the role of the TMEM family at the ciliary transition zone. Am J Hum Genet 89:713-30