Conducting experiments on model organisms is fundamental to biomedical research. Three of the most important are budding yeast (fundamental studies), rat (pharmacological, behavioral and neurological studies) and zebrafish (developmental, neurological and toxicological studies). Databases to capture and curate the wealth of data on these model organisms have been established and are known collectively as Model Organism Databases (MODs). Modern biology has resulted in the complete DNA sequence (`genome') of the human as well as these model organisms. In turn this has led to a new era of research in which experiments are carried out at the whole genome scale. The success of genomics has fuelled a challenge to integrate genomic datasets within the MODs in such a way that querying them and extracting data in a flexible fashion is possible for all scientists;not just specialists known as bioinformaticians, although it is also important to provide bioinformaticians with powerful tools. As part of previous work in support of another model organism, the fruitfly, InterMine software was developed to greatly increase the power and flexibility with which scientists can utilize genomic data. InterMine was designed to be applied easily to other areas of biology and organisms. Indeed it is currently being used to manage data from the NIH-funded modENCODE project which is experimentally characterizing the entire genomes of the fruitfly and nematode model organisms.
The aim of this project is to apply the InterMine software to three MODs: budding yeast, rat and zebrafish. This provides a number of advantages to each database: functionality that their user communities demand but that are not yet available;a standard interface and set of functionality between MODs;an opportunity for the different MODs to inter-operate providing a tool to compare and contrast the behavior of genes and proteins between this set of organisms, a feature that is not generally available today. This project will be carried out as a collaboration between the team that developed InterMine, based in Cambridge UK, and the teams that develop and maintain the three MODs, based at Stanford University (yeast, SGD), the Medical College of Wisconsin (rat, RGD) and the University of Oregon (zebrafish, ZFIN). This proposal provides one staff member per site, and the resulting team will work together to transfer data into, and add analysis tools to, InterMine databases that will be integrated at each MOD site and within their user interfaces. A benefit of working together in this way is that developments at one site can immediately benefit the others. By the end of the project the MODs will be able to provide far greater functionality to their research communities, and improvements to the underpinning InterMine software will be freely available to the broader community. The proposed project is unique in its integration of experimental results across the major model organisms. This integration is essential for our advanced understanding of molecular genetics, cell biology, developmental biology, physiology, and most importantly, human health and disease.

Public Health Relevance

The recent decoding of the human genome sequence has unprecedented implications for the future of human healthcare through improved understanding of human development, functioning, aging and disease. However, much of the experimental work that has to be done to fully understand these events cannot be done in humans and must therefore be carried out in so-called model organisms. The proposed project will address a pressing need to improve the efficiency with which the huge amounts of Model Organism data being generated can be integrated, analysed and compared, which will lead to improved understanding of humans and thus to better disease diagnosis, prognosis, prevention and cure.

National Institute of Health (NIH)
National Human Genome Research Institute (NHGRI)
Research Project (R01)
Project #
Application #
Study Section
Special Emphasis Panel (ZRG1-BST-Q (01))
Program Officer
Good, Peter J
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of Cambridge
United Kingdom
Zip Code
CB2 1-TN
Shaw, David R (2016) Searching the Mouse Genome Informatics (MGI) Resources for Information on Mouse Biology from Genotype to Phenotype. Curr Protoc Bioinformatics 56:1.7.1-1.7.16
Ruzicka, Leyla; Bradford, Yvonne M; Frazer, Ken et al. (2015) ZFIN, The zebrafish model organism database: Updates and new directions. Genesis 53:498-509
Eppig, Janan T; Richardson, Joel E; Kadin, James A et al. (2015) Mouse Genome Informatics (MGI): reflecting on 25 years. Mamm Genome 26:272-84
Lyne, Rachel; Sullivan, Julie; Butano, Daniela et al. (2015) Cross-organism analysis using InterMine. Genesis 53:547-60
Motenko, H; Neuhauser, S B; O'Keefe, M et al. (2015) MouseMine: a new data warehouse for MGI. Mamm Genome 26:325-30
Desvignes, T; Batzel, P; Berezikov, E et al. (2015) miRNA Nomenclature: A View Incorporating Genetic Origins, Biosynthetic Pathways, and Sequence Variants. Trends Genet 31:613-626
Kalderimis, Alex; Lyne, Rachel; Butano, Daniela et al. (2014) InterMine: extensive web services for modern biology. Nucleic Acids Res 42:W468-72
Sullivan, Julie; Karra, Kalpana; Moxon, Sierra A T et al. (2013) InterMOD: integrated data and tools for the unification of model organism research. Sci Rep 3:1802
Howe, Douglas G; Bradford, Yvonne M; Conlin, Tom et al. (2013) ZFIN, the Zebrafish Model Organism Database: increased support for mutants and transgenics. Nucleic Acids Res 41:D854-60
Wong, Edith D; Karra, Kalpana; Hitz, Benjamin C et al. (2013) The YeastGenome app: the Saccharomyces Genome Database at your fingertips. Database (Oxford) 2013:bat004

Showing the most recent 10 out of 15 publications