Extending InterMine to yeast rat and zebrafish model organism databases

Micklem, Gos; Cherry, Joe; Twigger, Simon; Westerfield, Monte

Abstract

Conducting experiments on model organisms is fundamental to biomedical research. Three of the most important are budding yeast (fundamental studies), rat (pharmacological, behavioral and neurological studies) and zebrafish (developmental, neurological and toxicological studies). Databases to capture and curate the wealth of data on these model organisms have been established and are known collectively as Model Organism Databases (MODs). Modern biology has resulted in the complete DNA sequence (`genome') of the human as well as these model organisms. In turn this has led to a new era of research in which experiments are carried out at the whole genome scale. The success of genomics has fuelled a challenge to integrate genomic datasets within the MODs in such a way that querying them and extracting data in a flexible fashion is possible for all scientists;not just specialists known as bioinformaticians, although it is also important to provide bioinformaticians with powerful tools. As part of previous work in support of another model organism, the fruitfly, InterMine software was developed to greatly increase the power and flexibility with which scientists can utilize genomic data. InterMine was designed to be applied easily to other areas of biology and organisms. Indeed it is currently being used to manage data from the NIH-funded modENCODE project which is experimentally characterizing the entire genomes of the fruitfly and nematode model organisms.
The aim of this project is to apply the InterMine software to three MODs: budding yeast, rat and zebrafish. This provides a number of advantages to each database: functionality that their user communities demand but that are not yet available;a standard interface and set of functionality between MODs;an opportunity for the different MODs to inter-operate providing a tool to compare and contrast the behavior of genes and proteins between this set of organisms, a feature that is not generally available today. This project will be carried out as a collaboration between the team that developed InterMine, based in Cambridge UK, and the teams that develop and maintain the three MODs, based at Stanford University (yeast, SGD), the Medical College of Wisconsin (rat, RGD) and the University of Oregon (zebrafish, ZFIN). This proposal provides one staff member per site, and the resulting team will work together to transfer data into, and add analysis tools to, InterMine databases that will be integrated at each MOD site and within their user interfaces. A benefit of working together in this way is that developments at one site can immediately benefit the others. By the end of the project the MODs will be able to provide far greater functionality to their research communities, and improvements to the underpinning InterMine software will be freely available to the broader community. The proposed project is unique in its integration of experimental results across the major model organisms. This integration is essential for our advanced understanding of molecular genetics, cell biology, developmental biology, physiology, and most importantly, human health and disease.

Public Health Relevance

The recent decoding of the human genome sequence has unprecedented implications for the future of human healthcare through improved understanding of human development, functioning, aging and disease. However, much of the experimental work that has to be done to fully understand these events cannot be done in humans and must therefore be carried out in so-called model organisms. The proposed project will address a pressing need to improve the efficiency with which the huge amounts of Model Organism data being generated can be integrated, analysed and compared, which will lead to improved understanding of humans and thus to better disease diagnosis, prognosis, prevention and cure.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Human Genome Research Institute (NHGRI)
Type: Research Project (R01)
Project #: 5R01HG004834-02
Application #: 7793467
Study Section: Special Emphasis Panel (ZRG1-BST-Q (01))
Program Officer: Good, Peter J

Project Start: 2009-03-26
Project End: 2011-07-17
Budget Start: 2010-03-01
Budget End: 2011-07-17
Support Year: 2
Fiscal Year: 2010
Total Cost: $567,842
Indirect Cost

Institution

Name: University of Cambridge
Department
Type
DUNS #: 226552610

City: Cambridge
State
Country: United Kingdom
Zip Code: CB2 1-TN

Related projects


NIH 2013 R01 HG	InterMOD: integrated data and tools to support model organism research Micklem, Gos; Cherry, Joe Michael; Richardson, Joel E.; Stein, Lincoln D.; Westerfield, Monte; Worthey, Elizabeth A. / University of Cambridge	$541,742
NIH 2012 R01 HG	InterMOD: integrated data and tools to support model organism research Micklem, Gos; Cherry, Joe Michael; Richardson, Joel E.; Stein, Lincoln D.; Westerfield, Monte; Worthey, Elizabeth A. / University of Cambridge	$567,269
NIH 2011 R01 HG	InterMOD: integrated data and tools to support model organism research Micklem, Gos; Cherry, Joe Michael; Richardson, Joel E.; Stein, Lincoln D.; Twigger, Simon N.; Westerfield, Monte / University of Cambridge	$567,842
NIH 2010 R01 HG	Extending InterMine to yeast rat and zebrafish model organism databases Micklem, Gos; Cherry, Joe Michael; Twigger, Simon N.; Westerfield, Monte / University of Cambridge	$567,842
NIH 2009 R01 HG	Extending InterMine to yeast rat and zebrafish model organism databases Micklem, Gos; Cherry, Joe Michael; Twigger, Simon N.; Westerfield, Monte / University of Cambridge	$425,000
NIH 2009 R01 HG	Extending InterMine to yeast rat and zebrafish model organism databases Micklem, Gos; Cherry, Joe Michael; Twigger, Simon N.; Westerfield, Monte / University of Cambridge	$135,359

Publications

Shaw, David R (2016) Searching the Mouse Genome Informatics (MGI) Resources for Information on Mouse Biology from Genotype to Phenotype. Curr Protoc Bioinformatics 56:1.7.1-1.7.16

Eppig, Janan T; Richardson, Joel E; Kadin, James A et al. (2015) Mouse Genome Informatics (MGI): reflecting on 25 years. Mamm Genome 26:272-84

Lyne, Rachel; Sullivan, Julie; Butano, Daniela et al. (2015) Cross-organism analysis using InterMine. Genesis 53:547-60

Motenko, H; Neuhauser, S B; O'Keefe, M et al. (2015) MouseMine: a new data warehouse for MGI. Mamm Genome 26:325-30

Desvignes, T; Batzel, P; Berezikov, E et al. (2015) miRNA Nomenclature: A View Incorporating Genetic Origins, Biosynthetic Pathways, and Sequence Variants. Trends Genet 31:613-626

Ruzicka, Leyla; Bradford, Yvonne M; Frazer, Ken et al. (2015) ZFIN, The zebrafish model organism database: Updates and new directions. Genesis 53:498-509

Kalderimis, Alex; Lyne, Rachel; Butano, Daniela et al. (2014) InterMine: extensive web services for modern biology. Nucleic Acids Res 42:W468-72

Wong, Edith D; Karra, Kalpana; Hitz, Benjamin C et al. (2013) The YeastGenome app: the Saccharomyces Genome Database at your fingertips. Database (Oxford) 2013:bat004

Sullivan, Julie; Karra, Kalpana; Moxon, Sierra A T et al. (2013) InterMOD: integrated data and tools for the unification of model organism research. Sci Rep 3:1802

Howe, Douglas G; Bradford, Yvonne M; Conlin, Tom et al. (2013) ZFIN, the Zebrafish Model Organism Database: increased support for mutants and transgenics. Nucleic Acids Res 41:D854-60

Showing the most recent 10 out of 15 publications

Comments

Be the first to comment on Gos Micklem's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: