Our previous funding round of this project was devoted to creating the underlying data standards and infrastructure to support sharing microarray data. The availability of the MGED (Microarray and Gene Expression Data Society;www.mged.org) standards and infrastructure and our accumulated experience using and evaluating them puts us in an excellent position to deliver tools and resources directly to the bench biologists who are generating high-throughput gene expression data: We now propose to shift our main efforts away from building computational infrastructure - the over-arching purpose of this new proposal is to facilitate scientific discovery by providing bench biologists with the tools they need to effectively share gene expression data and to take advantage of well-annotated gene expression data in their research. Our work to create and promote data sharing standards and resources will have greatest impact when those standards and resources are in common use by biomedical researchers. The need for and potential benefits of standards for microarray and other high-throughput technologies is clear, yet the positive impact of the standards thus far developed has not yet been fully realized. This is in large part because the standards and tools we have developed still require expert knowledge, yet the targeted users are bench biologists who are not experts in this domain, and should not be expected to become so. In this application, in addition to building the next generation of data exchange standards, we are proposing to use the infrastructure we have built in the previous round of funding as the basis for data exchange resources that are useful and usable by bench biologists. Therefore, our aims are to: 1. Develop tools to help researchers easily annotate microarray experiments. 2. Extend popular data analysis and visualization tools (BioConductor, MeV, GenePattern, Java TreeView) to use MAGE-TAB-encoded experimental annotations. 3. Generalize the MAGE-TAB data exchange standard to work with other high-throughput biomedical data, such as ultra-high-throughput sequencing. 4. Provide biologist-friendly ontology terms that can be used to annotate microarray data as well as serve as meaningful terms in computational analyses. 5. Participate in outreach, education and information gathering efforts to engage the community in the development of standards and to ensure widespread use and critiques.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Biotechnology Resource Grants (P41)
Project #
5P41HG003619-05
Application #
7692337
Study Section
Ethical, Legal, Social Implications Review Committee (GNOM)
Program Officer
Good, Peter J
Project Start
2005-08-23
Project End
2011-07-31
Budget Start
2009-08-01
Budget End
2010-07-31
Support Year
5
Fiscal Year
2009
Total Cost
$745,000
Indirect Cost
Name
Stanford University
Department
Biochemistry
Type
Schools of Medicine
DUNS #
009214214
City
Stanford
State
CA
Country
United States
Zip Code
94305
Bandrowski, Anita; Brinkman, Ryan; Brochhausen, Mathias et al. (2016) The Ontology for Biomedical Investigations. PLoS One 11:e0154556
Kolesnikov, Nikolay; Hastings, Emma; Keays, Maria et al. (2015) ArrayExpress update--simplifying data submissions. Nucleic Acids Res 43:D1113-6
Parkinson, Helen; Sarkans, Ugis; Kolesnikov, Nikolay et al. (2011) ArrayExpress update--an archive of microarray and high-throughput sequencing-based functional genomics experiments. Nucleic Acids Res 39:D1002-4
Lukk, Margus; Kapushesky, Misha; Nikkilä, Janne et al. (2010) A global map of human gene expression. Nat Biotechnol 28:322-4
Shankar, Ravi; Parkinson, Helen; Burdett, Tony et al. (2010) Annotare--a tool for annotating high-throughput biomedical investigations and resulting data. Bioinformatics 26:2470-1
Parkinson, Helen; Kapushesky, Misha; Kolesnikov, Nikolay et al. (2009) ArrayExpress update--from an archive of functional genomics experiments to the atlas of gene expression. Nucleic Acids Res 37:D868-72
Kauffmann, Audrey; Rayner, Tim F; Parkinson, Helen et al. (2009) Importing ArrayExpress datasets into R/Bioconductor. Bioinformatics 25:2092-4
Maier, Don; Wymore, Farrell; Sherlock, Gavin et al. (2008) The XBabelPhish MAGE-ML and XML translator. BMC Bioinformatics 9:28
Jones, Andrew R; Miller, Michael; Aebersold, Ruedi et al. (2007) The Functional Genomics Experiment model (FuGE): an extensible framework for standards in functional genomics. Nat Biotechnol 25:1127-33
Whetzel, Patricia L; Parkinson, Helen; Causton, Helen C et al. (2006) The MGED Ontology: a resource for semantics-based description of microarray experiments. Bioinformatics 22:866-73

Showing the most recent 10 out of 14 publications