One of the critical tools for probing gene function is imaging spatial expression with confocal or higher resolution microscopy, yielding a pattern of when and where a particular gene is expressed. As a result of the increased availability of such spatial expression patterns and the increased importance of high throughput imaging, there is a need for general purpose tools for managing and analyzing spatial gene expression datasets for research and systems biology. We propose to develop and make available to the general community a suite of tools for managing and analyzing large-scale spatial gene expression datasets. For our large Drosophila spatial expression dataset, we have developed an image based virtual representation that has been practical for visualization, data mining and analysis, while being compact enough to distribute and store large expression datasets. Moreover, this representation allows for analyzing image based data without specialized image processing knowledge. We propose to enhance these tools by adding an integrated data management and analysis platform to a popular scientific image processing program, ImageJ. This ImageJ workbench will be complementary to existing web-interfaces and will enable biologists to perform additional analyses and bioinformatics researchers to extend the software by writing their own custom modules. We will expand the applicability of our virtual embryo representations to complex patterns found at later stages of development and extend microscope automation software to generate virtual representations from automatically acquired images. Finally, we plan to extend the capabilities of the software to process images from other model organisms and develop methods for cross species comparisons. The proposed toolkit will provide a strong foundation for integration of gene expression data with regulatory and gene sequences to promote research for discovering networks of regulatory interactions.

Public Health Relevance

Our understanding of the fundamental processes in molecular biology is limited by our incomplete understanding of the mechanism of gene regulation and expression. Previous studies have shown fundamental roles for regulatory regions and gene expression changes in human diseases. Elucidating the mechanisms responsible for genome-wide gene expression and regulation in the development of model organisms such as Drosophila will aid in understanding normal growth and differentiation of tissues in humans, prerequisites for understanding human disease.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Research Project (R01)
Project #
5R01GM097231-03
Application #
8539040
Study Section
Biodata Management and Analysis Study Section (BDMA)
Program Officer
Lyster, Peter
Project Start
2011-09-15
Project End
2015-08-31
Budget Start
2013-09-01
Budget End
2014-08-31
Support Year
3
Fiscal Year
2013
Total Cost
$374,244
Indirect Cost
$173,135
Name
Lawrence Berkeley National Laboratory
Department
Genetics
Type
Organized Research Units
DUNS #
078576738
City
Berkeley
State
CA
Country
United States
Zip Code
94720
Booth, Benjamin W; McParland, Charles; Beattie, Keith et al. (2018) OpenHiCAMM: High-Content Screening Software for Complex Microscope Imaging Workflows. iScience 2:136-140
Wu, Siqi; Joseph, Antony; Hammonds, Ann S et al. (2016) Stability-driven nonnegative matrix factorization to interpret spatial gene expression and build local gene networks. Proc Natl Acad Sci U S A 113:4290-5
Brown, James B; Boley, Nathan; Eisman, Robert et al. (2014) Diversity and dynamics of the Drosophila transcriptome. Nature 512:393-9
Hammonds, Ann S; Bristow, Christopher A; Fisher, William W et al. (2013) Spatial expression of transcription factors in Drosophila embryonic organ development. Genome Biol 14:R140
Schindelin, Johannes; Arganda-Carreras, Ignacio; Frise, Erwin et al. (2012) Fiji: an open-source platform for biological-image analysis. Nat Methods 9:676-82