Gene expression measurement technology has evolved rapidly over the last two years, with the use of large arrays of cDNA taken from EST libraries receiving considerable attention. Fluorescent-tagged probe DNA hybridized to microarrays printed on glass is one promising technology. Another approach which hybridizes P33-labeled probe DNA to arrays on nylon membranes has the advantage of readily available reagents and instrumentation, greater sensitivity and ability to use smaller samples. Images of arrays of either sort must be quantified to produce a list of numerical intensities proportional to the expression levels corresponding to the gene fragments placed at each spot in the array. Using bioinformatics tools, spots on these arrays must be associated with sequence information for the corresponding clones, Unigene clusters, genes and protein products. Links to structural and functional information are also required.Numerous statistical, image processing and bioinformatics problems confront users of these technologies. As arrays can be constructed to contain thousands of spots, manual analysis of the resulting images is not feasible. Further, as investigators seek to couple this technology with laser capture micro dissection (LCM) in the analysis of pathological tissue, the technology itself must be refined and improved. Accordingly, this projects seeks to address problems in this area at the statistical, numerical, computational, and informatics levels. Progress in FY98: Working with several laboratories in NCI, NICHD, NIHGR, and NIDR, we have analyzed over 100 array images to estimate intensity levels representing over 500,000 DNA hybridization measurements. To accomplish this, a new program PSCAN was developed, which facilitates the image-processing steps of the analysis and produces optimal estimates of spot intensities. The program is written in MATLAB, and the code is being made publicly available, and a Web-site distribution site is being established. Initially, the image analysis problem requires parsing the image, recognizing and identifying the """"""""lattice"""""""" of spots, determining local or global background intensity and estimating the """"""""spread"""""""" of intensity due to the non-locality of radioactive nuclide detection. Appropriate corrections for these factors are attempted.Our analysis method relies on a number of data visualization tools, and allows users to identify significantly over- or under-expressed genes in a comparative study. Importantly, these techniques also allow users to identify experimental artifacts, outliers and other data anomalies which are present and a large percentage of hybrization studies, such as non-constant background hybridization, image defects, dropouts, printing artifacts, spot bleeds, etc. A set of rules has evolved which allow for calibration of two images, comparison of more than two images, determination of the correlation of spot intensities between experiments, and use of principle components and clustering to identify potential patterns of expression within a moderate to large set of experiments.Our statistical analysis has quantified the contributions to measurement error due to a variety of factors including spot-to-spot, filter-to-filter, hybridization, mRNA preparation, cDNA preparation, quantification, image acquisition, and sample-to-sample variability. Our findings emphasize the need for careful control of each of these sources before this technique becomes deployed in large-scale clinical studies. Our findings will also be valuable to developers of arrays in that they suggest improved spotting configurations and array designs. Moreover, as experience with each array format over several different labs increases, the bioinformatic value of each spot or clone increases. Those spots which are highly sensitive to experimental conditions can be identified while other spots whose intensity is essentially unchanged, possibly identified as """"""""control"""""""" spots, may become useful for calibration purposes. By applying our software tools to many experiments, we have gained practical experience with this assay technique, and now collaborate in the refinement of this technology and the design of future experiments.

Agency
National Institute of Health (NIH)
Institute
Center for Information Technology (CIT)
Type
Intramural Research (Z01)
Project #
1Z01CT000266-01
Application #
6103847
Study Section
Special Emphasis Panel (MSCL)
Project Start
Project End
Budget Start
Budget End
Support Year
1
Fiscal Year
1998
Total Cost
Indirect Cost
Name
Center for Information Technology
Department
Type
DUNS #
City
State
Country
United States
Zip Code
Deans, Katherine J; Minneci, Peter C; Chen, Hao et al. (2009) Impact of animal strain on gene expression in a rat model of acute cardiac rejection. BMC Genomics 10:280
Raghavachari, Nalini; Xu, Xiuli; Munson, Peter J et al. (2009) Characterization of whole blood gene expression profiles as a sequel to globin mRNA reduction in patients with sickle cell disease. PLoS One 4:e6484
Greenwell-Wild, Teresa; Vazquez, Nancy; Jin, Wenwen et al. (2009) Interleukin-27 inhibition of HIV-1 involves an intermediate induction of type I interferon. Blood 114:1864-74
Nares, Salvador; Moutsopoulos, Niki M; Angelov, Nikola et al. (2009) Rapid myeloid cell transcriptional and proteomic responses to periodontopathogenic Porphyromonas gingivalis. Am J Pathol 174:1400-14
Raat, Nicolaas J H; Noguchi, Audrey C; Liu, Virginia B et al. (2009) Dietary nitrate and nitrite modulate blood and organ nitrite and the cellular ischemic stress response. Free Radic Biol Med 47:510-7
Woszczek, Grzegorz; Chen, Li-Yuan; Nagineni, Sahrudaya et al. (2008) Leukotriene D(4) induces gene expression in human monocytes through cysteinyl leukotriene type I receptor. J Allergy Clin Immunol 121:215-221.e1
Hernandez-Novoa, Beatriz; Bishop, Lisa; Logun, Carolea et al. (2008) Immune responses to Pneumocystis murina are robust in healthy mice but largely absent in CD40 ligand-deficient mice. J Leukoc Biol 84:420-30
Coppey, Mathieu; Boettiger, Alistair N; Berezhkovskii, Alexander M et al. (2008) Nuclear trapping shapes the terminal gradient in the Drosophila embryo. Curr Biol 18:915-9
Raghavachari, Nalini; Xu, Xiuli; Harris, Amy et al. (2007) Amplified expression profiling of platelet transcriptome reveals changes in arginine metabolic pathways in patients with sickle cell disease. Circulation 115:1551-62
Elshal, Mohamed F; Khan, Sameena S; Raghavachari, Nalini et al. (2007) A unique population of effector memory lymphocytes identified by CD146 having a distinct immunophenotypic and genomic profile. BMC Immunol 8:29

Showing the most recent 10 out of 46 publications