Efforts to provide centralized access in public databases for storing and disseminating large amounts of gene expression data have proved successful. The next challenge in data distribution will be providing normalized data across experiments to enable scientists to interrogate data in a larger scale across wider conditions. To achieve this goal, a standardized normalization algorithm suitable for both intra- and inter-experiment datasets is needed to ensure comparability. We developed a nonparametric normalization method called MDA (Maximal density approximation) by employing classification and approximation theory. The nonparametric feature imposes minimal requirement on data distributions, suggesting its applicability on cross-experiment data that tend to have higher degree of variability.
Our aim i s to extend the MDA method from intra- to inter-experiment data application and to derive an optimal standardized approach to meet the additional difficulties encountered in the analysis of inter-experiment data. Specifically, the proposed research aims first to establish a methodology for compatible inter-experiment data normalization. Extension of classification theory to image analysis will be addressed as well for improved performance. Finally, we propose to develop an algorithm enabling comparative analysis across incompatible datasets and technology platforms.

Agency
National Institute of Health (NIH)
Institute
National Library of Medicine (NLM)
Type
Career Transition Award (K22)
Project #
1K22LM008243-01A1
Application #
6869323
Study Section
Biomedical Library and Informatics Review Committee (BLR)
Program Officer
Ye, Jane
Project Start
2005-02-01
Project End
2008-01-31
Budget Start
2005-02-01
Budget End
2006-01-31
Support Year
1
Fiscal Year
2005
Total Cost
$151,534
Indirect Cost
Name
Vanderbilt University Medical Center
Department
Internal Medicine/Medicine
Type
Schools of Medicine
DUNS #
004413456
City
Nashville
State
TN
Country
United States
Zip Code
37212
Ni, Terri T; Lemon, William J; Shyr, Yu et al. (2008) Use of normalization methods for analysis of microarrays containing a high degree of gene effects. BMC Bioinformatics 9:505