Statistical Methods For Analysis Of Microarray Data

Weinberg, Clarice

Abstract

We have begun to work on the problem of analysis of gene expression array data. This new genomic technology offers enormous potential, with both clinical and scientific application, and has already spawned an extensive literature. It enables investigators to classify tissues (e.g. normal versus tumor), identify etiologically or prognostically distinct subsyndromes of disease, characterize responses to toxicologic exposures, identify families of genes that are functionally related, and gain insight into the functions of specific genes in governing normal physiologic processes. Because a cDNA chip provides expression levels for thousands of genes, most of which are not relevant to the tissue distinction under study, one faces the daunting problem of locating an informative subspace (set of genes) embedded within a high-dimensional noisy background. Our approach for doing this is based on a supervised search strategy called the 'genetic algorithm' (with k nearest neighbors classification). This method, unlike the one-gene-at-a-time competitors currently in widespread use, takes advantage of the correlations among genes in their expression patterns, and when used to classify tissues (e.g. tumor versus normal) also allows for the existence (and discovery) of subtypes with distinct expression profiles. We have been developing the methods by applying them to existing public data sets and toxicologic data generated in-house, and will soon begin to carry out simulations to compare our approach with others in use.

Funding Agency

Agency: National Institute of Health (NIH)
Institute: National Institute of Environmental Health Sciences (NIEHS)
Type: Intramural Research (Z01)
Project #: 1Z01ES040014-01
Application #: 6546716
Study Section: (BB)

Project Start
Project End
Budget Start
Budget End
Support Year: 1
Fiscal Year: 2001
Total Cost
Indirect Cost

Institution

Name: U.S. National Inst of Environ Hlth Scis
Department
Type
DUNS #

City
State
Country: United States
Zip Code

Related projects


NIH 2003 Z01 ES	Statistical Methods For Analysis Of Microarray Data Weinberg, Clarice / U.S. National Inst of Environ Hlth Scis
NIH 2002 Z01 ES	Statistical Methods For Analysis Of Microarray Data Weinberg, Clarice / U.S. National Inst of Environ Hlth Scis
NIH 2001 Z01 ES	Statistical Methods For Analysis Of Microarray Data Weinberg, Clarice / U.S. National Inst of Environ Hlth Scis

Publications

Peddada, Shyamal D; Lobenhofer, Edward K; Li, Leping et al. (2003) Gene selection and clustering for time-course and dose-response microarray experiments using order-restricted inference. Bioinformatics 19:834-41

Li, L; Darden, T A; Weinberg, C R et al. (2001) Gene assessment and sample classification for gene expression data using a genetic algorithm/k-nearest neighbor method. Comb Chem High Throughput Screen 4:727-39

Li, L; Weinberg, C R; Darden, T A et al. (2001) Gene selection for sample classification based on gene expression data: study of sensitivity to choice of parameters of the GA/KNN method. Bioinformatics 17:1131-42

Comments

Be the first to comment on Clarice Weinberg's grant

Recent in Grantomics:

Recently viewed grants:

Recently added grants: