Many problems in the health and medical sciences have at their core the task of finding cohesive groups of observations in data. Examples include a group of voxels in an MRI image that correspond to a tumor, genes whose mRNA expression levels track one another, and tissues whose gene expression patterns are similar. The statistical method for solving this problem is cluster analysis. Most cluster analysis methods used in practice have been ad hoc, but recently the development of more formal model-based clustering methods has provided a principled framework for answering central questions such as: How many clusters are there? Which clustering method should be used? How should one deal with outliers? Our main goal is to develop new methods for problems in model-based clustering that arise in medical image segementation and gene expression data. The three major thrusts will be the development of: (A) model- based clustering methods for large numbers of variables; (B) automated medical image segementation methods appropriate for dynamic MRI breast images; and (C) model-based clustering methods for microarray gene expression data aimed at finding groups of genes that function together, and group of tissues or tissue types that have similar gene expression patterns.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Research Project (R01)
Project #
1R01CA094212-01
Application #
6415542
Study Section
Special Emphasis Panel (ZRG1-SNEM-5 (01))
Program Officer
Torres-Anjel, Manuel J
Project Start
2002-06-01
Project End
2005-05-31
Budget Start
2002-06-01
Budget End
2003-05-31
Support Year
1
Fiscal Year
2002
Total Cost
$269,907
Indirect Cost
Name
University of Washington
Department
Biostatistics & Other Math Sci
Type
Schools of Arts and Sciences
DUNS #
135646524
City
Seattle
State
WA
Country
United States
Zip Code
98195
Erosheva, Elena; Fienberg, Stephen; Lafferty, John (2004) Mixed-membership models of scientific publications. Proc Natl Acad Sci U S A 101 Suppl 1:5220-7