Medical and biological data often come in the form of digitized signals and images; for example, magnetic resonance images, electrocardiogram traces and even the folding paths of proteins. As instrumental data acquisition becomes routine, sequences of such images, signals or paths are collected, often along with other covariate measurements, resulting in datasets where the basic unit of measurement, or response, is a high-dimensional object. The project continues to focus on developing techniques for modelling and understanding such data that explicitly take into account, and indeed exploit inherent spatial or temporal correlation, and when appropriate, relate it to covariate or class label information. To study covariance structure, the project proposes """"""""sparse"""""""" forms of principal components and discriminant analysis that may be more sensitive to either local phenomena of not necessarily smooth form or that are more adapted to irregularly observed data. Corresponding quadratically regularized methods in appropriate bases form a natural foil for comparison, and will also be developed in certain applications. For estimation of means, the project will examine sparse empirical Bayes methods for estimating non smooth local phenomena. Much of this work will be carried out in existing and new collaborations with researchers in medical imaging, cardiology and other specialties, working for example on cancer, heart disease and brain mapping.

Agency
National Institute of Health (NIH)
Institute
National Cancer Institute (NCI)
Type
Research Project (R01)
Project #
5R01CA072028-07
Application #
6513032
Study Section
Special Emphasis Panel (ZRG1-STA (02))
Program Officer
Erickson, Burdette (BUD) W
Project Start
1996-09-10
Project End
2003-06-30
Budget Start
2002-07-01
Budget End
2003-06-30
Support Year
7
Fiscal Year
2002
Total Cost
$238,376
Indirect Cost
Name
Stanford University
Department
Biostatistics & Other Math Sci
Type
Schools of Arts and Sciences
DUNS #
800771545
City
Stanford
State
CA
Country
United States
Zip Code
94305
Shen-Orr, Shai S; Tibshirani, Robert; Khatri, Purvesh et al. (2010) Cell type-specific gene expression differences in complex tissues. Nat Methods 7:287-9
Witten, Daniela M; Tibshirani, Robert; Hastie, Trevor (2009) A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis. Biostatistics 10:515-34
Friedman, Jerome; Hastie, Trevor; Tibshirani, Robert (2008) Sparse inverse covariance estimation with the graphical lasso. Biostatistics 9:432-41
Johnstone, Iain M (2008) MULTIVARIATE ANALYSIS AND JACOBI ENSEMBLES: LARGEST EIGENVALUE, TRACY-WIDOM LIMITS AND RATES OF CONVERGENCE. Ann Stat 36:2638
Park, Mee Young; Hastie, Trevor (2008) Penalized logistic regression for detecting gene interactions. Biostatistics 9:30-50
Park, Mee Young; Hastie, Trevor; Tibshirani, Robert (2007) Averaged gene expressions for regression. Biostatistics 8:212-27
Chipman, Hugh; Tibshirani, Robert (2006) Hybrid hierarchical clustering with applications to microarray data. Biostatistics 7:286-301
Zhu, Ji; Hastie, Trevor (2004) Classification of gene microarrays by penalized logistic regression. Biostatistics 5:427-43
Tibshirani, Robert; Hastie, Trevor; Narasimhan, Balasubramanian et al. (2002) Diagnosis of multiple cancer types by shrunken centroids of gene expression. Proc Natl Acad Sci U S A 99:6567-72
Troyanskaya, O; Cantor, M; Sherlock, G et al. (2001) Missing value estimation methods for DNA microarrays. Bioinformatics 17:520-5

Showing the most recent 10 out of 12 publications