Medical and biological data often come in the form of digitized signals and images; for example, magnetic resonance images, electrocardiogram traces and even the folding paths of proteins. As instrumental data acquisition becomes routine, sequences of such images, signals or paths are collected, often along with other covariate measurements, resulting in datasets where the basic unit of measurement, or response, is a high-dimensional object. The project continues to focus on developing techniques for modelling and understanding such data that explicitly take into account, and indeed exploit inherent spatial or temporal correlation, and when appropriate, relate it to covariate or class label information. To study covariance structure, the project proposes """"""""sparse"""""""" forms of principal components and discriminant analysis that may be more sensitive to either local phenomena of not necessarily smooth form or that are more adapted to irregularly observed data. Corresponding quadratically regularized methods in appropriate bases form a natural foil for comparison, and will also be developed in certain applications. For estimation of means, the project will examine sparse empirical Bayes methods for estimating non smooth local phenomena. Much of this work will be carried out in existing and new collaborations with researchers in medical imaging, cardiology and other specialties, working for example on cancer, heart disease and brain mapping.
Shen-Orr, Shai S; Tibshirani, Robert; Khatri, Purvesh et al. (2010) Cell type-specific gene expression differences in complex tissues. Nat Methods 7:287-9 |
Witten, Daniela M; Tibshirani, Robert; Hastie, Trevor (2009) A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis. Biostatistics 10:515-34 |
Friedman, Jerome; Hastie, Trevor; Tibshirani, Robert (2008) Sparse inverse covariance estimation with the graphical lasso. Biostatistics 9:432-41 |
Johnstone, Iain M (2008) MULTIVARIATE ANALYSIS AND JACOBI ENSEMBLES: LARGEST EIGENVALUE, TRACY-WIDOM LIMITS AND RATES OF CONVERGENCE. Ann Stat 36:2638 |
Park, Mee Young; Hastie, Trevor (2008) Penalized logistic regression for detecting gene interactions. Biostatistics 9:30-50 |
Park, Mee Young; Hastie, Trevor; Tibshirani, Robert (2007) Averaged gene expressions for regression. Biostatistics 8:212-27 |
Chipman, Hugh; Tibshirani, Robert (2006) Hybrid hierarchical clustering with applications to microarray data. Biostatistics 7:286-301 |
Zhu, Ji; Hastie, Trevor (2004) Classification of gene microarrays by penalized logistic regression. Biostatistics 5:427-43 |
Tibshirani, Robert; Hastie, Trevor; Narasimhan, Balasubramanian et al. (2002) Diagnosis of multiple cancer types by shrunken centroids of gene expression. Proc Natl Acad Sci U S A 99:6567-72 |
Troyanskaya, O; Cantor, M; Sherlock, G et al. (2001) Missing value estimation methods for DNA microarrays. Bioinformatics 17:520-5 |
Showing the most recent 10 out of 12 publications