DNA micro-array technology and the sequencing of the human genome have advanced to the point that it is now possible to monitor gene expression levels on a genomic scale. These data hold the key to fundamental understanding of biological processes on the molecular level, which will revolutionize medical diagnosis and treatment of diseases and cancer, and the design of effective drugs. Despite the progress in gene expression technology, its promised benefits for the future of molecular genetics and medicine will not materialize without appropriate tools for data analysis and information extraction. Without means of detecting macroscopic patterns in the gene expression data that correspond to specific cellular processes or phenotype characteristics, these data are, and will remain, incomprehensible. Little has been done so far toward the analysis of these new data, which requires the development of new analytical and computational tools or the adaptation of existing tools from other scientific disciplines. These tools should be independent of any model for the description of gene expression, since no such predictive model is available. They should be suitable for making use of the large quantities of data and at the same time reducing the complexity of the data to make them comprehensible. In this project, these much-needed tools for genome-wide gene expression data analysis will be developed. Analytical and computational methods which are already successful in describing the physical world, from quantum mechanics to image processing, will be used. Applying these tools to gene expression data obtained in the labs of Profs. Botstein and Brown, patterns that correspond to specific molecular mechanisms and phenotype characteristics will be identified. The goal of this project is to create predictive models for diagnosis and treatment of diseases and cancer. These models may eventually help elucidate the first principles of biological processes on the molecular level.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Scientist Development Award - Research & Training (K01)
Project #
7K01HG000038-05
Application #
6838859
Study Section
Ethical, Legal, Social Implications Review Committee (GNOM)
Program Officer
Feingold, Elise A
Project Start
2000-04-01
Project End
2005-03-31
Budget Start
2004-01-16
Budget End
2004-03-31
Support Year
5
Fiscal Year
2003
Total Cost
$29,619
Indirect Cost
Name
University of Texas Austin
Department
Biomedical Engineering
Type
Schools of Engineering
DUNS #
170230239
City
Austin
State
TX
Country
United States
Zip Code
78712
Alter, Orly (2007) Genomic signal processing: from matrix algebra to genetic networks. Methods Mol Biol 377:17-60
Alter, Orly; Golub, Gene H (2006) Singular value decomposition of genome-scale mRNA lengths distribution reveals asymmetry in RNA gel electrophoresis band broadening. Proc Natl Acad Sci U S A 103:11828-33
Alter, Orly; Golub, Gene H (2005) Reconstructing the pathways of a cellular system from genome-scale signals by using matrix and tensor computations. Proc Natl Acad Sci U S A 102:17559-64
Alter, Orly; Golub, Gene H (2004) Integrative analysis of genome-scale data by using pseudoinverse projection predicts novel correlation between DNA replication and RNA transcription. Proc Natl Acad Sci U S A 101:16577-82
Alter, Orly; Brown, Patrick O; Botstein, David (2003) Generalized singular value decomposition for comparative analysis of genome-scale expression data sets of two different organisms. Proc Natl Acad Sci U S A 100:3351-6
Bohen, Sean P; Troyanskaya, Olga G; Alter, Orly et al. (2003) Variation in gene expression patterns in follicular lymphoma and the response to rituximab. Proc Natl Acad Sci U S A 100:1926-30
Nielsen, Torsten O; West, Rob B; Linn, Sabine C et al. (2002) Molecular characterisation of soft tissue tumours: a gene expression study. Lancet 359:1301-7
Alter, O; Brown, P O; Botstein, D (2000) Singular value decomposition for genome-wide expression data processing and modeling. Proc Natl Acad Sci U S A 97:10101-6