Microarrays have emerged as a new tool in biological and clinical research, giving a global view of a biological process in an unprecedented scale by simultaneous measurements of expression levels for thousands of genes. However, while their use is becoming widespread, many important issues remain unresolved and their potential for revealing important insights has not been fully realized. The initial part of this work will be on a more accurate estimation of microarray expression values. For example, performance of different probes on oligonucleotide arrays appears to vary widely depending on the melting temperature of the probe sequence, and this will be incorporated in a new algorithm. The main part of the work will be on developing new techniques for the discovery and understanding of complex interactions among genes as well as between genes and phenotypes. Moving beyond pairwise linear correlations, nonlinear and higher-order interactions among multiple genes will be explored with novel metrics. Density estimation techniques from multivariate statistics and other sophisticated computational tools will be employed to sift through billions of possible combinatorial arrangements. Those combinations found to be significant will be examined in depth and biologically validated when possible. Finally, a statistical framework will be developed in the generalized linear model setting in order to understand the relationship between genotypic and phenotypic data. To handle the large number of highly collinear genes in expression data, new computational techniques based on partial least squares will be developed. Preliminary results in finding correlations between genes and censored patient survival times have been promising, and similar methods will be developed and applied to identify predictive genes in the context of various types of phenotypic data. The candidate has been trained in applied and computational mathematics, and he now aims to apply his skills to problems in bioinformatics and functional genomics. The proposed award will allow him to receive a thorough training in molecular biology and genomics at Harvard Medical School and Children's Hospital in Boston. Through this transitional period, the candidate would like to become an independent investigator, able to lead a multidisciplinary team in an integrated approach to studying complex biological systems.

Agency
National Institute of Health (NIH)
Institute
National Institute of General Medical Sciences (NIGMS)
Type
Mentored Quantitative Research Career Development Award (K25)
Project #
5K25GM067825-05
Application #
7243386
Study Section
Special Emphasis Panel (ZRG1-SSS-H (90))
Program Officer
Anderson, James J
Project Start
2003-06-01
Project End
2009-05-31
Budget Start
2007-06-01
Budget End
2009-05-31
Support Year
5
Fiscal Year
2007
Total Cost
$141,620
Indirect Cost
Name
Children's Hospital Boston
Department
Type
DUNS #
076593722
City
Boston
State
MA
Country
United States
Zip Code
02115
Mazor, Rafi; Schmid-Schönbein, Geert W (2015) Proteolytic receptor cleavage in the pathogenesis of blood rheology and co-morbidities in metabolic syndrome. Early forms of autodigestion. Biorheology 52:337-52
Gelbart, Marnie E; Larschan, Erica; Peng, Shouyong et al. (2009) Drosophila MSL complex globally acetylates H4K16 on the male X chromosome for dosage compensation. Nat Struct Mol Biol 16:825-32
Gorchakov, Andrey A; Alekseyenko, Artyom A; Kharchenko, Peter et al. (2009) Long-range spreading of dosage compensation in Drosophila captures transcribed autosomal genes inserted on X. Genes Dev 23:2266-71
Alekseyenko, Artyom A; Peng, Shouyong; Larschan, Erica et al. (2008) A sequence motif within chromatin entry sites directs MSL establishment on the Drosophila X chromosome. Cell 134:599-609
Sural, Tuba H; Peng, Shouyong; Li, Bing et al. (2008) The MSL3 chromodomain directs a key targeting step for dosage compensation of the Drosophila melanogaster X chromosome. Nat Struct Mol Biol 15:1318-25
Baskerville, Karen A; Kent, Caroline; Personett, David et al. (2008) Aging elevates metabolic gene expression in brain cholinergic neurons. Neurobiol Aging 29:1874-93
Liu, Manway; Liberzon, Arthur; Kong, Sek Won et al. (2007) Network-based analysis of affected biological processes in type 2 diabetes models. PLoS Genet 3:e96
Peng, Shouyong; Alekseyenko, Artyom A; Larschan, Erica et al. (2007) Normalization and experimental design for ChIP-chip data. BMC Bioinformatics 8:219
Larschan, Erica; Alekseyenko, Artyom A; Gortchakov, Andrey A et al. (2007) MSL complex is attracted to genes marked by H3K36 trimethylation using a sequence-independent mechanism. Mol Cell 28:121-33
Kong, Sek Won; Pu, William T; Park, Peter J (2006) A multivariate approach for integrating genome-wide expression data and biological knowledge. Bioinformatics 22:2373-80

Showing the most recent 10 out of 19 publications