The development of genomics, proteomics and advancedimaging technology has resulted in the accumulation of vast amounts of biological data. As large scale data sets become predominant in biomedical research, we are approaching a paradigm shift in which the process of discovery isdata-driven, and in which data are the source of hypotheses as well as the means for testing them. These masses ofdata are rich sources of information;however, extracting meaningful information can be a daunting challenge, and often presents a bottleneck for the discovery process. Thus, there is a pressing need for interdisciplinary training of scientists who understand the data, how they are generated, and what they are used for. In addition, these scientists must become developers and highly skilled users of the new computational tools necessary to analyze large data sets. The goal of this program is to train students to become proficient in the following areas: 1. Data acquisition. This will include knowledge of the methods of genomics, proteomicsand imaging. 2. Computation. This will include knowledge of mathematical and statistical algorithms, implementation of effective computer codes as well as an emphasis on methods of data warehousing in relational, deductive and other databases. 3. Data integration. This is a critical area that involves extracting useful information from the heterogeneous data sets at various spatial and temporal scales. It will include knowledge of methods of modeling and simulation of systemsfrom the molecular to the organismal level. There will also be an emphasis on computational data mining methods. The core of the program will be research-based training in interdisciplinary teams under the guidance of at least two mentors from disparate disciplines (i.e., computational/mathematical and biomedical sciences). Training activities will consist of specialized didactic coursework as well as seminars, journal clubs and a student-faculty retreat. This will be a cross-institutional training program with faculty drawn from departments ranging from computer science and statistics to genetics and medicine, in five participating institutions in the Gulf Coast Consortia in the Houston Area.The training of scientists equipped to manage and extract information from large data sets will greatly facilitate biological discovery in areas such as infectious disease and cancer and therefore this training program will have a direct, positive impact on public health.

Agency
National Institute of Health (NIH)
Institute
National Institute on Drug Abuse (NIDA)
Type
Interdisciplinary Regular Research Training Award (R90)
Project #
5R90DA023418-04
Application #
7667711
Study Section
Special Emphasis Panel (ZDE1-NB (89))
Program Officer
Babecki, Beth
Project Start
2006-09-30
Project End
2011-07-31
Budget Start
2009-08-01
Budget End
2011-07-31
Support Year
4
Fiscal Year
2009
Total Cost
$135,433
Indirect Cost
Name
Baylor College of Medicine
Department
Microbiology/Immun/Virology
Type
Schools of Medicine
DUNS #
051113330
City
Houston
State
TX
Country
United States
Zip Code
77030
Adamski, Carolyn J; Cardenas, Ana Maria; Brown, Nicholas G et al. (2015) Molecular basis for the catalytic specificity of the CTX-M extended-spectrum ?-lactamases. Biochemistry 54:447-57
van der Hoeven, Dharini; Cho, Kwang-jin; Ma, Xiaoping et al. (2013) Fendiline inhibits K-Ras plasma membrane localization and blocks K-Ras signal transmission. Mol Cell Biol 33:237-51
Raol, Jay; Cox, Steven J (2013) Inverse problems in neuronal calcium signaling. J Math Biol 67:3-23
Wilkins, Angela; Erdin, Serkan; Lua, Rhonald et al. (2012) Evolutionary trace for prediction and redesign of protein functional sites. Methods Mol Biol 819:29-42
Sucgang, Richard; Kuo, Alan; Tian, Xiangjun et al. (2011) Comparative genomics of the social amoebae Dictyostelium discoideum and Dictyostelium purpureum. Genome Biol 12:R20
Brunner, Gerd; Chittajallu, Deepak R; Kurkure, Uday et al. (2010) Toward the automatic detection of coronary artery calcification in non-contrast computed tomography data. Int J Cardiovasc Imaging 26:829-38
Heath, Allison P; Bennett, George N; Kavraki, Lydia E (2010) Finding metabolic pathways using atom tracking. Bioinformatics 26:1548-55
Venner, Eric; Lisewski, Andreas Martin; Erdin, Serkan et al. (2010) Accurate protein structure annotation through competitive diffusion of enzymatic functions over a network of local evolutionary similarities. PLoS One 5:e14286
Tomchik, Seth M; Davis, Ronald L (2009) Dynamics of learning-related cAMP signaling and stimulus integration in the Drosophila olfactory pathway. Neuron 64:510-21
Rot, Gregor; Parikh, Anup; Curk, Tomaz et al. (2009) dictyExpress: a Dictyostelium discoideum gene expression database with an explorative data analysis web-based interface. BMC Bioinformatics 10:265

Showing the most recent 10 out of 11 publications