The Human Genome Project faces many difficult technical challenges. Among these are the problems of automatically recognizing coding regions in uncharacterized DNA sequences, assembling contig maps from YAC/STS content information, and other problems of knowledge discovery and database organization. The goal of this Special Emphasis Research Career Award is to study these and other problems that pose informatics and computational challenges, taking advantage of the candidate's experience as a computer scientist. The goals of this training program are to: (1) obtain a stronger background in basic molecular biology and genetics, (2) learn the fundamental laboratory techniques used for genome sequencing experiments, so as to better understand the problems actually faced by biologists in the lab, (3) become familiar with the technical details of the major genomic databases, including GenBank and the Genome Data Base (GDB), and (4) develop a long term project focused on bioinformatics, using the candidate's new biological knowledge in combination with his background as a computer scientist. In particular, as part of (4) he will significantly extend his previous work on finding coding regions in uncharacterized human DNA, and build a general gene-finding system to be distributed to the genomic research community. The candidate also plans to develop new projects for the analysis and management of DNA mapping data, for which the training under goal (3) will be invaluable. The project will begin with formal courses and laboratory work under the guidance of the distinguished molecular biologist Hamilton Smith, who will serve as the candidate's advisor. This will be followed by immersion in the activities of the Genome Data Base group at Johns Hopkins, where the candidate will have access to the latest computational resources for genomics research. The training and experience during the NCHGR/SERCA grant period will provide an ideal basis upon which to build a new research program in computational biology.

Agency
National Institute of Health (NIH)
Institute
National Human Genome Research Institute (NHGRI)
Type
Research Scientist Development Award - Research & Training (K01)
Project #
5K01HG000022-04
Application #
2706791
Study Section
Genome Research Review Committee (GRRC)
Project Start
1996-07-01
Project End
2000-06-30
Budget Start
1998-07-01
Budget End
2000-06-30
Support Year
4
Fiscal Year
1998
Total Cost
Indirect Cost
Name
Institute for Genomic Research
Department
Type
DUNS #
City
Rockville
State
MD
Country
United States
Zip Code
20850
Delcher, A L; Harmon, D; Kasif, S et al. (1999) Improved microbial gene identification with GLIMMER. Nucleic Acids Res 27:4636-41
Delcher, A L; Kasif, S; Fleischmann, R D et al. (1999) Alignment of whole genomes. Nucleic Acids Res 27:2369-76
Salzberg, S L; Pertea, M; Delcher, A L et al. (1999) Interpolated Markov models for eukaryotic gene finding. Genomics 59:24-31
Tettelin, H; Radune, D; Kasif, S et al. (1999) Optimized multiplex PCR: efficiently closing a whole-genome shotgun sequencing project. Genomics 62:500-7
Salzberg, S L; Salzberg, A J; Kerlavage, A R et al. (1998) Skewed oligomers and origins of replication. Gene 217:57-67
Salzberg, S L; Delcher, A L; Kasif, S et al. (1998) Microbial gene identification using interpolated Markov models. Nucleic Acids Res 26:544-8
Salzberg, S; Delcher, A L; Fasman, K H et al. (1998) A decision tree system for finding genes in DNA. J Comput Biol 5:667-80
Salzberg, S L (1997) A method for identifying splice sites and translational start sites in eukaryotic mRNA. Comput Appl Biosci 13:365-76
Henderson, J; Salzberg, S; Fasman, K H (1997) Finding genes in DNA with a Hidden Markov Model. J Comput Biol 4:127-41