Generalization of the GeneMark family of gene recognition programs of gene prediction to analysis of closely related genomes using Bayesian segmentation approach. This research will be done primarily in Russia as an extension of NIH grant #5R01HG00783.
The aim of this project is to build a global genomie alignment of two evolutionarily close genomes and to use a modification of the segmentation algorithm to parse syntenic regions of genome sequences. To this end the chain alignment of the extended regions of genomes will be constructed. Pairs of aligned sequences will be parsed into segments with different sequence variation statistics. A number of statistical models of genomic alignments will be built and the system, which automatically chooses the model relevant for the alignment of the particular region, will be designed. The final objective of the project is to develop a new algorithms of similarity based gene finding in the pairwise alignments of DNA sequences of closely related species. All the software programs will be available for users via the WWW interface.

Agency
National Institute of Health (NIH)
Institute
Fogarty International Center (FIC)
Type
Small Research Grants (R03)
Project #
1R03TW005899-01A1
Application #
6581987
Study Section
International and Cooperative Projects 1 Study Section (ICP)
Program Officer
Katz, Flora N
Project Start
2002-12-01
Project End
2004-11-30
Budget Start
2002-12-01
Budget End
2003-11-30
Support Year
1
Fiscal Year
2003
Total Cost
$46,910
Indirect Cost
Name
Georgia Institute of Technology
Department
Biology
Type
Schools of Arts and Sciences
DUNS #
097394084
City
Atlanta
State
GA
Country
United States
Zip Code
30332
Boeva, Valentina; Regnier, Mireille; Papatsenko, Dmitri et al. (2006) Short fuzzy tandem repeats in genomic sequences, identification, and possible role in regulation of gene expression. Bioinformatics 22:676-84
Kattenhorn, Lisa M; Mills, Ryan; Wagner, Markus et al. (2004) Identification of proteins associated with murine cytomegalovirus virions. J Virol 78:11187-97