A complex high resolution restriction map for the entire genome is available. This information has been collected and organized into a cohesive information base, unifying the efforts of many laboratories into a single data resource. This makes the E. coli genome project the most advanced of all cellular genome projects. This information has been collected and organized into a cohesive information base, unifying the efforts of many laboratories into a single data resource. This project includes software development, database development, and data analysis. Two relational databases have been developed: Gonioscope, a Macintosh database of genomic map information that is essentially completed, and E-BASE, a Sybase database of E. coli map and DNA sequence information. Report generators for ECBASE now allow complete ASN.1 and GenBank flatfile reports of EcoSeq contigs, DNA sequence and genomic restriction map data has been analyzed to determine the information content of ribosome binding sites, number and distribution of genomic restriction sites, repeated patterns in DNA sequences, distribution and categorization of proteins encoded in the genome, assignment of genes to individual clones in the ordered clone set of the E. coli genome, and the detection of putative new genes in the DNA sequence flanking known genes. Several new protein structural motifs have been identified by comparing various E. Coli proteins to the protein and DNA sequence databases. Integrated genetic and physical maps have been created for E. Coli and its close relative, Salmonella typhimurium. The DNA sequences of Vibrio cholerae have also been organized into a non- redundant database, VchSeq, and ten new Vibrio genes were identified in the flanking DNA of Vibrio cholerae Genbank entries.

Agency
National Institute of Health (NIH)
Institute
National Library of Medicine (NLM)
Type
Intramural Research (Z01)
Project #
1Z01LM000041-05
Application #
2578628
Study Section
Special Emphasis Panel (CBB)
Project Start
Project End
Budget Start
Budget End
Support Year
5
Fiscal Year
1996
Total Cost
Indirect Cost
Name
National Library of Medicine
Department
Type
DUNS #
City
State
Country
United States
Zip Code