We propose to determine on a genome-wide and population-wide scale genetic variations across all major diarrheal and extra-intestinal E. coli pathotypes, with the analysis of positive selection footprints in genes shared by multiple strains. The main focus of the analysis will be to characterize in detail small nucleotide variations (point substitutions and indels) that occur under positive selection in the core genes of E. coli and are adaptive to pathogenic strains. Our plan is to assemble pathotype-representative collection of E. coli that will encompass approximately 2,500 isolates selected from over 20,000 E. coli strains from around the globe. Clonal diversity of these strains will be determined and clonally diverse isolates will be subjected to high-throughput sequencing using 454 technology. Genomes of 250-300 E. coli strains (obtained de novo or available in public databases) will be analyzed for footprints of positive selection by detecting hot-spot mutations - repeated (phylogenetically-unlinked) mutations affecting the same amino acid position. Hot-spot mutations are indicative of convergent evolution - one of the strongest indications of the adaptive significance of such changes in specific environments. We expect that there are several hundred core genes affected by positively-selected hot-spot mutations. Genetic association of specific hot-spot mutations with different pathotypes will then be validated on a population-wide level. Also, we will test the clonal resolution of novel genotyping markers of E. coli for potential application in clinical and environmental diagnostics. Finally, functional effects of positively-selected variations will be investigated for genes involved in the regulation of E. coli virulence factors.

Public Health Relevance

The E.coli Variome Project will compile all genetic variations across major diarrheal and extra-intestinal E.coli pathogens, with the analysis of positive selection footprints in genes shared by multiple strains. Genetics association of specific hot-spot mutations in core genes and novel genotyping markers will be validated on a population-wide level. Finally, function effects of pathogenicity-adaptive variations will be investigated for genes involved in the regulation of E.coli virulence factors.

National Institute of Health (NIH)
National Institute of Allergy and Infectious Diseases (NIAID)
High Impact Research and Research Infrastructure Programs—Multi-Yr Funding (RC4)
Project #
Application #
Study Section
Special Emphasis Panel (ZRG1-IDM-C (55))
Program Officer
Baqar, Shahida
Project Start
Project End
Budget Start
Budget End
Support Year
Fiscal Year
Total Cost
Indirect Cost
University of Washington
Schools of Medicine
United States
Zip Code
Johnson, Timothy J; Aziz, Maliha; Liu, Cindy M et al. (2016) Complete Genome Sequence of a CTX-M-15-Producing Escherichia coli Strain from the H30Rx Subclone of Sequence Type 131 from a Patient with Recurrent Urinary Tract Infections, Closely Related to a Lethal Urosepsis Isolate from the Patient's Sister. Genome Announc 4:
Chi, Peter B; Chattopadhyay, Sujay; Lemey, Philippe et al. (2015) Synonymous and nonsynonymous distances help untangle convergent evolution and recombination. Stat Appl Genet Mol Biol 14:375-89
Lasaro, Melissa; Liu, Zhi; Bishar, Rima et al. (2014) Escherichia coli isolate for studying colonization of the mouse intestine and its application to two-component signaling knockouts. J Bacteriol 196:1723-32
Johnson, James R; Price, Lance B; Sokurenko, Evgeni V (2014) Response to Giufre et al. J Infect Dis 209:630-1
Barillova, Petra; Tchesnokova, Veronika; Dübbers, Angelika et al. (2014) Prevalence and persistence of Escherichia coli in the airways of cystic fibrosis patients - an unrecognized CF pathogen? Int J Med Microbiol 304:415-21
Johnson, James R; Clermont, Olivier; Johnston, Brian et al. (2014) Rapid and specific detection, molecular epidemiology, and experimental virulence of the O16 subgroup within Escherichia coli sequence type 131. J Clin Microbiol 52:1358-65
Colpan, Aylin; Johnston, Brian; Porter, Stephen et al. (2013) Escherichia coli sequence type 131 (ST131) subclone H30 as an emergent multidrug-resistant pathogen among US veterans. Clin Infect Dis 57:1256-65
Chattopadhyay, Sujay; Taub, Fred; Paul, Sandip et al. (2013) Microbial variome database: point mutations, adaptive or not, in bacterial core genomes. Mol Biol Evol 30:1465-70
Price, Lance B; Johnson, James R; Aziz, Maliha et al. (2013) The epidemic of extended-spectrum-?-lactamase-producing Escherichia coli ST131 is driven by a single highly pathogenic subclone, H30-Rx. MBio 4:e00377-13
Subashchandrabose, Sargurunathan; Hazen, Tracy H; Rasko, David A et al. (2013) Draft genome sequences of five recent human uropathogenic Escherichia coli isolates. Pathog Dis 69:66-70

Showing the most recent 10 out of 24 publications