Skip to main content

Table 3 The ten E. coli species-specific genomic regions identified in this study based on a total sequence identity of 90 %, their location in the K12 reference genome MG1655, the number out of 2324 E. coli genomes each region was found in, and their putative function based on the top scoring BLASTx hit

From: SuperPhy: predictive genomics for the bacterial pathogen Escherichia coli

Region ID Start bp End bp No. genomes Putative function
3160548 347258 346259 2238 Propionate catabolism operon regulatory protein PrpR
3160296 537566 536567 2256 2-hydroxy-3-oxopropionate reductase
3160113 538566 537567 2248 Allantoin permease
3159571 541565 540567 2275 Purine permease ybbY
3159389 542566 541567 2268 Glycerate kinase
3158844 545665 544666 2261 Allantoate amidohydrolase
3158667 546665 545666 2272 Ureidoglycolate dehydrogenase
3159808 1588200 1587201 2171 FimH protein
3160196 4411062 4410063 2261 Hypothetical protein
3158082 4456632 4457631 2074 Mur ligase family, glutamate ligase domain protein