Skip to main content

Table 3 The ten E. coli species-specific genomic regions identified in this study based on a total sequence identity of 90 %, their location in the K12 reference genome MG1655, the number out of 2324 E. coli genomes each region was found in, and their putative function based on the top scoring BLASTx hit

From: SuperPhy: predictive genomics for the bacterial pathogen Escherichia coli

Region ID

Start bp

End bp

No. genomes

Putative function

3160548

347258

346259

2238

Propionate catabolism operon regulatory protein PrpR

3160296

537566

536567

2256

2-hydroxy-3-oxopropionate reductase

3160113

538566

537567

2248

Allantoin permease

3159571

541565

540567

2275

Purine permease ybbY

3159389

542566

541567

2268

Glycerate kinase

3158844

545665

544666

2261

Allantoate amidohydrolase

3158667

546665

545666

2272

Ureidoglycolate dehydrogenase

3159808

1588200

1587201

2171

FimH protein

3160196

4411062

4410063

2261

Hypothetical protein

3158082

4456632

4457631

2074

Mur ligase family, glutamate ligase domain protein