Table 2 Descriptions of the metagenomic datasets

Project name Source description Dominating phylum/phyla Reference
AntarcticaAquatic (AA) Antarctica Aquatic Microbial Metagenome
Bacteroidetes, Proteobacteria [36]
AcidMine (AM) Acid Mine Drainage Metagenome
Nitrospirae [30]
BisonMetagenome (BM) Metagenome from Yellowstone Bison Hot Spring
Aquificae [37]
GOS Global Ocean Sampling Expedition
Proteobacteria [38]
GutlessWorm (GW) Mediterranean Gutless Worm Metagenome
Proteobacteria [39]
HumanGut (HG) Human Distal Gut Biome project
Firmicutes, Actinobacteria [40]
HOT Microbial Community Genomics at the Hawaii Ocean Time-series (HOT) station ALOHA
Proteobacteria, Cyanobacteria [41]
  1. The metagenomic datasets used in this paper are from the CAMERA website ( Dominating phyla have sequences amounting to more than 20% of the total in the dataset.