| ThauP25 | ThauP41 | ThauR71 | N4 | SAT1 | CN25 |
---|---|---|---|---|---|---|
Genome size, bp | 934,797 | 1,256,699 | 1,047,532 | 1,636,125 | 1,620,156 | 1,232,128 |
DNA coding region, bp | 878,769 | 1,159,316 | 970,632 | 1,500,929 | 1,499,274 | 1,164,105 |
G + C content, mol% | 33.15 | 37.82 | 37.99 | 42.25 | 41.00 | 33.16 |
Total RNA genes, n | 47 | 50 | 30 | 46 | 48 | 47 |
tRNA genes, n | 44 | 43 | 27 | 41 | 43 | 42 |
rRNA genes, n | 0 | 3 | 1 | 3 | 3 | 3 |
Other RNA genes, n | 3 | 4 | 2 | 2 | 2 | 2 |
Total number of genes, n | 1453 | 1771 | 1454 | 1955 | 1924 | 1516 |
Total CDSs, n (%) | 1406 (96.77) | 1721 (97.18) | 1424 (97.94) | 1909 (97.65) | 1876 (97.51) | 1469 (96.90) |
With predicted function, n (%) | 920 (63.32) | 1119 (63.18) | 944 (64.92) | 1233 (63.07) | 1212 (62.99) | 985 (64.97) |
Predicted with COG database, n (%) | 583 (40.12) | 791 (44.66) | 660 (45.39) | 1006 (51.46) | 998 (51.87) | 850 (56.07) |
Predicted with TIGRFAMs database, n (%) | 333 (22.92) | 402 (22.70) | 343 (23.59) | 456 (23.32) | 463 (24.06) | 413 (27.24) |
Encoding signal peptides, n (%) | 18 (1.24) | 51 (2.88) | 36 (2.48) | 81 (4.14) | 89 (4.63) | 40 (2.64) |
Encoding transmembrane proteins, n (%) | 231 (15.90) | 319 (18.01) | 255 (17.54) | 428 (21.89) | 424 (22.04) | 289 (19.06) |
Without predicted function, n (%) | 486 (33.45) | 602 (33.99) | 480 (33.01) | 676 (34.58) | 644 (34.51) | 484 (31.93) |
In internal clusters, n (%) | 381 (26.22) | 457 (25.80) | 234 (16.09) | 149 (7.62) | 137 (7.12) | 66 (4.35) |
Metadata | ||||||
 Isolation source, habitat | Seawater | Seawater | Freshwater | Geothermal hot spring | Wastewater | Seawater |
 | Atlantic Ocean | Atlantic Ocean | Brazil | Russian | China | Pacific Ocean |
 Thaumarchaeota group | 1.1a | 1.1a | 1.1a | 1.1a | 1.1a | 1.1a |
 Species | Nitrosopelagicus sp. | Nitrosotenuis sp. | Nitrosotenuis sp. | Nitrosotenuis uzonensis | Nitrosotenuis cloacae | Nitrosopelagicus brevis |