A. A distribution of the two copy genes based on general Clusters of Orthologous Groups of proteins (COG) functions. The genes are classified in 5 generalized groups: Not in COGs (Group 0); Information storage and processing (Group 1); Cellular processes (Group 2); Metabolism (Group 3); Poorly characterized (Group 4). B. A distribution of the two copy genes based on specific Clusters of Orthologous Groups (COGs) of protein functions. A more detailed breakdown of the distribution of the genes is given based on different cellular functions represented in 25 COG sub-groups. Of these classifiable COG groups, duplicated genes are present in 20 subgroups: J. Translation, ribosomal structure and biogenesis; K. Transcription; L. DNA replication, recombination and repair; D. Cell division and chromosome partitioning; V. Defense mechanisms; T. Signal transduction mechanisms; M. Cell envelope biogenesis, outer membrane; N. Cell motility and secretion; U. Intracellular trafficking and secretion; O. Posttranslational modification, protein turnover, chaperones. C. Energy production and conversion; G. Carbohydrate transport and metabolism; E. Amino acid transport and metabolism; F. Nucleotide transport and metabolism; H. Coenzyme metabolism; I. Lipid metabolism; P. Inorganic ion transport and metabolism; Q. Secondary metabolites biosynthesis, transport and catabolism; R. General function prediction only; S. Function unknown.