Intragenomic diversity of Rhizobium leguminosarum bv. trifolii clover nodule isolates
© Mazur et al; licensee BioMed Central Ltd. 2011
Received: 18 February 2011
Accepted: 30 May 2011
Published: 30 May 2011
Soil bacteria from the genus Rhizobium are characterized by a complex genomic architecture comprising chromosome and large plasmids. Genes responsible for symbiotic interactions with legumes are usually located on one of the plasmids, named the symbiotic plasmid (pSym). The plasmids have a great impact not only on the metabolic potential of rhizobia but also underlie genome rearrangements and plasticity.
Here, we analyzed the distribution and sequence variability of markers located on chromosomes and extrachromosomal replicons of Rhizobium leguminosarum bv. trifolii strains originating from nodules of clover grown in the same site in cultivated soil. First, on the basis of sequence similarity of repA and repC replication genes to the respective counterparts of chromids reported in R. leguminosarum bv. viciae 3841 and R. etli CFN42, chromid-like replicons were distinguished from the pool of plasmids of the nodule isolates studied. Next, variability of the gene content was analyzed in the different genome compartments, i.e., the chromosome, chromid-like and 'other plasmids'. The stable and unstable chromosomal and plasmid genes were detected on the basis of hybridization data. Displacement of a few unstable genes between the chromosome, chromid-like and 'other plasmids', as well as loss of some markers was observed in the sampled strains. Analyses of chosen gene sequences allowed estimation of the degree of their adaptation to the three genome compartments as well as to the host.
Our results showed that differences in distribution and sequence divergence of plasmid and chromosomal genes can be detected even within a small group of clover nodule isolates recovered from clovers grown at the same site. Substantial divergence of genome organization could be detected especially taking into account the content of extrachromosomal DNA. Despite the high variability concerning the number and size of plasmids among the studied strains, conservation of the location as well as dynamic distribution of the individual genes (especially replication genes) of a particular genome compartment were demonstrated. The sequence divergence of particular genes may be affected by their location in the given genome compartment. The 'other plasmid' genes are less adapted to the host genome than the chromosome and chromid-like genes.
Rhizobia are widely occurring soil bacteria that are able to establish nitrogen-fixing symbioses with legumes. Bacterium-plant interaction is a complex process in which specific plant and bacterial signals are exchanged resulting in formation of nodules, where rhizobia in the form of bacteroids fix nitrogen [1–3].
Rhizobial genomes are large and multipartite, composed of a single circular chromosome and a set of large plasmids [4–6]. The genes responsible for nodulation (nod) and nitrogen-fixation (nif-fix) are either carried by large plasmids (pSym) or are incorporated in the chromosome as symbiotic islands [7, 8]. Large genomes of Rhizobiaceae and Bradyrhizobiaceae (above 6-9 Mb) are considered more ecologically advantageous in an environment that is scarce in nutrients but diverse as regards carbon and energy sources. These genomes are disproportionately enriched in regulation and transport genes and in genes involved in secondary metabolism in comparison with medium-and small-size genome containing bacteria .
"Core" and "accessory" components of Rhizobium genomes can be distinguished. Chromosomes with conserved gene content and order (synteny) are considered as core. Accordingly, plasmids constitute the accessory genome. Plasmids are more flexible than the chromosomes, as defined by more frequent gene gains and losses, even in the same species. They are heterogeneous in size and gene content and lack synteny even in closely related species, except for genes involved in plasmid replication and symbiotic properties [6, 10, 11]. In some species, such as Rhizobium leguminosarum, plasmids may comprise up to 35% of the total genome [6, 7].
Rhizobial plasmids are maintained in the cells via repABC cassettes, comprising genes required for active segregation (repAB) and initiation of replication (repC) . The presence of several repABC operons within a single genome, which are subjected to individual selection pressure and divergence, could be the key element of the existence of different plasmid incompatibility groups in cells and could drive the rearrangement of gene organization and of their functions [11, 13–15]. It was proposed that repABC plasmids coexisting in the same strain most probably emerged by separate events of lateral transfer, which required evolution of different incompatibility groups allowing simultaneous residence of plasmids equipped with a similar replication/partition system in a single bacterial species . Thus, the degree of divergence of the plasmid replication apparatus, whose sequence is subject to strong evolutionary pressure and determines the ability to evade incompatibility between plasmids , and horizontal gene transfers are potential forces that shaped rhizobial genomes.
Recently, some (not only rhizobial) extrachromosomal replicons that have properties distinct from both chromosome and plasmids were reported and named "chromids" . Chromids are characterized by presence of some important genes essential for growth under all conditions, with nucleotide composition and codon usage similar to the chromosome of the parental strain, and, by contrast, plasmid replication and partition systems .
Furthermore, recent analyses of Rhizobium etli strains  showed that this species has a pangenomic structure. By definition, a pangenome "determines the core genome, which consists of genes shared by all the strains studied and probably encoding functions related to the basic biology and phenotypes of the species" . The basis of the pangenome concept emerged from an observation that each newly sequenced genome enriched the pool of species-specific genes with new ones [17, 18]. This makes it possible to detect, besides the core genomes, the dispensable genomes composed of both chromosomal and plasmid genes, present only in some of the strains, which contribute to the species diversity and allow adaptation to new ecological niches and a specific environment. Despite the overall genomic divergence, R. etli pangenome comprises a core genome composed of both chromosomal and plasmid sequences, as well as highly conserved symbiosis-related genes on the pSym plasmid. The unusual variability observed in rhizobial genomes may further result from several types of alterations, such as point mutations, deletions, amplification of DNA, and from intragenome re-assortment of sequences [19–21].
The aim of this study was to evaluate the divergence of genomes of a small population of R. leguminosarum bv. trifolii (Rlt) nodule isolates from clover plants grown in the same site in cultivated soil. Like the other members of the genus Rhizobium, the Rlt genomes were partitioned into the chromosome and several large plasmids, one of which carried symbiosis-related genes. The variability of the genome architecture involved not only the number and size of the plasmids, but also the location of specific genes on the particular replicons. Distribution of repABC operon markers and other genes in the three genome compartments: the chromosome, chromid-like and 'other plasmids' was assessed. We found "stable" genes that were permanently located in a specific genome compartment, as well as "unstable" ones, which were detected in different replicons of the sampled strains. Sequences of selected chromosome and plasmid genes were subjected to an assessment of adaptation to a particular genome compartment by analyses of codon usage and codon adaptation index. A potential evolutionary pathway of Rlt strains was proposed on the basis of gene sequences and their distribution.
R. leguminosarum bv. trifolii (Rlt)strains
129 R. leguminosarum isolates were obtained from nodules of red clover (Trifolium pratense L. cv. Dajana) growing in sandy loam (N:P:K 0.157:0.014:0.013%). Plants were grown on 1 m2 plot for six weeks between May and June 2008. Afterwards, ten randomly chosen clover plants growing in each other's vicinity were harvested, the nodules were collected, surface-sterilized, crushed and their content plated on 79CA medium . Strains isolated from the nodules were purified by successive streaking of single colonies and pure cultures were used in further experiments.
Primers and probes used in this study
RtTA1 replicon name
GenBank accession no
1300 bp of Pss-I region encoding part of putative flippase PssL
2956 bp of Pss-V region encoding lipopolysaccharide biosynthesis proteins
445 bp fragment encoding adenosylmethionine-8-amino-7-oxononanoate aminotransferase
487 bp fragment encoding RNA polymerase sigma factor
624 bp fragment encoding DNA helicase
645 bp fragment encoding heat shock protein 70 family
416 bp fragment encoding negative regulator of exopolysaccharide synthesis
1135 bp fragment comprising rRNA genes rrl and rrs-rrl intergenic spacer
850 bp fragment encoding lipid A oxidase
423 bp fragment encoding plasmid stability protein
539 bp fragment encoding nitrogen fixation cation transport proteins
442 bp fragment encoding C-terminal tail-specific protease precursor
620 bp fragment encoding type I secretion membrane fusion
1467 bp fragment encoding putative replication/partition protein of pRleTA1d
578 bp fragment encoding putative replication protein of pRleTA1d
740 bp fragment encoding dTDP-glucose 4,6-dehydratase, O-antigen biosynthesis protein
orf16, orf17, otsB
2191 bp fragment encoding component of ABC transporter Orf16 of AraC family, transcriptional regulator Orf17, trehalose-phosphatase OtsB
5 kb fragment encoding taurine uptake protein TauA and flavin monooxygenase/reductase Orf14 protein
433 bp fragment encoding replication/partition protein of pRleTA1c
1417 bp fragment encoding replication protein of pRleTA1c
440 bp fragment encoding surface polysaccharide biosynthesis protein of Pss-III region
582 bp fragment encoding quinolinate synthetase
589 bp fragment encoding septum site-determining protein
577 bp fragment encoding imidazolonepropionase protein
344 bp fragment encoding protocatechuate 3,4-dioxygenase protein
1309 bp fragment encoding replication/partition protein of pRleTA1b
932 bp fragment encoding putative replication protein of pRleTA1b
662 bp encoding fragment of acyltransferase nodulation protein
649 bp fragment encoding nitrogenase MoFe cofactor biosynthesis proteins
478 bp encoding fragment of thiamine biosynthesis protein
890 bp encoding fragment 1-aminocyclopropane-1-carboxylate deaminase
774 bp fragment encoding replication/partition protein of pRleTA1a
773 bp fragment encoding putative replication protein of pRleTA1a
Preparation of high molecular weight DNA and PFGE conditions
The plugs were formed with 5 ml 48 h culture of Rlt strains, which after centrifugation were resuspended in TE buffer and mixed with 2% LMP agarose (Sigma). Agarose embedded cells were incubated with TE and lysozyme (1.5 mg/ml) for 16 h at 37°C, and then in cell lysis buffer (1% sodium lauryl sarcosine, 50 mM EDTA, 50 mM Tris-HCl pH 8.0) supplemented with proteinase K (0.5 mg/ml) at 37°C for additional 48 h. The proteinase K was inactivated by PMSF (0.4 mg/ml) at 37°C for 1 h. Plugs were washed tree times (30 min) with TE buffer and finally stored in TE at 4°C. PFGE was performed with the contour-clamped homogenous electric field mode with the Bio-Rad system (model CHEF-DRIII). DNA samples were separated in 1% Megabase agarose gels (Bio-Rad) in 1 × TAE buffer, refrigerated at 12-14°C, with switch time 100-300 seconds, angle 106°, voltage gradient 3 V/cm for 48 h. Estimation of plasmid size was performed with BIO-PROFIL BioGene (Vilber-Lourmat, France), using R. leguminosarum bv. viciae strain 3841 , R. leguminosarum bv. trifolii TA1 [25, 26] and Sinorhizobium meliloti 1021 .
Computer assisted analyses
Sequence data were analyzed with Lasergene analysis software (DNASTAR, Inc). Data base searches were done with the BLAST and FASTA programs at the National Centre for Biotechnology Information (Bethesda, Md) and European Bioinformatic Institute (Hinxton, UK). For the DNA sequences multiple alignments Clustal-W algorithm was used . Codon usage of sequenced genes was calculated using ACUA . Codon adaptation index (CAI) was calculated with cai program . In codon usage discriminant analyses with two grouping methods were applied to studied sequences: (a) based on the localization of genes in defined part of the rhizobial genome (three groups: chromosome, chromid-like, and other plasmids), or (b) based on the origin of the genes (13 groups-each for one strain). The results of this multivariate analysis give us the information about separation of studied groups on the basis of discriminant functions i.e. linear combinations of studied variables maximizing distances between groups and orthogonal to each other .
For every grouping method set of variables included the relative frequency of alternative codons (for the same aminoacids), leading to the investigation of 59 variables (omitting stop codons and codons for methionine and tryptophan, which have no alternatives).
Complete discriminant analysis was performed but from among many obtained results we focused on Chi-squared test providing the number of statistically significant discriminant functions, squared Mahalanobis distances between the group centroids (taking into account the correlation between variables), scatterplots of discriminant scores i.e. cases located in the property space formed by first two discriminant functions  as well as the classification table containing information about the number and percent of correctly classified cases in each group.
The application of discriminant analysis was preceded by tolerance test, which enable us to remove redundant variables out of the model . The tolerance tests were performed using Classify/Discriminant unit of SPSS software (SPSS for Windows version 10.0, 1999, SPSS Inc., Chicago, IL, USA) while other results were obtained using Discriminant Function Analysis units of STATISTICA software system (Statistica version 6, 2001, StatSoft Inc., Tulsa, OK, USA).
Nucleotide sequence accession numbers
The following GenBank accession numbers were given to the nucleotide sequences determined in this study. For dnaC GQ374266-GQ374277, dnaK GQ374278-GQ374289, exoR GQ374290-GQ374301, fixGH GQ374302-GQ374313, hlyD GQ374314-GQ374325, lpsB GQ374326-GQ374337, nadA GQ374338-GQ374349, nifNE GQ374350-GQ374361, nodA GQ374362-GQ374373, prc GQ374374-GQ374385, rpoH2 GQ374386-GQ374397, thiC GQ374398-GQ374409, minD JF920043, hutI JF920044, pcaGJF920045
Strain selection based on variable genomic organization
Plasmid number and size of R. leguminosarum bv. trifolii strains determined by PFGE
Plasmid size (kb)
Average molecular weight (m.w.) of all the plasmids in each of the 23 isolates was calculated as 2.815 Mb (ranging from 1.89 to 3.25 Mb). With regard to the average genome size ~7.145 Mb of recently sequenced R. leguminosarum bv. trifolii WSM2304 (Rlt2304) and WSM1325 (Rlt1325) [33, 34], in which extrachromosomal replicons constitute 34% and 36%, respectively, the extrachromosomal DNA content in our strains was calculated to range from 26% to 45% (an average ~39%).
Similarity of replication-partition genes in the plasmid pool of selected strains
One of the methods to assess the phylogenetic relatedness among plasmids is to compare their replication systems. Thus, at the beginning of our study, similarity and/or diversity of replication regions between the plasmids of the nodule isolates were examined. Recently, the replication systems of four plasmids (pRleTA1a-pRleTA1d), each equipped with repABC genes, were analyzed in RtTA1 . An experimental approach comprising a series of Southern hybridizations with repA and repC genes derived from plasmids pRleTA1a-pRleTA1d of RtTA1 as molecular probes was used (Table 1). The repA and repC genes were PCR amplified from the RtTA1 genome and probed against PFGE-separated HMW DNA of the sampled strains. The choice of two different genes from each of the replication system identified in RtTA1 as molecular probes seemed to be justified by lack of single universal phylogenetic history within the repABC operon and by RepA and RepB evolution, partially independent from RepC .
RepABC of pRleTA1d and pRleTA1b display similarity with replication systems of the extrachromosomal replicons, which were recently described as chromids [16, 35]. Within the group of closely related strains RtTA1, R. leguminosarum bv. viciae 3841 (Rlv), R. etli CFN42 (Rhe), RltWSM2304 and RltWSM1325 clusters of replicons carrying the most similar replication systems can be distinguished. They comprise pRleTA1d-pRL12-p42f-pRLG201-pR132501 and pRleTA1b-pRL11-p42e-pRLG202-pR132502, respectively. Therefore, detection of positive hybridization signals with probes derived from rep genes of RtTA1 chromid-like replicons (i.e. pRleTA1b or pRleTA1d) to any of the replicons of the sampled strains allowed regarding those as a chromid-like. Based on the similarity of replication-partition genes detected in our assays, we divided the replicons of the studied strains into three genome compartments: chromosome, chromid-like and 'other plasmids' (i.e. those replicons which gave a hybridization signal with molecular probes originating from repA and repC genes of pRleTA1a or pRleTA1c, as well as those that gave no signal with any rep probes of RtTA1 replication genes). The compartment designated 'other plasmids' also comprised pSym. Such replicon division was taken into consideration in the subsequent analyses of distribution of other markers in the studied strains.
Variability of chromosomal and plasmid marker location
In further studies, the extent of gene content diversity in the sampled nodule isolates was examined. We aimed to estimate whether, besides repA and repC displacement events, we could demonstrate changes in the location of the chromosomal and plasmid genes. The same experimental approach was used, i.e. a series of Southern hybridizations with different genes with a well-defined chromosomal or plasmid location in RtTA1 (Table 1) .
For assays of chromosomal marker variability, essential bacterial genes were chosen: rpoH2, dnaK, dnaC, rrn, lpxQ as well as genes that are not essential or with unspecified essentiality but chromosomal in RtTA1, i.e. bioA, stbB, exoR, pssL (Pss-I) and rfbADBC (Pss-V) (Table 1). In addition, location of fixGH genes was assayed, even though they are known to be plasmid located on the sequenced RltWSM2304, RltWSM1325 [33, 34], Rlv  and Rhe  genomes, but chromosomal in RtTA1 .
Southern hybridizations with probes comprising markers previously identified on different RtTA1 replicons , such as prc and hlyD of pRleTA1d; lpsB2, orf16-orf17-otsB, tauA and orf14 genes cluster of pRleTA1c; nadA and pssM (surface polysaccharide synthesis region Pss-III) of pRleTA1b, were carried out. These analyses demonstrated that pRleTA1d markers were almost always jointly detected in the largest chromid-like replicons (only in K3.22 and K5.4 they are separated between distinct chromid-like replicons). pRleTA1c markers in almost all (21 out of 23) of the sampled strains were located in the genome compartment designated as 'other plasmids' (Figure 3). From among markers of pRleTA1b, nadA, minD, hutI and pcaG had always chromid-like location, while the pssM gene was located in the chromosome of 19 strains, in chromid-like replicons of four strains including RtTA1, and was absent in the genome of K3.22 strain, respectively (Figure 3).
Besides the symbiotic genes nodA and nifNE used for identification of pSym plasmids, stability of thiC and acdS (Table 1) of the pRleTA1a symbiotic plasmid (ipso facto described as markers of the 'other plasmids' pool) was examined (Figure 3). Only thiC was identified in all the strains, however, located in different genomic compartments: most frequently on the chromosome (18 of 23 strains), and in the 'other plasmids' (5 strains). The acdS gene was detected in 14 of 23 strains, in each case on pSym (Figure 3). The thiC gene, similarly to fixGHI, showed high variability in location; however, its putative mobile element location is unknown . thiC was reported as plasmid located in sequenced genomes of Rlv , Rlt2304  and Rhe .
Sequence divergence of chromosomal and plasmid genes
In general, a low number of nucleotide substitutions were found in the examined genes in most strains. Similar groups of clustered strains were obtained in dendrograms constructed both on the basis of concatenated chromosomal sequences (Figure 6A), as well as concatenated chromid-like replicon genes (Figure 6B). In both cases, a smaller group containing RtTA1, K4.15 and K3.6 strains, and a larger group consisting of the remaining strains was observed. Interestingly, K3.22 chromosomal genes split off from all remaining strains suggesting their considerable divergence (Figure 6B). Sequence similarity within the RtTA1, K4.15 and K3.6 group is also visible on a dendrogram exclusively based on plasmid gene sequences, derived from pSym (Figure 6C). When all the concatenated sequences (comprising genes with stable and unstable location in the genome) were used in dendrogram construction, the grouping of the strains was very similar to that obtained on the basis of stable chromosomal markers (Figure 6A, D). In conclusion, quite a similar phylogenetic history of the studied strains was demonstrated based on both stable and unstable chromosomal, chromid-like as well as 'other plasmid' genes (despite the small number of the markers analyzed).
The discrimination analysis of codon usage performed on individual strains harboring the set of the tested genes (13 groups of sequences) revealed only minor differences between the resultant groups and almost no accordance (31%) with the grouping performed on the basis of hybridization. However, some level of similarity between the strains can be demonstrated. As a consequence, one more discrimination analysis of codon usage was done, and the strains were divided into three groups: (i) K3.22, (ii) RtTA1, K3.6, K4.15 and (iii) all the remaining strains (Figure 7B). This resulted in 92% accordance between codon usage-based and strain-dependent grouping of sequences (Figure 7B and Figure 6D). It was concluded that codon usage was not significantly influenced by the individual strains but may be characteristic for the group of strains.
Finally, the Codon Adaptation Index (CAI) of the sequences studied was calculated. The CAI can be used to "evaluate the extent to which selection has been effective in molding the pattern of codon usage"  as well as to compare the codon usage of foreign genes versus that of highly expressed native genes . Here, we applied CAI analyses to assess the degree of adaptation of sequenced genes to the host by comparing the obtained CAI values with those of genes encoding ribosomal proteins in R. leguminosarum. The calculated CAI values for each sequence were arbitrarily grouped and subsequently submitted to ANOVA evaluation, which measures the significance of differences between groups. CAI values can range from 0 (reflecting use of synonymous codons) to 1 (reflecting the strongest bias where codon usage is equal to that in the ribosomal protein-encoding genes) .
The Codon Adaptation Index (CAI) of genes located in genome compartments in Rlt nodule isolates.
Number of sequences
0.767 ± 0.062 a
0.732 ± 0.065 b
0.645 ± 0.061 cd
Three genome compartments that differed genetically and functionally can be distinguished in the nodule population of R. leguminosarum bv. trifolii: the chromosome, chromid-like and 'other plasmids' including pSym. Chromid-like replicons were distinguished in Southern analyses on the basis of repA and repC sequence similarity to RtTA1 and to the respective replication genes of such replicons described in the sequenced genomes of R. leguminosarum bv. viciae, R. etli and R. leguminosarum bv. trifolii . The chosen name "chromid-like" (as opposed to simply "chromid") was the result of data scarcity concerning their gene content, insufficient to justify the name "chromid" . Moreover, it is known that genes of the repABC operon are peculiar genetic markers because of the complex phylogeny of particular genes within the operon, whose evolutionary history could not be strictly connected with other genes of particular replicons .
In the study of the distribution of several chromosomal and plasmid markers within a group of 23 nodule isolates, stable genes permanently located in a specific R. leguminosarum bv. trifolii genome compartment: chromosome, chromid-like and 'other plasmids' including pSym were distinguished. Unstable genes (fixGH, thiC, acdS, pssM and Pss-V region) that changed their location at various rates or were lost from the genome were also detected. Only two of the sampled 23 strains possessed all the studied markers. A majority of strains differed in the gene content and gene distribution, supporting the hypothesis of the pangenomic structure of R. leguminosarum, in which each strain of a given species contains, besides the core genome, additional genetic information specific for the strain [11, 17, 18, 39].
The distribution of the plasmid replication-partition genes was even more dynamic than that of genes not connected with replication. Independent transfer events of repA and repC genes of the putative repABC operon were frequently observed, especially in the 'other plasmids' compartment, which confirmed different evolutionary pathways for various elements of the repABC operon, recently evidenced in Alphaproteobacteria . Such considerable dynamics of replication/partition gene distribution in Rhizobium may account for changes in the plasmid number and, consequently, gene content observed in the sampled population. Beside the dynamics of replication/partition gene distribution, some level of conservation of replication genes, especially those of chromid-like replicons, was also observed. It was reflected in positive hybridizations with pRleTA1d and pRleTA1b derived rep probes, to the respective replicons of Rlt strains. One could speculate that the conservation of replication genes of chromid-like replicons may be related with their distinct properties e.g. stability. However, the gene content rather than the properties of the replication system, resulting e.g. from conservation of replication genes, seem to be crucial for replicon stability .
Redistribution of genes between the different genome compartments could further trigger their sequence divergence under different selective pressures [13, 15, 41]. Examination of sequence divergence of several stable and unstable chromosomal and plasmid genes showed a low level of substitutions in genes of all the compartments. Nearly identical nucleotide sequences of nifNE markers were found in different pSym plasmids of the studied population (Figure 6C), confirming the core character of symbiotic genes and their high conservation, despite the overall genome differentiation .
The extent of gene adaptation to a given compartment in the host genome was assessed by analyses of alternative codon usage. Three groups of well separated genes were obtained corresponding to the chromosome, chromid-like and 'other plasmids' genome compartments (Figure 7A) with 96% accordance with hybridization data. In conclusion, the sequence divergence of particular genes may be affected by their location in the given genome compartment. When all the sequences of the individual strains studied were subjected to a discrimination analysis, we obtained good separation of K3.22 and a group of strains related to RtTA1 (Figure 7B) that formed the outermost branch in the phylogenic tree. The remaining strains were randomly mixed with each other but apparently separated from K3.22 and TA1-related strains, which suggested no differences in codon usage within the main group.
The CAI analyses of the evaluated sequences confirmed good adaptation of chromosomal and chromid-like genes (high CAI values) to host genomes and lower CAI values for 'other plasmids' genes. The CAI values also reflect the level of transcriptional and translational activity of particular genes . While the activity of most of the chromosomal and chromid-like genes could be considered at least to some extent constitutive, the 'other plasmids' and especially symbiosis-related genes are expressed only transiently in the symbiotic stage . Therefore, in the Rhizobium model, the differences in codon usage in translation reflect the balance between the selection pressure and random mutations in the functionally differentiated genome compartments. The differences in codon usage and CAI values between the genome compartments are most likely a consequence of differential gene expression and adaptability to optimal codon usage in host genomes .
Our study showed that, even within a small rhizobial population of clover nodule isolates, substantial divergence of genome organization can be detected especially taking into account the content of extrachromosomal DNA. Despite the high variability with regard to the number and size of plasmids among the studied strains, conservation of the location as well as the dynamic distribution of the individual genes (especially replication genes) of a particular genome compartment was demonstrated. The sequence divergence of particular genes may be affected by their location in the given genome compartment. The 'other plasmid' genes are less adapted to the host genome than the chromosome and chromid-like genes.
Acknowledgements and Funding
This work was supported by Grant No. N N301 028734 from Ministry of Science and Higher Education of Poland.
- Jones KM, Kobayashi H, Davies BW, Taga ME, Walker GC: How symbionts invade plants: the Sinorhizobium-Medicago model. Nat Rev Microbiol. 2007, 5: 619-633. 10.1038/nrmicro1705.PubMedPubMed CentralView ArticleGoogle Scholar
- Masson-Boivin C, Giraud E, Perret X, Batut J: Establishing nitrogen-fixing symbiosis with legumes: how many rhizobium recipes?. Trends Microbiol. 2009, 17: 458-466. 10.1016/j.tim.2009.07.004.PubMedView ArticleGoogle Scholar
- Perret X, Staehelin C, Broughton W: Molecular basis of symbiotic promiscuity. Microbiol Mol Biol Rev. 2000, 64: 180-201. 10.1128/MMBR.64.1.180-201.2000.PubMedPubMed CentralView ArticleGoogle Scholar
- Galibert F, et al: The composite genome of the legume symbiont Sinorhizobium meliloti. Science. 2001, 293: 668-672. 10.1126/science.1060966.PubMedView ArticleGoogle Scholar
- González V, Santamaría RI, Bustos P, Hernández-González I, Medrano-Soto A, Moreno-Hagelsieb G, Janga SC, Ramírez MA, Jiménez-Jacinto V, Collado-Vides J, Dávila G: The partitioned Rhizobium etli genome: genetic and metabolic redundancy in seven interacting replicons. Proc Natl Acad Sci USA. 2006, 103: 3834-3839. 10.1073/pnas.0508502103.PubMedPubMed CentralView ArticleGoogle Scholar
- Young JPW, et al: The genome of Rhizobium leguminosarum has recognizable core and accessory components. Genome Biol. 2006, 7: R34-10.1186/gb-2006-7-4-r34.PubMedPubMed CentralView ArticleGoogle Scholar
- Palacios R, Newton WE: Genomes and genomics of nitrogen-fixing organisms. Edited by: Palacios R, Newton WE. 2005, Dordrecht, The Netherlands: SpringerView ArticleGoogle Scholar
- Sullivan JT, Trzebiatowski JR, Cruickshank RW, Gouzy J, Brown SD, Elliot RM, Fleetwood DJ, McCallum NG, Rossbach U, Stuart GS, Weaver JE, Webby RJ, De Bruijn FJ, Ronson CW: Comparative sequence analysis of the symbiosis island of Mesorhizobium loti strain R7A. J Bacteriol. 2002, 184: 3086-3095. 10.1128/JB.184.11.3086-3095.2002.PubMedPubMed CentralView ArticleGoogle Scholar
- Konstantinidis KT, Tiedje JM: Trends between gene content and genome size in prokaryotic species with larger genomes. Proc Natl Acad Sci USA. 2004, 101: 3160-3165. 10.1073/pnas.0308653100.PubMedPubMed CentralView ArticleGoogle Scholar
- Crossman LC, Castillo-Ramírez S, McAnnula C, Lozano L, Vernikos GS, Acosta JL, Ghazoui ZF, Hernández-González I, Meakin G, Walker AW, Hynes MF, Young JPW, Downie JA, Romero D, Johnston AWB, Dávila G, Parkhill J, González V: A common genetic framework for a diverse assembly of plasmids in the symbiotic nitrogen fixing bacteria. PLoS ONE. 2008, 7: e2567-View ArticleGoogle Scholar
- González V, Acosta JL, Santamaría RI, Bustos P, Fernández JL, Hernández González IL, Díaz R, Flores M, Palacios R, Mora J, Dávila G: Conserved symbiotic plasmid DNA sequences in the multireplicon pangenomic structure of Rhizobium etli. Appl Environ Microbiol. 2010, 76: 1604-1614. 10.1128/AEM.02039-09.PubMedPubMed CentralView ArticleGoogle Scholar
- Cevallos MA, Cervantes-Rivera R, Gutiérrez-Ríos RM: The repABC plasmid family. Plasmid. 2008, 60: 19-37. 10.1016/j.plasmid.2008.03.001.PubMedView ArticleGoogle Scholar
- Castillo-Ramírez S, Vázquez-Castellanos JF, González V, Cevallos MA: Horizontal gene transfer and diverse functional constrains within a common replication-partitioning system in Alphaproteobacteria: the repABC operon. BMC Genomics. 2009, 10: 536-10.1186/1471-2164-10-536.PubMedPubMed CentralView ArticleGoogle Scholar
- Cevallos MA, Porta H, Izquierdo J, Tun-Garrido C, García-de-los-Santos A, Dávila G, Brom S: Rhizobium etli CFN42 contains at least three plasmids of the repABC family: a structural and evolutionary analysis. Plasmid. 2002, 48: 104-116. 10.1016/S0147-619X(02)00119-1.PubMedView ArticleGoogle Scholar
- Fondi M, Bacci G, Brilli M, Papaleo MC, Mengoni A, Vaneechoutte M, Dijkshoorn L, Fani R: Exploring the evolutionary dynamics of plasmids: the Acinetobacter pan-plasmidome. BMC Evol Biol. 2010, 10: 59-10.1186/1471-2148-10-59.PubMedPubMed CentralView ArticleGoogle Scholar
- Harrison PW, Lower RPJ, Kim NKD, Young JPW: Introducing the bacterial 'chromid': not a chromosome, not a plasmid. Trends Microbiol. 2010, 18: 141-147. 10.1016/j.tim.2009.12.010.PubMedView ArticleGoogle Scholar
- Tettelin H, Riley D, Cattuto C, Medini D: Comparative genomics: the bacterial pan-genome. Curr Opin Microbiol. 2008, 12: 472-477.View ArticleGoogle Scholar
- Medini D, Donati C, Tettelin H, Masignani V, Rappuoli R: The microbial pangenome. Curr Opin Genet Dev. 2005, 15: 589-594. 10.1016/j.gde.2005.09.006.PubMedView ArticleGoogle Scholar
- Flores M, Morales L, Avila A, González V, Bustos P, García D, Mora Y, Guo X, Collado-Vides J, Piñero D, Dávila G, Mora J, Palacios R: Diversification of DNA sequences in the symbiotic genome of Rhizobium etli. J Bacteriol. 2005, 187: 7185-7192. 10.1128/JB.187.21.7185-7192.2005.PubMedPubMed CentralView ArticleGoogle Scholar
- Guerrero G, Peralta H, Aguilar A, Díaz R, Villalobos MA, Medrano-Soto A, Mora J: Evolutionary, structural and functional relationships revealed by comparative analysis of syntenic genes in Rhizobiales. BMC Evol Biol. 2005, 5: 55-10.1186/1471-2148-5-55.PubMedPubMed CentralView ArticleGoogle Scholar
- Rocha EPC: Evolutionary patterns in prokaryotic genomes. Curr Opin Microbiol. 2008, 11: 454-460. 10.1016/j.mib.2008.09.007.PubMedView ArticleGoogle Scholar
- Vincent JM: A manual for the practical study of root nodule bacteria. International biological program handbook no.15. 1970, Oxford, UK: Blackwell Scientific Publications LtdGoogle Scholar
- Sambrook J, Fritsch EF, Maniatis T: Molecular Cloning: A Laboratory Manual. 1989, Cold Spring Harbor, NY, USA: Cold Spring Harbor Laboratory Press, 2Google Scholar
- Eckhardt T: A rapid method for the identification of plasmid deoxyribonucleic acid in bacteria. Plasmid. 1978, 1: 584-588. 10.1016/0147-619X(78)90016-1.PubMedView ArticleGoogle Scholar
- Chakravorty AK, Żurkowski W, Shine J, Rolfe BG: Symbiotic nitrogen fixation: molecular cloning of Rhizobium genes involved in exopolysaccharide synthesis and effective nodulation. J Mol Appl Genet. 1982, 1: 585-596.PubMedGoogle Scholar
- Król J, Mazur A, Marczak M, Skorupska A: Syntenic arrangements of the surface polysaccharide biosynthesis genes in Rhizobium leguminosarum. Genomics. 2007, 89: 237-247. 10.1016/j.ygeno.2006.08.015.PubMedView ArticleGoogle Scholar
- Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucl Acids Res. 1994, 22: 4673-4680. 10.1093/nar/22.22.4673.PubMedPubMed CentralView ArticleGoogle Scholar
- Vetrivel U, Arunkumar V, Dorairaj S: ACUA: A software tool for automated codon usage analysis. Bioinformation. 2007, 2: 62-63.PubMedPubMed CentralView ArticleGoogle Scholar
- Sharp PM, Li WH: The codon adaptation index-a measure of directional synonymous codon usage bias, and its potential applications. Nucl Acids Res. 1987, 15: 1281-1295. 10.1093/nar/15.3.1281.PubMedPubMed CentralView ArticleGoogle Scholar
- McLachlan GJ: Discriminant analysis and statistical pattern recognition. 1992, Hoboken, New Jersey: John Wiley & Sons IncView ArticleGoogle Scholar
- Dillon WR, Goldstein M: Multivariate analysis. Methods and applications. 1984, New York: John Wiley & Sons IncGoogle Scholar
- Klecka WR: Discriminant analysis. 1980, Thousand Oaks, CA: Sage Publications, Quantitative Applications in the Social Sciences Series, No. 19.Google Scholar
- Reeve W, O'Hara G, Chain P, Ardley J, Bräu L, Nandesena K, Tiwari R, Malfatti S, Kiss H, Lapidus A, Copeland A, Nolan M, Land M, Ivanova N, Mavromatis K, Markowitz V, Kyrpides N, Melino V, Denton M, Yates R, Howieson J: Complete genome sequence of Rhizobium leguminosarum bv. trifolii strain WSM2304, an effective microsymbiont of the South American clover Trifolium polymorphum. Stand Genomic Sci. 2010, 2: 66-76. 10.4056/sigs.44642.PubMedPubMed CentralView ArticleGoogle Scholar
- Reeve W, O'Hara G, Chain P, Ardley J, Bräu L, Nandesena K, Tiwari R, Copeland A, Nolan M, Han C, Brettin T, Land M, Ovchinikova G, Ivanova N, Mavromatis K, Markowitz V, Kyrpides N, Melino V, Denton M, Yates R, Howieson J: Complete genome sequence of Rhizobium leguminosarum bv. trifolii strain WSM1325, an effective microsymbiont of annual Mediterranean clovers. Stand Genomic Sci. 2010, 2: 347-356. 10.4056/sigs.852027.PubMedPubMed CentralView ArticleGoogle Scholar
- Mazur A, Majewska B, Stasiak G, Wielbo J, Skorupska A: repABC-based replication systems of Rhizobium leguminosarum bv. trifoliiTA1 plasmids: incompatibility and evolutionary analyses. Plasmid. Google Scholar
- Król J, Mazur A, Marczak M, Skorupska A: Physical and genetic map of Rhizobium leguminosarum bv. trifolii TA1 and its application in comparison of closely related rhizobial genomes. Mol Genet Genomics. 2008, 279: 107-121. 10.1007/s00438-007-0299-9.PubMedView ArticleGoogle Scholar
- González V, Bustos P, Ramírez-Romero MA, Medrano-Soto A, Salgado H, Hernández-González I, Hernández-Celis JC, Quintero V, Moreno-Hagelsieb G, Girard L, Rodríguez O, Flores M, Cevallos A, Collado-Vides J, Romero D, Dávila G: The mosaic structure of the symbiotic plasmid of Rhizobium etli CFN42 and its relation to other symbiotic genome compartments. Genome Biol. 2003, 4: R36-10.1186/gb-2003-4-6-r36.PubMedPubMed CentralView ArticleGoogle Scholar
- Miranda-Ríos J, Morera C, Taboada H, Dávalos A, Encarnación S, Mora J, Soberón M: Expression of thiamin biosynthetic genes (thiCOGE) and production of symbiotic terminal oxidase cbb3 in Rhizobium etli. J Bacteriol. 1997, 179: 6887-6893.PubMedPubMed CentralGoogle Scholar
- Brom S, Girard L, García-de los-Santos A, Sanjuan-Pinilla JM, Olivares J, Sanjuan J: Conservation of plasmid-encoded traits among bean-nodulating Rhizobium species. Appl Environ Microbiol. 2002, 68: 2555-2561. 10.1128/AEM.68.5.2555-2561.2002.PubMedPubMed CentralView ArticleGoogle Scholar
- Landeta C, Dávalos A, Cevallos MA, Geiger O, Brom S, Romero D: Plasmids with a chromosome-like role in Rhizobium. J Bacteriol. 2011, 193: 1317-1326. 10.1128/JB.01184-10.PubMedPubMed CentralView ArticleGoogle Scholar
- Slater SC, et al: Genome sequences of three Agrobacterium biovars help elucidate the evolution of multichromosome genomes in bacteria. J Bacteriol. 2009, 191: 2501-2511. 10.1128/JB.01779-08.PubMedPubMed CentralView ArticleGoogle Scholar
- Peixoto L, Zaval A, Romero H, Musto H: The strength of translational selection for codon usage varies in the three replicons of Sinorhizobium meliloti. Gene. 2003, 320: 109-116.PubMedView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.