Skip to main content
  • Research article
  • Open access
  • Published:

Gene expression analysis indicates extensive genotype-specific crosstalk between the conjugative F-plasmid and the E. coli chromosome



Plasmids are an important component of the bacterial genome, but the crosstalk between genes encoded on the chromosome and on the plasmid is still poorly understood.


We performed a large-scale survey for genes on the E. coli chromosome that are affected by the presence of the conjugative F-plasmid (crosstalk). The expression pattern of about 4% (107 genes) of the genes encoded by the chromosome was affected by the presence of the F-plasmid. Comparing two different Escherichia coli strains, MG1655 and DH5α, we found a strong host genotype-specific crosstalk of the host chromosome with the F-plasmid. About 88% of the genes affected by the presence of the F-plasmid showed a significant plasmid by host genotype interaction, i.e. the presence of the F-plasmid resulted in a different gene expression in the two host genotypes. Less than 12% of the genes showed an additive effect of gene expression, i.e. host genotype independent crosstalk between plasmid and host chromosome.


We propose that epistatic effects also contribute to the maintenance of F-plasmids in natural populations.


Bacterial plasmids are extrachromosomal, non-essential DNA elements, which can replicate autonomously [1, 2]. Most plasmids are smaller than bacterial chromosomes and some plasmids have the capacity to move between different host species, leading to intra- and inter-specific gene transfer [3, 4].

The co-existence of the plasmid and the bacterial chromosome in the same cell raises interesting questions. Probably the best studied is how plasmids persist in natural populations. It has been proposed that plasmids are parasites [5] or mutualists [6], conferring a selective advantage to their host. Eberhard [2] suggested that plasmids carry genes, which are only required under certain rare circumstances. When needed they can be transferred to other cells, but when not required only a small number of cells carry the plasmid. According to this view plasmids are a kind of 'lending library' that deliver genes when required.

While the above mentioned hypotheses do not require crosstalk between plasmid and the host chromosome, some studies using experimental evolution showed that the host chromosome and the plasmid co-evolve to reduce the cost of F-plasmid carriage [7, 8]. Nevertheless, the extent of crosstalk between host chromosome and plasmids is not known.

In this study, we compare the gene expression pattern of the E. coli chromosome in the presence and absence of the conjugative F-plasmid to understand how the presence of the plasmid affects gene expression of the chromosome. Using two different E. coli strains, we find dramatic differences in the expression of host genes, indicating a highly genotype specific crosstalk between plasmid and chromosome.


4374 genes are represented on the Affymetrix chip (not counting the intergenic regions), 2698 of these were expressed in at least one of the four different genotypes (two hosts, each in the presence or absence of the plasmid). Our experimental set-up was designed to disentangle the contribution of genetic background of the host from changes in gene expression induced by F-plasmid.

We distinguish four groups of genes: "bacterial genotype effect only" (Fig. 1a), "F-plasmid effect only" (Fig. 1b), "bacterial and F-plasmid additive effects" (Fig. 1c), and "bacterial/F-plasmid interaction effect" genes (Fig. 1d).

Figure 1
figure 1

Possible effects of bacterial strain and F-plasmid in a 2 × 2 analysis of variance (ANOVA). a) -c): cases with no interaction effect. d): examples of some possible interaction effects.

a) Bacterial genotype effect only

We identified a total of 216 genes with a host-specific gene expression independent of the presence of the F-plasmid ("bacterial genotype effect only" genes, [see Additional file 1]). After accounting for genes located in the same operon we identified 141 unique "regulatory units" (i.e. sets of genes that are co-regulated due to their location in the same operon). 57 regulatory units were up-regulated in DH5α and 84 regulatory units were up-regulated in MG1655. Interestingly, the host-specific genes in the two strains fell into functionally different classes. All 11 amino acid synthesis regulatory units with host-specific gene expression were up-regulated (~6-fold) in MG1655 (Table 1). Genes affecting nucleotide biosynthesis, carbon utilization and fatty acid synthesis were also up-regulated (between ~4 and 10 fold) in MG1655. The opposite pattern was found for flagellum and chemotaxis genetic regulatory units, which were strongly up-regulated in DH5α (between ~24 and ~28-fold). Similarly, regulatory units with prophage- or transposon-related functions were more highly expressed in DH5α.

Table 1 Functional groups of genes differentially expressed between hosts.

b) F-plasmid effect only

An "F-plasmid effect only" gene is a gene that is expressed in response to the presence of the F-plasmid but without having any strain-specific effects. As expected, all 10 F-plasmid encoded genes that were represented on our microarray were strongly expressed in F-plasmid containing cells, but not expressed (i.e.: had signal intensities below the detection limit and were called "Absent" by the Affymetrix software) in F-plasmid free cells. Nine out of these 10 F-plasmid encoded genes were "F-plasmid effect only" genes (Table 2) while one F-plasmid encoded gene showed an interaction effect with the host genotype (see below). All "F-plasmid effect only" genes show an increase in expression level in the presence of the F-plasmid. The difference in expression level of F-plasmid encoded proteins between F-plasmid free cells (where expression level equals background signal intensity on the chip) and F-plasmid containing cells was on average 40-fold with a maximum change in expression level of the traD gene of 140-fold.

Table 2 "F-plasmid effect only" genes: similar effects of F-plasmid in both bacterial backgrounds

There are seven genes encoded on the bacterial chromosome that were at least two-fold differentially expressed in cells with and without F-plasmid. On average these are up-regulated by ~3.3-fold.

c) Bacterial and F-plasmid additive effects

We identified six genes which had different expression levels in the two hosts and which also had an effect induced by the presence of the F-plasmid, but without any interaction effect (Table 3). In contrast to "F-plasmid effect only" genes, an equal number of genes are up-regulated and down-regulated in the presence of the F-plasmid (Table 3). Furthermore, there was no net change in expression intensity (Figure 2).

Table 3 "Bacterial and F-plasmid additive effects": genes whose expression level differs depending on bacterial background and whether or not they harbor an F-plasmid
Figure 2
figure 2

Direction of expression differences. The mean (± 2 s.e.) of log2 transformed fold-change in expression level in response to the absence of the F-plasmid are plotted for two different E. coli strains, DH5α and MG1655. While "plasmid effect only" genes showed the expected change in one direction only (expression in the presence of the F-plasmid), the "bacterial and plasmid additive effects only" genes and "bacterial/plasmid interaction effect" genes were unbiased (i.e.: the mean change across genes was close to zero).

d) Bacterial/F-plasmid interaction effect

Genes falling into this category show either a significant effect of the F-plasmid in only one host, or have disproportional effects across the two hosts. In total, 95 interaction genes (82 regulatory units) were identified. 25 genes (20 regulatory units) had a significant difference in expression levels between MG1655 and MG1655F but not between DH5α and DH5αF. 49 genes (47 regulatory units) with a host-specific F-effect were detected in DH5α but not in MG1655. 20 genes (15 regulatory units) showed an F-effect in opposite direction in the two hosts [see Additional file 2]. One gene (the F-plasmid encoded Protein-D (resD)) was up-regulated in the presence of the F-plasmid in both genetic backgrounds but the magnitude of change was twice as strong in MG1655 than in DH5α. Similar to genes with "bacterial genotype effect only" or "F-plasmid effect only", interaction genes were also equally likely to be up- or down-regulated in the presence of the F-plasmid (Table 4) and no net change in expression intensity was noted (Fig. 2).

In contrast to the "bacterial genotype effect only" genes, a functional classification of the "bacterial/F-plasmid interaction effect" genes provided no clear pattern (Table 4). Among the genes with a host-specific response to the F-plasmid we found genes with functions in chaperoning and carbon utilization.

Table 4 Regulation of "bacterial/F-plasmid interaction effect" genes


In our experiments 323 genes (out of 2698 expressed genes) located on the E. coli chromosome show significant differences in gene expression. 216 (67%) of these 323 genes differed in expression between the two E. coli genotypes DH5α and MG1655 and this difference was not affected by the presence of the F-plasmid. Based on the significance level of 0.05 used in our study, only 135 genes would be expected to show 'significant' differences by chance alone. In our study, however, we observed significantly more differently expressed genes than expected by chance (P < 0.000001, binomial test), confirming a significant biological effect. Note that the binomial test is conservative, as we involved an additional filtering step to include only genes with at least 2-fold expression difference (see material and methods). Further support for the reliability of our expression analysis is provided by the recovery of the well-known difference between both strains in the activity of the lac operon, which is inactive in DH5α (see appendix). This high number of strain specific differences in gene expression pattern is consistent with previous reports that also found large differences among E. coli strains [9, 10].

About half of the strain-specifically expressed genes were not detectably expressed in the other strain (i.e. genes not expressed in DH5α were expressed in MG1655 and vice versa). We did not perform genomic hybridization to test the hypothesis that some of the changes in expression level are due to the absence of the respective gene, as the arrays had been constructed based on the sequence of the strain MG1655. As we also found genes expressed in DH5α but not in MG1655 we consider that gene deletions are not a major factor shaping the difference in gene expression between the two strains. This is supported by some recent work showing that gene content is very similar among related bacterial strains (for example strain W3110, a close relative of MG1655 lacks only 80 (1.9%) of MG1655's ORF's [11]). Nevertheless, even when only genes expressed in both strains were considered (i.e. 2243 genes), a statistically significant excess of differentially expressed genes (130) could be detected (P = 0.047, binomial test).

Of the 107 chromosomal genes that were affected by the presence of the F-plasmid we found less than 12% (13 genes) with purely additive effects across strains. The majority of genes showed a significant interaction between strain and plasmid in their expression. Hence, despite the fact that we used an F-plasmid with an identical genotype in our experiments, most of the genes showed a response to the presence of the F-plasmid that depended on the genotype of the host chromosome. The abundance of significant interaction effects between strain and plasmid gene expression implies that epistasis (i.e. genotype-specific crosstalk) is generally common. By contrast, of the 10 F-plasmid encoded genes that were represented on the chip, nine showed additive effects and were not significantly influenced by the host genotype. Even though the number of F-plasmid encoded genes is small, this difference is statistically highly significant (p < 0.001, Fisher's exact test), implying large differences between F-plasmid encoded and chromosomal genes. This result is not affected by the significance level used for the identification of significant genes (i.e. ANOVA P-value of 0.005 provided similar results, data not shown). The almost complete absence of interaction effects for F-plasmid encoded genes is not unexpected, as these genes are essential for the F-plasmid associated phenotype (e.g. formation of sex-pili). Thus, the F-plasmid, which could easily move among cells [4], requires a set of genes that remains functional in the genetic background of different host cells.


Our results of a strong host genotype dependent crosstalk could potentially have important evolutionary implications. As the same plasmid results in a contrasting gene expression in different host genotypes, the cost of carriage is likely to depend on both the host genotype and the environment. Hence, it is conceivable that this host genotype specific crosstalk also results in fitness differences, i.e. the cost of carriage may vary among host genotypes. More host genotypes, in particular more diverged ones, need to be analyzed in a range of environments to determine if the epistatic interactions observed in our study could contribute to the maintenance of F-plasmids in natural populations.


Strains used

The two laboratory strains of E. coli used were DH5α ([F-Φ80dlac ZΔM15Δ(lacZYA-argF) U169 end A1 recA 1 hsdR17(rk-mk+) deo R thi- 1 supE44 λ-gyr A96 rel A1]) and MG1655 (obtained from F. Blattner). The sequenced F-plasmid (GenBank Accession number: AP001918, obtained from Gen-ichi Sampei) was added to both strains by conjugation. Altogether four different genotypes were investigated: DH5α containing an F-plasmid, DH5α without an F-plasmid, MG1655 containing an F-plasmid and MG1655 without an F-plasmid. The identity of strains was confirmed on IPTG/X-Gal LB plates (MG1655 yields blue colonies while DH5α colonies are white) and by F-plasmid specific PCR using primers complementary to the F-plasmid encoded traU gene (5' gtc ttc ctg ggt gaa tga tg 3' and 5' gat gat gtc gta tcc ctg act g 3').

Culture conditions

Replicate cultures for each genotype were inoculated from single colonies in 5 ml LB medium (0.2 mg/ml Ampicillin) and grown over night at 37°C. 500 μl of each overnight culture was used to inoculate 50 ml fresh LB medium. These cultures were grown at 37°C and cells were harvested in early log phase corresponding to an OD600 of 0.4. The identical batch of broth was used throughout the entire experiment.

RNA extraction and hybridizations

1 ml early log phase culture was stabilized with 2 vol. RNAprotect Bacteria Reagent (Qiagen Cat# 76506) according to the manufacturers protocol. RNA was extracted with the MasterPure™ RNA Purification Kit obtained from Epicenter (Cat # MCR85102) following the manufacturer's protocol. To reduce background on the chip half of the recovered total RNA (12.5 μg) was subjected to a ribosomal RNA removal procedure using the MICROB Express™ Bacterial mRNA Purification Kit (Ambion. Cat# 1905). rRNA-free and rRNA-containing samples were then pooled and total, fragmented RNA was 3' end-labeled according to the Affymetrix protocol. Each sample was hybridized to an Affymetrix E. coli Antisense Array (Cat# 900381) following the manufacturer's protocol. Patterns of hybridization were detected with an Affymetrix scanner. Each genotype was replicated once so that two replicates per genotype could be analyzed.

Data analysis

Selection of genes with differences in expression level

Raw signal intensities were analyzed according to the standard implementation of the Affymetrix Microarray Suite (MAS) 5.0 software, which summarizes expression values based on perfect match and mismatch probes. Signal intensities generated by the Affymetrix software were log2 transformed. All genes not expressed in at least one of the genotypes were removed. Specifically, within one genotype signal intensities of both replicates had to be > 250 or alternatively had to be called "Present" by the Affymetrix software. To identify genes significantly affected by the presence of the F-plasmid in the two different host genotypes we applied an analysis of variance (or "ANOVA" model). The ANOVA model examines the association between nominal predictor variables and a continuous outcome variable (e.g., gene expression). Since in our case we have two nominal predictor variables (e.g., "strain" and "with and without F-plasmid"), we used a two-way ANOVA [12]. The two-way ANOVA was performed on the log2 transformed signal intensity of each gene with bacterial genotype (DH5α or MG1655) and presence of F-plasmid as predictor variables (also called "factors").

We scored a gene as having a main effect or interaction effect if that effect showed statistical support (P < 0.05) and if the magnitude of the effect was at least two-fold. In addition, the interaction effect was scored as present if P < 0.05 and the effects of the plasmid went in opposite directions in the different bacterial genotypes.

Genes were divided into four categories "bacterial genotype effect only", "F-plasmid effect only", "bacterial and F-plasmid additive effects" and "bacterial/F-plasmid interaction effects". In the "bacterial genotype effect only" category, only the main effect of the bacterial genotype was scored as being present (Figure 1a). If only the main effect of the plasmid was present, we labeled this gene as "F-plasmid effect only" (Figure 1b). If both main effects were present, then the genes were further divided by whether an interaction effect was present: "bacterial and F-plasmid additive effects" was the group without interaction effect (Figure 1c) and "bacteria/F-plasmid interaction effects" was chosen in the presence of an interaction effect (Figure 1d, i.–iii.).

The presence of the F-plasmid could have either an additive effect (Figure 1b, c) or a non-additive effect with a genotype by plasmid interaction (Figure 1d, i.–iii.). Significant bacterial genotype by F-plasmid interactions were identified by pairwise HSD (Tukey's Honest Significant Difference) tests. The HSD test conducts multiple comparisons at all possible pairwise comparisons of the 4 genotypes, simultaneously correcting for multiple testing. Comparisons for which the 95% confidence intervals do not overlap zero are here considered significant. Depending on the type of interaction, we either conditioned on genes, which showed a significant difference in signal intensities between DH5α and DH5αF but not between MG1655 and MG1655F (Figure 1d i.) or genes, which are significantly affected by the F-plasmid in both backgrounds but to a different extent (Figure 1d, ii.). The third group of interaction effects consisted of genes, which were affected in the opposite direction (Figure 1d, iii.).

All statistical analyses were performed using Perl scripts (available from the authors upon request) and the statistical language R. Affymetrix MAS5.0 normalized signal intensities were submitted to GEO [13] and are available under the following Series ID: GSE1154.

Functional grouping of candidate genes

Candidate genes were classified into functional groups according to the EcoCyc database (Encyclopedia of Escherichia coli Genes and Metabolism [14]) specifications. Candidate genes located in the same operon (as inferred from RegulonDB [15] were summarized as single "regulatory unit". If genes were members of operons and the genes within one operon had different functions each of them was weighted equally. For example, an operon with one gene functioning in carbon utilization and another one functioning as a chaperone was treated as 0.5 carbon utilization and 0.5 chaperoning. Single genes with multiple functions were disregarded, as their classification would have been ambiguous. Functional groups were only assigned if at least two different candidate genes or regulatory units could be assigned to that group.


  1. Sherley M, Gordon DM, Collignon PJ: Species differences in plasmid carriage in the Enterobacteriaceae. Plasmid. 2003, 49: 79-85. 10.1016/S0147-619X(02)00107-5.

    Article  CAS  PubMed  Google Scholar 

  2. Eberhard WG: Evolution in bacterial plasmids and levels of selection. Q Rev Biol. 1990, 65: 3-22. 10.1086/416582.

    Article  CAS  PubMed  Google Scholar 

  3. Shoemaker NB, Vlamakis H, Hayes K, Salyers AA: Evidence for extensive resistance gene transfer among Bacteroides spp. and among Bacteroides and other genera in the human colon. Appl Environ Microb. 2001, 67: 561-568. 10.1128/AEM.67.2.561-568.2001.

    Article  CAS  Google Scholar 

  4. Mulec J, Starcic M, Zgur-Bertok D: F-like plasmid sequences in enteric bacteria of diverse origin, with implication of horizontal transfer and plasmid host range. Curr Microbiol. 2002, 44: 231-235. 10.1007/s00284-001-0039-7.

    Article  CAS  PubMed  Google Scholar 

  5. Lundquist PD, Levin BR: Transitory derepression and the maintenance of conjugative plasmids. Genetics. 1986, 113: 483-497.

    PubMed Central  CAS  PubMed  Google Scholar 

  6. Levin BR, Stewart FM: The population biology of bacterial plasmids: a priori conditions for the existence of mobilizable nonconjugative factors. Genetics. 1980, 94: 425-443.

    PubMed Central  CAS  PubMed  Google Scholar 

  7. Dahlberg C, Chao L: Amelioration of the cost of conjugative plasmid carriage in Eschericha coli K12. Genetics. 2003, 165: 1641-1649.

    PubMed Central  CAS  PubMed  Google Scholar 

  8. Bouma JE, Lenski RE: Evolution of a bacteria/plasmid association. Nature. 1988, 335: 351-352. 10.1038/335351a0.

    Article  CAS  PubMed  Google Scholar 

  9. Soupene E, van Heeswijk WC, Plumbridge J, Stewart V, Bertenthal D, Lee H, Prasad G, Paliy O, Charernnoppakul P, Kustu S: Physiological studies of Escherichia coli strain MG1655: Growth defects and apparent cross-regulation of gene expression. J Bacteriol. 2003, 185: 5611-5626. 10.1128/JB.185.18.5611-5626.2003.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  10. Dobrindt U, Agerer F, Michaelis K, Janka A, Buchrieser C, Samuelson M, Svanborg C, Gottschalk G, Karch H, Hacker J: Analysis of genome plasticity in pathogenic and commensal Escherichia coli isolates by use of DNA arrays. J Bacteriol. 2003, 185: 1831-1840. 10.1128/JB.185.6.1831-1840.2003.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  11. Ochman H, Jones IB: Evolutionary dynamics of full genome content in Escherichia coli. Embo J. 2000, 19: 6637-6643. 10.1093/emboj/19.24.6637.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

  12. Sokal RR, Rohlf FJ: Biometry. 1995, New York, W. H. Freeman and Company, third edition

    Google Scholar 

  13. Geo database.

  14. Karp PD, Arnaud M, Collado-Vides J, Ingraham J, Paulsen IT, Saier MHJ: The E. coli EcoCyc Database: No longer just a metabolic pathway database. ASM News. 2004, 70: 25-30.

    Google Scholar 

  15. Salgado H, Santos-Zavaleta A, Gama-Castro S, Millan-Zarate D, Blattner FR, Collado-Vides J: RegulonDB (version 3.0): transcriptional regulation and operon organization in Escherichia coli K-12. Nucleic Acids Res. 2000, 28: 65-67. 10.1093/nar/28.1.65.

    Article  PubMed Central  CAS  PubMed  Google Scholar 

Download references


We are thankful to X. Li from the Functional Genomics Facility, the University of Chicago for providing expert support. C.-I. Wu and J. Bergelson provided lab space and support. M. Imhof, F. Blattner, Gen-ichi Sampei provided strains. We thank T. Price and H. Tang for statistical advice. Discussions with R. Bürger and C. Vogl about epistasis were essential for the manuscript. The research was supported by FWF grants to C. Schlötterer and an Emmy Noether fellowship by the DFG to B. Harr.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Christian Schlötterer.

Additional information

Competing interests

The author(s) declare that they have no competing interests.

Authors' contributions

BH and CS designed the experiment and wrote the manuscript. B. Harr performed the experiments and data analysis.

Electronic supplementary material


Additional File 1: Bacterial genotype only genes. List of genes that are differentially expressed between DH5α and MG1655. (DOC 230 KB)


Additional File 2: Bacterial/F plasmid interaction genes. List of genes that show an interaction effect between host genotype and F plasmid. (DOC 151 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Harr, B., Schlötterer, C. Gene expression analysis indicates extensive genotype-specific crosstalk between the conjugative F-plasmid and the E. coli chromosome. BMC Microbiol 6, 80 (2006).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: