Divergent methylation of CRISPR repeats and cas genes in a subtype I-D CRISPR-Cas-system

Background The presence and activity of CRISPR-Cas defense systems is a hallmark of many prokaryotic microorganisms. Here, the distribution of sequences related to the highly iterated palindrome 1 (HIP1) element and the DNA methylation of CGATCG motifs embedded within HIP1 as a vital part of the CRISPR1 repeat sequence was analyzed in the cyanobacterium Synechocystis sp. PCC 6803. Previously suggested functions of HIP1 include organization of chromosomal structure, DNA recombination or gene regulation, all of which could be relevant in CRISPR-Cas functionality. Results The CRISPR1 repeat-spacer array contains more than 50 CGATCG elements that are double-methylated (5mCG6mATCG) by the enzymes M.Ssp6803I and M.Ssp6803III. Hence, more than 200 possible methylation events cluster over a stretch of 3600 bp of double-stranded DNA. Bisulfite sequencing showed that these motifs were highly methylated at the m5CGATCG positions whereas specific motifs within the CRISPR1 cas genes were hypomethylated suggesting a lowered accessibility for the DNA methylase to these regions. Assays for conjugation and CRISPR1-mediated DNA interference revealed a 50% drop in conjugation efficiency in the mutant lacking the 5mC methylation of CGATCG motifs, while the highly efficient DNA interference activity was not affected by the lack of m5CGATCG DNA-methylation, nor was the capability to differentiate between self and non-self targets based on the protospacer adjacent motifs (PAMs) GTA and GTC versus the non-PAM AGC. A third DNA methylation mediated by M.Ssp6803II modifies the first cytosine in the motif GGCC yielding GGm4CC. We found a remarkable absence of GGCC motifs and hence the corresponding methylation over an 11 kb stretch encompassing all the cas genes involved in interference and crRNA maturation but not adaptation of the CRISPR1 system. Conclusions The lack of GGCC tetranucleotides along the CRISPR1 interference and maturation genes supports the reported hybrid character of subtype I-D CRISPR-Cas systems. We report tight and very high 5mC methylation of the CRISPR1 repeat sequences. Nevertheless, cells lacking the 5mC methylation activity were unaffected in their CRISPR1-mediated interference response but the efficiency of conjugation was reduced by 50%. These results point to an unknown role of m5CGATCG DNA-methylation marks in conjugation and DNA transformation.


Background
The highly iterated palindrome 1 (HIP1) element 5′-GCGATCGC-3′ is an octameric palindromic repeat that is overrepresented in several cyanobacteria [1,2]. In the chromosome of the cyanobacterial model Synechocystis sp. PCC 6803 (from here: Synechocystis 6803) HIP1 instances occur at the frequency of one copy in every 1131 bp [1,3]. Statistical analyses supported the hypothesis that HIP1 motifs are maintained by selection, suggesting that HIP1 motifs likely perform biological functions [4]. A relation between the presence of HIP1 motifs and DNA recombination and/or repair processes has been suggested [5]. In addition or alternatively, a potential HIP1 function associated with chromosomal structure or maintenance was suggested based on its distribution along the chromosome [4].
At its core, the HIP1 element contains the recognition sequence of Dam DNA methyltransferases. These N6adenine-specific enzymes modify the adenosine residue within the target sequence GATC and are often essential for viability [6]. Methylation at the position G m6 ATC in Synechocystis 6803 is carried out by the DNA methyltransferase M.Ssp6803III encoded by gene slr1803, which was found to be essential for viability of this cyanobacterium [7]. Moreover, the first cytosine within the HIP1 sequence is m5 C-methylated in Synechocystis 6803 by the DNA methyltransferase M.Ssp6803I encoded by slr0214 [7,8]. Hence, in this cyanobacterium, the hexanucleotide 5′-CGATCG-3′ within the HIP1 element can be methylated at four individual positions on the two DNA strands. Similar methylation patterns of HIP1 sequences have been reported for Anabaena sp. PCC 7120 [9]. In addition, the DNA methyltransferase M.Ssp6803II, encoded by sll0729, was recognized to methylate the first cytosine in the frequently occurring motif GGCC at the m4 C position yielding GG m4 CC [7]. GGCC is the most frequent methylation motif in Synechocystis 6803, on average providing one methylation site every 185 bp on the chromosome.
The crRNAs originate from the CRISPR repeatspacer arrays initially in the form of long precursor transcripts. After transcription, the CRISPR repeats are recognized by processing maturases. These frequently belong to the Cas6 class of endoribonucleases [16] whereas in subtype I-C systems the endoribonuclease is Cas5d [17,18]. In case of Synechocystis 6803, crRNA maturation proceeds by the Cas6-1 enzyme for the CRISPR1 system and by Cas6-2a for the CRISPR2 system [14,19,20], while for the CRISPR3 system RNase E was recognized as the major maturation endoribonuclease [15]. During interference, crRNAs guide the proteinaceous CRISPR effector complex to their targets, also known as protospacers, resulting in efficient immunity against potentially harmful invading nucleic acids [21][22][23]. CRISPR1 interference activity was shown to strictly depend on the presence of a DNA sequence element called protospacer adjacent motif (PAM). PAM sequences are functionally critical for CRISPR-based immune systems and are located adjacent to each protospacer consisting of a short signature sequence of 2-5 nt, depending on the CRISPR type and organism. The PAM sequences GTN were found to efficiently mediate CRISPR1 interference in Synechocystis 6803, while there also exist PAMs, e.g. the sequence motif AGC, that do not license interference [24].
While unexpected connections between bacterial natural competence, ubiquitin signaling and DNA modification was reported for type VI-B CRISPR-Cas systems [25], a possible relationship between DNA methylation and CRISPR-Cas-mediated interference responses has not been studied thus far. We noticed the CGATCG sequence to be overrepresented within the CRISPR1 system of Synechocystis 6803, with possibly more than 200 methylation events over a stretch of only 3600 nt. The recent availability of bisulfite sequencing data for this organism [7] enabled the detailed analysis of pSYSA cytosine methylation. Therefore, here we investigated the DNA methylation of the CRISPR1 system within the context of the pSYSA plasmid and whether there was a correlation between DNA methylation and DNA interference in a conjugation-based assay using a DNA methyltransferase mutant.

Strains and culture conditions
Synechocystis sp. 6803 was maintained on BG11 mineral medium [26] at 30°C under constant illumination. The slr0214 deletion mutant (Δslr0214-A1) and the slr0214 insertion mutant (Δslr0214-B1) were described previously [7,8]. Liquid cultures of Synechocystis 6803 wild type and slr0214 mutants were grown photoautotrophically in volumes of 50 mL in Erlenmeyer flasks in BG11 medium, with shaking at 50 μmol photons s − 1 m − 2 at 30°C. E. coli cultures were grown in LB medium at 37°C. Growth was followed by measurements of the optical density at 750 nm (OD 750 ) for Synechocystis 6803 and at 600 nm (OD 600 ) for E. coli.

Synechocystis 6803 conjugation and interference assay
Conjugation between E. coli and Synechocystis 6803 by triparental mating was essentially carried out as described previously [14]. In short, flasks containing LB medium without antibiotics were inoculated with overnight cultures of E. coli J53/RP4 (helper strain) and DH5α with the plasmid of interest (donor strain) to obtain an OD 600 of 0.1 and incubated at 37°C with shaking at 180 rpm for 2.5 h. Plasmid-bearing and helper strains equivalent to an OD 600 of 7.0 were harvested by centrifugation, resuspended in 1 mL LB, combined (1 mL plasmid-bearing and 1 mL helper strain), centrifuged again, resuspended in 100 μL LB and incubated at 30°C without shaking for 1 h. In parallel, cultures of the Synechocystis 6803 wild type and slr0214 deletion strains [7] equivalent to an OD 750 of 0.75 were harvested by centrifugation, resuspended in 800 μL BG11 medium and combined with the E. coli cells. The resulting mix was centrifuged, resuspended in 30 μL BG11 and placed on a sterile filter located on a BG11 agar plate supplemented with 5% LB medium at 30°C overnight (slightly covered with tissue). The next morning the filter was rinsed with 400 μL BG11 medium, 20 μL and 40 μL of the cell suspension were plated, respectively, on BG11 plates containing 5 μg mL − 1 gentamicin.
Interference assays essentially were performed as described [15,24] using tri-parental mating with the selfreplicating conjugative vector pVZ322 [27] containing the gentamicin resistance cassette for selection plus a protospacer sequence to spacer 1 of the CRISPR1 system with either of the PAMs GTA, GTC or the non-PAM AGC as a control. These plasmids were constructed during previous work [24].
The Synechocystis 6803 wild type, the slr0214 deletion strain (Δslr0214-A1) and the slr0214 insertion strain (Δslr0214-B1) [7] were used as recipients. Transconjugant colonies were counted after incubation at 30°C for 2 weeks. Experiments were performed in biological triplicates for the plasmid targets (pT) and in parallel with the control plasmids (pNT) either without protospacer sequence or containing the AGC non-PAM fused to a protospacer.

DNA manipulation and hybridization
For total DNA extraction from the Synechocystis 6803 wild type, Δslr0214-A1 and Δslr0214-B1 strains, cultures of 50 mL were harvested by centrifugation for 5 min. The pellet was resuspended in 1 mL SET buffer (50 mM Tris, pH 7.5; 1 mM EDTA, pH 8; 25% (w/v) sucrose) and shock frozen in liquid nitrogen. For lysis, cells were incubated overnight at 50°C in 100 mM EDTA (pH 8), 2% SDS (w/v) and 100 μg/mL proteinase K. After lysis, DNA was extracted twice by adding 1 vol phenol/ chloroform/isoamylalcohol (25:24:1) and once by adding 1 vol chloroform/isoamylalcohol (24:1). After each addition, the solution was mixed and phase separation was achieved by centrifugation (6 min, 6000 rpm) in a swing-out rotor. The upper aqueous phase was removed and DNA was precipitated by addition of 1 vol isopropanol and incubation at − 20°C overnight. DNA was collected by centrifugation (13,000 rpm) for 30 min at 4°C. The pellet was washed with 70% ethanol, air-dried and resuspended in 50 μL sterile Milli-Q water. DNA concentration was measured in a NanoDrop spectrophotometer (ND-1000, peQLab). The quality and quantity of nucleic acid extraction was verified optically in ethidium bromide stained 0.8% agarose gels.
For restriction analysis, 10 μg each of total DNA from Synechocystis 6803, Δslr0214-A1 and Δslr0214-B1 were digested with PvuI, DpnI or Sau3AI. Restriction endonucleases were used in a 10-fold excess and incubated overnight at 37°C to ensure complete digestion. Heat inactivation was performed at 80°C (PvuI and DpnI) and 65°C (Sau3AI) for 20 min. Digested total DNA was separated overnight on an ethidium bromide stained 1.2% agarose gel under a field strength of 1.8 V cm − 1 at 4°C. The gel was incubated in denaturation solution (1.5 M NaCl, 0.5 M NaOH) at 70 rpm at room temperature for 30 min and subsequently in neutralization solution (1.5 M NaCl, 0.5 M Tris pH 7.5) for the same time. DNA was transferred to Hybond-N+ membranes (Amersham Cat. No. RPN303B) with 10x saline sodium citrate (3 M NaCl, 300 mM sodium citrate pH 7) transfer buffer by capillary blotting overnight. After blotting the DNA was crosslinked to the membrane with 125 mJ using a UV-Stratalinker (Stratagene).
The following synthetic primers were used to generate the DNA probe for Southern hybridization by PCR: SS_ slr7010_fw, 5`-CCAAGAACGTCAGCAAACCCAAAC-3′ and SS_slr7010_rev, 5′-CCATCCCAAATCCCT-GACTGTAAAG-3′. PCR amplification was performed with Q5 high fidelity DNA polymerase (NEB Cat. No. M0491S) (0.02 U μL − 1 ) at 50 μL scale, containing 10 ng of template DNA, 200 μM dNTPs, 0.5 μM of each primer and 10 μL of 5x Q5 reaction buffer. The initial denaturation temperature was 98°C for 30 s. Thirty-two cycles were performed at 98°C for 10 s, 60°C for 30 s and 72°C for 18 s, followed by a 2 min final extension at 72°C. The PCR product was loaded on a Midori green (Nippon Genetics Cat. No. MG04) stained 1.8% agarose gel and the correct band was gel-eluted using the gel extraction kit from Macherey-Nagel (Cat. No. 740609.10).

Results
Distribution and methylation of GGCC motifs on the pSYSA plasmid We used the bisulfite sequencing data that were obtained in the frame of the global characterization of DNA methyltransferases in Synechocystis 6803 [7] to evaluate the degree of GG 4m CC methylation on plasmid pSYSA (Fig. 1). Bisulfite sequencing permits the direct and highly sensitive detection of 5-methylcytosines, but it can also be used to map 4-methylcytosines; although m4 C is partially resistant to bisulfite-mediated deamination and therefore the assay is less sensitive [31]. There are 246 GGCC motifs on plasmid pSYSA per DNA strand, one every 418 nt. The measured GG 4m CC methylation was 42% on average (Fig. 1c). This value matches the reported average sensitivity for the detection of this DNA modification by bisulfite sequencing [31]. The specificity of the detected GG 4m CC methylation was verified by analyzing a sll0729 deletion mutant lacking M.Ssp6803II activity in parallel, where we found zero methylation at these sites (not shown). The GGCC sites are relatively randomly distributed on pSYSA with one important exception, a region entirely free of this motif ranging from position 3537 to position 14,544 (Fig. 1b).
The GGCC-free region starts in the gene ssl7007 encoding the toxin component of a toxin-antitoxin system [32] and finishes within slr7015 (cas4) (Fig. 1d). Hence, this stretch encompasses all the genes belonging to the iCas and pCas functional modules according to the classification suggested by Roger Garrett [33] and separating them from the cas4-1-2 cassette making up the aCas module for adaptation. The GGCC-free sequence also contains slr7008 encoding an IS4 transposase and sll7009 encoding a transcriptional repressor [34] that possesses a WYL domain [35].
High methylation of CRISPR1 repeats and hypomethylation of cas genes at CGATCG sites We detected one CGATCG sequence that is recognized by the DNA methyltransferases M.Ssp6803I and M.Ssp6803III (recognizing the internal GATC [7]) within every single of the CRISPR1 repeats of Synechocystis 6803. All motif instances are preceded by a G residue, hence matching seven of the eight nucleotides of the HIP element (GCGATCGC) (Fig. 2). After transcription, the CRISPR repeats are recognized by processing maturases, which frequently belong to the Cas6 class of endoribonucleases [16]. This maturation step is obligatory for successful interference and often involves recognition of a repeat-internal stem-loop secondary structure [36]. In case of the Synechocystis 6803 CRISPR1 system this step is performed by the Cas6-1 enzyme [14,20]. Therefore, we judged the location of the repeated CGATCG sequences with regard to the repeat secondary structure after transcription. The repeat-internal CGATCG motifs after transcription form 4 of the 5 unpaired loop nucleotides and two nucleotides of the right arm of the RNA stem (Fig. 2). Hence, this motif is part of the sequence that has been found to be functionally relevant for subsequent recognition by the Cas6-1 maturation endonuclease.
The CRISPR1 system of Synechocystis 6803 consists of 50 repeats and 49 spacers in the reference strain [14]. The motif CGATCG contains two methylated bases, the first cytosine methylated by M.Ssp6803I and the adenosine methylated by M.Ssp6803III, together yielding 5m CG 6m ATCG. Because the motif is palindromic, there are four possible methylation events per CGATCG within the DNA double strand. Hence, over a stretch of only 3600 nt, there are 200 possible methylation events for the DNA double strand due to the CRISPR1 repeats alone. Moreover, the CRISPR1 repeat-spacer array contains within its spacer sequences four GGCC and eight GATC sites, which are not part of the CGATCG motif, possibly containing additional methylated bases. This very tight clustering of DNA methyltransferase recognition sites might lead to an undermethylation when the enzyme was locally limiting, but this was not observed. On average, the methylation level of 5m CGATCG Fig. 2 Four DNA methylation events can occur within the 5′-CGATCG − 3′ sequence that is at the core of the octameric HIP1 element. These methylations are carried out by the enzymes M.Ssp6803I (blue) and M.Ssp6803III (green) [7]. A 5′-CGATCG − 3′ recognition sequence is located within every single repeat of the Synechocystis 6803 CRISPR1 system. After transcription, this recognition sequence corresponds to four of the five nucleotides within the crRNA loop and to the two upper nucleotides of the right arm within the repeat stem-loop secondary structure that is essential for recognition by the Cas6-1 endoribonuclease, cleaving at the indicated site [20] measured by bisulfite sequencing for the entire pSYSA forward strand was 90.5 and 88.6% for the reverse strand and it was especially high in the repeat-spacer region (Fig. 3). We used two different strains for this analysis, the wild type and a Δsll0729 mutant that lacks M.Ssp6803II activity but not affecting 5m CGATCG methylation. The measured methylation level of 5m CGATCG in the repeat-spacer region reached 97.6 and 96% for the forward strand and 92.5 and 93% for the reverse strand each in the wild type and Δsll0729 mutant, respectively (n = 50). We conclude that the level of 5m CGATCG methylation in this region was close to saturation on the forward strand and on the reverse strand with the exception of repeats 22 and 44 as well (Fig. 3). The plasmid pSYSA exists in multiple copies, likely similar to the chromosome for which~20 copies per cell were reported during exponential growth phase [37]. Therefore, there was a high chance of freshly replicated plasmid copies explaining the average methylation below 100%.
However, we noticed striking imbalances of 5m CGATCG methylation levels with regard to the CRISPR1 cas genes. The region encompassing all cas genes (from nt position 5000 to 16,100) showed hypomethylation of this motif with only 63.4 and 66.3% methylation for the forward and reverse strand). Methylation was nearly absent at the motif positions 7392/7397 and 7998/8003 (Figs. 3 and 4). These positions reside within the slr7010 gene encoding the Cas3' protein (Fig. 4).

Verification of CGATCG hypomethylation by southern hybridization
The m5 C hypomethylation at the CGATCG sites detected for the CRISPR1 cas genes by bisulfite sequencing was independently tested by Southern hybridization. Total DNA was isolated from the wild type, the slr0214 deletion mutant (Δslr0214-A1) and the slr0214 insertion mutant (Δslr0214-B1). For restriction analysis, PvuI was chosen that cuts CGATCG but is sensitive to the methylated m5 CGATCG site. Hence, PvuI can only cleave when the cytosine in CGATCG is unmethylated, but it is not affected by methylation of the internal adenosine. For analysis, we chose the slr7010 gene encoding the Cas3' protein ( Fig. 1). According to bisulfite analysis, CGATCG at positions 7392/7397 and 7998/8003 were nearly not m5 C methylated, while the interspersed CGATCG at position 7839/7844 was methylated. Treatment by PvuI yielded two fragments, 447 bp and 159 bp in size with DNA from the two slr0214 mutants (Fig. 4), confirming the unmethylated status of all relevant sites. With DNA from the wild-type strain, these products were not obtained but a signal at~600 bp, which corresponds to the 606 bp fragment consisting of both parts. Hence, this result validates the bisulfite analysis with mainly cytosineunmethylated CGATCG sites at positions 7392/7397 and In addition, a band at~3.9 kb was detected, which corresponds to another restriction fragment from position 4053/4058 to 7998/8003 (Fig. 4a). According to bisulfite sequencing, the cytosines at 4053/ 4058 were methylated at 46 ± 12.7% (n = 4). For control, aliquots of the same DNA samples were digested by DpnI and Sau3AI. DpnI cleaves only when its recognition site is Dam-methylated, G m6 ATC, whereas Sau3AI is insensitive to any of these methylations. The resulting restriction fragments were of identical lengths in all three samples, pointing to full G m6 ATC methylation. The smaller fragment was somewhat shorter than the 159 bp fragment generated by PvuI, which is due to an internal GATC site, leading to a 30 bp shorter fragment.
Missing m5 CGATCG methylation affects conjugation rates but not CRISPR1-mediated DNA interference Within this study, we identified specific imbalances in the distribution and methylation level of GGCC and CGATCG sites within the CRISPR1 system of Synechocystis 6803. The tightly clustered appearance of many methylated bases within the repeat-spacer array could affect DNA interference. Therefore, we performed assays for conjugation and interference efficiency in wild type and the two different slr0214 mutant lines that lack the methylation activity of M.Ssp6803I encoded by this gene.
Due to the lack of identified bacteriophages infecting Synechocystis 6803, we used a conjugation-based assay. To trigger interference, we inserted a protospacer into the conjugative vector matching spacer 1 of the CRISPR1 system. In addition, we added different PAM sequences that should facilitate the differentiation between "self" and "non-self" targets [24]. The CRISPR1mediated DNA interference was as efficiently in the two different Δslr0214 mutants as in the wild type control and no difference was observed regarding the different PAMs (Fig. 5a-c). In all combinations, the GTC and GTA PAMs facilitated DNA interference, while the AGC non-PAM did not and served as a control. We conclude that the absence of CRISPR1 m5 C methylation had no effect on the efficiency of DNA interference. However, a difference was noticed in the conjugation efficiencies, which were in all instances below 50% compared to the wild type (Fig. 5d).

Discussion
In this work, we focused on the differential methylation on the plasmid pSYSA, but DNA methyltransferases can target any of their recognition sequences on the chromosome as well as on the other six plasmids. Synechocystis 6803 encodes five DNA methyltransferases. Two of these are encoded on plasmid pSYSX (M.Ssp6803IV and M.Ssp6803V) and three on the chromosome [7]. Among these DNA methyltransferases, M.Ssp6803I executes m5 CGATCG methylation, M.Ssp6803II performs GG 4m CC and M.Ssp6803III the dam-like G m6 ATC methylation [7]. Missing cytosine N4-methylation of GGCC motifs in the Δsll0729 mutant that lacks M.Ssp6803II activity resulted in strong phenotypical alterations, which were associated with the regulation of transcription, DNA replication and A B C D Fig. 5 Assays for conjugation and interference efficiency in wild type (WT) and slr0214 mutant cell lines. The conjugation efficiency was tested for plasmids carrying a protospacer to spacer 1 and a functional protospacer adjacent motif (PAM) of the CRISPR1 system and therefore were potential targets (labelled pT) or that lacked the protospacer or maintained a non-PAM fused to the protospacer and served as non-target controls (labelled pNT). The protospacer in the pT plasmids was linked to a functional GTC or GTA PAM that facilitates recognition of the invader DNA or to an AGC non-PAM that does not license interference in Synechocystis 6803 [24]. a Plates showing interference and growth of colonies following conjugation into wild type cells. b Assay in the Δslr0214-A1 deletion mutant. c Assay in the Δslr0214-B1 insertion mutant. d Conjugation efficiencies (numbers of colonies normalized to the wild type with the pNT plasmid control). The bars represent the mean ± SE of three biological replicates each DNA repair [38]. For the methylation of the HIP1-related motif via M.Ssp6803I, a role in DNA repair processes was previously suggested [5]. With regard to transformation, it was reported that the methylation of plasmid DNA in E. coli expressing M.Ssp6803I encoded by the Synechocystis 6803 gene slr0214 prior to transformation led to an 11-to 161-fold-higher efficiency in the subsequent integrative transformation of Synechocystis 6803 [39]. In contrast, expression of M.Ssp6803II from gene sll0729 methylating GGCC [7] had no measurable impact on transformation efficiencies [39]. Both of these DNA methyltransferases do not belong to a restriction-modification system. Therefore, the molecular basis for these observations has remained elusive.
During the study of methylation frequencies of single CGATCG sites, we made the remarkable observation that only a few sites were almost unmethylated, while the vast majority of sites was methylated almost 100%. One virtually methyl-free CGATCG site is situated on the plasmid pSYSA within the CRISPR-Cas systems encoding genes, which raised our attention and initiated the detailed functional study reported here. CRISPR-Cas systems involve DNA recombination at their very heart. Degradation products of the RecBCD repair complex were found to serve as templates for spacer acquisition as new spacers [40], especially in the naïve spacer acquisition in E. coli [41], with the RecBCD helicase function as the most important activity [42]. However, also vice versa, cas1 gene deletions were found to affect chromosome segregation and to lead to increased sensitivity towards DNA damage [43]. Another, recently discovered, bacterial phage resistance system called bacteriophage exclusion (BREX) [44] distinguishes self from non-self DNA by methylation of a specific DNA site [45]. It should be noticed that the extended GCGATCGG motif present in CRISPR1 repeats constitutes a 1-nt deviation from the cognate HIP1 sequence but left the recognition sequence for the DNA methyltransferase M.Ssp6803III intact. For all these reasons, the connection between DNA methylation and the CRISPR-Cas apparatus is worth investigation.
We noticed a striking absence of GGCC sites among the iCas and pCas modules of the CRISPR1 system. The cas3' , cas10d, cas7, cas5 and cas6-1 genes, together with the genes sll7009 and slr7008 for a WYL-domain regulator and a transposase, respectively, lack any GGCC site, which may indicate that they originate from another organism, possibly be transferred via horizontal gene transfer. Because this system belongs to the subtype I-D, this observation fits to the previously proposed hybrid character of this subtype, in which signature genes for a type I-C system were combined with a distinct type III gene arrangement [46]. Moreover, we show that the CRISPR1 system of Synechocystis 6803 contains M.Ssp6803III recognition sequences within its repeat sequences and that they are highly methylated while the associated cas genes showed hypomethylation at certain but not all m5 CGATCG sites. We verified the lowered methylation within the slr7010 gene encoding Cas3' for the positions 7392/7397 and 7998/8003 by Southern hybridization while the motif at position 7839/7844 was methylated. The reduced or lacking DNA methylation detected in this work might be related to the binding of one or several regulatory factors. We noticed that the undermethylated sites within slr7010 are located next to sll7009 that upon deletion caused an enhanced CRISPR1 expression leading to the classification of Sll7009 as a repressor [34].
We then asked whether the lacking methylation due to mutation of slr0214 would have an impact on the highly efficient DNA interference associated with this system but this was not the case. Moreover, the high transcription of the CRISPR1 repeat-spacer array and its dense and quantitatively high methylation obviously did not interfere with each other. This matches reports for HIP1 to not play a direct role in the regulation of gene expression [4].
However, we detected an approximately 50% reduced conjugation efficiency in the Δslr0214 mutant compared to the wild type. But, DNA methylation is not restricted to plasmid pSYSA containing the CRISPR system, hence, there is no reason to assume that the lack of M.Ssp6803I-mediated methylation on pSYSA was the only causative for the observed reduced conjugation efficiency. The detected correlation between the genome-wide lacking m5 CGATCG DNA methylation and the reduced conjugation efficiency cannot be related to DNA integration and recombination because we used a conjugative plasmid that replicates in the cell autonomously and does not require DNA recombination of integration. Moreover, it cannot be related to DNA restriction because M.Ssp6803I is not part of an RM system [7]. Generally, DNA methylation in bacteria can contribute to the transcriptional regulation of genes involved in diverse processes, ranging from biofilm formation, bacteriophage replication, transposition, the timing of chromosome replication and mismatchrepair to conjugation [47]. Therefore it is highly intriguing that instances were described in which DNA methylation affect conjugative transfer, e.g., of a plasmid in Salmonella enterica [48]. In that system, transcription of traJ is increased while transcription of finP that antagonizes traJ expression, is reduced in a dam mutant. The ratio between the methylation and transcription of these two genes accounts for the level of tra operon expression and the efficiency of conjugation of that plasmid [49,50]. Lacking DNA methylation leads to measurable changes in gene expression in Synechocystis 6803 as well [38]. Therefore, it is not unlikely that a yet unknown regulatory process that is affected via changed gene expression in mutant Δslr0214 possibly is also affecting plasmid uptake or replication resulting in lowered conjugative efficiency.