- Research article
- Open Access
Analysis of Tc1-Mariner elements in Sclerotinia sclerotiorum suggests recent activity and flexible transposases
BMC Microbiologyvolume 14, Article number: 256 (2014)
Sclerotinia sclerotiorum is a necrotrophic fungus that is pathogenic to many plants. Genomic analysis of its revealed transposable element expansion that has strongly influenced the evolutionary trajectory of several species. Transposons from the Tc1-Mariner superfamily are thought to be ubiquitous components of fungal genomes and are generally found in low copy numbers with large numbers of deleterious mutations in their transposase coding sequence.
This study shows that the genome of S. sclerotiorum has a large number of copies of Tc1-Mariner transposons, and in silico analysis shows evidence that they were recently active. This finding was confirmed by expressed sequence tag (EST) analysis. Fourteen new Tc1-Mariner transposon families that were distributed throughout the genome were identified, and in some cases, due to the excision/retention of introns, different transcripts were observed for the same family, which might be the result of an efficient strategy to circumvent mutations that generate premature stop codons in the RNA sequence. In addition, the presence of these introns shows that the transposase protein has a flexible coding sequence and, consequently, conformation. No evidence for RIP-like gene silencing mechanisms, which are commonly found in fungi, was found in the identified Tc1-Mariner elements, and analysis of the genomic insertion sites of these elements showed that they were widely distributed throughout the genome with some copies located near the 3′ regions of genes. In particular, EST analysis demonstrated that one of these copies was co-expressed with a gene, which showed the potential for these elements to undergo exaptation.
Fourteen novel Tc1-Mariner families were characterized. Some families had evidence of introns, which might or might not be excised depending on the family or element in question, and this finding demonstrates a possible strategy for overcoming possible mutations that generate premature stop codons in a RNA sequence. Tc1-Mariner elements likely play an important role in the structure and evolution of the S. sclerotiorum genome.
Transposable elements (TEs) encompass a wide range of DNA sequences that can move to new sites in the genome. For many years following their discovery in the mid 1940s, TEs were thought to be a genetic rarity and later, pejoratively, as genomic parasites. More recently, a significant role for TEs in genomic evolution has been demonstrated . Transposons are important tools for the evolution of several species because they increase genomic plasticity and diversity , modify gene structures , and are important sources for regulatory sequences ,.
Transposable elements can be divided into two classes that differ by the presence or absence of an RNA intermediate. Class I elements replicate by a copy-and-paste mechanism involving RNA intermediates that are subsequently reverse transcribed into double-stranded DNA by enzymes that are coded for by the transposable element (TE) itself. Class II elements, or DNA transposons, are divided into two subclasses. Subclass 1 consists of elements that transpose themselves by excision and integration, which results in both DNA strands being cleaved during the excision process. Transposons from subclass 2, on the other hand, duplicate before insertion. Subclass 1 contains two orders, the most widely known being the TIR (Terminal Inverted Repeated) order. This order contains nine superfamilies: Tc1-Mariner, Mutator, hAT, Merlin, Transib, P, PIF/Harbinger, CACTA and PiggyBac. Subclass 2 contains two orders: Helitron and Maverick. Two groups of non-autonomous TEs that lack one or more genes necessary for transposition also exist: MITEs (Miniature Inverted-repeat Terminal Elements), which are categorized as class 2, SINEs, which are members of the non-LTR (Long Terminal Repeat) retrotransposon group, and TRIMs (Terminal-repeat Retrotransposon In Miniature) and LARDs (Large Retrotransposon Derivates), which are in the LTR retrotransposon group .
Of the subclass 1 superfamilies, Tc1-Mariner is likely the most prevalent in organisms . Elements in this superfamily are generally between 1,300 and 2,400bp in length and have simple structures containing a single ORF that codes for the transposase protein and is flanked by terminal inverted repeats (TIRs) . The transposase has a conserved, three-amino acid sequence containing two aspartic acid (D) residues and one glutamic acid (E) (DDE). In some cases, a third aspartic acid can be observed (DDD). The catalytic DDE/D motif performs the excision and insertion activities, but it must interact with a divalent cation, usually Mg+2, to perform the transposition reaction . The transposase also contains helix-turn-helix (HTH) DNA binding motifs that are responsible for recognizing the TIRs . Due to the increasingly rapid availability of genomic sequences, identification of Tc1-Mariner elements and their potential evolutionary impacts have been shown in pathogenic fungi ,.
The most prominent effect of transposons on the genome is the induction of mutations. Because of their mobility and ability to recombine, TEs can interrupt genes or generate several types of rearrangements such as deletions, duplications and inversions. Thus, cells have evolved mechanisms to silence TEs, e.g., silencing by Repeat Induced Point Mutation (RIP). This mechanism was first described in Neurospora crassa, where the introduction of mutations into the DNA of this species was related with the sexual cycle during meiosis. The RIP complex recognizes duplicated sequences that are larger than 400bp and have identity that is greater than 80% and introduces transitions that convert C:G to T:A in both copies -. RIP appears to be widely distributed in ascomycete fungi .
The mutagenic activity of TEs can affect genomic sequences, therefore, and they could have potentially negative effects on the fitness of the host. However, mutations caused by transposons play important roles in genomic organization and are, thus, beneficial under some conditions ,. Substantial evidence has shown that TEs can act as a dynamic reservoir for novel cellular functions, and many endogenous genes have incorporated coding and regulatory sequences from TEs during evolution . Co-opting TEs to perform cellular functions can be considered an exaptation at the molecular level and has been observed in several species . In fact, TEs represent a natural and abundant source of regulatory sequences for host genes .
Sclerotinia sclerotiorum is a necrotrophic fungus that is pathogenic to a wide range of species (>400 species) and can persist in the environment for many years due to its ability to produce sclerotia. The S. sclerotiorum genome is estimated to contain 38.3Mb, 7% of which is composed of TEs . Analysis of the genetic diversity of TEs in S. sclerotiorum has suggested that a recent genomic remodeling event occurred that involved dramatic TE expansion . Specifically, Tc1-Mariner elements exist at high copy numbers and show low genetic variability, suggesting recent transposition events in the genome, unlike retroelements, which have a high number of degenerate copies and unpaired LTRs and indicates limited expansion . Due to the importance that the Tc1-Mariner element expansion may have on the organization and evolution of the S. sclerotiorum genome, this study sought to identify and characterize elements belonging to the Tc1-Mariner superfamily and to investigate the possible evolutionary impacts of these elements on the genome of this pathogen.
Tc1-Mariner superfamily in S. sclerotiorum
One hundred and fifty-seven different types of TEs from 15 different families were found and 50 of which were potentially active in the sequenced S. sclerotiorum genome (Table1). The Tc1-Mariner elements accounted for 0.8% of this genome. The transposons were between 1.8 and 2.3 Kb in length and had 36 to 70-bp-long TIRs. The ORFs of the potentially active elements coded for transposase sequences that contained between 453 and 574 amino acids (Table1), and the 5′ ends of representative TEs from each family revealed that the first four nucleotides (ACGT) were conserved across all families except the TcMar-Pogo family. The potentially active TEs also had duplicated TA sites and intact DDE, HTH_psq and HTH_Tnp_Tc5 motifs and had UTRs (UnTranslated Region) that were conserved across the same family, but the UTRs varied between 32 and 202 nucleotides in length in the various families (Figure1). Alignment of the elements that were identified in this study to transposon sequences in the RepBase database did not uncover any similarity between the elements in the sequences and those in the Repbase database, indicating that we discovered 14 novel families. Nevertheless, four elements had high identity (99%) with the Flipper element (GenBank accession number U74294) that had been identified in Botrytis cinerea and belonged to the TcMar-Pogo TE family . A search for copies of the Flipper element in the B. cinerea (http://genome.jgi.doe.gov/Botci1/Botci1.home.html) genome identified four copies of this element, three of which were potentially active by in silico analysis. The Flipper element was not identified in any other genomes that had been deposited into the various databases. The main differences in the organization and structure of the elements belonging to the 15 identified families are shown in Table1. Six MITEs ranging from 481 to 813 nucleotides in length were also found. Of these MITEs, one belonged to the Mariner-1_SS family, two belonged to the Mariner-8_SS family, and three belonged to the Mariner-14_SS family.
BLASTN alignment of Tc1-Mariner elements to the S. sclerotiorum transcript database revealed the presence of an intron in the transposase coding regions of six Tc1-Mariner transposon families (Table1 and Figure2). These introns were found in the Mariner-1_SS, Mariner-2_SS, Mariner-3_SS, Mariner-4_SS, Mariner-12_SS and TcMar-Pogo families and are 255, 146, 195, 141, 276 and 124 nucleotides in length, respectively. All of the introns had consensus 5′ GT and 3′ AG dinucleotides (Figure3).
Preferential insertion sites
Analysis of the genomic location of each TE insertion showed that they were distributed throughout the genome. Notably, potentially active transposons were identified approximately 50bp, 135bp and 300bp downstream of the translational stop codon of the serine/threonine kinase, polyprenyl 4-hydroxybenzoate transferase and MFS (Major Facilitator Superfamily) transporters genes, respectively. However, no ESTs containing these endogenous genes with transposon sequences were identified. TE sequences were also detected in the 3′ region of ESTs for a dehydrogenase containing a NADB_Rossman motif, and the sequences of these elements were found in the genomic sequence 87bp from the translational stop codon.
Transcriptional activity and transposase flexibility
Analysis of seven EST libraries showed that 136 of these sequences significantly aligned to TEs that were found in the S. sclerotiorum genome. Sequences for Tc1-Mariner transposons were found under all conditions: 52 were found in the library that was generated from developing apothecium after 55hrs of light exposure, two were from infected Brassica, three were from infected cushion samples, eight were from infected tomato, five were from mycelium that had been exposed to oxidative stress, 58 were from mycelium that had been exposed to pH7, and eight were from developing sclerotia.
Analysis of the S. sclerotiorum transcript database showed that in some families, such as Mariner-1_SS, alternative introns for the transposase were present that might be maintained or excised from the mRNA (Figure3 and Figure2). The same occurred with elements in the Mariner-4_SS and Mariner-12_SS families (Figure2). However, because the intron was located after the DDE motif in the Mariner-1_SS family, both transcripts could be made without directly interfering with the functional domains that were essential for transposition. In the Mariner-4_SS and Mariner-12_SS families, some elements could only make complete transcripts if the intron was removed because its retention would lead to early translational termination of the Tn1-4 (transposon 4 supercontig 1), Tn14-96 and Tn2-22 elements. In Mariner-4_SS, a point mutation in nucleotide 317 of the Tn2-22 transposase element ORF altered a TGG (Trp) codon to a TAG (stop) and, consequently, caused translation to be terminated prematurely. This mutation was located within an alternative intron in the HTH_Tnp_Tc5 motif region of the transposase, thus excision of the intron produced a transposase with only the HTH-psq and DDE motifs. However, in the Mariner-12_SS family, the Tn1-4 and Tn14-96 elements had mutations at nucleotides 1,079 and 1,213 of the transposase ORF, respectively, which fell within the DDE motif. The mutations introduced nucleotide substitutions that altered a TGG (Trp) codon in Tn1-4 and a CAA (Gln) codon in Tn14-96 to the stop codons TAG and TAA, respectively. These mutations were also found in an alternative intron for the transposase, which might or might not be maintained in most copies, but excision of the intron was necessary to produce a transposase with all of its motifs intact in Tn1-4 and Tn14-96 due to the mutations that created translational stop codons. In the TcMar-Pogo family, only transcripts where the intron was removed were found, despite the fact that in silico analysis showed that intron retention would still create a transposase with all of its functional domains intact (Figure2).
The Mariner-2_SS family also had alternative transposase introns, but, in this case, the intron had to be retained in the mature mRNA because its removal created early stop codons in potentially active copies. In contrast, excision of the intron in Tn15-99 created a complete ORF with all of its motifs intact (Figure2). Finally, only one element (Tn14-94) of the Mariner-3_SS family had an intron that would be potentially active if were removed (Figure2). Interestingly, other copies of potentially active elements in this family did not contain an intron, although alignment of the elements intron (Tn14-94) with other transposon sequences showed that this fragment was present in all of the identified TE sequences but had mutations in the start (GT) and end (AG) bases of the intron, which made up important splice donor and acceptor sites, respectively. Phylogenetic analysis of the nucleotide sequences that coded for the transposase in the Mariner-3_SS family had shown that the Tn14-94 element contained the ancestral sequence (Figure4), and this was also inferred for the sequence of the Tn15-99 element in the Mariner-2_SS family (data not shown).
Evidence for RIP and selective pressure in the transposase sequences
Analysis to detect events that were similar to RIP showed that all of the identified families had scores for TpA/ApT of < 0.86 and (CpA + TpG)/(ApC + GpT) of > 1.21, which suggested no evidence for RIP silencing in Tc1-Mariner element sequences (Table2). A low level of nucleotide diversity and a large amount of haplotype diversity was found in all alignments between elements in the same family (Table2). In addition, the Tajimas D neutrality test was performed and found to be insignificant (p > 0.10) for all of the alignments, except for the Mariner-4_SS family, which had Tajimas D test scores of −2.61 and p values of < 0.02 (Table2).
One hundred and fifty-seven Tc1-Mariner elements were identified, and these included 50 potentially active elements. To our knowledge, this is the largest number of potentially active Tc1-Mariner elements that has currently been found in a fungal genome. This value is highly significant when compared to the potentially active Tc1-Mariner elements in other fungi such as Paracoccideoides, Verticillium spp. , Mycosphaerella fijiensis and Lacaria bicolor. In addition, the six MITE elements and other copies that have truncated ORFS but contain preserved TIRs can be mobilized in trans by enzymes coded for by an intact copy ,. DDE, HTH_psq and HTH_Tnp_Tc5 motifs were identified in all of the potentially active copies.
Tc1-Mariner elements may also have three types of functional sequences that are involved in transposition: cleavage sites at the ends of the TIRs that contain 47 nucleotides, UTRs between the TIRs and the ORF that increase transposition efficiency, and DRs (direct repeats) within the TIRS that act as transposase linkage sites . All of the identified elements, except for elements belonging to the TcMar-Pogo family, have cleavage sites at their ends that contain ACGT, as found in elements from the DAHLIAE 1 and 2 families that were identified in Verticillium dahliae. Symmetric and conserved UTR regions were also found in elements from every family; however, DRs in the TIRs were not found. Nevertheless, each end/transposase combination appeared to create subtle versions for mobilization, which guaranteed a certain amount of specificity during transposition .
TEs with high sequence identity between S. sclerotiorum and B. cinerea were found. Elements similar to Flipper, which was first identified in B. cinerea, were also identified in S. sclerotiorum. This result indicates a possible horizontal transfer of the Flipper between S. sclerotiorum and B. cinerea. Both species are notorious plant necrotrophic fungi and share extensive syntenic blocks . Additionally, the Flipper element is widely used in genetic variability studies , and, thus, can be analyzed as a molecular marker in S. sclerotiorum.
Various transposase transcripts of the same family were identified due to intron retention/excision. Introns within class II DNA transposons have been reported in plant pathogens ,, and phylogenetic analysis of the Mariner-2_SS and Mariner-3_SS families demonstrated that elements that required the removal of the intron were ancestral. Therefore, the intron appeared to not be an evolved trait that was important to the element because it allowed the genome some control over transposition due to the dependence of the transposon on the host-splicing mechanism . Conversely, the presence of alternative introns in transposase allows the elements an efficient strategy to overcome possible mutations that generate early stop codons. In addition, the existence of transposase sequences that may or may not maintain the intron in the mature mRNA shows that the transposase for Tc1-Mariner elements has a flexible coding sequence and, consequently, a flexible conformation. This flexibility is likely related to the complex, synaptic organization of transposition (transpososome) -. Consistent with this finding, Nesmelova and Hackett  demonstrated that the catalytic domains of DDE-transposases had few similar sequences and significantly different sizes, and they suggested that transposases must be flexible enough to allow conformational rearrangements of their DNA binding domains and to provide a catalytic site for each transposition step.
Analysis of insertion sites showed that class II TEs inserted themselves near the coding sequences of important proteins such as serine threonine kinase, the MFS multidrug transporter and polyprenyl 4-hydroxybenzoate transferase. Serine/threonine kinase is an essential component of several regulatory pathways in fungi, including the mechanism for creation of turgor pressure in the appresorium and pathogenicity . The multidrug transporter protects the organism from toxic products such as fungicides , and the enzyme polyprenyl 4-hydroxybenzoate transferase is involved in ubiquitin biosynthesis . TEs near these genes or sequences involved in the same biological processes that these proteins are involved in were also identified in M. fijiensis. Because they are inserted downstream of these genes, the transposons can influence their expression. In fact, a TE sequence downstream and physically near the coding sequence of a NADB-Rossmann motif-containing dehydrogenase gene that is involved in a metabolic pathway such as glycolysis  has been shown to be co-expressed with the gene and detected in its EST. Analysis of genes in humans and rats have shown that the 3′ region of genes can be dynamically altered by TEs during evolution, which suggests that TEs can provide alternative polyadenylation sites when inserted downstream of endogenous genes . However, the only analysis of polyadenylation sites in fungi has been performed in Aspergillus oryzae. Therefore, because of the current lack of knowledge about the 3′ gene regulatory regions of fungi, additional studies are necessary to measure the possible involvement of TEs in the evolution of the 3′ ends of genes. Even if these insertions do not have any advantage for the host, they may be fixed in the population by genetic drift because strong evidence supports the idea that transposition is a significant source of exaptation events .
Active TEs in the Tc1-Mariner superfamily have been reported in fungi ,. Here, analyses to detect ORFs and nucleotide diversity have suggested that these elements were recently introduced and are potentially active. However, the presence of transposase sequences in S. sclerotiorum ESTs database only provides the information that TEs are transcribed. So, Western blot analysis for transposases of S. sclerotiorum should be performed in future work to suggest the mobility of Tc1-Mariner elements in this genome. Interestingly, the Mariner-4_SS family includes the largest number of potentially active elements (14). The negative and non-significant Tajimas D test for sequences that code for this familys transposase indicates that selection against genotypes carrying deleterious mutant alleles occurred. However, deviations from the infinite allele model are not only due to natural selection because a population that is growing will also contain an excess of rare alleles.
Despite strong evidence for Tc1-Mariner transposon activity, no evidence for gene-silencing mechanisms similar to RIP was found. Clutterbuck et al.  analyzed seven Tc1-Mariner elements in the S. sclerotiorum genome and did not find strong evidence that these copies were affected by RIP; however, they have suggested that gene silencing might be present because CpA and CpG dinucleotides are more commonly mutated than CpY dinucleotides. Here, two indices for detecting RIP-like mutations, TpA/ApT and (CpA + TpG)/(ApC + GpT), were used and indicated that RIP-like mutations were absent from the analyzed sequences. In addition, sequences from transposase coding regions have low nucleotide diversity, meaning few mutations occur between them, and they have high haplotype diversity, which indicates that these mutations are unique and, according to the Tajimas D test, neutral. Therefore, these results do not provide any evidence for RIP silencing in Tc1-Mariner elements. However, the absence of RIP-like mutations in Tc1-Mariner elements does not indicate the absence of the RIP mechanism in the genome of S. sclerotiorum because differences in the intensity with which RIP acts between the different transposable elements, within the same genome, has been reported for various genomes of fungi as Stagnospora nodorum, Aspergillus niger and Cochliobolus heterostrophus. Therefore, another type of activity control in Tc1-Mariner elements likely exists. Four other types of regulation for Tc1-Mariner elements in the S. sclerotiorum genome can be suggested. First, some elements depend on host regulatory factors for transposition, such as transcription factors, the existence of poly(A) sequences, epigenetic regulation and splice sites . In this case, one type of control could be observed because transposition of the TE copies with intron excision could be regulated by its dependence on the host splicing machinery ,. Second, TEs can be repressed by DNA methylation . Third, transcription of complete elements or MITEs that can form double-stranded RNA (dsRNA) can be controlled due to the presence of TIRs. These dsRNAs would then be processed by the short interfering RNA (siRNA) machinery and could silence copies of transcribed elements . Fourth, because many TE sequences in the S. sclerotiorum genome remain potentially active, monomers of transposase could form inactive or less-active oligomers that decrease transposition activity .
Consistent with the in silico evidence of recent activity, the analysis of seven S. sclerotiorum cDNA libraries showed that Tc1-Mariner element sequences were expressed under various conditions and, thus, were likely active in the genome. This fact suggests important ideas about the evolution of the S. sclerotiorum genome. First, it provides evidence that several elements could be transposing in the genome. Thus, an element could insert itself in a new location and inactivate a gene . In addition, when a Tc1-Mariner element transposes, it generates a double-stranded break in the DNA; thus, homologous recombination events that are catalyzed by the DNA repair system could occur . Second, complete transcription of the Tc1-Mariner elements or MITEs can form hairpins due to the complementarity of the TIRs and form a region of dsRNA that can be processed by the enzymatic machinery to form short siRNAs that can, in turn, silence these elements . Third, miRNA originating from transposons are evolutionarily new regulators that are involved in the regulation of endogenous genes ,,. In conclusion, the activity of these TEs may be allowed over evolutionary time in S. sclerotiorum because it provides the fungus with a large range of genetic variability that allows or has allowed the pathogen to parasitize a wide range of hosts.
Fourteen novel Tc1-Mariner families were characterized. Some families had evidence of introns, which might or might not be excised depending on the family or element in question, and this finding demonstrates a possible strategy for overcoming possible mutations that generate premature stop codons in a RNA sequence. This observance also indicates variation in the sequence and conformation of the transposase, which is likely due to the synaptic complex transposition (transpososome). Apparently, Tc1-Mariner TE activity occurred recently or has been tolerated throughout S. sclerotiorum evolution. The presence of these elements near gene regulatory regions may lead to exaptation of these elements by natural selection or genetic drift, and the activity of these transposons may result in recombination, inactivation or changes in gene expression that could provide an important source of genetic variability that allows the fungus to adapt to various stress conditions or exploit a wider range of hosts.
Identification and analysis of Tc1-Mariner TEs
The S. sclerotiorum genome was downloaded from the Broad Institute (http://www.broadinstitute.org/) database, and TE sequences in the S. sclerotiorum genome were identified and classified using RepeatMasker (A.F.A. Smit, R. Hubley and P. Green RepeatMasker at http://repeatmasker.org). This program identifies copies of TEs by comparing genomic sequences with sequences in a library of known TEs (RepBase 16.12: http://www.girinst.org/repbase/update/index.html) . In this study, a library of fungal TEs was used (fngrep.ref), and the following parameters were used for the search: RM_BLAST was used as the search model, slow search was used to make the search 0-5% more sensitive than the default, fungi was used to specify the species or group of sequences, and alignment was used to generate an output file of the alignments. However, this program only identifies regions in the genome where there is identity with the database sequences, which makes it impossible in many situations to determine the ends of the element. Therefore, TIRs were identified using Repeat Finder , and analysis of the open reading frames (ORFs) in transposase coding regions was performed in Expasy (http://expasy.org/) and Orf-finder (http://www.ncbi.nlm.nih.gov/projects/gorf/). Predicted ORFs were analyzed by BLASTN alignment to a database of S. sclerotiorum (http://www.broadinstitute.org/) transcripts. Putative TEs were then analyzed by BLASTX (http://www.ncbi.nlm.nih.gov/BLAST) alignment to the NCBI (National Center for Biotechnology Information) RefSeq_protein (Reference Sequence Protein) database to determine if DDE and HTH domains were present. The insertion sites or TSRs (Target Site Repeats), of the TEs were characterized by direct searches of the sequences flanking the TEs.
The resulting sequences were classified as complete elements and potentially active elements. Complete elements possess sequences that are similar to the proteins that make up the transposition machinery, such as conserved TIRs and TA target site duplications (TSDs), but lack intact ORFs. Potentially active elements are complete elements with intact motifs and ORFs that are typical for the Tc1-Mariner superfamily.
Families were defined using the classification system proposed by Wicker et al. . In this system, families are groups of TEs that contain more than 80% identity between coding regions, i.e., internal domains, or terminal repeats in at least 80% of the aligned sequences. Here, we used the transposase coding region to define families. To determine the existence of novel TE families, elements from each family were analyzed by BLASTN and a database of fungal TEs (fngrep.ref) in RepBase (http://www.girinst.org/Rpbase-Update.html) . Finally, elements were named using the nomenclature proposed by Kapitonov and Jurka , and representative TE sequences from novel families were submitted to the database at http://www.girinst.org/repbase/update/browse.php with the following identifiers: Mariner-1_SS, Mariner-2_SS, Mariner-3_SS, Mariner-4_SS, Mariner-5_SS, Mariner-6_SS, Mariner-7_SS, Mariner-8_SS, Mariner-9_SS, Mariner-10_SS, Mariner-11_SS, Mariner-12_SS, Mariner-13_SS e Mariner-14_SS.
After searching for intact TEs, approximately 5,000bp upstream and downstream of each TE was analyzed by BLASTX (http://www.ncbi.nlm.nih.gov/BLAST) alignment to the RefSeq_protein (Reference Sequence Protein) and S. sclerotiorum transcripts databases to determine the existence of sequences that coded for proteins near the TEs. The cutoff that was used for protein identification was an E-value of < 10−20 and identity of > 50%.
Evidence for RIP and selective pressure
Dinucleotide frequency analyses and RIP index calculations were performed using genomic DNA sequences from the ORF that coded for the transposase of each family. Sequences were aligned in Mega 4 , and only alignments containing pairs of sequences from the same family with 100% coverage and an identity that was greater than 80% were considered and later submitted to RipCal  to calculate the TpA/ApT and (CpA + TpG)/(ApC + GpT) indices. The TpA/ApT index is a simple index to measure the frequency of RIP products (TpA) and corrects for false positives that arise from ApT-rich regions. High TpA/ApT values indicate a strong RIP response. The (CpA + TpG)/(ApC + GpT) index is similar to the TpA/ApT index, in principle, but it measures the depletion of the RIP targets CpA and TpG. In this index, a low (CpA + TpG)/(ApC + GpT) score strongly suggests RIP. Standard reference values for RIP are TpA/ApT > 0.89 and (CpA + TpG)/(ApC + GpT) < 1.03 .
For the neutrality test, DNA sequences from the ORF that coded for the transposase in each family were used. Sequences were aligned using Mega 4 , and only alignments containing more than four sequences from the same family with 100% coverage and more than 80% identity were included and submitted to DnaSP v.5.10.01  to calculate Tajimas D value  and the statistical significance of the test. DnaSP v.5.10.01 was also used for descriptive analysis of nucleotide and haplotype diversity.
A total of 91,155 ESTs (Expressed Sequence Tag) from seven cDNA libraries (http://www.broadinstitute.org/), which were made from mRNA from developing sclerotia, developing apothecium after 55hrs of light exposure, mycelium at pH7, infected Brassica, infected tomato, samples from the infected cushion and mycelium under oxidative stress, were analyzed to determine if TE sequences were present. ESTs were aligned to Tc1-Mariner TEs that were found in the S. sclerotiorum genome by BLASTN, and ESTs with significant alignments (E-value < 10−5) were compared to the predicted gene transcripts in the S. sclerotiorum database from the NCBI.
Multiple sequence alignments and phylogenetic inferences
Multiple global alignments using nucleotide sequences coding for the transposase were performed using the ClustalW algorithm , and phylogenetic reconstruction of sequences that were aligned to the Mariner-3_SS family was performed using the Neighbor-joining method, which was implemented in the Mega 4 program . Trees were constructed using the Kimura 2-parameter model and Interior Branch Test for phylogenetic inference with bootstrap (5,000 replicates).
Availability of supporting data
The matrices and phylogenetic tree of this article are available in the TreeBase (accession number: 16387). The sequences of transposase are available in the Sclerotinia sclerotiorum database (http://www.broadinstitute.org/annotation/genome/sclerotinia_sclerotiorum/MultiHome.html): Tn1-10 (supercontig 1, 27535792755096), Tn7-53 (supercontig 7, 949055950516), Tn17-106 (supercontig 17, 641286642802), Tn18-111 (supercontig 18, 713987715504), Tn20-123 (supercontig 20, 495847497361), Tn14-94 (supercontig 14, 538355539881), Tn3-31 (supercontig 3, 24263812427898), Tn4-33 (supercontig 4, 818216819733), Tn7-50 (supercontig 7, 106098107615), Tn8-59 (supercontig 8, 382819384336), Tn9-69 (supercontig 9, 877401878918), Tn2-15 (supercontig 2, 11812711182788) and Tn6-45 (supercontig 6, 10006331002148).
This study was conceptualized planned by MFS and MVQ. MFS performed the in silico and preparation of the manuscript. MVQ coordinated and guided the research, assisted with data analysis and interpretation and helped to prepare the manuscript. EFA and ESGM assisted with the manuscript preparation and were co-mentors for MFS. JCFS assisted with data analysis. All authors have read and approved the final manuscript.
Fedoroff NV: Transposable elements, epigenetics, and genome evolution. Science. 2012, 338: 758-767. 10.1126/science.338.6108.758.
Beare PA, Unsworth N, Andoh M, Voth DE, Omsland A, Gilk SD, Williams KP, Sobral BW, Kupko JJ, Porcella SF, Samuel JE, Heinzen RA: Comparative genomics reveal extensive transposon-mediated genomic plasticity and diversity among potential effector proteins within the genus Coxiella. Infect Immun. 2009, 77: 642-656. 10.1128/IAI.01141-08.
Almeida LM, Silva IT, Silva WAS, Castro JP, Riggs PK, Carareto CM, Amaral ME: The contribution of transposable elements to Bos taurus gene structure. Gene. 2007, 390: 180-189. 10.1016/j.gene.2006.10.012.
Shapirova JA: Mobile DNA and evolution in the 21st century. Mob DNA. 2010, 1: 4-10.1186/1759-8753-1-4.
Yang L, Li C, Xia J, Jin Y: Domestication of transposable elements into MicroRNA genes in plants. Plos one. 2011, 6: e19212-10.1371/journal.pone.0019212.
Rebollo R, Romanish MT, Mager DL: Transposable elements: an abundant and natural source of regulatory sequences for host genes. Annu Rev Genet. 2012, 46: 21-42. 10.1146/annurev-genet-110711-155621.
Wicker T, Sabot F, Huan-Van A, Bennetzen JL, Capy P, Chalhoub B, Flavell A, Leroy P, Morgante M, Panaud O, Paux E, SanMiguel P, Schulman AH: A unified classification system for eukaryotic transposable elements. Nature. 2007, 8: 973-982.
Kalendar R, Flavell AJ, Ellis TH, Sjakste T, Moisy C, Schulman AH: Analysis of plant diversity with retrotransposon-based molecular markers. Heredity. 2011, 106: 520-530. 10.1038/hdy.2010.93.
Plasterk RHA, Izsvak Z, Ivics Z: Resident aliens - the Tc1/mariner superfamily of transposable elements. Trends Genet. 1999, 15: 326-332. 10.1016/S0168-9525(99)01777-1.
Baker TA, Luo L: Identification of residues in the Mu transposase essential for catalysis. Proc Natl Acad Sci U S A. 1994, 91: 6654-6658. 10.1073/pnas.91.14.6654.
Pietrokovski S, Henikoff S: A helix-turn-helix DNA-binding motif predicted for transposases of DNA transposons. Mol Gen Genet. 1997, 254: 689-695. 10.1007/s004380050467.
Marini MM, Zanforlin T, Santos PC, Barros RRM, Guerra ACP, Puccia R, Felipe MSS, Brigido M, Soares CMA, Ruiz JC, Silveira JF, Cisalpino PS: Identification and characterization of Tc1/mariner like DNA transposons in genome of the Paracoccidioides species complex. BMC Genomics. 2010, 11: 130-10.1186/1471-2164-11-130.
Amyotte SG, Tan X, Pennerman K, Jimenez-Gasco MM, Klosterman SJ, Ma LJ, Dobinson KF, Veronese P: Transposable elements in phytopathogenic Verticillium spp.: insights into genome evolution and inter- and intra-specific diversification. BMC Genomics. 2012, 13: 314-10.1186/1471-2164-13-314.
Selker EU, Cambareri EB, Jensen BC, Haack KR: Rearrangement of duplicated DNA in specialized cells of Neurospora. Cell. 1987, 51: 741-752. 10.1016/0092-8674(87)90097-3.
Cambareri EB, Jensen BC, Schabtach E, Selker EU: Repeat induced G-C to AT mutations in Neurospora. Science. 1998, 244: 1571-1575. 10.1126/science.2544994.
Selker EU: Premeiotic instability of repeated sequences in Neurospora crassa. Annu Rev Genet. 1990, 24: 579-613. 10.1146/annurev.ge.24.120190.003051.
Clutterbuck AJ: Genomic evidence of the repeat-induced point mutation (RIP) in filamentous ascomycetes. Fungal Genet Biol. 2011, 48: 306-326. 10.1016/j.fgb.2010.09.002.
Kempken F, Kck U: Transposons in filamentous fungi facts and perspectives. Biogeosciences. 1998, 20: 652-659.
Daboussi M-J, Capy P: Transposable elements in filamentous fungi. Annu Rev Microbiol. 2003, 57: 275-299. 10.1146/annurev.micro.57.030502.091029.
Volff J-N: Turning junk into gold: domestication of transposable elements and the creation of new genes in eukaryotes. BioEssay. 2006, 28: 913-922. 10.1002/bies.20452.
Feschotte C: Transposable elements and the evolution of regulatory networks. Nat Rev Genet. 2008, 9: 397-405. 10.1038/nrg2337.
Amselem J, Cuomo CA, van Kan JA, Viaud M, Benito EP, Couloux A, Coutinho PM, de Vries RP, Dyer PS, Fillinger S, Fournier E, Gout L, Hahn M, Kohn L, Lapalu N, Plummer KM, Pradier JM, Quvillon E, Sharon A, Simon A, ten Have A, Tudzynski B, Tudzynski P, Wincker P, Andrew M, Anthouard V, Beever RE, Beffa R, Benoit I, Bouzid O: Genomic analysis of the necrotrophic fungal pathogens Sclerotinia sclerotiorum and Botrytis cinerea. Plos Genet. 2011, 7: e1002230-10.1371/journal.pgen.1002230.
Levis C, Fortini D, Brygoo Y:Flipper, a mobile Fot1-like transposable element in Botrytis cinerea. Mol Gen Genet. 1997, 254: 674-680. 10.1007/s004380050465.
Hane JK, Oliver RP: RIPCAl: a tool for alignment-based analyses of repeat-induced point mutations in fungal genomic sequences. BMC Bioinformatics. 2008, 9: 478-10.1186/1471-2105-9-478.
Santana MF, Silva JCF, Batista AD, Ribeiro LE, Silva GF, Arajo EF, Queiroz MV: Abundance, distribution and potential impact of transposable elements in the genome of Mycosphaerella fijiensis. BMC Genomics. 2012, 13: 720-10.1186/1471-2164-13-720.
Labb J, Murat C, Morin E, Tuskan GA, Tacon FL, Martin F: Characterization of transposable elements in the ectomycorrhizal fungus Laccaria bicolor. Plos One. 2012, 7: e40197-10.1371/journal.pone.0040197.
Dufresne M, Hua-Van A, Wahab HA, MBarek SB, Vasnier C, Teysset L, Kema GHJ, Daboussi M-J: Transposition of a fungal miniature inverted-repeat transposable element through the action of a Tc1-like transposase. Genetics. 2007, 175: 441-452. 10.1534/genetics.106.064360.
Benjamim B, Yves B, Corinne A-G: Assembly of the Tc1 and mariner transposition initiation complexes depends on the origins of their transposase DNA binding domains. Genetica. 2007, 130: 105-120. 10.1007/s10709-006-0025-2.
Isenegger DA, Ades PK, Ford R, Taylor PWJ: Status of the Botrytis cinerea species complex and microsatellite analysis of transposon types in south Asia and Australia. Fungal Divers. 2008, 29: 17-26.
Fekete , Fekete E, Irinyi L, Karaffa L, rnyasi M, Asadollahi M, Sndor E: Genetic diversity of a Botrytis cinerea cryptic species complex in Hungary. Microbiol Res. 2012, 167: 283-291. 10.1016/j.micres.2011.10.006.
Ignacchiti MDC, Santana MF, Arajo EF, Queiroz MV: The distribution of a transposase sequence in Moniliophthora perniciosa confirms the occurrence of two genotypes in Bahia, Brazil. Trop Plant Pathol. 2011, 36: 276-286. 10.1590/S1982-56762011000500002.
Pereira JF, Almeida APMM, Cota J, Pamphile JA, Silva GF, Arajo EF, Gramacho KP, Brommonschenkel SH, Pereira GA, Queiroz MV:Boto, a class II transposons in Moniliophthora perniciosa, is the first representative of the PIF/harbinger superfamily in a phytopathogenic fungus. Microbiology. 2013, 159: 112-125. 10.1099/mic.0.062901-0.
Yuan JF, Beniac DR, Chaconas G, Ottensmeyer FP: 3D reconstruction of the Mu transposase and type 1 transpososome: a structural framework for Mu DNA transposition. Genes Dev. 2005, 19: 840-852. 10.1101/gad.1291405.
Richardison JM, Colloms SD, Finnegan DJ, Walkinshaw MD: Molecular architecture of the Mos1 paired-end complex: the structural basis of DNA transpositions in a eukaryote. Cell. 2009, 138: 1096-1108. 10.1016/j.cell.2009.07.012.
Nesmelova IV, Hackett PB: DDE transposase: structural similarity and diversity. Adv Drug Delivery Rev. 2010, 62: 1187-1195. 10.1016/j.addr.2010.06.006.
Liu H-H, Lu J-P, Zhang L, Dong B, Min H, Lin FC: Involvement of a Magnaporthe grisea serine/threonine kinase gene, MgATG1, in appressorium turgor and pathogenesis. Eukaryot Cell. 2007, 6: 997-1005. 10.1128/EC.00011-07.
Waard MA, Andrade AC, Hayashi K, Schoonbeek H-J, Stergiopoulos L, Zwiers LH: Impact of fungal drug transporter on fungicide sensitivity, multidrug resistance and virulence. Pest Manag Sci. 2006, 62: 195-207. 10.1002/ps.1150.
Burn MI, Hermn MD, Alcan FJ, Villalba JM: Stimulation of polyprenyl 4-hydroxybenzoate transferase activity by sodium cholate and 3-[(cholamidopropyl)dimethylammnonio]-1-propanesulfonate. Anal Biochem. 2006, 353: 15-21. 10.1016/j.ab.2006.03.029.
Kavanagh KL, Jornvall H, Persson B, Oppermann U: Medium-and short-chain dehydrogenase/reductase gene and protein families. Cell Mol Life Sci. 2008, 65: 3895-3906. 10.1007/s00018-008-8588-y.
Lee JY, Ji Z, Tian B: Phylogenetic analysis of mRNA polyadenylation sites reveals a role of transposable elements in evolution of the 3′-end of genes. Nucleic Acids Res. 2008, 36: 5581-5590. 10.1093/nar/gkn540.
Tanaka M, Sakai Y, Yamada O, Shintani T, Gomi K:In silico analysis of 3′-end-processing signals in Aspergillus oryzae using expressed sequence tags and genomic sequencing data. DNA Res. 2011, 18: 189-200. 10.1093/dnares/dsr011.
Bouvet GF, Jacobi V, Plourde KV, Bernier L: Stress-induced mobility OPHUIO1 and OPHIO22, DNA transposons of the dutch elm disease fungi. Fungal Genet Biol. 2008, 45: 565-578. 10.1016/j.fgb.2007.12.007.
Ogasawara H, Obata H, Hata Y, Takahashi S, Gomi K:Crawler, a novel Tc1/mariner-type transposable element in Aspergillus oryzae transposes under stress conditions. Fungal Genet Biol. 2009, 46: 441-449. 10.1016/j.fgb.2009.02.007.
Hane JK, Lowe RG, Solomon PS, Tan KC, Schoch CL, Spatafora JW, Crous PW, Kodira C, Birren BW, Galagan JE, Torriani SF, McDonald BA, Oliver RP: Dothideomyceteplant interactions illuminated by genome sequencing and EST analysis of the wheat pathogen Stagonospora nodorum. The Plant Cell. 2007, 19: 3347-3368. 10.1105/tpc.107.052829.
Braumann I, Berg M, Kempken F: Repeat induced point mutation in two asexual fungi, Aspegillusn niger and Penicillium chrysogenum. Curr Genet. 2008, 53: 287-297. 10.1007/s00294-008-0185-y.
Santana MF, Silva JCF, Mizubuti ESG, Arajo EF, Condon BJ, Turgeon BG, Queiroz MV: Characterization and potential evolutionary impact of transposable elements in the genome of Cochliobolus heterostrophus. BMC Genomics. 2014, 15: 536-10.1186/1471-2164-15-536.
Laski FA, Rio DC, Rubin GM: Tissue specificity of Drosophila P element transposition is regulated at the level of mRNA splicing. Cell. 1986, 44: 7-19. 10.1016/0092-8674(86)90480-0.
Muoz-Lpez M, Garca-Prez JL: DNA transposons: nature applications in genomics. Curr Genomics. 2010, 11: 115-128. 10.2174/138920210790886871.
Hollister JD, Gaut BS: Epigenetic silencing of transposable elements: a trade-off between reduced transposition and deleterious effects on neighboring gene expression. Mol Biol Evol. 2009, 19: 1419-1428.
Piriyapongsa J, Jordan IK: Dual coding of siRNAs and miRNAs by plant transposable elements. RNA. 2008, 14: 814-821. 10.1261/rna.916708.
Lohe AR, Hartl DL: Autoregulation of mariner transposase activity by overproduction and dominant-negative complementation. Mol Biol Evol. 1996, 13: 549-555. 10.1093/oxfordjournals.molbev.a025615.
Katiyar-Agarwal S, Jin H: Role of small RNAs in host-microbe interactions. Annu Rev Phytopathol. 2010, 48: 225-246. 10.1146/annurev-phyto-073009-114457.
Jurka J, Kapitonov VV, Pavlicek A, Klonowski P, Kohany O, Walichiewicz J: Repbase update, a database of eukaryotic repetitive elements. Cytogenet Genome Res. 2005, 110: 462-467. 10.1159/000084979.
Altschul SF, Madden TL, Schffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25: 3389-3402. 10.1093/nar/25.17.3389.
Kapitonov VV, Jurka J: A universal classification of eukaryotic transposable elements implemented in repbase. Nat Rev Genet. 2008, 9: 411-412. 10.1038/nrg2165-c1.
Tamura K, Dudley J, Nei M, Kumar S: MEGA4: molecular evolutionary genetics analysis (MEGA) software version 4.0. Mol Biol Evol. 2007, 24: 1596-1599. 10.1093/molbev/msm092.
Librado P, Rozas J: DnaSP ver. 5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics. 2009, 25: 1451-1452. 10.1093/bioinformatics/btp187.
Tajima K: Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics. 1989, 123: 585-595.
Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: Improving the sensitivity of progress multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994, 22: 4673-5680. 10.1093/nar/22.22.4673.
This work was financially supported by the Brazilian Agency CNPq (Conselho Nacional de Desenvolvimento Cientfico e Tecnolgico).
The authors declare that they have no competing interests.