- Research article
- Open Access
Multilocus sequence analysis of Treponema denticolastrains of diverse origin
BMC Microbiology volume 13, Article number: 24 (2013)
The oral spirochete bacterium Treponema denticola is associated with both the incidence and severity of periodontal disease. Although the biological or phenotypic properties of a significant number of T. denticola isolates have been reported in the literature, their genetic diversity or phylogeny has never been systematically investigated. Here, we describe a multilocus sequence analysis (MLSA) of 20 of the most highly studied reference strains and clinical isolates of T. denticola; which were originally isolated from subgingival plaque samples taken from subjects from China, Japan, the Netherlands, Canada and the USA.
The sequences of the 16S ribosomal RNA gene, and 7 conserved protein-encoding genes (flaA, recA, pyrH, ppnK, dnaN, era and radC) were successfully determined for each strain. Sequence data was analyzed using a variety of bioinformatic and phylogenetic software tools. We found no evidence of positive selection or DNA recombination within the protein-encoding genes, where levels of intraspecific sequence polymorphism varied from 18.8% (flaA) to 8.9% (dnaN). Phylogenetic analysis of the concatenated protein-encoding gene sequence data (ca. 6,513 nucleotides for each strain) using Bayesian and maximum likelihood approaches indicated that the T. denticola strains were monophyletic, and formed 6 well-defined clades. All analyzed T. denticola strains appeared to have a genetic origin distinct from that of ‘Treponema vincentii’ or Treponema pallidum. No specific geographical relationships could be established; but several strains isolated from different continents appear to be closely related at the genetic level.
Our analyses indicate that previous biological and biophysical investigations have predominantly focused on a subset of T. denticola strains with a relatively narrow range of genetic diversity. Our methodology and results establish a genetic framework for the discrimination and phylogenetic analysis of T. denticola isolates, which will greatly assist future biological and epidemiological investigations involving this putative ‘periodontopathogen’.
Periodontal disease is a chronic inflammatory infection that affects the tissues surrounding and supporting teeth [1–3]. It is highly prevalent in adult populations around the world, and is the primary cause of tooth loss after the age of 35 [2–4]. The term ‘periodontal disease’ encompasses a spectrum of related clinical conditions ranging from the relatively mild gingivitis (gum inflammation) to chronic and aggressive forms of periodontitis; where inflammation is accompanied by the progressive destruction of the gingival epithelial and connective tissues, and the resorption of the underlying alveolar bone. It has a highly complex, multispecies microbial etiology; typified by elevated populations of proteolytic and anaerobic bacterial species . Oral spirochete bacteria, all of which belong to the genus Treponema, have long been implicated in the pathogenesis of periodontitis and other periodontal diseases . One species in particular: Treponema denticola has been consistently associated with both the incidence and severity of periodontal disease [6–11].
Over the past few decades, a significant number of T. denticola strains have been isolated from periodontal sites in patients suffering from periodontal disease; predominantly from deep ‘periodontal pockets’ of infection that surround the roots of affected teeth. Clinical isolates of T. denticola have previously been identified and differentiated by a combination of cell morphological features; biochemical activities (e.g. proteolytic substrate preferences), immunogenic properties (e.g. serotyping, or reactivity towards monoclonal or polyclonal antibodies) as well as multilocus enzyme electrophoresis [12–17]. However, these approaches are generally tedious and technically demanding, and often yield inconsistent or ambiguous results.
To date, only two complete genome sequences are available for oral spirochete bacteria; those of T. denticola ATCC 35405 (type strain)  and Treponema vincentii LA-1 (ATCC 35580), which has been sequenced by researchers at the J. Craig Venter Institute as part of the Human Microbiome Project , but is as yet unpublished. The 2.84 Mbp single circular chromosome of T. denticola ATCC 35405 contains ca. 2,770 predicted protein-encoding genes, whilst the 2.51 Mbp T. vincentii genome is predicted to have ca. 2,600 protein encoding genes (NCBI GenBank accession number NZ_ACYH00000000). The syphilis spirochete Treponema pallidum is closely-related to T. denticola at the genetic level, but contains a much smaller ‘host-adapted’ genome ca. 1.14 Mbp in size .
Over recent years, multilocus sequence analysis (MLSA) has proven to be a powerful method for the discrimination, taxonomic classification and phylogenetic analysis of closely related microbial species, subspecies and strains [21–29]. MLSA involves the systematic comparison of the DNA sequences of sets of (conserved) genes, usually 2 to 10 in number, within a given set of strains or species. Commonly, the total gene sequence data for a single isolate is concatenated prior to analysis using a variety of distance-based or criterion-based computational methods. MLSA offers many advantages over ‘single gene’ approaches; most notably its greater sensitivity and resolving power, and its ability to identify or overcome conflicting signals, such as those arising from horizontal gene transfer [22, 23, 29].
Although studies have consistently associated T. denticola with periodontal disease, its precise pathogenic roles remain to be fully established. This issue has been complicated by the use of a variety of different T. denticola strains in previously reported biophysical analyses, cell culture-based investigations or animal infection models. Very little is presently known about how similar or disparate these isolates may be at the genetic level. This prompted us to utilize an MLSA-approach to systematically analyze the genetic composition of 20 of the most commonly used strains of T. denticola; originally isolated from patients with periodontal diseases who were living in Asia, Europe or North America. Our results reveal that there is considerable genetic diversity within this species. Phylogenetic analyses of multi-gene datasets indicate that the T. denticola strains studied share a common genetic origin, which is distinct from that of T. vincentii or T. pallidum and appear to have a clonal structure.
Selection of strains and genetic loci for sequence analysis
All six ATCC reference strains of T. denticola, as well as 14 other clinical isolates were selected for multilocus sequence analysis (see Table 1). These strains were originally isolated from the oral cavities of subjects with various forms of periodontal disease; who resided in China, Japan, the Netherlands, Canada or the USA. We subjectively chose these particular strains based on several main criteria: 1) their diverse geographical origin; 2) their inclusion in one or more previously-published scientific investigations; and 3) their reported differences in phenotypic properties. Using the genome sequence of the type strain (ATCC 35405), seven protein-encoding genes distributed throughout the single, circular chromosome were selected for genetic analysis: flaA, recA, pyrH, ppnK, dnaN, era and radC (see Table 2). This approach enabled us to obtain a representative snapshot of genomic composition within each strain. None of these genes are predicted to reside in regions of suspected prophage origin . Using a PCR-based strategy, the full length gene sequences for all seven genes were determined for each of the 19 other T. denticola strains. Details are shown in Table 3. Only the era gene from the ATCC 700768 strain could not be PCR-amplified using any primer set, and its sequence was determined by direct sequencing of purified chromosomal DNA. The gene sequences corresponding to the major rRNA component of the small ribosomal subunit (rrs, 16S rRNA) were also determined for each strain, to confirm their taxonomic assignment. In T. denticola, 16S rRNA is encoded by two genes (rrsA, rrsB), which have identical sequences and are positioned at distinct chromosomal loci (see Table 2) .
Inter-strain differences in nucleotide composition
We first compared the G + C content of each of the eight genes within the 20 T. denticola strains, to evaluate inter-gene and inter-strain variation. Results are summarized in Table 4. For all gene sequences, average G + C content (%) ranged from 32.4% to 52.4%. The rrsA/B gene had the highest average G + C content (52.4%), whilst the dnaN gene had the lowest (32.4%). The other six genes had similar overall levels of G + C content; ca. 40 − 45%. The G + C levels for individual genes exhibited very little variation between the strains (≤ ± 0.5%). Average overall G + C content for the eight genes in all 20 strains was ca. 42.5% (Additional file 1), which is slightly higher than the overall G + C content for the entire T. denticola ATCC 35405 genome, which is ca. 37.9% .
Multiple sequence alignments were separately constructed for the eight genes, using sequence data from each of the 20 T. denticola strains. The eight respective sets of gene sequences aligned well, and there were only minor inter-strain differences in gene lengths. The number of polymorphic sites differed considerably between the seven protein-encoding genes (see Table 4); being highest in the flaA (18.8%) and pyrH (18.4%) genes, and lowest in the dnaN gene (8.9%). The 16S rRNA (rrsA/B) genes had by far the lowest numbers of polymorphic sites (1.1%), indicating a strong conservation of sequence.
Phylogenetic analyses of T. denticolastrains using individual gene sequence data
Using data obtained from the NCBI GenBank, gene homologues from T. vincentii LA-1 (ATCC 35580) and T. pallidum SS14 were also included in our phylogenetic analyses for comparative purposes (see Additional file 2). Homologues of the flaA, recA, pyrH, ppnK, dnaN, era and radC genes are present in T. vincentii LA-1. The flaA, recA, pyrH, ppnK, dnaN and era genes; but not radC, are present in T. pallidum (e.g. subsp. pallidum SS14 strain ). We first determined the most appropriate nucleotide substitution models to use; for the analysis of the 8 individual gene datasets, as well as the combined multi-gene datasets from each strain (species). Accordingly, the optimal nucleotide-substitution models were identified using the Akaike Information Criterion (AIC), as described by Bos and Posada . The results are summarized in Additional file 3. The optimal nucleotide substitution model for the recA, radC, dnaN and era genes was the GTR + I + G model; whilst the GTR + G model was optimal for the 16S rRNA and flaA genes; and the K80 + I + G and SYM + G models were optimal for the ppnK and pyrH gene datasets, respectively. The respective optimal models were used for the phylogenetic analyses of the eight individual gene datasets, whilst the GTR + I + G model was used for the analysis of the concatenated seven-gene dataset (described below).
Phylogenetic reconstructions based on the eight individual gene sequences (16S rRNA, flaA, recA, pyrH, ppnK, dnaN, era and radC) were performed using both maximum likelihood (ML) and Bayesian (BA) approaches. The eight BA trees constructed are shown in an ultrametric form (i.e. topology only) in Figure 1. The eight corresponding ML trees are shown with branch lengths proportional to genetic distances in Additional file 4. It should be noted that due to the proportionately large genetic distances between the T. denticola, T. vincentii and T. pallidum taxa, the two out-groups are not shown in the ML trees; so that the relationships between the respective T. denticola strains are more easily visualized (see below). Taken together, the 8 respective pairs of phylogenetic trees generated using these two different approaches shared similar overall topologies (i.e. had a similar shape and branching order). The 20 strains were fairly poorly resolved in the phylogenetic trees obtained from the individual 16S rRNA, ppnK, radC and dnaN gene datasets; especially in the ML trees; each forming polytomies (multifurcations) with a lack of statistical support. The BA topologies of the flaA, recA, and pyrH genes were the best resolved; especially on the backbone, indicating that 15 strains formed a well-supported monophyletic clade. However, the strain compositions and inter-strain relationships were not entirely concordant with one another. The MS25 and GM-1 strains formed a strongly supported clade in the flaA, era, dnaN, recA and radC trees generated by both phylogenetic approaches [BA: posterior probability (PP) = 0.99 − 1.00; ML: bootstrap support (BS) = 91 − 100]. The ATCC 35404, NY531, NY535 and NY553 strains clustered together in a strongly-supported clade in the pyrH, dnaN and recA trees constructed using both BA and ML methods.
The range of intraspecific sequence similarity (%) was calculated for each gene, in order to determine how this measure of DNA sequence variation could be used to discriminate the 20 T. denticola strains (Figure 2). The pyrH gene had the highest levels of sequence polymorphism between the strains (83.9 − 100% similarity), closely followed by flaA (84.4 − 100%). The 16S rRNA gene had by far the lowest levels of inter-strain sequence variation (99.3 − 100% similarity). This indicated that the pyrH and rrsA/B gene sequences respectively had the best and worst strain-differentiating abilities. The levels of nucleotide diversity per site (Pi) within each of the eight genes are shown in Table 4. In the protein-encoding genes, Pi values ranged from ca. 0.033 (pyrH, recA) to 0.026 (dnaN).
Detection of recombination using concatenated multi-gene sequence data
Failing to account for DNA homologous recombination (i.e. horizontal genetic exchange) can lead to erroneous phylogenetic reconstruction and also elevate the false-positive error rate in positive selection inference. Therefore, we checked for evidence of recombination within each of the eight individual genetic loci in all 20 strains, by identifying possible DNA ‘breakpoints’ using the HYPHY 2.0 software suite . No evidence of genetic recombination was found within any gene sequences in any strain. This indicated that all the sites in the respective gene sequences shared a common evolutionary history.
Analysis of selection pressure at each genetic locus
Selection pressure was analyzed by determining the ratios of non-synonymous to synonymous mutations (ω = dN/dS) for each codon site within each of the seven protein-encoding genes, in each of the 20 strains. When ω < 1, the codon is under negative selection pressure, i.e. purifying or stabilizing selection, to conserve the amino acid composition of the encoded protein. Table 4 summarizes the global rate ratios (ω = dN/dS) with 95% confidence intervals, as well as the numbers of negatively selected codon sites for each of the genes investigated. It may be seen that global ratios for the seven genes were subject to strong purifying selection (ω < 0.106), indicating that there was a strong selective pressure to conserve the function of the encoded proteins. No positively-selected sites were found in any of the 140 gene sequences.
Phylogenetic analyses of T. denticolastrains using concatenated multi-gene sequence data
The DNA sequences of the seven protein-encoding genes were concatenated in the order: flaA − recA − pyrH − ppnK − dnaN − era − radC, for analysis using BA and ML approaches. The combined data matrix contained 6,513 nucleotides for each strain. The ML tree is shown with branch lengths proportional to genetic distances (Figure 3A), whilst the BA tree is shown in an ultrametric form (Figure 3B). Both the BA and ML trees clearly show that the T. denticola strains share a monophyletic origin. The genetic distances on the ML tree indicate that the T. denticola strains analyzed here are much more closely related to each other, than to T. vincentii or T. pallidum. Six analogous clades (labeled I–VI) comprising 18 strains were identified in both the ML and BA trees. Clade I consists of five strains: NY531, NY553, ATCC 35404, NY535 and OT2B; with moderate to strong statistical support (BA PP = 1.00, ML BS = 88). Clade II has two strains (ATCC 33520 and NY545) and is well-supported (BA PP = 1.00; ML BS = 92). Clade III contains the CD-1 and ATCC 35405 (type) strains, which are both North American in origin, with moderate to strong support (BA PP = 1.00; ML BS = 80). Clade IV contains 3 strains (ATCC 33521, ST10 and OMZ 852) with no statistical support. Clade V comprises four strains: MS25, GM-1, S2 and OKA3. Although this clade has no support, it is apparent that the two USA strains (MS25 and GM-1) form a well-supported clade (BA PP = 1.00, ML BS = 100), whereas the two Japanese strains (S2 and OKA3) form a clade with moderate to strong support (BA PP = 0.98, ML BS = 62). Clade VI comprises two strains from China (ATCC 700771 and OMZ 853), with strong support (BA PP = 0.97, ML BS = 94). The Chinese ATCC 700768 strain is found to be basal to the other 19 strains in the BA tree, and appears to be highly divergent in the ML tree. Since the ML tree is better resolved than the corresponding BA tree, we will primarily refer to the ML tree in the rest of this paper.
The oral spirochete bacterium Treponema denticola is postulated to play an important role in the pathogenesis of periodontal disease; in particular chronic periodontitis, which is estimated to affect ca. 10-15% of the global population [3, 4, 6–9]. It is also implicated in the etiology of acute necrotizing ulcerative gingivitis (ANUG)  and orofacial noma , two other tissue-destructive diseases of the orofacial region. However, T. denticola is commonly detected in the oral microbiota in dentulous adults; albeit at relatively low levels, and its precise etiopathogenic mechanisms remain to be established. The elucidation of more specific disease associations is presently hampered by the lack of a reliable method for strain identification, and a very poor understanding of how strains differ at the genetic level.
Here, we utilized a seven protein-coding gene multilocus sequence analysis (MLSA) approach, to characterize genomic diversity and evolutionary relationships in a small, but carefully-selected collection of T. denticola isolates of diverse geographical origin. Our results revealed that there are relatively high levels of genetic diversity amongst T. denticola strains; with gene sequence similarities ranging between ca. 84 − 100% between the strains. These levels are considerably higher than in T. pallidum; where strains of the pallidum and pertenue subspecies share ca. 100-99.6% genome sequence identity, and genetic differences are largely confined to recombination ‘hotspots’ or other areas of acquired DNA sequence . Whilst there were variations in the relative proportions of polymorphic sites present in the seven protein-encoding genes selected for analysis, all were under a strong (purifying) evolutionary pressure to conserve function. We found no evidence of genetic recombination in any gene sequence analyzed, indicating that genes had evolved as intact units in each strain. It is interesting to note that the flaA gene, which encodes an endoflagellar sheath protein that is a known a cell surface-exposed epitope , appeared to follow a similar evolutionary pathway as the pyrH and recA ‘housekeeping’ genes analyzed. Although we also sequenced the 16S rRNA (rrsA/rrsB) gene(s) from each strain, we did not add this to the concatenated multi-gene sequence for phylogenetic analysis. This was because it is present in two identical copies on the T. denticola genome , and may be under distinct evolutionary pressures, due to the fact that is not translated into a protein; e.g. it may have increased levels of nucleotide insertions or deletions (indels), or may have selection biases relating to its secondary structure .
Based on the concatenated 7-gene (flaA, recA, pyrH, ppnK, dnaN, era and radC) datasets, both the Bayesian (BA) and maximum likelihood (ML) topologies clearly indicated that all 20 T. denticola strains are monophyletic; i.e. they originated from a single common ancestor that was genetically distinct from T. vincentii and T. pallidum (see Figure 3). Our data also indicates that at the genetic level, T. denticola is more closely related to the oral treponeme T. vincentii, than the syphilis spirochete. Six well-defined clades (I-VI) were formed in both the BA and ML trees, which comprised 18 of the 20 strains analyzed. The OTK strain from the USA does not fall within any of the defined clades, possibly due to the relatively low sample size. The early-branching ATCC 700768 strain from China appears to be highly divergent from the other T. denticola taxa (discussed further below). The overall concordances in tree topologies obtained for the 7 individual genes, which are well-distributed around the ca. 2.8 Mbp chromosome, are consistent with T. denticola being predominantly clonal in nature. We did not attempt to estimate evolutionary timescales, as the precise dates of isolation are not known for these strains. Due to the high levels of sequence divergence and putatively clonal strain distributions, we speculate that T. denticola has been co-evolving in humans and animal hosts for a considerable period of time. However, genome sequence data from additional strains of known isolation date will be required to validate this proposition.
It should be noted that the majority of previous biophysical or culture-based investigations involving T. denticola have primarily utilized only three different (ATCC) strains: 35405T (Clade III), 35404 (Clade I) and 33520 (Clade II); which are all of North American origin [30, 31]. Our data suggests that these three strains (lineages) may not be wholly representative of the T. denticola strains distributed within global populations. Whilst our sample size is modest, the scope of our MLSA analysis was limited by the relative paucity of T. denticola strains presently available. Oral treponemes such as T. denticola are fastidious, capricious and notoriously difficult to isolate; and there are very few laboratories in the world that actively maintain strain collections.
The ATCC 700768 (OMZ 830, China), ATCC 700771 (OMZ 834, China), OMZ 853 (China) and OTK (USA) strains, located in basal positions in the phylogenetic trees, appear to be the most genetically distant from the genome-sequenced ATCC 35405 type strain (Canada). This genetic divergence is consistent with literature reports, which have stated that these strains have notable phenotypic differences. For example, the primary sequence, domain structure and immunogenic properties of the major surface protein (Msp) in the OTK strain, were shown to be quite distinct from those of the ATCC 35405 or 33520 strains [14, 45, 46]. In another study, Wyss et al. reported that the FlaA proteins from the ATCC 700768 and ATCC 700771 strains reacted positively towards the ‘pathogen-related oral spirochete’ (PROS) H9-2 antibody (raised against T. pallidum); whilst the ATCC 35405, 35404, 33521, 33520 and ST10 strains were unreactive .
It is highly notable that several sets of T. denticola strains with similar genetic compositions were isolated from subjects living on different continents; i.e. the MS25 (USA), GM-1 (USA), S2 (Japan) and OKA3 (Japan) strains in Clade V; the ATCC 33520 (USA) and NY545 (Netherlands) strains in Clade II; the ATCC 33521 (USA), ST10 (USA) and OMZ 852 (China) strains in Clade IV; and the ATCC 35404 (Canada), OT2B (USA), NY531 (Netherlands), NY535 (Netherlands) and NY553 (Netherlands) strains in Clade I. We tentatively propose that this indicates that there may be a number of T. denticola clonal lineages, or closely-related clusters of strains, which have global distributions. We also identified closely-related strains that had been isolated from different subjects residing in the same geographical location: e.g. the ATCC 700771 and OMZ 853 strains from China (Clade VI).
This study represents the first in-depth multilocus sequencing approach that has been used to analyze strains belonging to a species of oral spirochete bacteria. However, it is important to note that alternative MLSA schemes have previously been used to characterize intra-species variation in other (pathogenic) spirochetes. A 21 gene MLSA approach was notably used to probe the origins, evolutionary history and possible migratory routes of T. pallidum, the causative agent of syphilis . Genetic diversity within Borellia burgdorferi sensu lato, was similarly investigated using a seven gene MLSA system , enabling taxonomic relationships to be defined within this complex group of related (sub)-species. As far as other putative periodontal pathogens are concerned, Koehler and coworkers used a 10 gene MLSA system to investigate genetic relationships between 18 Porphyromonas gingivalis strains isolated from patients with periodontitis in Germany, and one isolate from the USA . This revealed the presence of high levels of horizonal gene transfer, i.e. a panmictic population structure; quite unlike what we have found for T. denticola here. Subsequent studies have revealed that both P. gingivalis and another another ‘periodontopathogen’: Aggregatibacter actinomycetemcommitans both had specific lineages with increased association with periodontal disease; with apparently differing levels of carriage in certain ethnic groups or geographical populations [48–50]. It remains to be established whether T. denticola also possesses lineages with increased association with periodontal disease.
As the seven selected genes appear to be well-conserved in treponeme species, we envisage our MLSA framework as being readily adaptable for strain typing, as well as establishing intra- and inter-species phylogenetic relationships within diverse treponeme populations. For example, one interesting application would be to explore similarities and evolutionary relationships between closely-related strains and species of treponeme bacteria found in the human oral cavity, versus those present in animal reservoirs; especially those associated with polymicrobial tissue-destructive infections [51, 52].
Our sequencing data clearly reveals that clinical isolates of the periodontal pathogen T. denticola have highly diverse genotypes. We define 6 distinct clonal lineages present within strains isolated from subjects living in Asia, Europe and North America. Several T. denticola lineages are present on different continents, which is consistent with the existence of strains with widespread, possibly global, distributions. Our results lay the foundations for future systematic molecular investigations aimed at establishing the ecological distributions, disease associations or phylogeny of treponemes belonging to this and other species.
Strain culture; gene amplification, cloning and sequencing
Treponema denticola strains were purchased from the American Type Culture Collection (ATCC) or generously provided by Dr. Barry McBride (University of British Columbia, Canada), Dr. Chris Wyss (University of Zurich, Switzerland) and Dr. E. Peter Greenberg (Washington University, USA). All strains were cultured anaerobically in TYGVS media supplemented with 10% rabbit serum as previously described . Genomic DNA was purified from 3-5 day old cultures using a Wizard Genomic DNA Purification Kit (Promega), using the manufacturer’s gram-negative protocol. PCR primers targeting the dnaN (TDE0231); recA (TDE0872); radC (TDE0973); ppnK (TDE1591); flaA (TDE1712); era (TDE1895) and pyrH (TDE2085) genes were designed using Omiga 2.0 (Oxford Molecular), based on the genome-sequenced ATCC 35405 strain , and are listed in Table 3. The rrsA/B genes were amplified using the TPU1 (5′-AGAGTTTGATCMTGGCTCAG-3′)  and C90 (5′-GTTACGACTTCACCCTCCT-3′) primers . PCR reactions were performed using a ‘touchdown’ method on a GeneAmp PCR System 9700 (Applied Biosystems). PCR reactions (50 μl) contained 10 μl of PyroBest Buffer II, 2 μl of genomic DNA (ca. 50 ng), 4 μl of dNTPs (2.5 mM each), 2 μl of each forward and reverse primer (10 μM each), and 0.25 μl of PyroBest DNA polymerase (1.25 U, TaKaRa). PCR cycling conditions consist of an initial denaturation (94°C, 90s); followed by 4-6 cycles of: denaturation (94°C, 20s), annealing (temperature as indicated in Table 3, 20s) decreasing 1°C every cycle, extension (72°C, 3 min); followed 26 cycles of denaturation (94°C, 15s), annealing (temperature as indicated, 15s), extension (72°C, 2 min); final extension (72°C, 7 min). PCR products were analyzed using 1% agarose gel electrophoresis and stained with ethidium bromide. PCR products were gel-purified using a QIAquick Gel Extraction Kit (Qiagen), and cloned into pCR2.1-TOPO vector using a TOPO TA Cloning Kit (Invitrogen) according to the manufacturer’s instructions. Ligation mixtures were electroporated into Escherichia coli DH10B cells, plated on Luria-Bertani (LB) 1% agar plates supplemented with kanamycin (50 μg/ml) and X-gal (5-bromo-4-chloro-indolyl-β-D-galactopyranoside, 20 μg/ml), and incubated overnight at 37°C. Plasmid DNA was purified from 4 or 5 colonies from each plate using the QIAprep Spin Miniprep Kit (Qiagen). At least three colonies containing PCR inserts were commercially sequenced in both directions (M13 forward and reverse primers) using an Applied Biosystems 3730xl DNA Analyzer. Direct sequencing of genomic DNA (Invitrogen, Hong Kong) using outward-facing internal primers was used as indicated to obtain genomic sequence data when PCR amplification proved unsuccessful.
Analysis of gene sequence similarity and phylogeny
Sequence data were edited and assembled in Omiga 2.0 and EMBOSS GUI (European Molecular Biology Open Software Suite  and gene alignments were manually checked and optimized using BioEdit v.7.0.9  and MEGA 4 . GC content and the location of polymorphic sites were analyzed using Omiga 2.0 and FaBOX  (http://www.birc.au.dk/software/fabox). All seven genes (flaA, recA, pyrH, ppnK, dnaN, era, and radC) were concatenated using Se-Al ver.2.0a11 , giving a final alignment of 6,780 nucleotides (including gaps). The range of intraspecific sequence similarity (%) for each gene was calculated using the sequence identity matrix program implemented in BioEdit. Nucleotide polymorphism in each gene was evaluated by quantifying the nucleotide diversity per site (Pi) using DNA Sequence Polymorphism software (DnaSP 5.10) .
Maximum Likelihood (ML) and Bayesian methods were used to analyze both individual genes, and concatenated gene sequence datasets. The optimal substitution model and gamma rate heterogeneity for individual genes and combined dataset were determined using the Akaike Information Criterion (AIC) in MrModeltest ver. 2.2 . Maximum likelihood (ML) trees were generated using GARLI ver. 0.96  with support calculated from 100 bootstrap replicates. Bootstrap support (BS) values ≥ 70% were considered to have strong support.
Partitioned Bayesian analyses (BA) were conducted using MrBayes v.3.1.2 , with two independent runs of Metropolis-coupled Markov chain Monte Carlo (MCMCMC) analyses, each with 4 chains and 1 million generations, with trees sampled every 100 generations. The level of convergence was assessed by checking the average standard deviation of split frequencies (<0.005). Convergence of the runs was also checked visually in Tracer ver. 1.5 , ensuring the effective sample sizes (ESS) were all above 200. Bayesian posterior probabilities (PP) were calculated by generating a 50% majority-rule consensus tree from the remaining sampled trees after discarding the burn-in (10%). PP values ≥ 0.95 indicate statistical support.
Detection of recombination and natural selection
A codon-based approach implemented in HYPHY 2.0  was used to analyze selection pressures within the seven individual protein-encoding genes, using a neighbor-joining model. Genetic algorithm recombination detection (GARD) was first used to identify any possible recombination breakpoints within each gene. Single likelihood ancestor counting (SLAC) was employed to calculate the global nonsynonymous (dN) and synonymous (dS) nucleotide substitution rate ratios (ω = dN/dS), with 95% confidence intervals; and to test the selection of variable codon sites based on the most appropriate nucleotide substitution model and tree topology, with a critical p-value of 0.05.
Nucleotide sequence accession numbers
Nucleotide sequences were submitted to GenBank under the following accession numbers: JF700256–JF700268 and KC415232−KC415235, [16S rRNA (rrsA/B)]; JF700269–JF700283 and KC415220−KC415223 (flaA); JF700284–JF700298 and KC415208−KC415211 (recA); JF700299–JF700313 and KC415216−KC415219 (pyrH); JF700314–JF700328 and KC415204−KC415207 (ppnK); JF700329–JF700343 and KC415228−KC415231 (dnaN); JF700344–JF700358 and KC415224−KC415227 (era); and JF700359–JF700373 and KC415212−KC415215 (radC). The Treponema vincentii LA-1 (ATCC 35580) and Treponema pallidum subsp. pallidum SS14 reference strains were selected as outgroups, using complete genomes obtained from GenBank under Accession numbers NZ_ACYH00000000 and NC_010741, respectively.
Multilocus sequence analysis
Akaike Information Criterion
Resampling of estimated log likelihoods
Metropolis-coupled Markov chain Monte Carlo
General Time Reversible model
Hasegawa, Kishino and Yano model
Gamma distribution of changes
Ribosomal ribonucleic acid
American type culture collection
Tryptone-yeast extract-gelatin-volatile fatty acids-serum (medium).
Darveau RP: Periodontitis: a polymicrobial disruption of host homeostasis. Nat Rev Microbiol. 2010, 8 (7): 481-490. 10.1038/nrmicro2337.
Loesche WJ, Grossman NS: Periodontal disease as a specific, albeit chronic, infection: diagnosis and treatment. Clin Microbiol Rev. 2001, 14 (4): 727-752. 10.1128/CMR.14.4.727-752.2001.
Pihlstrom BL, Michalowicz BS, Johnson NW: Periodontal diseases. Lancet. 2005, 366 (9499): 1809-1820. 10.1016/S0140-6736(05)67728-8.
Petersen PE, Ogawa H: Strengthening the prevention of periodontal disease: the WHO approach. J Periodontol. 2005, 76 (12): 2187-2193. 10.1902/jop.2005.76.12.2187.
Socransky SS, Haffajee AD: Periodontal microbial ecology. Periodontol 2000. 2005, 38: 135-187. 10.1111/j.1600-0757.2005.00107.x.
Ellen RP, Galimanas VB: Spirochetes at the forefront of periodontal infections. Periodontol 2000. 2005, 38: 13-32. 10.1111/j.1600-0757.2005.00108.x.
Sela MN: Role of Treponema denticola in periodontal diseases. Crit Rev Oral Biol Med. 2001, 12 (5): 399-413. 10.1177/10454411010120050301.
Dashper SG, Seers CA, Tan KH, Reynolds EC: Virulence factors of the oral spirochete Treponema denticola. J Dent Res. 2011, 90 (6): 691-703. 10.1177/0022034510385242.
Ishihara K: Virulence factors of Treponema denticola. Periodontol 2000. 2010, 54 (1): 117-135. 10.1111/j.1600-0757.2009.00345.x.
Simonson LG, Goodman CH, Bial JJ, Morton HE: Quantitative relationship of Treponema denticola to severity of periodontal disease. Infect Immun. 1988, 56 (4): 726-728.
Holt SC, Ebersole JL: Porphyromonas gingivalis, Treponema denticola, and Tannerella forsythia: the “red complex”, a prototype polybacterial pathogenic consortium in periodontitis. Periodontol 2000. 2005, 38: 72-122. 10.1111/j.1600-0757.2005.00113.x.
Chan EC, Siboo R, Keng T, Psarra N, Hurley R, Cheng SL, Iugovaz I: Treponema denticola (ex Brumpt 1925) sp. nov., nom. rev., and identification of new spirochete isolates from periodontal pockets. Int J Syst Bacteriol. 1993, 43 (2): 196-203. 10.1099/00207713-43-2-196.
Simonson LG, Rouse RF, Bockowski SW: Monoclonal antibodies that recognize a specific surface antigen of Treponema denticola. Infect Immun. 1988, 56 (1): 60-63.
Capone R, Wang HT, Ning Y, Sweier DG, Lopatin DE, Fenno JC: Human serum antibodies recognize Treponema denticola Msp and PrtP protease complex proteins. Oral Microbiol Immunol. 2008, 23 (2): 165-169. 10.1111/j.1399-302X.2007.00404.x.
Wyss C, Moter A, Choi BK, Dewhirst FE, Xue Y, Schupbach P, Gobel UB, Paster BJ, Guggenheim B: Treponema putidum sp. nov., a medium-sized proteolytic spirochaete isolated from lesions of human periodontitis and acute necrotizing ulcerative gingivitis. Int J Syst Evol Microbiol. 2004, 54 (Pt 4): 1117-1122.
Heuner K, Bergmann I, Heckenbach K, Gobel UB: Proteolytic activity among various oral Treponema species and cloning of a prtP-like gene of Treponema socranskii subsp. socranskii. FEMS Microbiol Lett. 2001, 201 (2): 169-176.
Dahle UR, Olsen I, Tronstad L, Caugant DA: Population genetic analysis of oral treponemes by multilocus enzyme electrophoresis. Oral Microbiol Immunol. 1995, 10 (5): 265-270. 10.1111/j.1399-302X.1995.tb00152.x.
Seshadri R, Myers GS, Tettelin H, Eisen JA, Heidelberg JF, Dodson RJ, Davidsen TM, DeBoy RT, Fouts DE, Haft DH, et al: Comparison of the genome of the oral pathogen Treponema denticola with other spirochete genomes. Proc Natl Acad Sci USA. 2004, 101 (15): 5646-5651. 10.1073/pnas.0307639101.
NIH Human Microbiome Project: [http://hmpdacc.org/]
Smajs D, Norris SJ, Weinstock GM: Genetic diversity in Treponema pallidum: implications for pathogenesis, evolution and molecular diagnostics of syphilis and yaws. Infect Genet Evol. 2012, 12 (2): 191-202. 10.1016/j.meegid.2011.12.001.
Gevers D, Cohan FM, Lawrence JG, Spratt BG, Coenye T, Feil EJ, Stackebrandt E, Van de Peer Y, Vandamme P, Thompson FL, et al: Opinion: Re-evaluating prokaryotic species. Nat Rev Microbiol. 2005, 3 (9): 733-739. 10.1038/nrmicro1236.
Hanage WP, Fraser C, Spratt BG: Fuzzy species among recombinogenic bacteria. BMC Biol. 2005, 3: 6-10.1186/1741-7007-3-6.
Hanage WP, Fraser C, Spratt BG: Sequences, sequence clusters and bacterial species. Philos Trans R Soc Lond B Biol Sci. 2006, 361 (1475): 1917-1927. 10.1098/rstb.2006.1917.
Santos SR, Ochman H: Identification and phylogenetic sorting of bacterial lineages with universally conserved genes and proteins. Environ Microbiol. 2004, 6 (7): 754-759. 10.1111/j.1462-2920.2004.00617.x.
Naser SM, Thompson FL, Hoste B, Gevers D, Dawyndt P, Vancanneyt M, Swings J: Application of multilocus sequence analysis (MLSA) for rapid identification of Enterococcus species based on rpoA and pheS genes. Microbiology. 2005, 151 (Pt 7): 2141-2150.
Thompson FL, Gevers D, Thompson CC, Dawyndt P, Naser S, Hoste B, Munn CB, Swings J: Phylogeny and molecular identification of vibrios on the basis of multilocus sequence analysis. Appl Environ Microbiol. 2005, 71 (9): 5107-5115. 10.1128/AEM.71.9.5107-5115.2005.
Richter D, Postic D, Sertour N, Livey I, Matuschka FR, Baranton G: Delineation of Borrelia burgdorferi sensu lato species by multilocus sequence analysis and confirmation of the delineation of Borrelia spielmanii sp. nov. Int J Syst Evol Microbiol. 2006, 56 (Pt 4): 873-881.
Harper KN, Ocampo PS, Steiner BM, George RW, Silverman MS, Bolotin S, Pillay A, Saunders NJ, Armelagos GJ: On the origin of the treponematoses: a phylogenetic approach. PLoS Negl Trop Dis. 2008, 2 (1): e148-10.1371/journal.pntd.0000148.
Vinuesa P: Multilocus Sequence Analysis and Bacterial Species Phylogeny Estimation, Chapter 3. Molecular Phylogeny of Microorganisms. Edited by: Oren A, Papke RT. 2010, Norfolk, UK: Caister Academic Press, 41-64.
Cheng SL, Siboo R, Quee TC, Johnson JL, Mayberry WR, Chan EC: Comparative study of six random oral spirochete isolates. Serological heterogeneity of Treponema denticola. J Periodontal Res. 1985, 20 (6): 602-612.
Jacob E, Allen AL, Nauman RK: Detection of oral anaerobic spirochetes in dental plaque by the indirect fluorescent-antibody technique. J Clin Microbiol. 1979, 10 (6): 934-936.
Weinberg A, Holt SC: Interaction of Treponema denticola TD-4, GM-1, and MS25 with human gingival fibroblasts. Infect Immun. 1990, 58 (6): 1720-1729.
Socransky SS, Listgarten M, Hubersak C, Cotmore J, Clark A: Morphological and biochemical differentiation of three types of small oral spirochetes. J Bacteriol. 1969, 98 (3): 878-882.
Hespell RB, Canale-Parola E: Amino acid and glucose fermentation by Treponema denticola. Arch Mikrobiol. 1971, 78 (3): 234-251. 10.1007/BF00424897.
Mikx FH: Comparison of peptidase, glycosidase and esterase activities of oral and non-oral Treponema species. J Gen Microbiol. 1991, 137 (1): 63-68.
Ter Steeg PF, Van Der Hoeven JS: Development of Periodontal Microflora on Human Serum. Microb Ecol Health Dis. 1989, 2 (1): 1-10. 10.3109/08910608909140195.
Ter Steeg PF, Van Der Hoeven JS, De Jong MH, Van Munster PJJ, Jansen MJH: Modelling the Gingival Pocket by Enrichment of Subgingival Microflora in Human Serum in Chemostats. Microb Ecol Health Dis. 1988, 1 (2): 73-84. 10.3109/08910608809140185.
Miyamoto M, Noji S, Kokeguchi S, Kato K, Kurihara H, Murayama Y, Taniguchi S: Molecular cloning and sequence analysis of antigen gene tdpA of Treponema denticola. Infect Immun. 1991, 59 (6): 1941-1947.
Matejkova P, Strouhal M, Smajs D, Norris SJ, Palzkill T, Petrosino JF, Sodergren E, Norton JE, Singh J, Richmond TA, et al: Complete genome sequence of Treponema pallidum ssp. pallidum strain SS14 determined with oligonucleotide arrays. BMC Microbiol. 2008, 8: 76-10.1186/1471-2180-8-76.
Bos DH, Posada D: Using models of nucleotide evolution to build phylogenetic trees. Dev Comp Immunol. 2005, 29 (3): 211-227. 10.1016/j.dci.2004.07.007.
Pond SLK, Frost SDW, Muse SV: HyPhy: hypothesis testing using phylogenies. Bioinformatics. 2005, 21 (5): 676-679. 10.1093/bioinformatics/bti079.
Gmur R, Wyss C, Xue Y, Thurnheer T, Guggenheim B: Gingival crevice microbiota from Chinese patients with gingivitis or necrotizing ulcerative gingivitis. Eur J Oral Sci. 2004, 112 (1): 33-41. 10.1111/j.0909-8836.2004.00103.x.
Paster BJ, Falkler JWA, Enwonwu CO, Idigbe EO, Savage KO, Levanos VA, Tamer MA, Ericson RL, Lau CN, Dewhirst FE: Prevalent bacterial species and novel phylotypes in advanced noma lesions. J Clin Microbiol. 2002, 40 (6): 2187-2191. 10.1128/JCM.40.6.2187-2191.2002.
Wyss C: Flagellins, but not endoflagellar sheath proteins, of Treponema pallidum and of pathogen-related oral spirochetes are glycosylated. Infect Immun. 1998, 66 (12): 5751-5754.
Fenno JC, Wong GW, Hannam PM, Muller KH, Leung WK, McBride BC: Conservation of msp, the gene encoding the major outer membrane protein of oral Treponema spp. J Bacteriol. 1997, 179 (4): 1082-1089.
Edwards AM, Jenkinson HF, Woodward MJ, Dymock D: Binding properties and adhesion-mediating regions of the major sheath protein of Treponema denticola ATCC 35405. Infect Immun. 2005, 73 (5): 2891-2898. 10.1128/IAI.73.5.2891-2898.2005.
Koehler A, Karch H, Beikler T, Flemmig TF, Suerbaum S, Schmidt H: Multilocus sequence analysis of Porphyromonas gingivalis indicates frequent recombination. Microbiology. 2003, 149 (Pt 9): 2407-2415.
Rylev M, Kilian M: Prevalence and distribution of principal periodontal pathogens worldwide. J Clin Periodontol. 2008, 35 (8 Suppl): 346-361.
Enersen M, Olsen I, Kvalheim O, Caugant DA: fimA genotypes and multilocus sequence types of Porphyromonas gingivalis from patients with periodontitis. J Clin Microbiol. 2008, 46 (1): 31-42. 10.1128/JCM.00986-07.
Enersen M, Olsen I, van Winkelhoff AJ, Caugant DA: Multilocus sequence typing of Porphyromonas gingivalis strains from different geographic origins. J Clin Microbiol. 2006, 44 (1): 35-41. 10.1128/JCM.44.1.35-41.2006.
Evans NJ, Brown JM, Demirkan I, Murray RD, Birtles RJ, Hart CA, Carter SD: Treponema pedis sp. nov., a spirochaete isolated from bovine digital dermatitis lesions. Int J Syst Evol Microbiol. 2009, 59 (Pt 5): 987-991.
Evans NJ, Brown JM, Murray RD, Getty B, Birtles RJ, Hart CA, Carter SD: Characterization of novel bovine gastrointestinal tract Treponema isolates and comparison with bovine digital dermatitis treponemes. Appl Environ Microbiol. 2011, 77 (1): 138-147. 10.1128/AEM.00993-10.
Fenno JC: Laboratory maintenance of Treponema denticola. Current Protocols in Microbiology. 2005, 12B.11.11-12B.11.21.
Choi BK, Paster BJ, Dewhirst FE, Gobel UB: Diversity of cultivable and uncultivable oral spirochetes from a patient with severe destructive periodontitis. Infect Immun. 1994, 62 (5): 1889-1895.
Dewhirst FE, Tamer MA, Ericson RE, Lau CN, Levanos VA, Boches SK, Galvin JL, Paster BJ: The diversity of periodontal spirochetes by 16S rRNA analysis. Oral Microbiol Immunol. 2000, 15 (3): 196-202. 10.1034/j.1399-302x.2000.150308.x.
Rice P, Longden I, Bleasby A: EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet. 2000, 16 (6): 276-277. 10.1016/S0168-9525(00)02024-2.
Hall TA: BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symp Ser. 1999, 41: 95-98.
Kumar S, Nei M, Dudley J, Tamura K: MEGA: a biologist-centric software for evolutionary analysis of DNA and protein sequences. Brief Bioinform. 2008, 9 (4): 299-306. 10.1093/bib/bbn017.
Villesen P: FaBox: an online toolbox for FASTA sequences. Molecular Ecology Notes. 2007, 7 (6): 965-968. 10.1111/j.1471-8286.2007.01821.x.
Rambaut: Sequence Alignment Editor ver. 2.0. 1996, University of Oxford: Department of Zoology, [http://tree.bio.ed.ac.uk/software/seal/]
Librado P, Rozas J: DnaSP v5: A software for comprehensive analysis of DNA polymorphism data. Bioinformatics 2009. 2009, 25 (11): 1451-1452.
Posada D, Crandall KA: MODELTEST: testing the model of DNA substitution. Bioinformatics. 1998, 14 (9): 817-818. 10.1093/bioinformatics/14.9.817.
Zwickl DJ: PhD thesis. Genetic algorithm approaches for the phylogenetic analysis of large biological sequence datasets under the maximum likelihood criterion. 2006, The University of Texas at Austin
Ronquist F, Huelsenbeck JP: MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics. 2003, 19 (12): 1572-1574. 10.1093/bioinformatics/btg180.
Rambaut A: Molecular evolution, phylogenetics and epidemiology: Tracer. 2009, [http://tree.bio.ed.ac.uk/software/tracer/]
We are grateful to Dr. Chris Wyss, Dr. Barry McBride and Dr. E. Peter Greenberg for providing us with reference strains and clinical isolates. RMW acknowledges financial support from the University of Hong Kong through the Infection and Immunology Strategic Research Theme and a Seed Funding grant (#200911159092); and the Research Grants Council of Hong Kong, via a General Research Fund (GRF) grant (#781911). YCFS and GJDS are supported by the Duke–NUS Signature Research Program funded by the Agency for Science, Technology and Research, and the Ministry of Health, Singapore.
The authors declare no competing interests; financial or otherwise.
Conceived the study: RMW. Designed and performed the practical experimental work: SM, MY, DCLB, YBH, WKL, RMW. Designed and performed the computational analyses: SM, MY, YCFS, DCLB, GJDS, RMW. Wrote the manuscript: SM, MY, YCFS, DCLB, GJDS, WKL, RMW. All authors have read and approved the final manuscript.
Electronic supplementary material
Table summarizing G + C content (%) for the eight genes selected for sequence analysis within the 20
Additional file 1: Treponema denticola strains.(PDF 8 KB)
Additional file 2: flaA , recA , pyrH , ppnK , dnaN , era and radC gene homologues present in Treponema pallidum SS14 and Treponema vincentii LA-1 (ATCC 35580).(PDF 8 KB)
Additional file 3: flaA − recA − pyrH − ppnK − dnaN − era − radC gene datasets analyzed in this study.(PDF 8 KB)
Additional file 4: flaA , recA , pyrH , ppnK , dnaN , era and radC gene datasets.(PDF 341 KB)
About this article
Cite this article
Mo, S., You, M., Su, Y.C. et al. Multilocus sequence analysis of Treponema denticolastrains of diverse origin. BMC Microbiol 13, 24 (2013). https://doi.org/10.1186/1471-2180-13-24
- Treponema denticola
- Periodontal disease
- Multilocus sequence analysis
- Oral microbiota
- Infectious diseases