- Research article
- Open Access
atpE gene as a new useful specific molecular target to quantify Mycobacteriumin environmental samples
BMC Microbiology volume 13, Article number: 277 (2013)
The environment is the likely source of many pathogenic mycobacterial species but detection of mycobacteria by bacteriological tools is generally difficult and time-consuming. Consequently, several molecular targets based on the sequences of housekeeping genes, non-functional RNA and structural ribosomal RNAs have been proposed for the detection and identification of mycobacteria in clinical or environmental samples. While certain of these targets were proposed as specific for this genus, most are prone to false positive results in complex environmental samples that include related, but distinct, bacterial genera. Nowadays the increased number of sequenced genomes and the availability of software for genomic comparison provide tools to develop novel, mycobacteria-specific targets, and the associated molecular probes and primers. Consequently, we conducted an in silico search for proteins exclusive to Mycobacterium spp. genomes in order to design sensitive and specific molecular targets.
Among the 3989 predicted proteins from M. tuberculosis H37Rv, only 11 proteins showed 80% to 100% of similarity with Mycobacterium spp. genomes, and less than 50% of similarity with genomes of closely related Corynebacterium, Nocardia and Rhodococcus genera. Based on DNA sequence alignments, we designed primer pairs and a probe that specifically detect the atpE gene of mycobacteria, as verified by quantitative real-time PCR on a collection of mycobacteria and non-mycobacterial species. The real-time PCR method we developed was successfully used to detect mycobacteria in tap water and lake samples.
The results indicate that this real-time PCR method targeting the atpE gene can serve for highly specific detection and precise quantification of Mycobacterium spp. in environmental samples.
Mycobacterium genus is constituted of several pathogenic species, including the M. tuberculosis complex (MTC) responsible for tuberculosis (i.e. M. tuberculosis, M. africanum, M. bovis, M. canettii, M. caprae, M. microti and M. pinnipedii), M. leprae responsible for leprosy, and non-tuberculous mycobacteria (NTM), which are environmental potentially pathogenic species causing mycobacteriosis . Detection of mycobacteria by bacteriological tools is generally time-consuming and difficult because most pathogenic mycobacteria are slow growing, such that other microorganisms overgrow NTM colonies . Identification of mycobacteria based on metabolic criteria is also problematic as current methods do not allow for proper identification of mycobacterial species and sub-species. Consequently, molecular tools have been developed using rrs, gyrA, gyrB, hsp65, recA, rpoB, sodA genes and 16S-23S internal transcribed spacer (ITS) genes, to detect and/or identify mycobacteria species by sequence analysis [3, 4]. In order to detect Mycobacterium genus in clinical and environmental samples, several studies have proposed targeting different loci of the 16S rRNA gene [5–17], or other housekeeping genes such as gyrB , rpoB, and hsp65. Nevertheless, in a recent study comparing several primers commonly used for mycobacterial detection or identification, we demonstrated that most of these primers present either a high specificity (i.e. the proportion of true negatives that are correctly identified by the test) but a low sensitivity (i.e. the proportion of true positives that are correctly identified by the test), or conversely a high sensitivity but a low specificity . Indeed, some of these methods fail to detect several mycobacterial species by PCR, while other primers lead to detection of closely related genera  which also belong to the Corynebacterium, Nocardia, Rhodococcus, Mycobacterium (CNM) group  and which are commonly present in water and soil samples. Consequently, new strategies must be used in order to design Mycobacterium genus targets with high levels of specificity and sensitivity that will be useful for studying mycobacteria in their habitat.
As new mycobacterial sequences are added into genetic databases, our knowledge of mycobacterial genomes is increasing and this may help to design new primers and probes that will be both specific and sensitive. Since the whole sequencing of the first mycobacterial genome in 1998  by Sanger sequencing method (M. tuberculosis H37Rv), the number of mycobacterial sequences has considerably increased due to advances in sequencing capacity and the appearance of high throughput sequencing techniques . Today, GenBank database provides access to whole genomes of seven other strains of the MTC (M. tuberculosis and M. bovis species), two strains of M. leprae, and eleven species and subspecies of pathogenic (P) and non-pathogenic (NP) NTM: M. abscessus (P), M. avium (P), M. avium subsp. paratuberculosis (P), M. gilvum (NP), M. marinum (P), M. smegmatis (NP), Mycobacterium sp. JLS (NP), Mycobacterium sp. KMS (NP), Mycobacterium sp. MCS (NP), M. ulcerans (P), M. vanbaalenii (NP), [24–26]. Moreover, three whole genomes of other NTM species were sequenced and are currently assembled (M. intracellulare, M. kansasii, M. parascrofulaceum). This increasing number of completely sequenced mycobacterial genomes led to the development of the MycoHit software, which permits gene- and protein-level comparisons across mycobacteria species, . This software was originally developed to detect horizontal gene transfers and mutations among whole mycobacterial genomes . However, MycoHit should also be useful for developing new primers and probes for mycobacteria detection and quantification in environmental and clinical samples.
In this paper, we used this tool for screening sensitive and specific targets of Mycobacterium spp.. We compared in silico proteins of whole mycobacterial genomes with those of non-mycobacterial genomes using the MycoHit software, in order to find conserved sequences among mycobacteria that will not be shared with non-mycobacterial species. Based on the screening results a primer pair and a probe targeting the atpE gene were designed and tested by real-time PCR. This novel target proved to be totally specific and sensitive. It also offers the advantage of targeting a gene present as a single copy in the genome. Thus this new real-time PCR method appears promising for water quality survey, and should be useful for studying the ecology of mycobacteria in aquatic, terrestrial and urban environments.
Specificity of genes commonly used for mycobacterial detection/identification
Excluding rrs gene and ITS (non-functional RNA elements and structural ribosomal RNAs), and according to our strategy of genome comparison (Figure 1) most of the genes commonly used for mycobacterial species identification (gyrA, gyrB, hsp65, recA, rpoB, sodA, groEL1, groEL2) code for proteins which present similar conformations in non-mycobacterial studied genomes (Additional file 1). Indeed, protein similarity levels of these genes, in comparison with M. tuberculosis H37Rv genome, were higher than 80% for the other 15 mycobacterial genomes studied (96 ± 2% for gyrA, 94 ± 5% for gyrB, 79 ± 5% for groEL1, 93 ± 4% for groEL2 which is an alternative gene name for hsp65, 99 ± 1% for recA, 96 ± 2% for rpoB, 81 ± 33% for sodA), and also for the 12 non-mycobacterial genomes studied (86 ± 5% for gyrA, 85 ± 5% for gyrB, 89 ± 3% for groEL1, 96 ± 2% for groEL2, 94 ± 3% for recA, 88 ± 4% for rpoB, 69 ± 22% for sodA).
Selection of exclusively conserved proteins in Mycobacteriumspp. genomes
Among the 3989 predicted proteins of M. tuberculosis H37Rv genome (Figure 2A and Additional file 1), about 54.6% (i.e. 2177 proteins) presented protein similarities above 50% with the other studied mycobacterial genomes (n = 15), and only 6.8% of these hypothetical conserved mycobacterial proteins (150 proteins: 150 number in the top of a bar in Figure 2B) displayed similarities less than 50% with the studied non-mycobacterial genomes (n = 12). Consequently, almost half of the M. tuberculosis H37Rv predicted proteins are potentially present in the 12 studied genomes of CNM group members. We chose to decrease the number of candidate proteins by restricting the panel of studied proteins to those exclusively conserved in the mycobacterial genomes, focusing on M. tuberculosis H37Rv proteins with similarity levels between 80% and 100% in comparison with other mycobacterial genomes (n = 15), and less than 50% similarity levels in comparison with genomes (n = 12) of the other CNM group genera. As a result, among the 3989 predicted proteins of M. tuberculosis H37Rv genome (Figure 2A), we selected 11 proteins (11 number in the top of a bar in Figure 2B). Among the 3989 predicted proteins of M. tuberculosis H37Rv proteins (Additional file 1), the selected candidate proteins (Table 1), were the subunits C (locus Rv1305) and A (locus Rv1304) of the ATP synthase, the cyclopropane mycolic acid synthase (CMAS) coded by the cmaA1 gene in M. tuberculosis H37Rv (locus Rv3392c), hypothetical PE or PPE family proteins (loci Rv0285 and Rv3022c), proteins coded by esxG, esxH and esxR genes in M. tuberculosis H37Rv (loci Rv0287, Rv0288, Rv3019c, respectively), and proteins such as a lipoprotein coding by lppM gene (locus Rv2172c), an oxidoreductase (locus Rv0197), and a small secreted protein (locus Rv0236A).
Mycobacterial molecular target design
Among the 11 selected mycobacterial proteins, protein alignments revealed that the ATP synthase subunit C (locus Rv1305), the oxidoreductase (locus Rv0197), and the small secreted protein (locus Rv0236A), are the less polymorphous among the 14 NTM species studied (Additional file 2) and even absent in other bacteria genus and thus seemed very promising for primers and probes design. The remaining 8 proteins that were selected, namely ATP synthase subunit A, CMAS coded by the cmaA1 gene, lipoprotein coding by lppM gene, as well as PE, PPE and proteins coded by esx genes esxG, esxH and esxR, were highly conserved in studies MTC species (tuberculosis and bovis) but very polymorphous in the 14 NTM species studied (Additional file 1), which did not allow us to design specific mycobacterial primers and probes, according to the rules of primer and probe design (Additional file 3).
DNA sequence alignment of the oxidoreductase and of the small secreted protein did not allow design of PCR primers with a minimal length of 18 oligonucleotides (Additional file 3). Only the DNA sequence alignment of the ATP synthase subunits C allowed designing a PCR primer pair and a probe. We designed the following primers and probe: forward primer FatpE 5′-CGGYGCCGGTATCGGYGA-3′ (Tm = 62°C), with the probe PatpE 5′-ACSGTGATGAAGAACGGBGTRAA-3′ (Tm = 68°C) which might be hydrolyzed by the reverse primer RatpE 5′-CGAAGACGAACARSGCCAT-3′ (Tm = 59°C, 182 bp).
Real-time PCR validation
Based on standard curve comparisons, our results showed reproducible amplification signals with similar Ct values for each genome equivalents of tested mycobacterial strains: M. avium, M. fortuitum, M. intracellulare, M. gordonae, and M. chelonae (Table 2). Detection limit was estimated at about 6 genome equivalents for M. chelonae by real-time PCR reaction by testing repetition of dilution limits (i.e. EC95 value: more than 95% of positive detection for these genome concentration) whereas quantification limits were estimated at about 100 genome equivalents. In the positive collection all 31 mycobacteria species were positively detected by the real-time PCR method. This collection includes NTM species, leprae species and MTC species as tuberculosis and bovis (Table 3). None of the non-mycobacterial environmental strains and none of the CNM collection strains , were detected before the end of the 40 PCR cycles (Table 3). These results indicate a sensibility of 100% (31/31) and a specificity of 100% (0/30).
In order to compare with culture-based method (Method A) , and evaluate the impact of extraction methods on the quantification process by the new real-time PCR, we used two DNA extraction procedures (Method B and C) on water distribution samples: a commercial kit (Method B) and a published phenol-chloroform extraction (Method C) . DNA extraction from tap water significantly influenced the result of mycobacteria detection by atpE real-time PCR (Figure 3A). Detection levels from DNA extracted by the kit (Method B) were significantly higher (Wilcoxon signed-rank test, n = 90, p = 0.002) than those from DNA extracted by phenol/chloroform procedure (Method C). The percentage of positive samples was significantly higher (Chi-square test, n = 180, df = 1, p = 0.021) when performing the real-time PCR with the DNA extracted by method B (33/90), compared to method C (19/90). In order to evaluate the new real-time PCR method, we compared the levels of mycobacteria detected in water distribution samples with a published culture method called method A . Using the method A, Mycobacterium spp. colonies were obtained from 76% of tap water samples.
Mycobacteria quantification in lake samples by real-time PCR targeting atpE gene, shows a vast diversity of mycobacteria concentration, ranging from 104 to 106 ge/L in water column and neuston samples, and 105 to 106 ge/g DW (dry weight) in sediment samples. Comparison with the previously published methods targeting 16S rRNA  shows a high correlation between the results (Figure 3B, Correlation test, n = 30, Rs = 0.571, p = 0.028).
Although gyrA, gyrB, hsp65, recA, rpoB, and sodA genes are appropriate for identification purposes [3, 4], our results emphasized that these genes seem inappropriate for specific detection of mycobacteria. Indeed, their high similarities with non-mycobacterial genes make specific target design delicate. These new results are in accordance with our previous observations that the molecular targets which were designed based on gyrB , rpoB or hsp65 genes, had low specificity . For example, the non-related Helicobacter pylori show positive amplification with several Mycobacterium specific primer pairs . Prospection for more specific targets in mycobacterial genomes seems consequently necessary in order to improve current detection tools based on proteins and/or DNA. The new atpE real-time PCR method that we propose is just as specific, but more sensitive than the previously proposed rrs real-time PCR method which cannot detect some mycobacterial species .
The proposed strategy is aimed at comparing mycobacterial and non-mycobacterial genomic proteins to reference genomic DNA of M. tuberculosis H37Rv, sorting proteins according to similarity requests and listing candidate proteins (Figure 1). We chose to perform protein-level comparisons in order to identify exclusively conserved proteins in Mycobacterium spp. because non-coding regions, as intergenic regions and insertion sequences, are known to be less conserved than coding regions in M. tuberculosis genomes . According to literature, our results emphasized that almost half of the M. tuberculosis H37Rv predicted proteins are potentially present in the genomes of CNM group members. More precisely, mycobacteria belong to Actinobacteria which may explain the presence of 48 to 73% shared genes among high G + C content microorganisms [31–34]. In addition, horizontal gene transfers from different bacteria widely present in soil or water, especially Rhodococcus sp., Nocardia sp. and Streptomyces sp. were previously considered to have happened in the Mycobacterium genus which may also explain the shared proteins with non-mycobacterial species [24, 27, 35]. These observations show that CNM group members must be taken into account in order to develop highly specific mycobacterial targets, considering that these bacteria are commonly found in aquatic and terrestrial environments [36, 37].
Our study showed that 11 proteins exclusively conserved in the 16 mycobacterial genomes studied could be selected using our genome comparison strategy (i.e. proteins coded by atpE, atpB, cmaA1, lppM, PE5, PPE48, esxG, esxH and esxR genes, as well as an oxidoreductase and a small secreted protein). Only the aptE gene could be used to design primers and a probe for mycobacteria detection. Concerning the other genes, the sequence polymorphism among NTM species did not allow designing molecular targets for Mycobacterium spp. detection. However, these genes could be of immunological or pathogenic importance. Indeed, PE and PPE family proteins represent 0.9 to 4.2% of the genome coding capacity of several mycobacteria [22, 25, 26, 35], and are suspected to play a major antigenic role in immune response . PE and PPE family proteins are often associated with mycobacterial esx gene clusters, which encode ATP dependent specific secretion system  and are required to export specific members of the 6-kDa early secreted antigenic target (ESAT-6) protein family . Together, ATP dependent specific secretion system and ESAT-6 protein family play a major role in the virulence and life cycle of mycobacteria [24, 26]. Nevertheless, PE and PPE family proteins, and proteins coded by esx gene clusters are very small and polymorphous among genomes of the 11 NTM species compared (Table 1). Mycobacterial cell wall is also important in pathology, and could procure interesting PCR targets. For instance, several studies emphasized that cyclopropanation of the mycolic acids is common among pathogenic mycobacteria but rare among saprophytic species . Although having sufficient length, proteins CMAS coded by the cmaA1 gene and lipoprotein coded by lppM gene in M. tuberculosis H37Rv, were also polymorphous among genomes of the 11 NTM species compared (Table 1) and thus could not be used to design a primer pair and a probe (Additional file 2). Nevertheless, polymorphism of mycobacterial mycolic acids is useful for mycobacteria identification [40, 41].
The atpE gene which codes ATP synthase subunit C in M. tuberculosis H37Rv genome (locus Rv1305) is exclusively conserved in the genomes of the 17 mycobacterial species studied (Additional file 2), and its length and relative conservation among mycobacteria make it an adequate molecular target in order to detect Mycobacterium genus. It is remarkable to see that the protein coded by atpE gene was also the target of the new antimycobacterial compound recently described: diarylquinoline R207910 . This compound shows a specific bactericidal effect on mycobacteria and none in other genera . In addition, our in vitro results demonstrated the specificity of the atpE gene (locus Rv1305), which codes for the ATP synthase protein subunit C. These results also showed that our strategy of target design based on MycoHit software (Figure 1) gave very useful results for designing highly specific primers and might be applied to other microorganism clusters.
In vitro validation of the real-time PCR targeting the atpE gene showed a very high specificity and sensitivity, as well as reproducible quantification of different mycobacteria species. The new real-time method was tested on a realistic number of mycobacterial species including several slow and rapid growing NTM, although not all the described mycobacterial species were tested. In addition, application of this real-time PCR method to environmental samples showed that Mycobacterium was detected in tap water samples. The discrepancy between the cultural and molecular techniques was previously described for other pathogens, and the lower level of prevalence obtained by the PCR methods was probably due to our concentration and extraction procedures. These protocol steps must be improved to detect low level of NTM even if the used spin column seemed more appropriate for DNA extraction from environmental samples compared to classical phenol-chloroform extraction. Moreover, culture method did not detect higher level of mycobacterial cells compared to the molecular one. Both methods have advantages and drawbacks, and it may explain the differences observed. For instance, molecular methods could detect dead bacteria, or viable but uncultivable bacteria. However, the real-time PCR targeting the atpE gene allows more accurate Mycobacterium spp. quantification, contrary to culture based method which is subjected to many drawbacks such as decontamination artifact (about 2 log10 reduction for M. chelonae), slow mycobacteria growth, clumping of mycobacterial cells, high hydrophobicity of mycobacteria and contamination of culture media by other fast growing environmental microorganisms .
Comparison of the method targeting atpE with previously described method targeting 16S rRNA, , showed a high correlation. Moreover the method targeting atpE gene presents two major advantages over the method targeting rrs gene. First, the new method detects all the tested mycobacterial strains, while the method targeting rrs gene cannot detect isolates of M. celatum, M. heckeshornense, and M. leprae. Second, the atpE gene is present in a single copy in the Mycobacterium genomes, while the 16S rRNA gene is present either in 1 or 2 copies in the genome . When comparing samples it will be simpler to interpret the data with a stable gene copy number, and probably give a better accuracy of the mycobacterial concentration.
One of the limitations of this study is that only 31 mycobacterial species were tested in vitro as positive controls whereas more than 150 mycobacterial species have been described so far . To date, we have confirmed the sensitivity of the atpE real-time PCR method using a large representative collection of mycobacterial species (31 species, e.g. around 20% of described species), including members of MTC (n = 2), M. leprae species (n = 1), slow growing NTM (n = 13), and rapid growing NTM (n = 15). Given the broad diversity of mycobacterial species we have tested in this study, we expect the method to be applicable to all species within the Mycobacterium genus. In addition, it is the first time that a sensitive and specific molecular target has been identified based on an in silico comparison of 16 mycobacterial (13 species) and 12 non-mycobacterial genomes (4 closely related species).
In conclusion, although our strategy did not take into account non-coding regions, such as insertion sequences, repetitive units, non-functional RNA, and structural ribosomal RNAs, the comparison of whole bacterial genomes for design of specific primers is a promising approach not only for mycobacteria but also for other cultured bacterial or archaeal groups for which whole sequenced genomes are accumulating in databases. Metagenomic libraries from environmental samples which are increasingly performed in microbial ecology studies  could also provide useful data for the design of specific targets toward uncultured Bacteria and Archaea. Using these databases, MycoHit, or other new software, may then be used to design new primers for real-time PCR detection or quantification, for in situ hybridization and other molecular tools. With this approach we were able to design primer pairs and a probe that target specific mycobacterial atpE gene, and could be used to detect and quantify very specifically mycobacteria in environmental samples. Although the atpE gene may not be appropriate for microdiversity studies, it appeared to be very useful for specific detection of the genus Mycobacterium in environmental samples. More generally, genome comparison used here showed its utility to identify specific genera’s targets, and could be used to identify specific proteins for antimicrobial design as previously emphasized .
In silicocomparison strategy
In order to detect M. tuberculosis genes, presenting homologue genes in other mycobacterial genomes, and not presenting homologue genes in non-mycobacteria genomes, we used the MycoHit software version 14.17 (Zipped copy of the files and instructions for this application are available in the Behr Research Lab, https://www.mcgill.ca/molepi/) and performed an alignment search with Stand Alone tblastn algorithm as previously described . Stand Alone tblastn algorithm has been chosen because coding sequences are known to be more conserved in mycobacterial genomes than non-coding sequences, as intergenic regions, insertion sequences, or phage sequences . Genome of M. tuberculosis H37Rv has been used as a reference of the Mycobacterium genus, because it is the most historically described mycobacterial genome . Based on the 3989 predicted proteins from M. tuberculosis H37Rv, corresponding to the query sequences used in order to search for matches in the genomic DNA of other organisms (Figure 1), a matrix of 107703 scores (3989 protein sequences blasted against 12 non-mycobacterial genomes and 15 mycobacterial genomes) was obtained. As previously described  and according to NCBI procedures , expected value was set at e-10. Following sequence comparisons, the MycoHit software allowed to sort scores according to similarity requests which were performed on the one hand toward mycobacterial genomes, and on the other hand toward non-mycobacterial genomes (Figure 1). A protein list of the reference target, which can be downloaded from NCBI web site (http://www.ncbi.nlm.nih.gov), allowed identification of the conserved mycobacterial proteins presenting no homology in non-mycobacterial genomes (Figure 1).
Mycobacterial genome database
In order to perform comparisons of pathogenic (P) and non-pathogenic (NP) mycobacterial genomes with M. tuberculosis H37Rv genome using MycoHit software, sequences were obtained at NCBI web site (http://www.ncbi.nlm.nih.gov) using the accession numbers: M. abscessus ATCC 19977 (CU458896.1) (P), M. avium 104 (CP000479.1) (P), M. avium subsp. paratuberculosis K10 (AE016958.1) (P), M. bovis subsp. bovis AF2122/97 (BX248333.1) (P), M. gilvum PYR-GCK (CP000656.1) (NP), M. marinum M (CP000854.1) (P), M. smegmatis MC2 155 (CP000480.1) (NP), Mycobacterium sp. JLS (CP000580.1) (NP), Mycobacterium sp. KMS (CP000518.1) (NP), Mycobacterium sp. MCS (CP000384.1) (NP), M. tuberculosis CDC1551 (AE000516.2) (P), M. tuberculosis H37Ra (CP000611.1) (NP), M. tuberculosis H37Rv (AL123456.2) (P), M. tuberculosis KZN 1435 (CP001658.1) (P), M. ulcerans Agy99 (CP000325.1) (P), and M. vanbaalenii PYR-1 (CP000511.1) (P). In order to avoid data lost during genome comparisons performed by MycoHit software, we have chosen to ignore some mycobacterial genomes. Since the number of coding proteins is much lower compared to other mycobacterial species, M. leprae Br4923 (FM211192.1) (P), and M. leprae TN (AL450380.1) (P) were ignored in the analysis (e.g. 1604 coding proteins in M. leprae Br4923 or 1605 coding proteins in M. leprae TN, against 6716 coding proteins in M. smegmatis MC2 155) [22, 24–26, 35]. Genomes of M. bovis BCG Pasteur 1173P2 (AM408590.1) (NP) and M. bovis BCG Tokyo 172 (AP010918.1) (NP) were also not taken into account, because these vicinal genomes present mutations . Moreover, genomes of M. intracellulare ATCC 13950 (ABIN00000000) (P), M. kansasii ATCC 12478 (ACBV00000000) (P) and M. parascrofulaceum BAA-614 (ADNV00000000) (P) were also not used during MycoHit proceedings, because their genomes were still not assembled at the moment we performed the first screening step of our analysis. Nevertheless, the genomes of M. leprae, M. bovis BCG, M. intracellulare, M. kansasii and M. parascrofulaceum were used during alignment of nucleic sequences of the most conserved proteins in mycobacterial genomes.
Non-mycobacterial genome database
We selected non-mycobacterial genomes of species from the CNM group using the following accession numbers: Corynebacterium aurimucosum ATCC 700975 (CP001601.1), C. diphtheriae NCTC 13129 (BX248353.1), C. efficiens YS-314 (BA000035.2), C. glutamicum ATCC 13032 (BX927147.1), C. jeikeium K411 (NC_007164), C. kroppenstedtii DSM 44385 (CP001620.1), C. urealyticum DSM 7109 (AM942444.1), Nocardia farcinica IFM 10152 (AP006618.1), Nocardioides sp. JS614 (CP000509.1), Rhodococcus erythropolis PR4 (AP008957.1), R. jostii RHA1 (CP000431.1), and R. opacus B4 (AP011115.1).
Primer pair and probe design
In order to check the homology of the selected mycobacterial sequences, the protein and DNA sequences of these selected proteins were aligned using the ClustalW multiple alignment of the BioEdit software 188.8.131.52 with 1000 bootstraps . Primer pair and probe was designed from the best fitted gene sequences (after protein screening and selection) by visual analysis and using the Beacon Designer software version 7.90 (Premier Biosoft International, Palo Alto, Calif.).
Real-time PCR validation
Reproducibility, sensitivity and specificity of the new real-time PCR method were estimated using DNA from a previously described microorganism collection, and according to Radomski et al. protocol . Reproducibility, efficiency, limits of detection and quantification of the real-time PCR methods  were estimated by quantification of several tenfold dilutions (10 replications of 400, 100, 40, 20, 4, 0.4 and 0.04 genome equivalent (ge) by reaction) of a known quantity of DNA extracted from four strains: M. avium, M. fortuitum, M. intracellulare and M. gordonae (identified from the national French reference laboratory collection). Specificity and sensitivity were estimated against 30 non-mycobacteria (negative) strains and 31 mycobacteria (positive), respectively. The collection contained reference and environmental strains of mycobacteria, as well as, strains of the closely related CNM group, and other non-actinobacteria strains isolated from the environment . Mycobacteria collection included MTC (n = 2) and leprae species (n = 1), as well as species of slow growing NTM (n = 13), and rapid growing NTM (n = 15). TaqMan® real-time PCR were performed in duplicate using an ABI7500 real-time PCR system (Applied Biosystems), a Lifetech 7500 software version 2.0.6 (Applied Biosystems) and TaqMan fast virus 1-STEP Master Mix with 6-carboxy-X-rhodamine (ROX) (Applied Biosystems). The TaqMan® probes were labeled (Eurogentec) with the fluorescent dyes 6-carboxyfluorescein (5′ end) and Black Hole Quencher (3′ end). All reactions were performed in a 25 μl reaction mixture volume (2.5 μl of DNA) with 500 nM of forward primer, 500 nM of reverse primer, 50 nM of probe and 5 mM of MgCl2. Reverse transcriptase was inactivated immediately (95 °C, 45 s) according to the manufacturer instruction, and real-time PCR consisted in 40 cycles of denaturation (95°C for 3 s), annealing and extension (both steps at 60°C for 30 s). Determinations of cycle threshold were performed by setting the instrument’s threshold line at 0.02 ∆Rn units (fluorescence gain above the baseline divided by the ROX channel signal).
In order to compare the new real-time PCR method to the culture method, 26 tap water distribution points in Paris (France) were sampled between April 2011 and July 2011, corresponding to 90 samples. Briefly, one liter of tap water was sampled in sterile plastic bottle, then centrifuged at 5000 × g for 2h and finally re-suspended in 1 ml of water. Mycobacteria density was estimated by culture (Method A) in all these samples following the procedure previously described by Le Dantec et al. . In parallel, DNA was extracted using two different methods: i) a bacterial DNA extraction kit (QIAamp DNA mini kit, Qiagen) according to the manufacturer recommendations (Method B), and ii) a phenol-chloroform extraction procedure according to Radomski et al.  (Method C). Extracted DNA was 10 fold diluted and mycobacteria density was estimated in duplicate using the new real-time PCR method.
Using environmental samples, the new atpE targeting method was also compared a previously described rrs targeting method . More precisely, samples collected from water column, sediment, and neuston of two urbanized lakes (Daumesnil Lake, Paris, France, and Créteil Lake, Créteil, France) were analyzed in triplicate. Water samples (column and neuston) were centrifuged 1 h at 7500 × g, and DNA was extracted using a MagNA Pure System (Roche). Sediment samples were lyophilized and DNA was isolated using FastDNA SPIN kit for Soil according to the manufacturer’s instructions (MP Biomedicals, Santa Ana, CA). Statistical analyses were carried out using R software v. 2.15 .
Availability of supporting data
The data sets supporting the results of this article are included within the article and its additional files.
Cyclopropane mycolic acid synthase
Corynebacterium, Nocardia, Rhodococcus, Mycobacterium
Internal transcribed spacer
Mycobacterium tuberculosis complex
Kazda J: The chronology of mycobacteria and the development of mycobacterial ecology. The ecology of mycobacteria: Impact on animal’s and human’s health. Edited by: Kazda J, Pavlik I, Falkinham JO, Hruska K. 2009, Dordrecht Heidelberg London New York: Springer, 1: 1-11.
Radomski N, Cambau E, Moulin L, Haenn S, Moilleron R, Lucas FS: Comparison of culture methods for isolation of nontuberculous mycobacteria from surface waters. Appl Environ Microbiol. 2010, 76 (11): 3514-3520. 10.1128/AEM.02659-09.
Adékambi T, Drancourt M: Dissection of phylogenetic relationships among 19 rapidly growing Mycobacterium species by 16S rRNA, hsp65, sodA, recA and rpoB gene sequencing. Int J Syst Evol Microbiol. 2004, 54 (6): 2095-2105. 10.1099/ijs.0.63094-0.
Gomila M, Ramirez A, Lalucat J: Diversity of environmental Mycobacterium isolates from hemodialysis water as shown by a multigene sequencing approach. Appl Environ Microbiol. 2007, 73 (12): 3787-3797. 10.1128/AEM.02934-06.
Mendum TA, Chilima BZ, Hirsch PR: The PCR amplification of non-tuberculous mycobacterial 16S rRNA sequences from soil. FEMS Microbiol Lett. 2000, 185 (2): 189-192. 10.1111/j.1574-6968.2000.tb09060.x.
Garcia-Quintanilla A, Gonzalez-Martin J, Tudo G, Espasa M, Jiménez de Anta MT: Simultaneous identification of Mycobacterium genus and Mycobacterium tuberculosis complex in clinical samples by 5′-exonuclease fluorogenic PCR. J Clin Microbiol. 2002, 40 (12): 4646-4651. 10.1128/JCM.40.12.4646-4651.2002.
Nieminen T, Pakarinen J, Tsitko I, Salkinoja-Salonen M, Breitenstein A, Ali-Vehmas T, Neubauer P: 16S rRNA targeted sandwich hybridization method for direct quantification of mycobacteria in soils. J Microbiol Methods. 2006, 67 (1): 44-55. 10.1016/j.mimet.2006.02.015.
Dutil S, Veillette M, Mériaux A, Lazure L, Barbeau J, Duchaine C: Aerosolization of mycobacteria and legionellae during dental treatment: Low exposure despite dental unit contamination. Environ Microbiol. 2007, 9 (11): 2836-2843. 10.1111/j.1462-2920.2007.01395.x.
Böddinghaus B, Rogall T, Flohr T, Blocker H, Bottger EC: Detection and identification of mycobacteria by amplification of rRNA. J Clin Microbiol. 1990, 28 (8): 1751-1759.
Zolg JW, Philippi-Schulz S: The superoxide dismutase gene, a target for detection and identification of mycobacteria by PCR. J Clin Microbiol. 1994, 32 (11): 2801-2812.
Pryor M, Springthorpe S, Riffard S, Brooks T, Huo Y, Davis G, Sattar SA: Investigation of opportunistic pathogens in municipal drinking water under different supply and treatment regimes. Water Sci Technol. 2004, 50 (1): 83-90.
Niva M, Hernesmaa A, Haahtela K, Salkinoja-Salonen M, Sivonen K, Haukka K: Actinobacteria communities of borel forest soil and lake water are rich in mycobacteria. Boreal Environ Res. 2006, 11 (1): 45-53.
Leys NM, Ryngaert A, Bastiaens L, Wattiau P, Top EM, Verstraete W, Springael D: Occurrence and community composition of fast-growing Mycobacterium in soils contaminated with polycyclic aromatic hydrocarbons. FEMS Microbiol Ecol. 2005, 51 (3): 375-388. 10.1016/j.femsec.2004.09.015.
Uyttebroek M, Vermeir S, Wattiau P, Ryngaert A, Springael D: Characterization of cultures enriched from acidic Polycyclic Aromatic Hydrocarbon-contaminated soil for growth on pyrene at low pH. Appl Environ Microbiol. 2007, 73 (10): 3159-3164. 10.1128/AEM.02837-06.
Uyttebroek M, Breugelmans P, Janssen M, Wattiau P, Joffe B, Karlson U, Ortega-Calvo JJ, Bastiaens L, Ryngaert A, Hausner M, et al: Mycobacterium community and polycyclic aromatic hydrocarbons (PAHs) among different size fractions of a long-term PAH-contaminated soil. Environ Microbiol. 2006, 8 (5): 836-847. 10.1111/j.1462-2920.2005.00970.x.
Uyttebroek M, Spoden A, Ortega-Calvo JJ, Wouters K, Wattiau P, Bastiaens L, Springael D: Differential responses of Eubacterial, Mycobacterium, and Sphingomonas communities in Polycyclic Aromatic Hydrocarbon (PAH)-contaminated soil to artificially induced changes in PAH profile. J Environ Qual. 2007, 36 (1): 1403-1411.
Radomski N, Lucas FS, Moilleron R, Cambau E, Haenn S, Moulin L: Development of a real-time qPCR method for detection and enumeration of Mycobacterium spp. in surface water. Appl Environ Microbiol. 2010, 76 (11): 7348-7351.
Fukushima M, Kakinuma K, Hayashi H, Nagai H, Ito K, Kawaguchi R: Detection and identification of Mycobacterium species isolates by DNA microarray. J Clin Microbiol. 2003, 41 (6): 2605-2615. 10.1128/JCM.41.6.2605-2615.2003.
Kim BJ, Hong SK, Lee KH, Yun YJ, Kim EC, Park YG, Bai GH, Kook YH: Differential identification of Mycobacterium tuberculosis complex and nontuberculous mycobacteria by duplex PCR assay using the RNA polymerase gene (rpoB). J Clin Microbiol. 2004, 42 (3): 1308-1312. 10.1128/JCM.42.3.1308-1312.2004.
Tobler NE, Pfunder M, Herzog K, Frey JE, Altwegg M: Rapid detection and species identification of Mycobacterium spp. using real-time PCR and DNA-Microarray. J Microbiol Methods. 2006, 66 (1): 116-124. 10.1016/j.mimet.2005.10.016.
Murray RGE, Brenner DJ, Bryant MP, Holt JG, Krieg NR, Mouldier JW, Pfennig N, Snearth PHA, Staley JT, Lapage SP, et al: Bergey’s manual of systematic and bacteriology. 1989, Baltimore, USA: Williams and Wilkins, 2: 1
Cole ST, Brosch R, Parkhill J, Garnier T, Churcher C, Harris D, Gordon SV, Eiglmeier K, Gas S, Barry CE, et al: Deciphering the biology of Mycobacterium tuberculosis for the complete genome sequence. Nat Aust. 1998, 44 (6685): 393-537.
Nyrén P: The history of pyrosequencing. Methods Mol Biol. 2007, 373 (1): 1-14. 10.1016/j.jmb.2007.07.038.
Ripoll F, Pasek S, Schenowitz C, Dossat C, Barbe V, Rottman M, Macheras E, Heym B, Herrmann JL, Daffé M, et al: Non mycobacterial virulence genes in the genome of the emerging pathogen Mycobacterium abscessus. PLoS One. 2009, 4 (6): 1-12.
Li L, Bannantine J, Zhang Q, Amonsin A, May B, Alt D, Banerji N, Kanjilal S, Kapur V: The complete genome sequence of Mycobacterium avium subspecies paratuberculosis. Proc Natl Acad Sci U S A. 2005, 102 (35): 12344-13349. 10.1073/pnas.0505662102.
Stinear TP, Seemann T, Harrison PF, Jenkin GA, Davies JK, Johnson PDR, Abdellah Z, Arrowsmith C, Chillingworth T, Churcher C, et al: Insights from the complete genome sequence of Mycobacterium marinum on the evolution of Mycobacterium tuberculosis. Genome Res. 2010, 18 (1): 729-741.
Veyrier F, Pletzer D, Turenne C, Behr MA: Phylogenetic detection of horizontal gene transfer during the step-wise genesis of Mycobacterium tuberculosis. BMC Evo Biol. 2009, 196 (8): 1-14.
Le Dantec C, Duguet JP, Montiel A, Dumoutier N, Dubrou S, Vincent V: Occurrence of mycobacteria in water treatment lines and in water distribution systems. Appl Environ Microbiol. 2002, 68 (11): 5318-5325. 10.1128/AEM.68.11.5318-5325.2002.
Radomski N, Betelli L, Moilleron R, Haenn S, Moulin L, Cambau E, Rocher V, Gonçalves A, Lucas FS: Mycobacterium behavior in wastewater treatment plant, a bacterial model distinct from Escherichia coli and enterococci. Environ Sci Technol. 2011, 45 (12): 5380-5386. 10.1021/es104084c.
Cubillos-Ruiz A, Morales J, Zambrano MM: Analysis of the genetic variation in Mycobacterium tuberculosis strains by multiple genome alignments. BMC Res Notes. 2008, 7 (1): 110-120.
Casas Botero AE, Torem ML, Souza de Mesquita LM: Fundamental studies of Rhodococcus opacus as a biocollector of calcite and magnesite. Mine Eng. 2007, 20 (10): 1026-1032. 10.1016/j.mineng.2007.03.017.
Cocito C, Gilot P, Coene M, de Kesel M, Poupart P, Vannuffel P: Paratuberculosis. Clin Microbiol Rev. 1994, 3 (7): 328-345.
Gangadharam PRJ, Jenkins PA: Mycobacteria, basic acpects vol. 1. 1998, New York: International Thomson Publishing
Prescott LM, Harley JP, Klein DA, Bacq-Calberg CM, Dusart J: Les bactéries : Les Gram-positifs riches en G-C. Microbiologie. Edited by: Prescott J, Harley J, Klein D. 2003, Bruxelles: De Boeck Université, 1: 541-
Garnier T, Eiglmeier K, Camus JC, Medina N, Mansoor H, Pryor M, Duthoy S, Grondin S, Lacroix C, Monsempe C, et al: The complete genome sequence of Mycobacterium bovis. Proc Natl Acad Sci U S A. 2003, 100 (13): 7877-7882. 10.1073/pnas.1130426100.
Goodfellow M, Williams ST: Ecology of actionomycetes. Annu Rev Microbiol. 1983, 37: 189-216. 10.1146/annurev.mi.37.100183.001201.
Rowbotham TJ, Cross T: Ecology of Rhodococcus coprophilus and associated Actinomycetes in fresh water and agriculturl habitats. Microbiol. 1977, 100 (2): 231-240.
Voskuil MI, Schnappinger D, Rutherford R, Liu Y, Schoolnik GK: Regulation of the Mycobacterium tuberculosis PE/PPE genes. Tuberculosis (Edinb). 2004, 84 (3–4): 256-262.
Grogan DW, Cronan JE: Cyclopropane ring formation in membrane lipids of bacteria. Microbiol Mol Biol Rev. 1997, 61 (4): 429-441.
Butler WR, Ahearn DG, Kilburn JO: High-Performance Liquid Chromatography of mycolic acids as a tool in the identification of Corynebacterium, Nocardia, Rhodococcus, and Mycobacterium species. J Clin Microbiol. 1986, 21 (1): 182-185.
Thibert L, Lapierre S: Routine application of high-performance liquid chromatography for identification of mycobacteria. J Clin Microbiol. 1993, 31 (7): 1759-1763.
Petrella S, Cambau E, Chauffour A, Andries K, Jarlier V, Sougakoff W: Genetic basis for natural and acquired resistance to the diarylquinoline R207910 in mycobacteria. Antimicrob Agents Chemother. 2006, 50 (8): 2853-2856. 10.1128/AAC.00244-06.
Andries K, Verhasselt P, Guillemont J, Göhlmann HWH, Neefs JM, Winkler H, van Gestel JV, Timmerman P, Zhu M, Lee E, et al: A diarylquinolone drug active on the ATP synthase of Mycobacterium tuberculosis. Science. 2005, 307 (5707): 223-227. 10.1126/science.1106753.
Radomski N, Moilleron R, Lucas FS, Falkinham JO: Challenges in environmental monitoring of pathogens: Case study in Mycobacterium avium. Current research, technology and education topics in applied microbiology and microbial biotechnology. Edited by: Méndez-Vilas A. 2010, Badajoz: Formatex Research Center, 2: 1551-1561.
Fogel GB, Collins CR, Li J, Brunk CF: Prokaryotic genome size and SSU rDNA copy number: estimation of microbial relative abundance from a mixed population. Microb Ecol. 1999, 38 (2): 93-113. 10.1007/s002489900162.
Riesenfeld CS, Schloss PD, Handelsman J: Metagenomics: genome analysis of microbial communities. Annu Rev Genet. 2004, 38 (1): 525-552. 10.1146/annurev.genet.38.072902.091216.
Rosamond J, Allsop A: Harnessing the power of the genome in the search for new antibiotics. Science. 2000, 287 (5460): 1973-1976. 10.1126/science.287.5460.1973.
Yao J, Lin H, Doddapaneni H, Civerolo EL: nWayComp: a genome-wide sequence comparison tool for multiple strains/species of phylogenetically related microorganisms. In Silico Biol. 2007, 7 (2): 195-200.
Mahairas GG, Sabo PJ, Hickey MJ, Singh DC, Stover CK: Molecular analysis of genetic differences between Mycobacterium bovis BCG and virulent M. bovis. J Bacteriol. 1996, 178 (5): 1274-1282.
Hall TA: BioEdit: a user-friendly biological sequence alignment editor and analysis program fo windows 95/98/NT. Nucleic Acids Symp Ser. 1999, 41 (1): 95-98.
R Development Core Team: R foundation for statistical computing. R: a language and environment for statistical computing. 2012, Vienna, Austria, http://www.R-project.org,
We thank Pr. Jacques Printems from the laboratory of analysis and applied mathematics (CNRS UMR 8050) in Paris Est University for access to his computer (MacPro3.1, Quad-Core Intel Xeon) in order to perform tblastn algorithms, which run between 1 and 80 hours for each genome comparison according to the similarity levels with the reference genome. We also thank members of the R&D Biology lab from Eau de Paris, Claire Therial from LEESU, as well as, Michael Reed and Lynn Dery Capes from the Research Institute of the McGill University Health Centre.
The authors declare that they have no competing interests.
NR contributed in the experimental design, data acquisition and interpretation under the supervisions of FL, RM, EC, and LM, and was involved in writing the manuscript. For in silico comparisons, FJV provided technical assistance and MAB supervised data interpretation. HA and LM carried out the in vitro assays, and participated to statistical analyses and manuscript writing. All authors read and approved the manuscript.
Nicolas Radomski, Françoise S Lucas, Emmanuelle Cambau, Régis Moilleron and Laurent Moulin contributed equally to this work.