Large scale multiplex PCR improves pathogen detection by DNA microarrays
© Palka-Santini et al. 2009
Received: 29 May 2008
Accepted: 03 January 2009
Published: 03 January 2009
Skip to main content
© Palka-Santini et al. 2009
Received: 29 May 2008
Accepted: 03 January 2009
Published: 03 January 2009
Medium density DNA microchips that carry a collection of probes for a broad spectrum of pathogens, have the potential to be powerful tools for simultaneous species identification, detection of virulence factors and antimicrobial resistance determinants. However, their widespread use in microbiological diagnostics is limited by the problem of low pathogen numbers in clinical specimens revealing relatively low amounts of pathogen DNA.
To increase the detection power of a fluorescence-based prototype-microarray designed to identify pathogenic microorganisms involved in sepsis, we propose a large scale multiplex PCR (LSplex PCR) for amplification of several dozens of gene-segments of 9 pathogenic species. This protocol employs a large set of primer pairs, potentially able to amplify 800 different gene segments that correspond to the capture probes spotted on the microarray. The LSplex protocol is shown to selectively amplify only the gene segments corresponding to the specific pathogen present in the analyte. Application of LSplex increases the microarray detection of target templates by a factor of 100 to 1000.
Our data provide a proof of principle for the improvement of detection of pathogen DNA by microarray hybridization by using LSplex PCR.
Clinical microbiological diagnostics, environmental survey, food quality control and biodefence strategies have a common keystone: accurate and rapid identification of pathogenic microorganisms. Several molecular biology-based methods have been recently developed for microbial diagnostics and offer noticeable advantages over conventional techniques in microbiology. Among the molecular biology-based methods, DNA microarray technology presents the potential of direct and rapid identification of multiple DNA sequences [1–7]. A microarray displaying DNA probes corresponding to a collection of genes of a broad spectrum of pathogens is a powerful tool for simultaneous species identification, detection of virulence factors and antimicrobial resistance determinants . Major drawbacks in using DNA microarrays as a standard technique for pathogen detection are linked to the low representation of pathogen DNA in the analytes, but also to the relatively low sensitivity of fluorescence-based microarrays. The amount of specific pathogen DNA present in clinical, environmental, and food samples is sometimes as low as few femtograms [8–14], while the detection limit for genomic DNA in fluorescence-based microarrays, without any pre-amplification, is in the range of micrograms to nanograms [1, 3, 4, 7, 15].
A solution to overcome this intrinsic weakness of fluorescence-based microarrays is to specifically amplify the pathogen DNA fraction in the sample in order to increase the sensitivity level of detection. The question of random or selective pathogen DNA amplification prior to DNA microarray detection has been already addressed  and applications of multiplex PCR using a small number of primer pairs corresponding to the capture probes on low density microarrays have been published [16, 5, 6, 16–18]. We present here a further development of this approach, by proposing a large scale multiplex PCR adapted to the format of a prototype medium density microarray developed in our laboratory, employing up to 800 specific primer pairs. The limiting conditions for the LSplex PCR protocol are empirically determined and the resulting amplification biases are evaluated.
Template DNA was prepared from the following bacterial and fungal reference strains, obtained from the American Type Culture Collection (ATCC, Manassas, Va.), the Deutsche Sammlung von Mikroorganismen und Zellkulturen (DSMZ, Braunschweig, Germany) or the Collection de l'Institut Pasteur, (CIP, Paris, France): Staphylococcus aureus (ATCC 29213 and CIP 65.6), Staphylococcus epidermidis (ATCC 12228), Escherichia coli (ATCC 25922 and CIP 105893), Pseudomonas aeruginosa (ATCC 27853 and CIP 105765), Klebsiella pneumoniae (DSM 681), Proteus mirabilis (DSM 788), Enterococcus faecalis (ATCC 29212), Streptococcus pneumoniae (CIP 106577), Streptococcus mitis (CIP 104997), Candida albicans (ATCC 10231). A clinical isolate of S. aureus (T100) was also used in some experiments. Microorganisms were grown over night at 37°C with constant shaking at 220 rpm in 5 ml Luria-Bertani (LB) broth or tryptic soy broth (TSB, 30 g/l, Merck) containing 3 g/l yeast extract. Enterococci and Streptococci were grown in 10 ml TSB plus yeast without agitation under 5% CO2. Overnight cultures were harvested at 2,560 g for 10 min. After discarding the supernatant the pellet was washed in 1 ml TE (10 mM Tris-HCl, pH 7.5 and 1 mM EDTA) and recovered by centrifugation at 17,900 g for 10 min. Cell pellets were used for DNA preparation. Clinical samples were obtained from the routine microbiological laboratory were they were characterized by subculture and standard biochemical identification (VITEK2).
Total bacterial DNAs were extracted and purified by using the Bacterial Genomic DNA Purification Kit (EdgeBioSystems, Gaithersburg, MD, USA) following the instructions of the supplier. For Gram-positive bacteria the cell pellets were resuspended in 200 μL TES buffer (20 mM Tris-HCl, pH 7.5, 10 mM EDTA pH 8.0 and 50 mM NaCl) containing lysozyme (Sigma, Taufkirchen, Germany) in a final concentration of 0.8 g/L prior to extraction. In additon, lysostaphin (Sigma) was added to a final concentration of 0.2 g/L, to promote Staphylococcal lysis, or mutanolysin (0.5 U/μL; Sigma) was added to lyse Streptococci and Enterococci and incubated one hour at 37°C. Candida albicans DNA extraction was achieved by beating the cell pellet with glass beads (425–600 microns, Sigma) using a Tissue Lyser (Qiagen, Hilden, Germany) at maximum speed for 5 minutes and the DNeasy Tissue Kit (Qiagen) with an overnight Proteinase K (10 mg/L) treatment. DNA from cotton swabs was prepared by DNeasy Tissue Kit (Qiagen) followed by manufacturer's protocol for the purification of genomic DNA from Gram+ bacteria.
A total of 930 gene segments of Staphylococcus spp., Streptococcus spp., Enterococcus spp., Proteus spp., Klebsiella spp., Stenotrophomonas sp., Enterobacter sp., Acinetobacter spp., E. coli, P. aeruginosa, and Candida albicans and genes encoding resistance against antimicrobials were selected from the literature and databases. Next they were compared by BLAST analysis to all other sequences available in the NCBI database in order to avoid regions homologous with genes of other bacterial species and Homo sapiens. Primers for the selected sequences were designed with the help of Primer3 search  in order to produce amplicons of 200 to 800 bp length (primer sequences and their characteristics are shown in Additional file 1).
Negative controls comprising genes of Homo sapiens, Dictyostelium discoideum, Mus musculus and Hordeum vulgaris and positive controls (16S rRNA genes of several bacterial species) were also included. PCR products were cloned following the detailed protocol described elsewhere . All cloned gene segments were amplified from the plasmids and diluted in 25% DMSO at a concentration of 200 mg/L. For printing the microarrays a BioRobotics Microgrid 610 spotter (Genomic Solutions, Huntingdon, UK) and Ultra-GAPS™ coated glass slides (Corning Incorporated, Corning, USA) were used and conditions for printing were as described . The complete array of 930 gene amplicons was spotted in 2 replicates per slide, each replicate containing 2 spots of the same probe, therefore totaling 4 replicates of each probe. Each lot of microarrays was quality controlled by hybridization with 2 μg genomic DNA of reference strains of pathogens present on the array.
For testing the Large-scale Multiplex PCR (LSplex) approach, 800 primer-pairs were selected out of the 930 available primer-pairs. (Additional file 1).
LSplex was carried out with different amounts of pure culture bacterial DNA templates. A primer mix was used with a final concentration of in general 0.02 μM of each primer. Reactions in a total volume of 50 μL were performed with 2 U either of Taq DNA polymerase (Fermentas, St. Leon-Rot, Germany) (standard LSplex) or Vent exo- DNA polymerase (New England Biolabs, Frankfurt am Main, Germany) (optimized LSplex). Standard LSplex using Taq DNA polymerase amplification reactions contained 1× KCl PCR buffer (Fermentas), 2 mM MgCl2, and 0.2 mM of dATP, dCTP, gGTP, and dTTP (Sigma). Optimized LSplex using Vent exo- DNA polymerase amplification reactions contained 1× ThermoPolBuffer (New England Biolabs), 4 mM MgCl2, and 0.2 mM of dATP, dCTP, dGTP, and dTTP (Sigma). The cycling was performed in Trio T3 Thermocycler (Biometra, Goettingen, Germany) using protocol comprising an initial denaturing step at 94°C for 3 minutes, followed by 35 cycles of 94°C for 30 s, 55°C for 45 s and 72°C for 1 min. LSplex products were spin purified with the QIAquick PCR Purification Kit (Qiagen) and eluted with nuclease-free water (pH 8).
LSplex amplified products were labelled with fluorophores after or during amplification.
Purified LSplex products in a volume of 20 μL were labelled with 3 μL of either Cy5-dCTP or Cy3-dCTP (Amersham Pharmacia Biotech Europe, Freiburg, Germany) by random priming using Klenow Polymerase (50 units) (BioPrime DNA labelling Kit, Invitrogen, Karlsruhe, Germany) in the presence of 0.12 mM dATP, dGTP and dTTP and 0.06 mM dCTP, in a total volume of 50 μL. After 2 hours incubation at 37°C, the reaction was stopped by adding 5 μL of 0.5 M EDTA.
Labelling during PCR was performed directly, by incorporation of fluorescent nucleotides, or indirectly by incorporation of aminoallyl-modified nucleotides and subsequent staining of the amplified products with amino reactive fluorescent dyes. The LSplex PCR protocols using Taq or Vent exo- DNA polymerases were modified as follows: 1) for direct labelling the amount of dTTP was reduced to 0.15 mM and 0.05 mM of Alexa Fluor 546-14-dUTP was added (ChromaTide Labelled Nucleotides, Molecular Probes, Willow Creek, US). 2) for indirect labelling the amount of dTTP was reduced to 0.13 mM and 0.07 mM aminoallyl-dUTP was added (ARES DNA labelling Kit, Invitrogen). Amino-modified amplified DNA was spin purified with the QIAquick PCR Purification Kit (Qiagen), eluted in 60 μL nuclease-free water (pH 8), analyzed by spectrophotometry, freeze-dried (Lyovac GT2, Finn-Aqua, Huerth, Germany), resuspended in 5 μL nuclease-free water and subsequently stained with Alexa-fluor 555 or 647.
For some comparative experiments, bacterial or fungal pure culture genomic DNAs have been fragmented by sonification (Bandelin, Berlin, Germany) to an average size of 1000 bp and then also been labelled by random priming and Klenow Polymerase as described above (1. Labelling after amplification).
Finally, labelled LSplex products and genomic DNA were spin purified with the QIAquick PCR Purification Kit (Qiagen) and eluted in 60 μL elution buffer (10 mM Tris/HCl, pH 8.0). The labelling efficiency was evaluated by calculating the approximate ratio of bases to dye molecules. This ratio and the amount of recovered labelled DNA was determined by measuring the absorbance of the undiluted purified LS-Plex products at 260 nm and the absorbance of the dye at its absorbance maximum using a lambda40 UV-spectrophotometer (PerkinElmer) and plastic disposable cuvettes for the range from 220 nm to 700 nm (UVette; Eppendorf, Hamburg, Germany).
In order to provide a complete evaluation of the LSplex protocol using genus-specific and high complexity primer mixes, amplified products were hybridized to a prototype microarray designed to identify pathogenic microorganisms involved in sepsis.
All amplifications were performed at least twice for each condition indicated. Each experiment described in the present study represent co-hybridization of two different DNA samples (LSplex amplified and genomic DNA for comparison) labelled with Cy3, Alexa 546 or Alexa 555 and Cy5 or Alexa 647 respectively. After purification, DNA samples labelled with distinguishable fluorophores were pooled and 10 μg of Salmon Sperm DNA were added. The whole yield of one amplification reaction was used for one labeling and hybridization experiment. The mixture was frozen in liquid nitrogen and freeze-dried (Lyovac GT2, Finn-Aqua, Huerth, Germany) in the dark. Hybridization was automatically performed with a TECAN hybridization station (HS400, TECAN, Salzburg, Austria). The microarray slides were prewashed with 5 × SSC then 110 μL of pre-hybridization buffer (25% Formamide, 5 × SSC, 0.1% SDS, 10 mg/ml BSA) were added and incubated for 30 minutes at 42°C with mild agitation. Lyophilized labelled DNA was resuspended in 110 μL of hybridization buffer (25% Formamide, 5 × SSC, 0.1% SDS), denatured for 3 minutes at 90°C, and injected into the hybridization chambers. Hybridization was performed for 18 hours at 42°C. After hybridization the arrays were automatically washed at 42°C in 1 × SSC/0.1% SDS, three cycles of 30 sec wash time and 2 min soak time, then in 0.1 × SSC/0.1% SDS, five cycles of 30 sec wash time and 2 min soak time, in 0.1 × SSC, four cycles of 30 sec wash time and 2 min soak time and finally dried at 30°C with N2 (270 MPa) for 5 min.
Hybridized arrays were scanned with a GenePix Personal Axon 4100A laser scanner (Axon Instruments, Union city, CA). Laser light of wavelengths at 532 and 635 nm were used to excite Cy3/Alexa546/Alexa555 dyes and Cy5/Alexa647 dyes, respectively. Fluorescent images were analyzed by the GenePixPro software (v.6.0) and Acuity (v.4.0) (Axon Instruments). The intensity of fluorescence of each spot was measured and the mean of 4 replicate spots per probe was calculated. Local background fluorescence was also measured and subtracted from the mean fluorescence. Spots displaying fluorescence greater than mean fluorescence of all spots on the array plus two times standard deviation (SD) were considered as positive. The hybridization was considered successful if spiked and control spots produced positive signals. Presence of more than 5 positive spots from same species was interpreted as positivity of the sample for this pathogen species. The fidelity limit of LSplex was defined as minimal amount of DNA necessary to obtain the hybridization pattern with >95% correspondence to one from the 2 μg genomic DNA.
We have recently established a prototype medium-density gene-segment DNA microarray for the detection and genetic profiling of pathogens causing bloodstream infections . The limit of detection of such medium-density gene-segment DNA microarrays was previously identified and ranged between 10 and 100 ng of DNA . This microarray has been extended for the present study to represent specific gene fragments of more than 20 of the most prominent causative agents of sepsis . As expected the sensitivity of detection was not influenced by the extension of the microarray. This was confirmed experimentally by hybridizing decreasing amounts of bacterial genomic DNA (Additional file 2). At the nanogram level a striking reduction in the detection power was observed and the number of detected genes was gradually reduced. In order to improve the sensitivity of detection we focused on the development of an amplification protocol by multiplex PCR.
To demonstrate specificity of LSplex the amplified DNA was fluorescently labelled and hybridized with the pathogen-specific microarray.
Comparison of LSplex labelling methods
Final amount of DNA1 (μg)
labelling after amplification with Klenow DNA polymerase
1.5 h LSplex, 15 min purification; 2 h labelling, 15 min purification
direct incorporation of fluorescent nucleotides during Lsplex
Alexa Fluor 546-14-dUTP(1:3)3
1.5 h LSplex, 15 min purification
incorporation of amino-modified nucleotides during Lsplex staining with Amino-reactive dye
aminoallyl-dUTP (1:2)4 stained after PCR with Alexa Fluor 555
1.5 h Lsplex, 15 min purification; 1 h post staining, 15 min purification
In order to reduce the number of steps in the labeling procedure and to shorten the labeling time we attempted to label DNA by incorporation of modified nucleotides concomitantly to the amplification procedure. Additionally, the impact of different labeling methods on general LSplex specificity and sensitivity upon microarray hybridization were evaluated.
The possibility of directly incorporating fluorescent nucleotides during LSplex amplification was examined. Chromatide Alexa Fluor 546-47-dUTPs were used for amplification but resulted in a rather weak incorporation ratio (one fluorescent nucleotide each 139 bases) (Table 1). The corresponding hybridization profile of S. aureus specific probes was barely more informative than the one obtained with 10 ng of non-amplified genomic DNA (Fig. 2D and 2B).
The indirect labeling of LSplex products by incorporating aminoallyl-modified nucleotides during amplification, with subsequent staining by amino reactive fluorescent dyes, was a potential alternative to Klenow labeling with one tagged nucleotide per 64 bases. Some probes displayed reduced fluorescence when compared to the fluorescence levels obtained with LSplex amplification plus Klenow labeling (Fig. 2E). For example the 2nd catalase probe (cata), the 4th coagulase (coa), bsaG, all capsular polysaccharide type 5 related genes (cap5), the gamma hemolysin (hglA), and the enterotoxines G (seg) and T15 (set15) showed weaker signals but were nonetheless identified as positive. Notably, LSplex amplification combined with indirect labeling granted a saving of one hour time compared to LSplex amplification with subsequent Klenow-labeling (Table 1).
Next we wished to determine the minimum amount of target DNA efficiently supporting the optimized LSplex amplification protocol. Agarose gel electrophoresis was unable to detect the LSplex amplification products from templates containing less than 10 ng of DNA (105–106 genomic equivalents) from several bacterial species (not shown).
The applicability of fluorescence-based DNA microarrays for the direct detection and characterization of pathogens depends on amplification of the target DNA . To compensate for the low sensitivity of such a multi-capture probe detection system, microarray analysis can be preceded of pathogen isolation and clonal expansion as a source for abundant DNA. A pre-amplification of the target DNA using a single-step Large Scale multiplex PCR (LSplex) could avoid such a time-consuming procedure. Although it is generally accepted that Multiplex PCR is potentially an ideal co-adjuvant for DNA microarrays in pathogen detection  there is, nevertheless, a limitation in the number of distinct PCR products that can be generated. Up to date, multiplex PCR was only combined with low-density microarray formats  and required either several parallel multiplex PCR reactions [5, 17, 23] or subsequent PCR steps [6, 24].
The complex nature of the interference between multiple primer pairs and targets [25, 26, 21] has limited conventional multiplex PCR in solution phase to a dozen of primer pairs [27–29]. Antagonizing the typical hindrances of highly multiplexed PCR requires innovative technical platforms as for instance performing on-chip amplification with primers attached to a solid support . Another alternative approach applied to solution-phase highly multiplex PCR has been the replacement of target-specific primers with universal ones. However, this process involves multiple steps starting with enzymatic digestion of the template DNA, ligation to adapters, primer extension and finally two subsequent PCR reactions [30, 31]. Such multi-step approaches are time consuming and prone to contamination  and therefore have not been recommended for bacteriological routine diagnostics.
The coupling of a pre-processing multiplex PCR to a medium-density microarray format, displaying hundreds of probes for identification and virulence profile typing of several pathogenic species, requires an unbiased multi-target amplification corresponding to several dozens of specific capture probes characterizing a certain pathogen. Since the presence and concentration of the particular pathogen in a microbiological laboratory is unknown, the multiplex reaction should include as many primer pairs as capture probes are present on the microarray. Moreover, the reaction has to cope with femtograms of pathogen template DNA whose GC-content can range between 30 and 70% and which is mixed with nanograms of human DNA.
We have shown high fidelity amplification of specific DNA targets using pools of species-specific mixes of up to 800 primer pairs, which improves the sensitivity of the microarray detection of pathogens by a factor of 2 to 3-logs.
By using S. aureus DNA (strain ATCC 29213) as template for amplification, we demonstrated that LSplex tolerates the increase in primer mix complexity until at least 800 primer pairs, without significant reduction in the profiling fidelity. LSplex products amplified from 10 and even 1 ng of template generated fluorescent signals as strong as those produced by micrograms of genomic DNA. Nevertheless, the comparison between LSplex hybridization profiles and the ones obtained with 2 μg of S. aureus showed that some probes were poorly amplified with the high complexity primer mixes. These probes produced a strong fluorescent signal when hybridized with genomic DNA but upon the LSplex protocol they were not considered as positive since their fluorescence difference was less then 2 times SD to the mean fluorescence intensity of the whole microarray. This problem of under-amplification of some targets might be circumvented by a specific increase in the concentration of primer pairs amplifying these specific targets . Such a balancing strategy for individual primer pairs could be applied on the whole set of primers, following a broad comparison between hybridization profiles generated by genomic DNA of many reference strains of all species of interest and the LSplex amplified products. In this way, the amount of all primer pairs responsible for low amplification yield can be adjusted. Cut-off values supporting the decision between positive or negative signals are determined empirically and should be specifically adapted to different experimental setups. Although several calculation methods are described in the literature, they basically represent subjective evaluation of the signal to noise ratio. Some authors consider a signal positive when it is only two or three times higher than the assay background [33, 16], while others take only signals ten times higher .
The fact that the LSplex protocol could allow concomitant amplification and labelling represents a valuable feature for future application in diagnostics since it should reduce the total time required for providing the identification of the pathogen. The optimized LSplex protocol using Vent exo- performed reliable amplification and efficient incorporation of amino-allyl modified nucleotides, allowing indirect labelling of PCR products. However, direct incorporation of fluorescent nucleotides during the multiplex PCR under the same amplification conditions led to weak label incorporation making the separate labelling step necessary to achieve a good profiling fidelity. Alternatively, the use of labelled primers can be employed for obtaining fluorescent multiplex PCR products .
LSplex successfully amplified less than 10 nanograms of DNA from several different pathogens (Gram-positive, Gram-negative and fungi) generating signals in general stronger and more specific than the ones generated with 2–5 micrograms of DNA. LSplex improved the specificity of the hybridization assay and enriched the sample for the target sequences present in the template. Interestingly, Candida albicans produced non-detectable signals when 2 μg of genomic DNA are used for hybridization. After amplification of 10 ng of C. albicans DNA by LSplex protocol resulted in the clear hybridization pattern (Fig. 4).
We would like to emphasize that a reduction in the limit of sensitivity of the LSplex protocol to picograms or to femtograms would be desirable in order to detected pathogens directly from every clinical, food or environmental samples.
In the last two years the publication of several reports referring to rapid identification of bacterial species by multiplex PCR coupled to microarrays detection [5, 35, 6, 17, 16, 36–38, 17, 3, 37, 3, 4, 23, 7] demonstrated the usefulness of this approach and the growing interest in implementing it in routine diagnostics. It also underlines the necessity of finding robust protocols for amplifying the target DNA before microarray analysis.
Whole genome amplification (WGA) is a powerful technique for the amplification of total genomic DNA (e.g. for comparative hybridization ). However, the random priming employed in WGA will amplify every DNA in the sample. Therefore, the application of WGA is difficult if the DNA of interest is contaminated by unwanted DNA. This is the case in clinical microbiology settings where DNA extracted directly from patient sample contains a significant amount of human DNA. LSplex would amplify selectively the underrepresented bacterial DNA. The large set of primer pairs is potentially able to amplify as many gene segments as probes are immobilized on the prototype microarray but in practice, it is supposed to only amplify the gene-segments specific to the pathogens present in the analyte.
In parallel, real-time PCR-based assays for identification of pathogens were proposed since the sensitivity is adequate for direct detection and quantification [10–12, 40–43]. However, the information level obtained by this approach is incomparably lower than the one provided by medium or high density microarray analyses. Real-time PCR has a reduced potential for multiplexing because the current availability of only four to five channels for the simultaneous non-overlapping detection of different fluorophores . For this reason, real-time PCR is in general confined to a mere species identification based on single sequence polymorphism [10, 43] or to confirm the presence of a suspected pathogen by using a reduced number of specific primer pairs [44, 45] eventually completed by the detection of a few genes related to antibiotic resistance [46, 45]. In contrast, microarrays offer the possibility to profile pathogens by providing information at the strain level , by detecting virulence factors and genes determining the antibiotic resistance . The LSplex amplification protocol is a promising co-adjuvant for pathogen profiling by microarray analysis since it increases sensitivity and the specificity of detection. It also presents the flexibility of using hundreds of primer pairs, whose sequences are exchangeable in function of the pathogens targeted in the microarray. The single-step LSplex protocol, allowing labelling during amplification, could represent one piece of the methodological mosaic in a future time-saving bacteriological diagnostic approach.
We are grateful to Georg Plum and Paul Higgins for helpful comments on the manuscript. This work was supported by the DFG, the DFG Gottfried-Wilhelm-Leibniz-Program, the GEW Stiftung, Cologne, Germany and Köln Fortune.
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.