- Research article
- Open Access
Rapid identification of Burkholderia mallei and Burkholderia pseudomalleiby intact cell Matrix-assisted Laser Desorption/Ionisation mass spectrometric typing
BMC Microbiologyvolume 12, Article number: 229 (2012)
Burkholderia (B.) pseudomallei and B. mallei are genetically closely related species. B. pseudomallei causes melioidosis in humans and animals, whereas B. mallei is the causative agent of glanders in equines and rarely also in humans. Both agents have been classified by the CDC as priority category B biological agents. Rapid identification is crucial, because both agents are intrinsically resistant to many antibiotics. Matrix-assisted laser desorption/ionisation mass spectrometry (MALDI-TOF MS) has the potential of rapid and reliable identification of pathogens, but is limited by the availability of a database containing validated reference spectra. The aim of this study was to evaluate the use of MALDI-TOF MS for the rapid and reliable identification and differentiation of B. pseudomallei and B. mallei and to build up a reliable reference database for both organisms.
A collection of ten B. pseudomallei and seventeen B. mallei strains was used to generate a library of reference spectra. Samples of both species could be identified by MALDI-TOF MS, if a dedicated subset of the reference spectra library was used. In comparison with samples representing B. mallei, higher genetic diversity among B. pseudomallei was reflected in the higher average Eucledian distances between the mass spectra and a broader range of identification score values obtained with commercial software for the identification of microorganisms. The type strain of B. pseudomallei (ATCC 23343) was isolated decades ago and is outstanding in the spectrum-based dendrograms probably due to massive methylations as indicated by two intensive series of mass increments of 14 Da specifically and reproducibly found in the spectra of this strain.
Handling of pathogens under BSL 3 conditions is dangerous and cumbersome but can be minimized by inactivation of bacteria with ethanol, subsequent protein extraction under BSL 1 conditions and MALDI-TOF MS analysis being faster than nucleic amplification methods. Our spectra demonstrated a higher homogeneity in B. mallei than in B. pseudomallei isolates. As expected for closely related species, the identification process with MALDI Biotyper software (Bruker Daltonik GmbH, Bremen, Germany) requires the careful selection of spectra from reference strains. When a dedicated reference set is used and spectra of high quality are acquired, it is possible to distinguish both species unambiguously. The need for a careful curation of reference spectra databases is stressed.
Burkholderia (B.) pseudomallei and B. mallei are genetically closely related bacterial species that can cause fatal disease in humans and animals. B. pseudomallei is a facultative intracellular soil bacterium and the cause of melioidosis, which has the highest prevalence in the hot and humid regions of Southeast Asia, and Northern Australia. The infection can be acquired by contact with contaminated soil or water by inhalation or percutaneously. Human infections can be asymptomatic, can present with focal lesions that may affect almost any organ, or may be disseminate resulting in pneumonia and septicaemia. In certain regions of Asia melioidosis is a major cause of human morbidity and acute systemic melioidosis has a case fatality rate of up to 50% even if treated [1, 2]. Melioidosis has been described in wild animals, but also in farm and pet animals and can be spread by animal trade and transport . Both species are pathovars of a single genomospecies which was divided historically in two separate species due to their clinical impact and host tropism. B. thailandensis is the third closely related species of the so-called “Pseudomallei complex” which has been out-grouped from the species B. pseudomallei based on arabinose fermentation and its markedly lower pathogenicity. B. thailandensis and B. pseudomallei are soil bacteria that share the same geographical distribution.
B. mallei is a gram-negative, non-motile obligate pathogen and the causative agent of glanders and farcy in equines (horses, donkeys, mules). In horses, glanders primarily presents with purulent nasal discharge, inflammation of the mucous membranes of the upper respiratory tract, and poor general condition, whereas farcy is a chronic cutaneous disease with formation of nodules which may develop into ulcers. Equines are the only known reservoir. Contact with infected animals, ingestion of glanderous meat and exposure to aerosols can cause B. mallei infections in humans. Human glanders is highly lethal and resembles melioidosis. Chronic and latent infections can exacerbate into the acute form even after 15 years in both diseases. Both bacterial species are intrinsically resistant to many antibiotics including ampicillin and broad- and expanded-spectrum cephalosporines due to the production of a beta-lactamase . B. mallei and B. pseudomallei have been classified by the CDC as priority category B biological agents.
Isolation and microbiological identification of B. pseudomallei and B. mallei from clinical samples can take up to one week. Commercial biochemical test systems for B. mallei are not available and B. pseudomallei may be misidentified as Chromobacterium violaceum or other bacteria [5–7]. Latex agglutination using a monoclonal antibody was shown to be a valuable technique for the rapid identification of B. pseudomallei in positive blood cultures, but no commercial tests are available [8, 9]. Real-time PCR systems have been developed for diagnosing and differentiating as rapid alternatives to biochemical tests, but few have been validated on clinical samples [10–13].
Matrix-assisted laser desorption ionization-time of flight mass spectrometry (MALDI-TOF MS) for identification of bacteria has become a useful tool for the rapid identification of bacteria (see  for a recent review). In some studies intact cell mass spectrometry (ICMS) showed better correlation to genetic markers than conventional morphological classification . The technique and modified procedures including a digestion step (‘shotgun mass mapping’) have been successfully used for subspecies-level classification in some species . Characteristic features of ICMS are simple sample preparation procedures of whole cells, spectrum acquisition in the mass range between approximately 2,000 and 15,000 Da and analysis based upon comparison of sample spectra with reference spectra. By statistical approaches, similarity between mass spectra can be exploited for the identification of microorganisms. MALDI-TOF MS was also established for identification of non-fermenting gram-negative bacteria isolated from cystic fibrosis patients in Brazil . Patients with cystic fibrosis suffer primarily under infections with Pseudomonads, but Burkholderiae play also an important role. In the Brazilian study a comprehensive number of Burkholderia species was included and could be identified correctly in most cases. However, neither B. pseudomallei nor B. mallei were among the clinical isolates tested. Sporadic cases of melioidosis in cystic fibrosis patients have been described in the literature and seem to be an emerging problem [18–22]. Due to increased travel activity, international trade, climate change, and the potential threat of bioterrorist attacks infections caused by B. pseudomallei and B. mallei can become a serious problem.
The aim of this study was to evaluate the potential benefit of MALDI-TOF MS for the rapid and reliable identification and differentiation of B. pseudomallei and B. mallei.
Construction of a reference database
A custom made set of 34 reference spectra, which are called main spectra (MSP) in the MALDI Biotyper terminology (Bruker Daltonik GmbH, Bremen, Germany), was generated and used as the basis for all further calculations. This reference spectra set included all strains listed in Table 1 (B. mallei and B. pseudomallei) and additionally samples from B. ambifaria (DSM 16087), B. cenocepacia (ATCC BAA-245), B. dolosa (DSM 16088), B. glathei (ATCC 29195), B. multivorans (DSM 13243), B. stabilis (DSM 16586), and B. thailandensis (ATCC 700388). This set of 34 samples will be referred to as the ‘custom reference set’. The full set of MALDI Biotyper reference database entries will be referred to as ‘MALDI Biotyper reference set’. In a first analysis, spectra of the custom reference set were queried against a combined database composed of the custom reference set of 34 Burkholderia samples and the MALDI Biotyper reference set. For every queried spectrum, MALDI Biotyper software generates a score-based ranked list of organisms. The organism with the highest score is ranked first (‘top hit’) and its species is taken as the result of the query. MSP scores are calculated by an algorithm that compares the masses of a tested sample with those present in the MSP of the spectra collection by agglomeration of scores for every mass and performing a normalisation that will result in a final value between 0 (unrelated) and 1000 (identical). For convenience, the result is given as logarithmic value smaller than or equal to 3.0. Masses of the tested spectrum will be scored in a weighted fashion depending on their location within a narrower or a wider mass tolerance window centred on the masses of the MSP. Additionally, the score for every coinciding mass of the tested spectrum will be weighted according to the frequency with which the corresponding mass of the MSP has been found in the single spectra that were used for the construction of the MSP. Thus, scores carry information on the number of coinciding masses found in the tested spectrum and the MSP, the mass aberration that is observed between the corresponding masses of the tested spectrum and the MSP and the reproducibility of the respective masses of the MSP. Cut-off values for reliable species determination cannot be theoretically calculated and have to be determined empirically. According to the manufacturer, experience has shown that scores exceeding 2.0 will allow reliable genus identification and species identification in the majority of cases. Scores calculated for all spectra of the custom reference set among them are summarized in Figure 1. In the hit lists of all tested specimen, the highest-ranking entry represented the same species as the tested specimen, indicating that, within the given database, the standard MALDI Biotyper identification procedure reliably allows the determination of Burkholderia species including the differentiation between B. mallei and B. pseudomallei. Even though species identification was correct in all cases, the distribution of scores in Figure 1 gave rise to concern about the reliability of the discrimination of the three members of the Pseudomallei group: B. thailandensis produced relatively high scores with some of the B. mallei and B. pseudomallei samples, and B. pseudomallei generally produced relatively high scores with B. mallei. Therefore, a set of B. mallei and B. pseudomallei samples was additionally cultivated and processed in two different laboratories and queried using the custom reference set as database in order to challenge the identification procedure. It is known that cultivation conditions can influence the outcome of ICMS experiments. In an interlaboratory comparison that was performed in three laboratories with B. thailandensis we had observed that cultivation on different growth media (Columbia 5% Sheep Blood agar (CSB), chocolate agar, and McConkey agar) and different cultivation periods (24, 48 and 72 h) had a notable influence on the scores in the identification procedure (data not shown). To avoid any variance caused by differing growth conditions, all B. mallei and B.pseudomallei were grown on CSB and the cultivation period of 48 h was strictly observed.
Discrimination of B. mallei and B. pseudomallei
Scores between B. mallei samples listed in Table 1 ranged between 2.56 and 2.94, whereas those between B. pseudomallei samples ranged between 2.25 and 2.89. For B. mallei samples, the score range over 2.72 was completely reserved for correct species assignments and the top scores of all isolates reached this threshold. Due to the stronger variation of B. pseudomallei, such a well-defined threshold for correct species assignments could not be defined for this species.
As MSP will usually not be constructed for routine samples, the identification procedure with MALDI Biotyper was repeated with single spectra of the B. mallei and B. pseudomallei samples from Table 1. The results were very similar to those obtained with MSP. For B. mallei samples, scores between 2.60 and 2.93 were observed, whereas B. pseudomallei were recognized with scores in the range from 2.57 to 2.92. The top-ranking hit of the hit-list correctly indicated the species of all queried samples. Scores of all top-ranking hits exceeded 2.8. Construction of a score-based dendrogram of B. mallei and B. pseudomallei samples (Figure 2) with MALDI Biotyper software resulted in the expected clustering of the two species. Interestingly, the B. pseudomallei type strain ATCC 23343 separated notably from other B. pseudomallei representatives. This was at least in part caused by the appearance of two series of masses between 5,000 and 5,084 Da and 8,500 and 8,565 Da which were not detected in any of the other samples (Figure 3). The observation of multiple mass differences of 14 Da in these series suggests that they were caused by multiple methylations being specific for this strain. The mass series reproducibly appeared in all single spectra used to calculate the MSP of the B. pseudomallei strain ATCC 23343 and were also observed in independent replicates of the spectra with a freshly cultivated specimen. The identity of the modified molecule is unknown. A dendrogram was constructed from the MSP of the B. mallei and B. pseudomallei strains listed in Table 1 and the Burkholderia, Chromobacterium, and Rhodococcus species from Table 2 which were added from the MALDI Biotyper database (Figure 4). As expected, score-based distances between B. mallei and B. pseudomallei were smaller than between the other Burkholderia species and B. mallei/B. pseudomallei and B. thailandensis formed a distinct group which was separated from the other species of the Burkholderia genus.
The distance relations of B. mallei and B. pseudomallei were further analysed after transfer of the mass lists into statistical programming language R. Based on the mass alignment, a cluster analysis was performed, a distance matrix was calculated, and the distances within and between the B. mallei and B. pseudomallei strains were calculated. To test the influence of the peak intensities on the clustering behavior, cluster analysis was performed with the quantitative and qualitative data. For the latter purpose the quantitative alignment containing the intensities of every mass peak was transformed into a qualitative binary table by marking the absence or presence of a mass with 0 and 1, respectively. From both tables, distance matrices were calculated and visualized as Sammon-plots (Figure 5). For qualitative and quantitative data the average normalized distances between B. mallei strains were smaller than between B. pseudomallei strains (0.57 vs. 0.73 for the binary data and 0.46 vs. 0.71 when peak intensities of the spectra were included) confirming the score-based clustering in Figure 2 that suggests a higher variation among B. pseudomallei than among B. mallei strains. As a measure for the separation of the two species, the weighted ratio between the distances of B. mallei and B. pseudomallei strains within the species and the distances between the species was calculated. Transition from qualitative to quantitative data showed slight improvement (0.82 vs. 0.74) in the species separation indicating that peak intensities are relevant for the discrimination of the two species and should not be neglected. Cluster analysis with the quantitative data using the k-means algorithm indicated the presence of two clusters which were congruent with the two Burkholderia species whereas cluster analysis based on the qualitative data failed to do so. On basis of the qualitative data, which weights every mass equally for the calculation of the distance, B. pseudomallei ATCC 23343 was notably separated from all other spectra, most probably because of the multiple modifications shown in Figure 3.
As some B. mallei and B. pseudomallei specimen from the reference spectrum set produced quite high scores with the respective other species, it was essential to test the practicability of the custom reference set in a routine laboratory setting with samples prepared in a different laboratory. The panel of samples used for this test (Table 3, the ‘test set’) only partially overlapped with the custom reference set (Table 1) so that not only inter-laboratory variation was tested but also the ability of the custom reference set to discriminate newly appearing isolates like those from a glanders outbreak in the United Arabic Emirates in 2004.
Using the complete custom reference set as database a number of misclassifications of the test set isolates occurred. Considering the distribution of scores (Figure 1) and the distance relations between B. mallei and B. pseudomallei (Figure 5), this was not unexpected and obviously a consequence of the indiscriminate inclusion of all available B. mallei and B. pseudomallei samples into the custom reference set. Classification could be substantially improved by selecting combinations of isolates of B. mallei and B. pseudomallei to form a dedicated reference set which is optimized for the discrimination of the two species. To screen the complete custom reference set of B. mallei and B. pseudomallei for appropriate combinations of isolates, the outcome of a database query was simulated with all permutations of up to four members of each species. The smallest reference group yielding error-free results was composed of two B. mallei (M1, NCTC10247) and three B. pseudomallei (EF15660, PITT 225A, NCTC01688) isolates which are highlighted by an asterisk in Table 1. Not surprisingly, these isolates located close to the centers of their respective species in the Sammon plot visualization of the distance matrix (Figure 5).
Finally, multivariate statistics on basis of the four different statistical approaches (Genetic Algorithm, Support Vector Machine, Supervised Neural Network, Quick Classifier) available in ClinProTools 3.0 showed that B. mallei and B. pseudomallei could be well separated with cross validation results ranging between 98.95% and 100.00% (data not shown). Principal Component Analysis (PCA) carried out with ClinProTools 3.0 (Figure 6) further confirmed the separation of both species and also the broader distribution of B. pseudomallei in comparison with B. mallei.
Identification of taxon-specific biomarker ions
Mass spectra of the reference spectrum set were analysed for species-specific masses which may be used for species identification independent of the score values considered so far. For that purpose the mass lists of the MSP generated with MALDI Biotyper software were evaluated in detail. An alignment of all masses occurring in the spectra was constructed as a table in which every column represented the mass spectrum of a sample and every row the intensity of a mass occurring in a certain mass range. The alignment contained a total of 350 masses. One of them, 4,410 Da, was found in the spectra of all 34 samples and also in every one of the single spectra underlying the MSPs so that this mass can be considered a reliable indicator of the set of nine Burkholderia species that was analysed (B. mallei, B pseudomallei, B, thailandensis, B. ambifaria, B. cenocepacia, B. dolosa, B. glathe, B. multivorans, B. stabilis). Seven more masses (3,655 [doubly charged 7,309], 5,195, 6,551, 7,169, 7,309, 8,628 and 9,713 Da) were present in all B. mallei and B. pseudomallei samples but also in one or more of the other Burkholderia species. Considering the close relation of B. thailandensis with B. mallei and B. pseudomallei, mass 9,713 Da is of interest, which was specific for all B. mallei, B. pseudomallei, and B. thailandensis samples, i.e. the Pseudomallei group. Finally, 6,551 Da was present in all B. mallei and B. pseudomallei samples but in none of the other species, making it an effective discriminator between the B. mallei/pseudomallei group and the other representatives of the genus Burkholderia. Concerning the distinction of B. mallei and B. pseudomallei, statistical analysis with ClinProTools 3.0 software revealed a number of masses with significant class separation between the two species based on peak intensity. Most significant separation could be obtained based on the masses 7,553 and 5,794 which differ significantly in intensity between the two species.
In recent years MALDI-TOF MS has been introduced in microbiological laboratories as a time saving diagnostic approach supplementing morphological, biochemical, and molecular techniques for identification of microbes . In several studies the comparability with conventional identification procedures was assessed with generally good correlation, but discordances were seen on the species and even on the genus level [24, 25]. This proteomic profiling approach was successfully applied in routine identification of bacterial isolates from blood culture with the exception of polymicrobial samples and streptococci . The identification of Burkholderia spp. and other non-fermenting bacteria using MALDI-TOF MS was investigated in cystic fibrosis (CF) patients as Burkholderia spp. (mainly of the cepacia-complex) cause a relevant number of life-threatening infections in these patients [27–29]. It was demonstrated that MALDI-TOF MS is a useful tool for rapid identification in the routine laboratory. B. pseudomallei can be the cause of melioidosis in CF patients and travelers to tropical regions, but this bacterium and the closely related species B. mallei was not included in previous MALDI-TOF MS studies [18–22, 30, 31]. Natural catastrophes like the tsunami in Indonesia (2004) and occasional flooding in other tropical regions resulted in elevated incidence of melioidosis and several cases among travelers and tourists [32–36]. B. mallei and B. pseudomallei are biological agents which further underlines the need for rapid detection tools. Identification of Burkholderia ssp. and distinction of B. mallei and B. pseudomallei from other species was feasible. This was also true for the closely related bacterium B. thailandensis. All strains grew well within 48 hours and could then be readily prepared for MALDI-TOF MS.
Due to the close relationship of B. mallei and B. pseudomallei, it was not surprising that the search for species-identifying biomarker ions discriminating these species was not successful. Obviously, more complicated mass signatures are required for this purpose and, as we could show after separate statistical evaluation of qualitative and quantitative data, peak intensities also play a crucial role for the discrimination of B. mallei and B. pseudomallei. However, group-specific masses like 9,713 Da, standing for the Pseudomallei complex (B. mallei/B. pseudomallei/B. thailandensis) or 6,551, exclusively found in B. mallei and B. pseudomallei may be of use for the discrimination of these three species.
For the identification of B. mallei and B. pseudomallei samples under routine laboratory conditions, it was necessary to reduce the reference spectrum set to avoid misclassifications. Interestingly, the reference spectrum set optimized for spectrum-based discrimination neither contained the type strain ATCC 23344T (B. mallei) nor ATCC 23343T (B. pseudomallei). One reason for the exclusion of ATCC 23343 could be the occurrence of two peak series with repeating mass increments of 14 Da most probably representing polymethylated proteins. This strain has been shown to have unique immunological features. In an immunization experiment with a panel of 14 B. pseudomallei strains, ATCC 23343 induced monoclonal antibodies in mice which did not cross-react with any of the other B. pseudomallei strains . These peculiarities may indicate that this type strain has been genetically modified by frequent subcultivation or misuse of media. To our knowledge, similar modifications which may have an impact on classification of bacteria have not been reported to-date. These series were specific for the isolate and also for two molecules within the observed mass range.
In this study we have demonstrated that isolates of the closely related species B. mallei and B. pseudomallei can be identified using MALDI-TOF MS. Dangerous and cumbersome handling under BSL 3 conditions can be minimized by inactivation of the isolates with ethanol and subsequent MALDI-TOF MS analysis that requires much less time than nucleic acid amplification methods . The reference spectra exhibited a higher homogeneity among B. mallei than among B. pseudomallei. The type strain of B. pseudomallei ATCC 23343 was isolated decades ago and separated from the other B. pseudomallei specimens in the dendrograms which is probably due to polymethylation as indicated by two intensive series of mass increments of 14 Da. To our knowledge, this is the first report of such a modification in whole cell MALDI-TOF MS spectra of microorganisms. As expected for closely related species, especially when one of them, B. pseudomallei, displays the broad distribution of MALDI-phenotypes that was observed, the identification process requires the selection of spectra from representatives of the centers of their respective distance distributions. Uncritical inclusion of all available samples as references in a library was counterproductive for the identification process. Only by selecting an appropriate set of reference spectra (Table 3) it was possible to identify all strains. This underlines the need for careful curation of reference spectra databases used for the identification of microorganisms.
A comprehensive collection of B. mallei and B. pseudomallei strains, referred to as the ‘reference set’, were tested (Table 1) and compared with spectra of closely related and other clinically relevant bacteria (Table 2) included in the MALDI Biotyper Reference Library (version 3.0, Bruker Daltonics, Bremen, Germany). Strain identity was confirmed using Gram staining, motility testing, and real-time PCR assays targeting fliC and fliP as described previously [11, 12], and a species-specific DNA-microarray . Strains were obtained from the Friedrich-Loeffler-Institut, Jena, Germany and the Bundeswehr Institute of Microbiology in Munich, Germany. Strain Dubai 7 was kindly provided by the Central Veterinary Reference Laboratory, Dubai, UAE. Some strains originated from the Robert Koch Institute in Berlin, Germany, that coordinated a project of the European Union for the “Establishment of Quality Assurances for Detection of Highly Pathogenic Bacteria of Potential Bioterrorism Risk”. Spectra from the set of strains enlisted in Table 3, referred to as ‘test set’, were recorded with an Autoflex mass spectrometer (Bruker) in a second laboratory and queried against the reference set to test for robustness and inter-laboratory variation.
Intact cell mass spectrometry (ICMS)
Samples were prepared as described previously [16, 40]. Briefly, the bacteria were cultivated under BSL 3 conditions on a nutrient blood agar containing 3% glycerol at 37°C for 48 hours. Specimens from single colonies were thoroughly suspended in 300 μl water and precipitated by addition of 900 μL ethanol (98% v/v). This treatment inactivated the bacteria as was demonstrated by growth controls and the specimens could be further tested under BSL 1 conditions. After sedimentation for five minutes at 10,000 g min-1 the supernatant was carefully removed and the sediment suspended in 50 μL of 70% (v/v) formic acid. After mixing with 50 μL acetonitrile, the suspension was centrifuged as described above and the supernatant transferred to a fresh tube. 1.5 μL of the extract was spotted onto a steel MALDI target plate and allowed to dry at ambient temperature. Finally, the dried extract was overlaid with 2 μL of a saturated solution of α-Cyano-4-hydroxycinnamic acid in 50% acetonitrile/2.5% trifluoroacetic acid as matrix and was again allowed to dry. A custom-made database of reference spectra was generated using the MALDI Biotyper software (version 2.0, Bruker Daltonik GmbH, Bremen, Germany) following the guidelines of the manufacturer. Each sample was spotted onto six target spots of a steel MALDI target plate. Spectra were acquired with an UltraflexTM I instrument (Bruker Daltonik GmbH) in the linear positive mode in the range of 2,000 to 20,000 Da. Acceleration Voltage was 25 kV and the instrument was calibrated in the range between 3,637.8 and 16,952.3 Da with Bacterial Test Standard calibrant (BTS, Bruker Daltonik GmbH). Four single mass spectra with 500 shots each were acquired from each spot and a reference spectrum calculated from the 24 single spectra. Reference spectra contained the parameters peak mass and intensity and additional information on the reproducibility of the mass peaks, i.e. the frequency of occurrence of every peak in the underlying 24 single spectra. Reference spectra were generated within the mass range of 2,000 to 20,000 Da with the default parameter settings in the MALDI Biotyper software. The number of peaks was limited to 100 per reference spectrum and all peaks of a reference spectrum were normalized to the most intense peak with an intensity of 1.0. The minimum frequency of occurrence within the 24 single spectra was set to 50% for every mass. Peaklists of reference spectra were exported for further evaluation in the statistical programming language R.
To test the inter-laboratory variation and the robustness of the classification by using MALDI Biotyper software, a set of B. mallei and B. pseudomallei test samples from a second laboratory (Table 3) was queried against the reference spectra set described above. These spectra were recorded at the Bundeswehr Institute of Microbiology with an Autoflex mass spectrometer (Bruker Daltonik GmbH, Bremens). Spectra were generated for the test set in the same way as for the reference set. A query of all test samples was performed and the resulting scores were transferred into a data matrix, the ‘score matrix’, in which every column represented a member of the reference set and every line a test sample. The power of certain combinations of representatives of the two classes within the reference set to discriminate samples of the test set was tested as follows: the columns representing the combination of reference spectra to be evaluated were selected from the score matrix and every member of the test set was classified by assigning it to the class of the member of the reduced reference set with the highest score. The number of correct and incorrect assignments was then calculated for the test set. This procedure simulates a MALDI Biotyper query with a reduced number of spectra in the reference database.
Transfer of data into statistical programming language R
Peak lists of the reference spectra generated by the MALDI Biotyper software were converted into a format compatible with the R-package caMassClass which offers a number of functions to evaluate mass spectra  for further statistical analysis in R version 2.10.1. available at the R-project homepage . Peak lists were aligned by the msc.peaks.align command of caMassClass and transformed into a binary mass table where rows represented all unique masses of the aligned spectra set and every column represented the spectrum of one sample. The size of the mass ranges defining a unique peak in the alignment, designated as bin size, was restricted to a maximum of 2,000 ppm. Among other features, the algorithm of the msc.peaks.align command minimizes the bin size in the given range, maximizes the space between bins and ensures that no two peaks of the same spectrum are in the same bin. For the calculation of qualitative data, the presence of the respective mass in the spectrum of a sample was marked with 1, absence with 0, i.e. all mass intensities were removed. These tables were the basis for the calculation of distances (R-routine ‘dist’, parameter ‘binary’ for the distance measure) which were used for the construction of cladograms, Sammon plots , and k-means cluster analysis using the R-routines ‘hclust’ (parameter ‘ward’ for the agglomeration method) , ‘sammon’ (used with default settings) and ‘kmeans’ (three initial cluster centers, maximum of 100 iterations, Hartigan-Wong algorithm ).
Statistical analysis with ClinProTools software
Raw spectra from the specimens in Table 3 were imported into ClinProTools 3.0 software for statistical analysis. Each species was represented by 20 to 24 spectra to cover measurement variability. The multiple spectra of multiple species were imported as a “class” for the respective species. ClinProTools preformed a normalization and recalibration of mass spectra before further analysis, thereby reducing measurement variability effects significantly. Peak picking was performed based on the overall average spectrum over the whole mass range (signal to noise threshold of 5). Further spectra processing parameters were: baseline correction (convex hull), resolution (300 ppm), smoothing (Savitzky Golay, 5 cycles with 2 m/z width),
Multivariate statistical analyses were performed using the four supervised algorithms and PCA which are implemented in ClinProTools. For the Genetic Algorithm, models with maximum 5 peaks and 50 generations were calculated and k-nearest neighbor (kNN) classification was performed with 5 neighbors. Also for Support Vector Machine the maximum number of peaks was set to 5 and kNN classification was performed with 5 neighbors. Supervised Neural Network was calculated with automated optimization of peak number, maximum 25. For the Quick Classifier, a maximum number of differentiating peaks of 25 was allowed; selection of peaks was based on ranking in t-test. For PCA, “level” scaling was selected.
Dance DA: Melioidosis: the tip of the iceberg?. Clin Microbiol Rev. 1991, 4 (1): 52-60.
Deris ZZ, Hasan H, Siti Suraiya MN: Clinical characteristics and outcomes of bacteraemic melioidosis in a teaching hospital in a northeastern state of Malaysia: a five-year review. J Infect Dev Ctries. 2010, 4 (7): 430-435.
Sprague LD, Neubauer H: Melioidosis in animals: a review on epizootiology, diagnosis and clinical presentation. J Vet Med B Infect Dis Vet Public Health. 2004, 51 (7): 305-320. 10.1111/j.1439-0450.2004.00797.x.
Estes DM, Dow SW, Schweizer HP, Torres AG: Present and future therapeutic strategies for melioidosis and glanders. Expert Rev Anti Infect Ther. 2010, 8 (3): 325-338. 10.1586/eri.10.4.
Dance DA, Wuthiekanun V, Naigowit P, White NJ: Identification of Pseudomonas pseudomallei in clinical practice: use of simple screening tests and API 20NE. J Clin Pathol. 1989, 42 (6): 645-648. 10.1136/jcp.42.6.645.
Inglis TJ, Chiang D, Lee GS, Chor-Kiang L: Potential misidentification of Burkholderia pseudomallei by API 20NE. Pathology. 1998, 30 (1): 62-64. 10.1080/00313029800169685.
Lowe P, Engler C, Norton R: Comparison of automated and nonautomated systems for identification of Burkholderia pseudomallei. J Clin Microbiol. 2002, 40 (12): 4625-4627. 10.1128/JCM.40.12.4625-4627.2002.
Samosornsuk N, Lulitanond A, Saenla N, Anuntagool N, Wongratanacheewin S, Sirisinha S: Short report: evaluation of a monoclonal antibody-based latex agglutination test for rapid diagnosis of septicemic melioidosis. AmJTrop Med Hyg. 1999, 61 (5): 735-737.
Steinmetz I, Reganzerowski A, Brenneke B, Haussler S, Simpson A, White NJ: Rapid identification of Burkholderia pseudomallei by latex agglutination based on an exopolysaccharide-specific monoclonal antibody. J Clin Microbiol. 1999, 37 (1): 225-228.
Thibault FM, Valade E, Vidal DR: Identification and discrimination of Burkholderia pseudomallei, B. mallei,and B. thailandensis by real-time PCR targeting type III secretion system genes. J Clin Microbiol. 2004, 42 (12): 5871-5874. 10.1128/JCM.42.12.5871-5874.2004.
Tomaso H, Pitt TL, Landt O, Al Dahouk S, Scholz HC, Reisinger EC, Sprague LD, Rathmann I, Neubauer H: Rapid presumptive identification of Burkholderia pseudomallei with real-time PCR assays using fluorescent hybridization probes. Mol Cell Probes. 2005, 19 (1): 9-20. 10.1016/j.mcp.2004.08.001.
Tomaso H, Scholz HC, Al Dahouk S, Eickhoff M, Treu TM, Wernery R, Wernery U, Neubauer H: Development of a 5'-nuclease real-time PCR assay targeting fliP for the rapid identification of Burkholderia mallei in clinical samples. Clin Chem. 2006, 52 (2): 307-310.
Tomaso H, Scholz HC, Al Dahouk S, Pitt TL, Treu TM, Neubauer H: Development of 5' nuclease real-time PCR assays for the rapid identification of the burkholderia mallei//burkholderia pseudomallei complex. Diagn Mol Pathol. 2004, 13 (4): 247-253. 10.1097/01.pdm.0000137099.36618.cc.
Giebel R, Worden C, Rust SM, Kleinheinz GT, Robbins M, Sandrin TR: Microbial fingerprinting using matrix-assisted laser desorption ionization time-of-flight mass spectrometry (MALDI-TOF MS) applications and challenges. Adv Appl Microbiol. 2010, 71: 149-184.
Marklein G, Josten M, Klanke U, Muller E, Horre R, Maier T, Wenzel T, Kostrzewa M, Bierbaum G, Hoerauf A, et al: Matrix-assisted laser desorption ionization-time of flight mass spectrometry for fast and reliable identification of clinical yeast isolates. J Clin Microbiol. 2009, 47 (9): 2912-2917. 10.1128/JCM.00389-09.
Sauer S, Freiwald A, Maier T, Kube M, Reinhardt R, Kostrzewa M, Geider K: Classification and identification of bacteria by mass spectrometry and computational analysis. PLoSONE. 2008, 3 (7): e2843-
Fernandez-Olmos A, Garcia-Castillo M, Morosini MI, Lamas A, Maiz L, Canton R: MALDI-TOF MS improves routine identification of non-fermenting Gram negative isolates from cystic fibrosis patients. J Cyst Fibros. 2012, 11 (1): 59-62. 10.1016/j.jcf.2011.09.001.
Barth AL, de Abreu ESFA, Hoffmann A, Vieira MI, Zavascki AP, Ferreira AG, da Cunha LG, Albano RM, de Andrade Marques E: Cystic fibrosis patient with Burkholderia pseudomallei infection acquired in Brazil. J Clin Microbiol. 2007, 45 (12): 4077-4080. 10.1128/JCM.01386-07.
Corral DM, Coates AL, Yau YC, Tellier R, Glass M, Jones SM, Waters VJ: Burkholderia pseudomallei infection in a cystic fibrosis patient from the Caribbean: a case report. Can Respir J. 2008, 15 (5): 237-239.
Holland DJ, Wesley A, Drinkovic D, Currie BJ: Cystic Fibrosis and Burkholderia pseudomallei Infection: An Emerging Problem?. Clin Infect Dis. 2002, 35 (12): e138-e140. 10.1086/344447.
Schulin T, Steinmetz I: Chronic melioidosis in a patient with cystic fibrosis. J Clin Microbiol. 2001, 39 (4): 1676-1677. 10.1128/JCM.39.4.1676-1677.2001.
Visca P, Cazzola G, Petrucca A, Braggion C: Travel-associated Burkholderia pseudomallei Infection (Melioidosis) in a patient with cystic fibrosis: a case report. Clin Infect Dis. 2001, 32 (1): E15-E16. 10.1086/317528.
Seng P, Rolain JM, Raoult D, Brouqui P: Detection of new Anaplasmataceae in the digestive tract of fish from southeast Asia. Clin Microbiol Infect. 2009, 15 (Suppl 2): 88-90.
Ferreira L, Vega S, Sanchez-Juanes F, Gonzalez M, Herrero A, Muniz MC, Gonzalez-Buitrago JM, Munoz JL: [Identifying bacteria using a matrix-assisted laser desorption ionization time-of-flight (MALDI-TOF) mass spectrometer. Comparison with routine methods used in clinical microbiology laboratories]. Enferm Infecc Microbiol Clin. 2010, 28 (8): 492-497. 10.1016/j.eimc.2009.12.009.
Risch M, Radjenovic D, Han JN, Wydler M, Nydegger U, Risch L: Comparison of MALDI TOF with conventional identification of clinically relevant bacteria. Swiss Med Wkly. 2010, 140: w13095-
La Scola B, Raoult D: Direct identification of bacteria in positive blood culture bottles by matrix-assisted laser desorption ionisation time-of-flight mass spectrometry. PLoS One. 2009, 4 (11): e8041-10.1371/journal.pone.0008041.
Degand N, Carbonnelle E, Dauphin B, Beretti JL, Le Bourgeois M, Sermet-Gaudelus I, Segonds C, Berche P, Nassif X, Ferroni A: Matrix-assisted laser desorption ionization-time of flight mass spectrometry for identification of nonfermenting gram-negative bacilli isolated from cystic fibrosis patients. J Clin Microbiol. 2008, 46 (10): 3361-3367. 10.1128/JCM.00569-08.
Minan A, Bosch A, Lasch P, Stammler M, Serra DO, Degrossi J, Gatti B, Vay C, D'Aquino M, Yantorno O, et al: Rapid identification of Burkholderia cepacia complex species including strains of the novel Taxon K, recovered from cystic fibrosis patients by intact cell MALDI-ToF mass spectrometry. Analyst. 2009, 134 (6): 1138-1148. 10.1039/b822669e.
Vanlaere E, Sergeant K, Dawyndt P, Kallow W, Erhard M, Sutton H, Dare D, Devreese B, Samyn B, Vandamme P: Matrix-assisted laser desorption ionisation-time-of of-flight mass spectrometry of intact cells allows rapid identification of Burkholderia cepacia complex. J Microbiol Methods. 2008, 75 (2): 279-286. 10.1016/j.mimet.2008.06.016.
Currie BJ: Melioidosis: an important cause of pneumonia in residents of and travellers returned from endemic regions. Eur Respir J. 2003, 22 (3): 542-550. 10.1183/09031936.03.00006203.
O'Carroll MR, Kidd TJ, Coulter C, Smith HV, Rose BR, Harbour C, Bell SC: Burkholderia pseudomallei: another emerging pathogen in cystic fibrosis. Thorax. 2003, 58 (12): 1087-1091. 10.1136/thorax.58.12.1087.
Christenson B, Fuxench Z, Morales JA, Suarez-Villamil RA, Souchet LM: Severe community-acquired pneumonia and sepsis caused by Burkholderia pseudomallei associated with flooding in Puerto Rico. Bol Asoc Med P R. 2003, 95 (6): 17-20.
Ciervo A, Mattei R, Cassone A: Melioidosis in an Italian tourist injured by the tsunami in Thailand. J Chemother. 2006, 18 (4): 443-444.
Nieminen T, Vaara M: Burkholderia pseudomallei infections in Finnish tourists injured by the December 2004 tsunami in Thailand. Euro Surveill. 2005, 10 (3): E050303 050304-
Svensson E, Welinder-Olsson C, Claesson BA, Studahl M: Cutaneous melioidosis in a Swedish tourist after the tsunami in 2004. Scand J Infect Dis. 2006, 38 (1): 71-74. 10.1080/00365540500264738.
Wuthiekanun V, Chierakul W, Rattanalertnavee J, Langa S, Sirodom D, Wattanawaitunechai C, Winothai W, White NJ, Day N, Peacock SJ: Serological evidence for increased human exposure to Burkholderia pseudomallei following the tsunami in southern Thailand. J Clin Microbiol. 2006, 44 (1): 239-240. 10.1128/JCM.44.1.239-240.2006.
Feng SH, Tsai S, Rodriguez J, Newsome T, Emanuel P, Lo SC: Development of mouse hybridomas for production of monoclonal antibodies specific to Burkholderia mallei and Burkholderia pseudomallei. Hybridoma (Larchmt). 2006, 25 (4): 193-201. 10.1089/hyb.2006.25.193.
Lasch P, Nattermann H, Erhard M, Stammler M, Grunow R, Bannert N, Appel B, Naumann D: MALDI-TOF mass spectrometry compatible inactivation method for highly pathogenic microbial cells and spores. Anal Chem. 2008, 80 (6): 2026-2034. 10.1021/ac701822j.
Schmoock G, Ehricht R, Melzer F, Rassbach A, Scholz HC, Neubauer H, Sachse K, Mota RA, Saqib M, Elschner M: DNA microarray-based detection and identification of Burkholderia mallei, Burkholderia pseudomallei and Burkholderia spp. Mol Cell Probes. 2009, 23 (3–4): 178-187.
Karger A, Ziller M, Bettin B, Mintel B, Schares S, Geue L: Determination of serotypes of Shiga toxin-producing Escherichia coli isolates by intact cell matrix-assisted laser desorption ionization-time of flight mass spectrometry. Appl Environ Microbiol. 2011, 77 (3): 896-905. 10.1128/AEM.01686-10.
Tuszynski J: caMassClass: Processing & Classification of Protein Mass Spectra (SELDI) Data. 2010, http://CRAN.R-project.org/package=caMassClass.
R Development Core Team: R: A language and environment for statistical computing. R Foundation for Statistical Computing. 2009, Vienna, Austria
Sammon J: A non-linear mapping for data structure analysis. IEEE Trans Comp C. 1969, 18: 401-409.
Everitt B: Cluster analysis. 1974, London: Heinemann Educational Books
Hartigan J, Wong M: A K-means clustering algorithm. Appl Statistics. 1979, 28: 100-108. 10.2307/2346830.
We are grateful to Gabi Echle, Katja Fischer, Michaela Ganss, and Robert Schneider for their excellent technical assistance.
This work was supported by the EU, EAHC Agreement - No 2007 204.
AK performed MALDI-TOF MS experiments, data analysis and participated in drafting the manuscript. RS worked in the BSL3 laboratory, performed MALDI-TOF MS experiments and data analysis. MZ developed R-scripts and participated in the mathematical analysis of mass spectra and in solving classification problems. MCE coordinated the work in the BSL3 laboratory, performed cultivation and PCR assays. BB performed MALDI-TOF MS experiments and data analysis. FM worked in the BSL3 laboratory, performed cultivation and PCR assays. TM performed MALDI-TOF MS experiments and data analysis. MK performed data analysis and statistical examination. HCS worked in the BSL3 laboratory, performed cultivation and PCR assays, and critically reviewed the manuscript. HN critically reviewed the manuscript. HT participated in the design of the study, coordinated the experiments, and participated in drafting the manuscript. MK and TM are employees of Bruker Daltonik GmbH, the manufacturer of the MALDI Biotyper system used in this study. All authors read and approved the final manuscript.