Performance of optimized McRAPD in identification of 9 yeast species frequently isolated from patient samples: potential for automation

Background Rapid, easy, economical and accurate species identification of yeasts isolated from clinical samples remains an important challenge for routine microbiological laboratories, because susceptibility to antifungal agents, probability to develop resistance and ability to cause disease vary in different species. To overcome the drawbacks of the currently available techniques we have recently proposed an innovative approach to yeast species identification based on RAPD genotyping and termed McRAPD (Melting curve of RAPD). Here we have evaluated its performance on a broader spectrum of clinically relevant yeast species and also examined the potential of automated and semi-automated interpretation of McRAPD data for yeast species identification. Results A simple fully automated algorithm based on normalized melting data identified 80% of the isolates correctly. When this algorithm was supplemented by semi-automated matching of decisive peaks in first derivative plots, 87% of the isolates were identified correctly. However, a computer-aided visual matching of derivative plots showed the best performance with average 98.3% of the accurately identified isolates, almost matching the 99.4% performance of traditional RAPD fingerprinting. Conclusion Since McRAPD technique omits gel electrophoresis and can be performed in a rapid, economical and convenient way, we believe that it can find its place in routine identification of medically important yeasts in advanced diagnostic laboratories that are able to adopt this technique. It can also serve as a broad-range high-throughput technique for epidemiological surveillance.


Background
Over the past decades, patients have increasingly been colonized and infected with a variety of yeast species mainly due to immunocompromising conditions as well as due to increased use of invasive techniques and devices (for reviews see [1,2]). Therefore, clinical microbiology laboratories face an important challenge of rapid detection of pathogenic yeasts. However, accurate species identifica-tion is very much demanded in addition to mere detection, because susceptibility to antifungal agents, probability of resistance development and ability to cause disease vary in different species [3]. Although there are several rapid diagnostic procedures available based mainly on PCR amplification of yeast DNA that have been developed to facilitate diagnosis, conventional cultivation techniques followed by identification of pure culture still dominate the field. A profound change can hardly be expected in the foreseeable future except for rapid detection of selected yeasts species in specific types of samples, blood in particular. This is mainly because only the identification techniques based on pure culture examination are able to identify the whole spectrum of potentially pathogenic yeast species reliably. Also, only cultivation techniques make antifungal susceptibility testing and strain typing for epidemiological purposes possible. However, diagnostic laboratories and clinicians can hardly be satisfied with the potential of routinely available identification techniques in this field because these are typically either (i) economical and easy to perform but time-consuming, or (ii) rapid but costly and/or requiring special equipment or expertise. For reviews on phenotyping-and genotyping-based systems see [4,5].
We have recently proposed an innovative technique termed McRAPD (Melting curve of Random Amplified Polymorphic DNA) which has the potential to provide rapid and accurate pathogenic yeast identification grown in pure culture in an easy and economical way [6]. Here we have evaluated the performance of optimized McRAPD on a broader spectrum of yeast species frequently isolated from clinical samples and also examined the potential of automated and semi-automated interpretation of McRAPD data for identification purposes. We believe that because of its advantages over conventional phenotypic approaches and its competitive costs, McRAPD can find its place in routine identification of medically important yeasts.

Crude colony lysates perform satisfactorily in McRAPD
To achieve rapid and economical performance of the McRAPD identification approach, we used the simplified DNA extraction technique described by Steffan et al. [7]. However, since the recommended 0.3 μl volume of crude colony lysates added into McRAPD reaction mixture did not always provide satisfactory results with all the species included in our study, we first optimized this volume. Results of optimization are summarized in Figure 1. Apparently, the volume of crude colony lysates added into the reaction mixture had no or almost no influence on the banding pattern in most of the species, whereas there were marked differences in others (namely S. cerevisiae and C. glabrata). Based on these results, the amount of 1 μl was evaluated as the best, because lower volumes did not always guarantee reliable amplification, whereas in higher volumes it cannot be excluded that too much PCR inhibitors may enter the reaction mixture. For comparison and reference, the commercial kit YeaStar Genomic DNA Kit (Zymo Research, Orange, California, USA) was used in parallel with 1 μl of crude colony lysates. Results of this comparison represented by melting curves and banding patterns are summarized in Figure 2. When comparing the initial relative fluorescence of amplified samples, the use of DNA extracted by the commercial kit resulted in higher values on average, indicating higher yields. In 8 of the 9 Results of optimization of the amount of crude colony lysates added into reaction mixture species studied, no marked differences in melting curves based on kit versus crude lysates were observed, although some minor differences in the relative intensity of individual bands occurred in some of the species. Only 1 of the 9 species, namely C. glabrata, showed both markedly different banding patterns and melting curves, indicating that the performance of McRAPD with colony lysate was suboptimal in this case compared to the commercial kit. Our experience in routine experiments shows that the initial relative fluorescence intensity of a McRAPD sample after amplification should exceed the relative value of 15 at the standard 30% LED power as adjusted in melting protocol by user. When a sample does not meet this condition, repeating the assay including DNA extraction is strongly recommended for reliable results.
In addition, reproducibility of the simplified DNA extraction based on crude colony lysates was tested. DNA was extracted from 4 different yeast species, each represented by one strain, where 5 colonies were grown for different time periods in each strain and used for extraction. Sampling was performed in the interval between 12 and 24 h of colony growth, approximately every 3 h. Freshly prepared lysis buffer was always used for DNA extraction in each of the samples. The results clearly demonstrate that the time-point of colony sampling and different runs of the extraction procedure have little influence on the variability of McRAPD results ( Figure 3). Our data show, that crude colony lysates perform satisfactorily in McRAPD. Of  course, any DNA extraction technique may fail to provide adequate amplification occasionally and a commercial kit should on average secure better reproducibility compared to the technique of crude colony lysates. As widely accepted, commercial kits should also be generally more robust in hands of less experienced personnel. Our experience showed that accurate reproducible sampling of colonies by trained personnel was rather important to achieve reliable amplification with crude colony lysates. Also, using Zymolyase from different suppliers or even different batches of this enzyme from the same supplier can influence performance of the technique. Thus, the procedure needs to be optimized in each laboratory to achieve balance between the amount of cells added into lysing solution and activity of the Zymolyase. Adding too many cells can result in insufficient cell wall lysis and too high concentration of PCR inhibitors. On the contrary, an overload of Zymolyase can be a source of too large amount of contaminating DNA which can interfere with appropriate McRAPD performance, because the McRAPD approach has the capacity to amplify any DNA sample.

Inter-run variability is very low, whereas inter-strain differences can be a source of considerable variability of McRAPD data in some species
We have repeated McRAPD amplification with the same crude colony lysates during 3 consecutive days to test for the short-term stability of DNA in these lysates and to Reproducibility of McRAPD with crude colony lysates sampled from different colonies at different timepoints  Figure 4; no marked differences were observed indicating that the McRAPD technique itself performed highly reproducibly. We have also tested the influence of short-term storage of crude colony lysates at -20°C on proper performance and reproducibility of McRAPD and have not observed any marked variability (data not shown). On the contrary, considerable interstrain differences were observed when performing McRAPD in some species, whereas rather uniform data were observed in other species. The lowest interstrain variability was observed in C. guilliermondii, whereas the highest was observed in C. krusei ( Figure 5). It can be supposed, that the species showing typically simple fingerprints with just one or only a few intense bands and almost no interstrain variability should produce less variable melting curves, whereas those showing complex and variable fingerprints should produce rather variable melting curves. This assumption is in good agreement with the fingerprinting patterns of selected strains of C. guilliermondii and C. krusei, as demonstrated in Figure 5C, F. This figure also illustrates that the uniformly present shorter RAPD products (around 500 bp) are reflected in the uniform first portion of the melting domain in C. krusei (78-82.5°C), whereas those variably present longer RAPD products (> 900 bp) are reflected in the variable second portion of the domain (82.5-90°C, compare Figure 5D-F). Marked differences in interstrain variability in different species observed by us are not surprising, because previous studies showed rather different degrees of genotypic variability in different yeast species [8][9][10]. Thus, although our McRAPD protocol was previously optimised empirically to achieve the highest Short-term stability of crude colony lysates versus reproducibility of McRAPD data as well as evaluation of inter-run variability of McRAPD data uniformity of data within each species, some of the species studied have too many variables in their genotypes to provide uniform data with our protocol. Although this drawback can potentially hinder simple species identification, it might be compensated by the fact that detection of outstanding interstrain differences could provide valuable genotyping data along with identification in some of the species studied.

Different genotypes can be recognized within species based on McRAPD data
Clustering of McRAPD data was performed using the UPGMA algorithm performed with similarity coefficients obtained as described in Material and Methods (See addi-tional file 1: Similarity coefficients). This revealed distinct clades of isolates in some of the species, indicating the possibility to recognise distinct genotypes based on McRAPD data ( Figure 6 , 7, 8, 9, 10, 11, 12, 13 and 14). After correlating these clusters with the appearance of curves visually, thresholds for defining distinct McRAPD genotypes were established in dendrograms empirically (see red vertical lines in Figures 6,7,8,9,10,11,12,13 and 14). Strains belonging to each genotype are highlighted by different ground tint colors in the dendrograms corresponding with the same colors of curves in accompanying melting curve plots. Those strains not assigned to a specific genotype are not color-coded. When McRAPD data of a particular strain were markedly different com-Interstrain variability of McRAPD data in C. guilliermondii (parts A-C; lowest variability in this study) and C. krusei (parts D-F; highest in this study) pared to data obtained with all the other strains of the same species, RAPD fingerprint of this strain was first inspected and compared with the other strains to verify this discrepancy. In 4 such cases the isolates were originally identified as C. parapsilosis, but the RAPD fingerprint unequivocally indicated them as those belonging to the cryptic species C. metapsilosis, as demonstrated by comparison to a CBS reference strain and two MCO strains originally classified as groups II and III of C. parapsilosis [11,12] and later confirmed to belong to newly recognised species C. orthopsilosis and C. metapsilosis, respectively [13,14] ( Figure 15). Because C. orthopsilosis and C. metapsilosis cannot be easily differentiated from C. parapsilosis using conventional phenotypic identification techniques or using the ID 32C commercial set of assimilation tests (bioMérieux, Marcy l'Etoile, France), the result of McRAPD and RAPD identification cannot be considered as discrepant from the result of conventional phenotyping techniques. In the other cases of doubtful profiles (n = 12), McRAPD either suggested discrepant species identification result or did not suggest any identification. In such cases, the conventional phenotypic species identification was further verified using the ID 32C. Results of this verification are summarized in Table 1. In all cases where McRAPD suggested discrepant identification, further supported by detailed inspection of RAPD fingerprint (n = 9), ID 32C identified the strain in accordance with McRAPD. On the contrary, in all cases where McRAPD and RAPD did not suggest any unequivocal identification (n = 3), ID 32C identified the strain in accordance with conventional phenotypic identification techniques. In the latter cases, the McRAPD profile presumably reflects a unique genotype represented by a single isolate among the strains included in our study. If the original species identification was changed in the above mentioned cases, the original strain labelling which includes original species abbreviation did not change, but the change was indicated by an arrow and new abbreviation in all figures concerned, e.g. I3-CAGU3-01 → CAAL.
To see whether the strain clustering patterns resulting from McRAPD and conventional RAPD are consistent, McRAPD genotypes were color-coded by ground tint colors in the dendrogram of RAPD fingerprints using different color saturation for different genotypes (additional file 2: Dendrogram of RAPD fingerprints). Whereas McRAPD genotypes correlated very well with RAPD clustering in C. tropicalis, the correlation was limited in C. lusitaniae and no or almost no correlation was observed in C. albicans, C. krusei, and S. cerevisiae (no McRAPD genotypes were delineated in other species). This is mainly because of different data processing in conventional RAPD versus McRAPD. In RAPD, differences in overall amplification efficiency result in differences in intensity of banding patterns. Therefore, it is strongly recommended not to include weak bands into comparison of RAPD finger- prints, because these can appear or disappear in different amplification runs. Also, the relative intensity of strong bands cannot be reliably taken into account for comparison. That is why we used the band-based Jaccard coefficient for processing of RAPD fingerprints, which takes the position of a band into account but neglects its intensity. In contrast, raw fluorescence measured during melting in the McRAPD procedure truly reflects the relative representation of individual RAPD products (bands in electrophoresis) in the sample. Inter-sample and inter-run differences in overall fluorescence of samples are subsequently proportionally equilibrated during numerical normalization of melting data. Then, relative representation of individual RAPD products is reflected in the slope of a normalized curve or in the height of a peak in a derivative curve and this is taken into account during further evaluation.

McRAPD data can be used for automated species identification
Since McRAPD data are numerical, the possibility of automated processing aimed to provide accurate identification is self-intriguing. We considered two approaches to achieve this objective. Firstly, absolute differences between normalized melting curves can be easily calculated as described in Material and Methods; such calculation can be simply automated. This should allow us to compare the McRAPD profile of an unknown isolate to a set of profiles obtained with previously identified yeast strains, revealing the closest match. Performance of such automated identification is summarized in Table 2. Overall accurate identification rate was 80%, varying between 58.5 and 100% in different species.
Since the peaks observed in a first derivative plot may in some cases represent the overall characteristic shape of a melting curve better, we also tested performance of matching peaks positions for identification purposes as the second possible approach. However, identification of individual melting peaks in a derivative plot and comparison of these results to those characteristic for each species cannot be automated as easily. Therefore, we first evaluated the presence of individual peaks in each species and each genotype. To reduce the amount of processed data and to identify typical positions of peaks in derivative curves, average first derivative curves were first calculated for each species/genotype based on individual derivation values of each strain of the respective species/genotype. Average curves are summarized in additional file 3: Average derivative curves. To establish the relevance of each averaged peak for species/genotype identification, these were subsequently classified into three categories: (i) decisive which occurred in all strains of the respective species/ genotype, (ii) characteristic which occurred in 75-99% of UPGMA clustering of C. albicans strains based on normalized McRAPD data Figure 6 UPGMA clustering of C. albicans strains based on normalized McRAPD data. Clustering with empirically defined genotypes is demonstrated in part (A) and corresponding normalized melting curves are shown in part (B). All strains of the respective species included in the study are clustered and plotted; strains belonging to a specific genotype are highlighted by specific ground tint color in the dendrogram corresponding with the same color of curves in accompanying normalized melting curve plot and derivative plots.
strains of the respective species/genotype, and (iii) possible which occurred in less than 75% of strains. Presence of peaks in individual species/genotypes as described above is summarized in Table 3. Unfortunately, when we tested the reading of peaks positioning alone for yeast identification, unequivocal match was impossible in many cases (data not shown).
Therefore, we combined the two proposed approaches into one two-step approach. In the first step, the closest match was established between the McRAPD data of the unknown sample and a set of all the other McRAPD profiles in an automated way. Then, a derivative plot was checked for the presence of decisive peaks in the second step. When the examined peak was found to fit in the interval of average peak position ± 2 S.D., it was considered as matched to the average peak. If any of the average decisive peaks characteristic for the best matched species was missing in the examined strain, this best match was evaluated as incorrect identification and the second best match was further evaluated. If the automated identification suggested two very close matches with curves of different species, both concordant in decisive peaks with the examined strain, the characteristic peaks were evaluated and interpreted in favor of one of the matches. Performance of this two step approach was generally much better than the first-step approach alone, with overall accurate identification rate of 87%, varying between 72.7 and 100% in different species. Results of the evaluation are summarized in Table 2. Surprisingly, in C. tropicalis and C. pelliculosa, the two step approach showed lower sensitivity compared to the first-step alone. This indicates that in some cases the process of matching examined peaks with average decisive peaks ± 2 S.D. precludes correct identification. This is most likely because the interval defined by the position of a decisive peak ± 2 S.D. includes only 95% of the isolates of each genotype. Thus, some isolates among the remaining 5% can prove to match a closely positioned decisive peak of a different species, even if they in fact belong to the species originally suggested by firststep automated processing of McRAPD data.

Computer-aided visual matching of derivative plots shows excellent performance
Since the performance of proposed automated identification approach followed by matching the peaks positions has not reached the accuracy of identification based on traditional RAPD fingerprints, we further looked for other ways to best interpret the information present in melting curves. Simple visual inspection of a derivative curve obtained with the examined strain and its comparison to sets of curves obtained with isolates belonging to each clearly delineated species genotype appeared intuitively as the most promising alternative. To achieve this compari- son in an easy-to-manage way we developed a simple computer-aided plotting scheme. Using Microsoft Excel 2007 software, plots of all derivative curves assigned to each species/genotype were prepared in separate sheets using thin lines and the curve of a tested isolate was then imported into another sheet and automatically plotted into each of the plots using a bold line. Then, all of the plots of specific species/genotypes including the bold curve of the tested isolate were inspected visually and the best match was evaluated based on subjective judgment (see Figure 16 for an example). This evaluation was performed independently by two people in a blinded fashion, i.e. the evaluating person did not know the identity of any of the tested curves and the curves were selected in a random order for evaluation to avoid any bias. Later, a third person evaluated the accuracy of this subjective visual identification using a key generated during randomization. Altogether, 316 and 317 of 322 isolates were identified correctly, achieving excellent accuracy of 98.1-98.4% (for results in individual species see Table 2). In other words, 6 strains were misidentified by one evaluator and 5 strains by the other, where the 6 strains misidentified by one evaluator included the 5 strains misidentified by the other. This concordance indicates clearly that this failure was not caused by subjective error, but rather by lack of typical properties in the misidentified melting profiles. Closer inspection of the misidentified strains showed that they included one strain which showed a completely unique fingerprint and therefore was not identified by traditional RAPD fingerprinting, and other 2 strains which showed less characteristic fingerprints, albeit it was possible to identify them using traditional RAPD fingerprinting.

Discussion
Our results show that McRAPD offers a promising alternative to conventional phenotypic identification techniques. Surprisingly, simple visual inspection of derivative plots performed best among the approaches tested for interpretation of mere numerical McRAPD data. Its performance almost matched the performance of traditional RAPD fingerprinting. Compared to the automated processing developed and tested by ourselves, the time costs of simple visual evaluation were roughly equal when using a pre-made computer-aided plotting scheme. However, with a broader spectrum of yeast species and expanding database of McRAPD results, simple visual examination can become more time demanding and cumbersome. Therefore, it may be advantageous to test for a threshold score in automated matching which can guarantee flawless identification in the future. Then, the visual matching could be reserved for isolates failing to reach this score in automated matching. When looking at the accuracy of identification obtained in this study, this . All strains of the respective species included in the study are clustered and plotted; strains belonging to a specific genotype are highlighted by specific ground tint color in the dendrogram corresponding with the same color of curves in accompanying normalized melting curve plot and derivative plots. One strain not assigned to a specific genotype is not color-coded in dendrogram and its melting curve is plotted in black. This strain was later re-identified as C. parapsilosis.
should be regarded critically in the light of the fact that all of the evaluations were based on an artificially assembled set of strains. However, because this set comprised almost 95% of species typically isolated from clinical samples, real performance in routine settings should not differ too much. An ongoing prospective study being performed by ourselves should prove this assumption.
When evaluating the future potential of McRAPD, we should first consider the main advantages and disadvantages of the RAPD technique itself. It is well-known that RAPD is highly sensitive not only to minor inter-strain differences, but also to minor differences in experimental conditions, which can result in different profiles, compromising intra-and interlaboratory reproducibility. There are many factors that can influence the appearance or disappearance of bands, including Mg 2+ concentration, primer/template concentration ratio, Taq polymerase concentration and source, the model of thermal cycler etc. [15][16][17][18]. Since we aimed to use RAPD/McRAPD primarily not for strain typing but for species identification purposes, we optimised the amplification conditions in favour of low interstrain variability. This efficiently prevented problems with intralaboratory reproducibility, as clearly demonstrated in Figure 4 and discussed above. Of course, some problems may occur with interlaboratory reproducibility, mainly when using a different model of thermal cycler or a different Taq polymerase. However, machines using air heating and cooling of samples which secure rapid and accurate ramping and rapid cycling should be able to minimise this problem, as documented in our study using glass capillaries and RapidCycler 2 machine. Also, commercial polymerases with guaranteed performance are available globally. Therefore, we believe that these drawbacks can be at least compensated or even outweighed by the advantages of McRAPD. Firstly, RAPD itself is very easy and economical to perform, which makes it the second most widely used genotyping technique in yeast microbiology as illustrated by 92 citations in PubMed for "(RAPD OR AP-PCR) AND typing AND yeast" versus 139 for RFLP, 40 for PFGE, 30 for MLST, and 9 for AFLP. In addition, its usefulness for yeast species identification was documented by several groups independently [7,[19][20][21][22][23]. To the best of our knowledge, all of the other genotyping techniques are more laborious and less economical for the purpose of species identification. If there is a technology for melting analysis available, McRAPD is even easier and more economical to perform than RAPD, because it does not require gel electrophoresis. However, omitting the electrophoresis also means that a visual check of proper amplification is not possible. This can question the reliability of McRAPD results, because as in any PCR, RAPD amplification can also occur in negative controls, for reasons well documented earlier [24,25]. Thus, it should be fair to say that both approaches are roughly comparable, McRAPD being more rapid with a potential for future improvements. Since ID 32C offers the most extensive set of assimilation tests among commercially available yeast identification systems, it can be expected that other phenotyping approaches will show inferior performance. Thus, the need of special instrumentation and skills should be the only obstacle for general acceptance of McRAPD in routine diagnostic laboratories. Generally speaking, those laboratories being able to adopt McRAPD will be also able to adopt other genotyping techniques. Then, such techniques, Multi Locus Sequence Typing (MLST) in particular, should be the main competitors of McRAPD. Although MLST is more demanding concerning instrumentation, skills and labour, it has the advantage of unmatched interlaboratory UPGMA clustering of C. lusitaniae strains based on normalized McRAPD data Figure 11 UPGMA clustering of C. lusitaniae strains based on normalized McRAPD data. Clustering with empirically defined genotypes is demonstrated in part (A) and corresponding normalized melting curves are shown in part (B). All strains of the respective species included in the study are clustered and plotted; strains belonging to a specific genotype are highlighted by specific ground tint color in the dendrogram corresponding with the same color of curves in accompanying normalized melting curve plot and derivative plots. One strain not assigned to a specific genotype is not color-coded in dendrogram and its melting curve is plotted in black.
reproducibility, enabling global epidemiology. However, it can hardly be expected that MLST can present an economically affordable alternative for routine identification and prospective epidemiological surveillance in near future. It can rather be expected that its use will be limited to retrospective epidemiological studies. Thus, McRAPD offers a promising choice for routine identification of pathogenic yeast species; in case of failure, it could be supplemented by other techniques, the best of which appears to be single-locus sequencing in our opinion.

Conclusions
1. Crude colony lysates provide an economical, rapid and reliable alternative to elaborate DNA extraction techniques for the purposes of McRAPD when performed by skilled personnel.
2. Our optimized McRAPD protocol shows excellent intralaboratory reproducibility and is able to delineate specific genotypes in some of the species studied.
3. Computer-aided visual matching of first derivative plots shows best performance among the approaches tested for interpretation of mere numerical McRAPD data. Its performance almost matched the performance of tradi-tional RAPD fingerprinting and was comparable to the performance of the ID32C commercial system. 4. We believe that because of its advantages over conventional phenotypic identification approaches and competitive costs McRAPD can find its place in routine identification of medically important yeasts in advanced diagnostic laboratories being able to adopt the technique. It can also serve as a broad-range high-throughput technique for crude epidemiological surveillance.

Yeast strains
The 9 yeast species most frequently isolated from clinical samples in our settings, namely representing 94.3% of yeast species isolated from patient samples at our department, were included into the study. Among these, 7 more common species, i.e. Candida albicans (56.2%), C. glabrata (12.6%), C. krusei (8%), C. tropicalis (7.7%), Saccharomyces cerevisiae (3.1%), C. parapsilosis (2.5%), and C. lusitaniae (2%) were represented by at least 35 isolates each, whereas the less frequently isolated species C. guilliermondii (1.3%) and C. pelliculosa (1%) were represented by at least 15 isolates each. A few isolates of C. orthopsilosis and C. metapsilosis were also included into the study later, UPGMA clustering of C. pelliculosa strains based on normalized McRAPD data Figure 12 UPGMA clustering of C. pelliculosa strains based on normalized McRAPD data. Clustering with empirically defined genotypes is demonstrated in part (A) and corresponding normalized melting curves are shown in part (B). All strains of the respective species included in the study are clustered and plotted; strains belonging to a specific genotype are highlighted by specific ground tint color in the dendrogram corresponding with the same color of curves in accompanying normalized melting curve plot and derivative plots. Three strains not assigned to a specific genotype are not color-coded in dendrogram and their melting curves are plotted in black. One of these strains was later re-identified as C. krusei.
when described as cryptic species of C. parapsilosis [13]. See also additional file 4: Listing of clinical isolates and reference strains included in this study. The strains were stored in 20% BBL Skim Milk Powder supplemented with glycerol (BD, Franklin Lakes, New Jersey, USA) at -70°C until used.

Phenotypic identification
All of the isolates were identified using conventional phenotypic identification techniques, i.e. evaluation of micromorphology on rice agar and evaluation of biochemical properties using in-house prepared assimilation and fermentation tests [26] followed by interpretation using the identification key according to Fragner [27]. Selected isolates were also identified using the ID 32C commercial set (bioMérieux, Marcy l'Etoile, France) in accordance with manufacturer's instructions.

DNA extraction
Crude colony lysates described earlier as suitable for amplification were prepared by simple toothpick technique [7]. Briefly, a part of colony grown on SGA plate was picked up by a micropipette tip at latest one day after inoculation and transferred into 5 μl of freshly prepared lysing solution (1 M sorbitol, 5 mM MgCl 2 , 2 mM dithiothreitol, 12 U of Zymolyase, all from Sigma-Aldrich, St. Louis, Missouri, USA). The mixture was incubated for 30 min at 37°C and centrifuged (10,000 g for 5 min). The supernatant was transferred into a new tube, diluted with TE buffer to 300 μl and stored at -20°C until used. For comparison and reference, YeaStar Genomic DNA Kit (Zymo Research, Orange, California, USA) was also used for DNA extraction in selected strains following manufacturer's recommendations. Briefly, 1 ml of yeast submerged culture (approx. 1.5 × 10 7 cells) grown in YPG (1% of each yeast extract, peptone and glucose) in an Erlenmeyer flask shaken at 30°C was spun down and the pellet was subjected to enzyme lysis in 120 μl of YD Digestion Buffer (containing RNase A and Zymolyase) for 1 hour at 37°C. Then, 120 μl of YD Lysis Buffer and 250 μl of chloroform were added, mixed and spun down again. The aqueous supernatant was then loaded onto a fast spin-column, spun down, and the impurities were washed away using DNA Wash Buffer. Finally, DNA was eluted by 60 μl of water. All strains of the respective species included in the study are clustered and plotted; strains belonging to a specific genotype are highlighted by specific ground tint color in the dendrogram corresponding with the same color of curves in accompanying normalized melting curve plot and derivative plots. Three strains not assigned to a specific genotype are not color-coded in dendrogram and their melting curves are plotted in black. Two of these strains were later re-identified as C. lusitaniae and C. tropicalis.

McRAPD procedure
UPGMA clustering of selected C. parapsilosis, orthopsilosis and metapsilosis strains Republic), 250 μg/ml BSA and LCGreen dye at 1× concentration (Idaho Technology Inc., Salt Lake City, Utah, USA). Either 1 μl of crude colony lysate or 1 μl of DNA extracted using the YeaStar Genomic DNA Kit was added into the reaction. Amplification was performed in a Rapid Cycler 2 apparatus (Idaho Technology Inc., Salt Lake City, Utah, USA) applying an empirically optimized protocol of initial denaturation at 95°C, 5 min, followed by 45 cycles of denaturation at 95°C for 5 s, annealing at 48°C for 10 s, and extension at 72°C for 40 s, with ramping 1°C/s, followed by final extension at 72°C for 5 min.  Normalized curves column stays for accurate identification rate achieved when identification was based on automated determination of the numerically closest match of the examined curve with known strain. Normalized curve + matching of derivative peaks column stays for the same amended by checking for decisive peaks in derivative plot. Visual matching of derivative plots column stays for accurate identification rate achieved when identification was based on simple visual comparison of examined derivative plot with plots of known strains. Accurate identification rate achieved upon evaluation and matching of RAPD fingerprints is shown for reference in the last column. See Results and discussion for details.

Analysis of McRAPD data
(page number not for citation purposes) After the absolute distance was established in all pairs (combinations) of isolates, the relative distance 1.0 was assigned to the highest absolute value obtained in the most dissimilar (numerically distant) pair of isolates, abbreviated as AD max . Relative distance values for the remaining pairs of isolates were calculated as a fraction of the highest absolute value, i.e.: A matrix of relative distances was assembled for the isolates included into each comparison. Then, the matrix of relative distances was used to calculate tree data for a cladogram using the UPGMA method and Phylip software [28,29]. PhyloDraw 0.8 software [30,31] was used for cladogram construction.
For additional analysis, plots of the first negative derivation of fluorescence depending on temperature were also prepared based on melting data normalized previously.
To delineate the melting peaks better, smoothing of data was performed using the HR-1 analysis software as recommended by the manufacturer. In some cases this smoothing resulted in truncation of the left and/or right end of the derivative curve. This process was carefully observed to prevent any loss of potentially discriminatory peaks at both ends of the derivative curves. To prevent excessive simplification and loss of informative data, smoothing was performed only if it undoubtedly resulted in a distinct amelioration of peaks' discrimination.

Electrophoresis and analysis of banding patterns
After melting analysis was performed, each sample was also subjected to gel electrophoresis in 2% agarose gel at 5 V/cm for 3 hours. The gels were stained by ethidium bromide added into them during preparation at the final concentration of 1 μg/ml and resulting banding patterns were photographed. Comparison of fingerprints was performed using GelCompar II software (Applied Maths, Sint-Martens-Latem, Belgium) applying the Jaccard coefficient at 1.5% positioning tolerance. Dendrograms were constructed using the UPGMA algorithm.
Visual matching of derivative curves as used for species identification Figure 16 Visual matching of derivative curves as used for species identification. Plots of derivative curves obtained with all strains assigned to 9 selected species/genotypes versus the derivative curve obtained with a tested isolate are shown as an example to illustrate the visual matching approach. Curve obtained with the tested isolate is always plotted as a bold red line, whereas curves obtained with strains assigned to a known genotype/species are plotted as thin dark blue lines in separate plots as follows: