Genetic islands of Streptococcus agalactiae strains NEM316 and 2603VR and their presence in other Group B Streptococcal strains
© Herbert et al; licensee BioMed Central Ltd. 2005
Received: 10 December 2004
Accepted: 24 May 2005
Published: 24 May 2005
Streptococcus agalactiae (Group B Streptococcus; GBS) is a major contributor to obstetric and neonatal bacterial sepsis. Serotype III strains cause the majority of late-onset sepsis and meningitis in babies, and thus appear to have an enhanced invasive capacity compared with the other serotypes that cause disease predominantly in immunocompromised pregnant women. We compared the serotype III and V whole genome sequences, strains NEM316 and 2603VR respectively, in an attempt to identify genetic attributes of strain NEM316 that might explain the propensity of strain NEM316 to cause late-onset disease in babies. Fourteen putative pathogenicity islands were described in the strain NEM316 whole genome sequence. Using PCR- and targeted microarray- strategies, the presence of these islands were assessed in a diverse strain collection including 18 colonizing isolates from healthy pregnant women, and 13 and 8 invasive isolates from infants with early- and late-onset sepsis, respectively.
Side-by-side comparison of the strain NEM316 and strain 2603VR genomes revealed that they are extremely similar, with the only major difference being the capsulation loci and mobile genetic elements. PCR and Comparative Genome Hybridization (CGH) were used to define the presence of each island in 39 GBS isolates. Only islands I, VI, XII, and possibly X, met criteria of a true pathogenicity island, but no significant correlation was found between the presence of any of the fourteen islands and whether the strains were invasive or colonizing. Possible associations were seen between the presence of island VI and late-onset sepsis, and island X and early-onset sepsis, which warrant further investigation.
The NEM316 and 2603VR strains are remarkable in that their whole genome sequences are so similar, suggesting that the capsulation loci or other genetic differences, such as pathogenicity islands, are the main determinants of the propensity of serotype III strains to cause late-onset disease. This study supports the notion that GBS strain NEM316 has four putative pathogenicity islands, but none is absolutely necessary for disease causation, whether early- or late-onset sepsis. Mobile genetic elements are a common feature of GBS isolates, with each strain having its own peculiar burden of transposons, phages, integrases and integrated plasmids. The majority of these are unlikely to influence the disease capacity of an isolate. Serotype associated disease phenotypes may thus be solely related to differences in the capsulation loci.
Streptococcus agalactiae (Group B Streptococcus, GBS) is a Gram positive, facultative anaerobic bacterium that is the most common cause of neonatal and obstetric sepsis, and is an increasingly important cause of septicaemia in elderly and immunocompromised patients . Serotype III GBS causes approximately 37% of early-onset and 67% of late-onset neonatal GBS sepsis (compared with 13% and 5%, respectively, caused by serotype V), and is the predominant serotype causing late-onset meningitis [1, 2]. Serotype V prevails in invasive infection in non-pregnant adults (causing 29% of all such infections) . The genetic determinants of the propensity of serotype III GBS to cause late-onset sepsis and meningitis have not been fully elucidated, but the availability of whole genome sequences of a serotype III isolate (strain NEM316) and a serotype V isolate (strain 2306VR) brings this prospect closer [4, 5]. One possibility is that the serotype III GBS has pathogenicity islands (PAIs) that are not present in the other serotypes, and which confer an enhanced invasive potential. Glaser et al.  described fourteen regions of strain NEM316 that they considered to be putative PAIs. These islands are composed of 11 to 77 genes and contain most of the mobile elements in the NEM316 genome . Six of the islands are adjacent to tRNA genes, a feature of pathogenicity islands , and many known or putative virulence genes of GBS are contained within these regions. For instance, alp2  is in 'island IV', the cyl operon  is in 'island VI', and lmb and scpB are in 'island XII' . PAIs are defined by the following criteria: (1) they carry one or more virulence genes, (2) they are present in the genome of pathogenic bacterium but absent in non-pathogenic representatives of the same species, (3) they are frequently located adjacent to tRNA genes, (4) they are associated with mobile genetic elements and are often flanked by direct repeats (DR), (5) they are unstable and either the whole of the PAI or part of it may be deleted, and (6) often represent mosaic like structures rather than homogenous segments of horizontally acquired DNA .
We used the C. elegans database genome sequence graphical interface (AceDB) [11, 12] to compare the strain NEM316 and the strain 2603VR genome sequences to identify serotype III and V genomic differences, and to further define the putative PAIs in the NEM316 serotype III strain. We then conducted PCR amplification and targeted microarray-based comparative genome hybridization (CGH) studies aimed at delineating the nature of the putative PAIs.
NCBI and AceDB analysis of the sequenced serotype III and V genomes
Which islands appear to be real PAIs?
PAIs contain virulence and mobilization genes and are flanked by direct repeat (DR) sequences that are recognised by mobilization proteins . Potential PAIs must be distinguished from non-mobile regions of the chromosome that contain virulence genes adjacent to tRNA genes, and which have merely attracted mobile elements. Such mobile elements may themselves be genomic or metabolic islands but by definition they are not PAIs, unless they mobilize virulence genes and are associated with pathogenic strains .
The structure of 14 putative PAIs.
No. of genes
Island I is adjacent to tRNA-Ala, begins with a phage integrase family site-specific recombinase (gbs0211) and a Cro/CI transcriptional regulator (gbs0212), and harbors other mobilization genes. The region contains rgg (gbs0230), a homologue of a virulence regulator in S. pyogenes.
Inserted into the proximal end of island II, adjacent to tRNA-Leu, are 9 genes in strain NEM316 (gbs0236-0244) not present in strain 2603VR, and 7 genes in strain 2603VR (sag0245-0251) not present in strain NEM316. Gbs0236-0244 consists of genes encoding a phage integrase and other phage proteins. Sag0245-0251 consists of genes encoding hypothetical proteins and a Cro/CI family regulator. Neither of these two regions harbor known virulence genes. In the remainder of island II (gbs0245-0254 or sag0252-0264), there are no other mobilization genes or known virulence genes.
III, VII, VIII
Islands III, VII, VIII are near-identical copies of a chromosomally integrated plasmid, designated pNEM316-1.
Gbs0458-0470 (or sag0423-0433) contains several transcriptional regulators, including araC family members, and the virulence factor alp, but does not harbor any identifiable mobilization genes. Inserted into the distal end of island IV, adjacent to tRNA-Thr, are 16 genes in strain NEM316 (gbs0471-0486) not present in strain 2603VR, and 6 genes in strain 2603VR (sag0434-0439) not present in strain NEM316. Gbs0471-0486 contains an integrase (gbs0482) and a Cro/CI transcriptional regulator (gbs0475). Sag0434-0439 contains an IS256 family transposase (sag0434), a phage family site-specific recombinase (sag0438). Neither of these elements contain known virulence genes.
Inserted into the proximal end of island V, adjacent to tRNA-Arg, is a single gene in strain NEM316 (gbs0588; an integrase) that is not present in strain 2603VR, and 65 genes in strain 2603VR (sag0545-0609) that are not present in strain NEM316. Sag0545-0609 contains numerous prophage lambda genes. The remainder of island V (gbs0589-0598 or sag0610-0617) harbors genes encoding a cell membrane protein complex and a two-component regulator, vncSR, flanked by two transposase genes (for instance, sag0611 a degenerate transposase and sag0618 a truncated transposase). There are no genes known to be involved in virulence in island V.
Island VI contains the cyl locus (gbs0644-0655; sag0662-0673), encoding a β-hemolysin that has been shown unequivocally to be involved in virulence. The region preceding the cyl locus (gbs0616-0639) in strain NEM316 contains Tn5252 transposon genes, and is identical in strain 2603VR (sag0636-0657). Downstream of the cyl locus, in strain NEM316, there are neither mobilization genes nor other known virulence genes. In the middle of the island, three genes in strain NEM316 (gbs0656-0658; encoding a permease and hypothetical proteins) are not present in strain 2603VR, and 10 genes in strain 2603VR (sag0674-0683; protease, endopeptidase and permease genes) are not present in strain NEM316. The distal half of island VI contains genes encoding core metabolic enzymes, and does not contain mobile elements or virulence determinants.
Island IX contains genes with homology to those encoding a two-component regulatory system, a carbon starvation protein, and secreted proteins, but it does not contain any mobilization genes.
Island X appears to be mobile in that it is present in strain NEM316 but not in strain 2603VR, and it contains transferase, relaxase and some genes homologous with those in Tn5252. It also contains 3 LPXTG genes and a DNA methyltransferase. There are no known virulence genes.
Island XI is composed of three genes that are present in both strains NEM316 and 2603VR, and these are involved in murein hydrolase export. Eight genes in island XI are present in strain NEM316, but not in strain 2603VR. One of these is an integrase, and the element is adjacent to a tRNA gene. None of the 8 genes appears to have a role in virulence.
Island XII is a good candidate for a pathogenicity island. The virulence genes lmb (gbs1307), and scpB (gbs1308), encoding laminin binding protein and C5a peptidase, respectively, are at the proximal end of island XII, and are part of a large compound transposon. Upstream of lmb/scpB, gbs1296-1306, are five transposon (ISSdy1) or phage related genes, and downstream of lmb/scpB, gbs1309-1313 and gbs1338-1340, are other transposon (Tn5252) genes. In the distal half of island XII, 24 genes are present in strain NEM316 (gbs1314-1337; encoding phage and plasmid replication genes and the lac operon) that do not occur in strain 2603VR. In the same relative location in the genome, 20 genes (sag1253-1272; encoding heavy metal transporters) are present in strain 2603VR that do not occur in strain NEM316.
Inserted into the proximal end of island XIII, adjacent to tRNA-Lys, are 20 genes in strain NEM316 (gbs1965-1984; function mostly unknown, but not obvious virulence genes) that is not present in strain 2603VR, and 47 genes in strain 2603VR (sag1979-2025; containing several phage genes) that are not present in strain NEM316. The downstream half of the island (gbs1987-2011) is identical in strains NEM316 and 2603VR and contains genes encoding the CAMP factor, two proteases, core metabolic enzymes, two transporters, and a two-component regulator. There are no mobilization genes in this half of the island.
Inserted into the proximal end of island XIV are 16 genes in strain NEM316 (gbs2064-2079; containing numerous phage genes) that are not present in strain 2603VR, and 10 genes in strain 2603VR (sag2111-2120; containing phage genes) that are not present in strain NEM316. The remainder of the island contains genes encoding 2 two-component regulators, 2 membrane proteins and enzymes involved in metabolism, but no obvious virulence or mobilization genes.
Inserted into the ends of islands II, IV, V, XI, XIII and XIV, and the middle of island VI, are mobile elements that contain phage or transposon genes, but no known virulence genes (see table 1). The mobile elements in strain NEM316 are different from those in strain 2603VR at each of these sites of insertion. The putative PAIs do not otherwise contain mobilization genes or flanking DR sequences. Island IX contains a two-component regulator, but has no mobilization genes. These putative PAIs may therefore merely represent non-mobile regions of the genome into which phages and transposons have inserted.
Islands I, VI and XII contain virulence genes (rgg[4, 13, 14], the cyl locus, and lmb/scpB, respectively) flanked by mobilization genes that are present in both sequenced strains. Island X contains mobilization genes and is presumed to be mobile because it is only found in strain NEM316 and not strain 2603VR. It also contains genes encoding surface proteins that have an LPXTG signal sequence; these could potentially have a virulence role. Four regions of the GBS genome (islands I, VI, X and XII) may therefore be real PAIs.
Other genomic differences
Another disparity between the two GBS whole genome sequences is the gene gbs0048 (a Cro/CI transcriptional regulator) in strain NEM316, which has a different proximal half compared to its homologue sag0048 in strain 2603VR.
The presence of putative pathogenicity islands as defined by CGH
The presence of putative pathogenicity islands as defined by CGH. Results of PCR (figure 3) and CGH (figure 4) analyses. The genes and GBS strains shaded grey in table 2 are those included in CGH experiments, table 3. Sag0001 encodes dnaA, a positive control for PCR. The NEM316 strain is a positive control. The pNEM316-1 plasmid is located three times in the NEM316 genome, and in figure 1 is represented as 'islands III, VII and VIII'. The strains are divided into three groups: colonizing strains from healthy pregnant women, and strains causing early- and late-onset sepsis in babies; and are sub-divided into those strains for which we have PCR results, and those for which we have PCR and CGH data.
Some putative PAIs are almost always present in the strain collection
Islands II, IV, V, IX and XII-XIV are almost always present in every strain from our strain collection. An occasional gene could not be amplified in one or more strains. For instance, sag1246, located in the distal half of island XII, could not be amplified in strains J99, B9, MK3, M1, and J87 (see figure 3). However, sag1233, located in the proximal half of island XII, could be amplified in all strains. The whole genome comparison of strains NEM316 and 2603VR revealed inter-strain sequence divergence at the distal end of island XII, whereas the proximal end of island XII, containing lmb/scpB (sag1234-1235), is highly conserved between these two strains (see table 1). Amplification of sag1233 therefore best reflects the presence of the putative PAI. Sag1233 may be particularly hard to PCR amplify because sequence divergence affects primer annealing. Similar sequence divergence between strains may also explain our inability to amplify occasional genes in islands V and XIII.
These islands are present in all strains tested, whether isolated from disease or colonizing sites and therefore do not meet the PAI definition criteria (2) and (5), above: that they should be present in pathogenic but absent from non-pathogenic strains, and they should be unstable and delete with distinct frequencies. Colonizing is not necessarily synonymous with non-pathogenic, a fact that confounds interpretation of a genetic comparison of invasive and colonizing strains.
Some putative PAIs are almost always absent in the strain collection
Copies of pNEM316-1, represented by islands III, VII and VIII, are only found in strain NEM316, and are consistently absent from the other strains in our collection. Five genes were amplified from island X. They were only all consistently amplifiable from strain NEM316. Two or three genes from island X, however, were amplified in 4 strains other than NEM316, reflecting either the part presence of the island in these strains or marked sequence divergence. These islands, that are absent from most disease causing strains, are unlikely to be PAIs. However, the central part of island X is present in 5 strains known to have caused early-onset sepsis, and absent from all colonizing and late-onset sepsis strains.
Some putative PAIs are variably present in the strain collection
Two genes amplified from each of islands I and VI revealed a variable presence of these islands in the strains of our collection (see figure 3). Island I is at least part-present in 14 of 18 colonizing strains (78%), 8 of 13 early-onset sepsis strains (61%), and 6 of 8 late-onset sepsis strains (75%). Although island I meets the PAI criteria of being variably present in the species, there is no relationship between the whole or part presence of the island and whether the strain was colonizing or disease causing. The two genes amplified from island I were sag0224 and sag0234. Sag0234 is close to the only recognisable virulence gene in island I, rgg (sag0239; homologue of a virulence regulator in S. pyogenes), and thus amplification of this gene reflects the presence of the most important part of the island. Sag0234 homologues are present in 13 of 18 colonizing strains (72%), in 7 of 13 early-onset sepsis strains (53%), and in 6 of 8 late-onset sepsis strains (75%). Thus, in this relatively small collection, there is no relationship between the presence of the distal half of island I and whether the strain was a colonizing or disease causing isolate.
Island VI is at least part present in all colonizing strains, 12 of 13 early-onset sepsis strains, and all late-onset sepsis strains. There is therefore no relationship between the island and disease. The proximal marker gene sag0645 is closer to the cyl locus (encoding the β-hemolysin, a major contributor to virulence in GBS) than the distal marker gene, sag0685, and therefore possibly better reflects the presence of a PAI that contains Tn5252 transposon genes and the cyl locus. Sag0645 is present in 14 of 18 colonizing strains (78%), in 9 of 13 early-onset sepsis strains (69%), and in 7 of 8 late-onset sepsis strains (87.5%). Although these differences are not statistically significant, there is a trend towards the presence of this putative PAI in late-onset sepsis strains. A larger study is required to bear out this finding.
Comparative genomic hybridization analysis
Comparative genomic hybridisation (CGH) analysis was performed on 22 of the 39 strains assessed by PCR. These 22 strains were randomly selected and included 15 of the 18 colonizing strains, 3 of the 13 isolates that caused early-onset sepsis, and 4 of the 8 strains that caused late-onset sepsis.
For probes to the island genes, the results of CGH (figure 4) are near identical to those of PCR (figure 3), with only a few exceptions. Notable is the hybridization of strains Z50 and K1 DNA to the gbs0367 gene probe, suggesting that this gene, and therefore possibly the whole or part of pNEM316-1, is present in these strains. However, the presence of pNEM316-1 was not detected by PCR in these or any other strain except the control NEM316 strain. Thus, perhaps the gene sequence of pNEM316-1 is divergent in strains Z50 and K1 so that the primers for PCR were unable to anneal, or that CGH detected a similar gene to gbs0367. We propose that similar reasons account for the other few discrepancies that exist between the PCR and CGH results. In general, however, the CGH and PCR results are highly consistent.
Genes variably present in GBS strains, as defined by CGH analysis
sag0633, sag0647-8, sag0650, sag1406-7
sag0433, sag0645-6, sag0649, sag1333, sag1404, sag1407-8, sag1462 and sag2063
sag1294 and sag1585
sag1517, sag1998-90, sag1902 and sag1934
sag0048, sag0124, sag0169, sag 0637, sag0644, sag1128, sag1332, sag1359, sag1409 ( rogB ), sag1463 (encoding a RALP), sag1791, and sag1956-7 ( rgf )
Encoding other proteins
sag0031, sag 0624, sag0662 ( cyl operon), sag0664 ( cyl operon), sag0825, sag1190 ( pavA ), sag1283, sag1417, sag1442, sag1472, sag1510, sag1558, sag1603, sag1675, sag1772, and sag2043 ( cfb )
By combining the results of genome comparison and PCR/CGH analysis we can make the following arguments about the likelihood that each of the putative PAIs is a true PAI:
Island I may be a true PAI. It contains the virulence regulator rgg, which is flanked by mobilization genes, and the whole island is variably present in strains of our collection. It does not appear to be found preferentially in GBS isolates that are known to have caused disease, but the number of isolates tested in this study may be too small to tease out small contributions of a PAI to invasiveness. A confounding factor is that the colonizing isolates in our collection may have the capacity to cause disease. Thus, our colonizing and disease isolates do not simply reflect non-pathogenic and pathogenic strains, respectively. This study is not powered to identify small contributions of a putative PAI to the propensity of serotype III to cause late-onset sepsis. Only a very large study is likely to do this.
Island II is unlikely to be a true PAI. In strains NEM316 and 2603VR there are two different mobile elements inserted at the same relative genomic location into the proximal end of the island, neither of which appears to harbour virulence genes. This suggests that the proximal end of island II is a hot spot for the insertion of mobile elements. Furthermore, the distal half of island II does not appear to have mobilization machinery and is present in all the strains within our collection. Islands III, VII and VIII are near-identical copies of a chromosomally integrated plasmid, pNEM316, which contains no known virulence determinants and which is only present in strain NEM316, and is not present within other strains within our collection. Thus, this plasmid is unlikely to be a PAI.
Islands IV and V are unlikely to be PAIs for the same reason as island II. Island VI may be a true PAI as it contains the cyl locus adjacent to Tn5252 (present in both strains NEM316 and 2603VR), has a mosaic-like structure, and is variably present in our strain collection. We cannot show a relationship between the presence of island VI and strains causing disease, but this may be due to limitations of the power of this study. Island IX does not contain mobilization genes and is present in all strains within our collection, making it unlikely that it is a PAI. Island X is mobile, but does not contain obvious virulence determinants. The whole of island X is only found in strain NEM316, and parts of it within four other strains causing early-onset sepsis. There may thus be an association between the middle of island X, gbs1125-1135, and the capacity of an isolate to cause chorioamnionitis. Early-onset sepsis in a newborn baby reflects invasive disease in a pregnant mother, whereas the fetus is merely a vulnerable secondary host. The potential association between island X and early-onset sepsis needs a larger study for clarification. Island XI is mostly composed of a small mobile element present in strain NEM316, but not strain 2603VR, and does not contain known virulence genes.
Island XII contains mobilization and virulence genes, has a mosaic like structure, the distal end of it is variably present in strains of our collection. It could therefore be a PAI. Our study does not have the power to identify an association between the presence of the island and disease. Islands XIII and XIV are unlikely to be PAIs for the same reason as island II.
The majority of late-onset meningitis, and to a lesser extent late-onset sepsis, is caused by serotype III strains. There is likely to be a bacterial genetic basis for this invasive propensity. A comparison of the whole genome sequences of a serotype III isolate, NEM316, and a serotype V isolate, 2603VR, is remarkable in the degree of similarity of the two strains, but there are some dissimilarities. These include open reading frame annotation discrepancies, genes that show sequence divergence between strains, an MME, mobile DNA, and the capsulation loci. This study contributes to our understanding of pathogenesis by further delineating the nature of mobile elements in GBS. Individual GBS isolates probably carry their own unique aliquot of horizontally acquired genetic material. Only four (islands I, VI, X and XII) of 14 putative PAIs are likely to be real PAIs, but there is no absolute association of any of these four PAIs with strains causing disease. The strongest possible disease association is with island X and early-onset sepsis.
Strains and culture conditions
Strains employed for PCR and CGH analysis.
Z69A, Z72A, Z73, Z87A, Z111
Z73, Z34A, Z50, Z101A, Z117
B9, H11, J81, J88, J100, R1, WC3, NEM316
M1, MK2, J76, J95, B11, J90, K1
Z84A, Z12A, Z87A, Z95
GBS serotype III strain NEM316 and serotype V strain 2603VR genome sequences were compared through NCBI [24, 25] and using AceDB [11, 12], hosted by the University of Oxford Bioinformatics Centre . Additional information on domains and homologies were obtained through NCBI BLAST searches  and the NCBI Conserved Domain Search .
DNA was extracted from a 3 ml culture of each strain using spin column technology (DNAeasy; Qiagen), following the manufacturer's recommendations with the exception that lysozyme was replaced by mutanolysin (50 units per extraction) and the cell pellet was pre-incubated with this enzyme for 60 minutes at 37°C.
Double strand sequencing was conducted by the Department of Biochemistry Core Sequencing Facility, University of Oxford, using the same primers employed for the PCR using gel extracted (Qiagen) templates. Sequencing reactions used Big Dye version 3 (Applied Biosciences) and were analyzed on an ABI377 sequencer. Sequences were assembled, evaluated, and interpreted using Chromas v2.3 (Technelysium Pty Ltd) and ClustalW .
Primers for PCR
Gene assignment in strain NEM316
Gene assignment in strain 2603VR
5'-gtagctgatagtcctggc-3' and 5'-agtccccaactaaagcgc-3'
5'-aaatgggacacgtacgg-3' and 5'-attgccgccatctcaggg-3'
5'-caagcctttaatgctcgc-3' and 5'-aactgaaattccaatcgcc-3'
5'-tcatcgcgaaaatatggag-3' and 5'-cggtcttttagaaactgtgtcc-3'
5'-gacttatttcaagtttatgg-3' and 5'-acccttatatacgacagc-3'
pNEM316-1 (islands III, VII and VIII)
5'-atcgatttaggattcatgcc-3' and 5'-caacattcgcaaaataagcc-3'
5'-cctagatggcgtagaggcag-3' and 5'-ttgctcacagaccataagcg-3'
5'-tcacccctgagacgtttacc-3' and 5'-gatcgtaaccacggtttgct-3'
5'-attgatagatcttacttgcg-3' and 5'-tgatgcaatagctattggc-3'
5'-cagggtgttcaaggctacc-3' and 5'-caagcttacgcacccaag-3'
5'-ctttcctaaaacatatttgg-3' and 5'-atatggtaaaaacttaaggc-3'
5'-tagctcagtttgcgactgg-3' and 5'-ccaacttttgcatctgctg-3'
5'-aattcttgattgatgagcg-3' and 5'-tcagctttaatcaattccc-3'
5'-aagaccaaaagtggcgaac-3' and 5'-gcctttggattcattcctg-3'
5'-agcagttacttgatttgcc-3' and 5'-tcctgaattagctagtcgc-3'
5'-tctgcttgagataactccc-3' and 5'-caatagcagttatcaaaggg-3'
5'-cctagatggcgtagaggcag-3' and 5'-ttgctcacagaccataagc-3'
5'-tcgacgtgttttacggttg-3' and 5'-accgaagagatgatgacgac-3'
5'-gggccacactagaaactgc-3' and 5'-aaatccttcatcgctcctg-3'
5'-tcacccctgagacgttacc-3' and 5'-gatcgtaaccacggtttgct-3'
5'-tctctcggcgttattgtcc-3' and 5'-acaaaagcacaagcgactg-3'
5'-ctttactggcttcacttgg-3' and 5'-gttgatacaggcattgagc-3'
5'-gattactctaccagtgagg-3' and 5'-agaatagtctgcttcaccc-3'
5'-ctgacaattgctttgtttcg-3' and 5'-ggctaacccaaatgtaccg-3'
5'-gctcctctgattaatgccc-3' and 5'-caagctcttgttcggttgc-3'
5'-tttctgggaaaaatcagtgg-3' and 5'-ttttcccgaacaaatgatg-3'
Comparative genome hybridization
Fifteen gene-specific probes from within the islands were incorporated into a 384-probe GBS sub-microarray being developed to study regulatory networks in GBS (unpublished). The other probes were designed from 369 genes representing all the identifiable regulators (including homologues of Streptococcus pyogenes regulators such as rofA, rggB, mga), all the known GBS virulence factors, stress adaptation molecules, and proteins with LPXTG sorting signals, and many transporters (focussing on ABC and PTS systems). Probe regions were chosen using AceDB [11, 12], so that where the gene was present in both strains, a region of greater than 300 bp region was chosen that was near identical in each of the sequenced genomes that was devoid of repetitive elements. Primers were designed using Primer3 , with the product size set at an optimum of 300 bp (range 150–450 bp), the primer size at 19 bp (range 17–21), the primer Tm set at 58°C (range 54–63), the primer GC% at 40 (range 30–80), and the GC clamp option set to 1. The primers were synthesised commercially (Operon). See Additional file 1 for sequence information. Amplicons were generated using DNA extracted from the sequenced serotype III strain NEM316 . The printed probes were amplified from a 1:50 dilution of these products by second-round PCR using the first-round primers, once a single band of the correct size had been obtained from the first reaction, a similar single band was confirmed from the second round PCR. PCR products were checked using 96-well E-gels (Invitrogen). Probes were spotted onto Genetix amine microarray slides in Genetix amine spotting solution for amine slides using a Qarray Mini microarray printer (Genetix) using 150 micron tipped solid tungsten pins (Genetix). FluoroLink™ Cy3-dCTP and FluoroLink™ Cy5-dCTP (Amersham Pharmacia Biotech) were incorporated into 10 μg of chromosomal DNA using random hexamer primers (Invitrogen) and DNA polymerase I, Klenow fragment (Bioline, UK). Labelled DNA:DNA probe microarray hybridizations were conducted in 4x SSC, 0.2875% SDS at 65°C overnight.
Of the 384 gene-specific probes included on the array, seven probes were directed at serotype-specific capsular polysaccharide synthesis genes and were thus hybridization controls. Another five probes were directed at genes in pNEM316-1 and 'island X' that only infrequently hybridized to the DNA from the strains in the collection.
The probes were chosen in gene regions that were of low complexity, contained no repeats, and were identical according to our alignment of the NEM316 and 2603VR genomes using AceDB.
Identification of strain differences within the non-island genes was not the initial purpose of this study. However, such variation, in the context of the relative paucity of differences found in islands genes, indicates that allelic variants of the non-island genes may explain differences in strain behaviour. A larger scale project directed at these genes, using a microarray based upon a greater number of genome sequences than were available for this project, is needed to specifically investigate this type of divergence.
Probes to genes from each of islands I-XIV were included on the array, with the exception of 'islands IX and XI'. PCR analysis demonstrated that the 'island IX' region is consistently present in all strains in our collection, and our analysis did not support the notion that it contains mobile DNA. The major part of 'Island XI' is a small prophage, and we therefore expected it not to be relevant to the virulence of the organism. One probe was included to each 'island', except for two probes to 'islands I and V', and five probes to 'island X'.
CJBB is supported by a Wellcome Trust Entry Level Fellowship and LASS by a Wellcome Trust Project grant awarded to NJS. This project was also part-funded by OHSRC research grant 778.
- Schuchat A: Group B streptococcus. Lancet. 1999, 353: 51-56. 10.1016/S0140-6736(98)07128-1.View ArticlePubMedGoogle Scholar
- Weisner AM, Johnson AP, Lamagni TL, Arnold E, Warner M, Heath PT, Efstratiou A: Characterization of group B streptococci recovered from infants with invasive disease in England and Wales. Clin Infect Dis. 2004, 38: 1203-1208. 10.1086/382881.View ArticlePubMedGoogle Scholar
- Schuchat A: Epidemiology of group B streptococcal disease in the United States: shifting paradigms. Clin Microbiol Rev. 1998, 11: 497-513.PubMed CentralPubMedGoogle Scholar
- Glaser P, Rusniok C, Buchrieser C, Chevalier F, Frangeul L, Msadek T, Zouine M, Couve E, Lalioui L, Poyart C, Trieu-Cuot P, Kunst F: Genome sequence of Streptococcus agalactiae, a pathogen causing invasive neonatal disease. Mol Microbiol. 2002, 45: 1499-1513. 10.1046/j.1365-2958.2002.03126.x.View ArticlePubMedGoogle Scholar
- Tettelin H, Masignani V, Cieslewicz MJ, Eisen JA, Peterson S, Wessels MR, Paulsen IT, Nelson KE, Margarit I, Read TD, Madoff LC, Wolf AM, Beanan MJ, Brinkac LM, Daugherty SC, DeBoy RT, Durkin AS, Kolonay JF, Madupu R, Lewis MR, Radune D, Fedorova NB, Scanlan D, Khouri H, Mulligan S, Carty HA, Cline RT, Van Aken SE, Gill J, Scarselli M, Mora M, Iacobini ET, Brettoni C, Galli G, Mariani M, Vegni F, Maione D, Rinaudo D, Rappuoli R, Telford JL, Kasper DL, Grandi G, Fraser CM: Complete genome sequence and comparative genomic analysis of an emerging human pathogen, serotype V Streptococcus agalactiae. Proc Natl Acad Sci U S A. 2002, 99: 12391-12396. 10.1073/pnas.182380799.PubMed CentralView ArticlePubMedGoogle Scholar
- Hacker J, Blum-Oehler G, Muhldorfer I, Tschape H: Pathogenicity islands of virulent bacteria: structure, function and impact on microbial evolution. Mol Microbiol. 1997, 23: 1089-1097. 10.1046/j.1365-2958.1997.3101672.x.View ArticlePubMedGoogle Scholar
- Kong F, Gowan S, Martin D, James G, Gilbert GL: Molecular profiles of group B streptococcal surface protein antigen genes: relationship to molecular serotypes. J Clin Microbiol. 2002, 40: 620-626. 10.1128/JCM.40.2.620-626.2002.PubMed CentralView ArticlePubMedGoogle Scholar
- Doran KS, Chang JC, Benoit VM, Eckmann L, Nizet V: Group B streptococcal beta-hemolysin/cytolysin promotes invasion of human lung epithelial cells and the release of interleukin-8. J Infect Dis. 2002, 185: 196-203. 10.1086/338475.View ArticlePubMedGoogle Scholar
- Franken C, Haase G, Brandt C, Weber-Heynemann J, Martin S, Lammler C, Podbielski A, Lutticken R, Spellerberg B: Horizontal gene transfer and host specificity of beta-haemolytic streptococci: the role of a putative composite transposon containing scpB and lmb. Mol Microbiol. 2001, 41: 925-935. 10.1046/j.1365-2958.2001.02563.x.View ArticlePubMedGoogle Scholar
- Schmidt H, Hensel M: Pathogenicity islands in bacterial pathogenesis. Clin Microbiol Rev. 2004, 17: 14-56. 10.1128/CMR.17.1.14-56.2004.PubMed CentralView ArticlePubMedGoogle Scholar
- Durbin R, Thierry-Mieg JT: A C. elegans DataBase. 1991Google Scholar
- Saunders NJ, Peden JF, Hood DW, Moxon ER: Simple sequence repeats in the Helicobacter pylori genome. Mol Microbiol. 1998, 27: 1091-1098. 10.1046/j.1365-2958.1998.00768.x.View ArticlePubMedGoogle Scholar
- Chaussee MS, Sylva GL, Sturdevant DE, Smoot LM, Graham MR, Watson RO, Musser JM: Rgg influences the expression of multiple regulatory loci to coregulate virulence factor expression in Streptococcus pyogenes. Infect Immun. 2002, 70: 762-770. 10.1128/IAI.70.2.762-770.2002.PubMed CentralView ArticlePubMedGoogle Scholar
- Chaussee MS, Somerville GA, Reitzer L, Musser JM: Rgg coordinates virulence factor synthesis and metabolism in Streptococcus pyogenes. J Bacteriol. 2003, 185: 6016-6024. 10.1128/JB.185.20.6016-6024.2003.PubMed CentralView ArticlePubMedGoogle Scholar
- Saunders NJ, Snyder LA: The minimal mobile element. Microbiology. 2002, 148: 3756-3760.View ArticlePubMedGoogle Scholar
- Spellerberg B, Rozdzinski E, Martin S, Weber-Heynemann J, Lutticken R: rgf encodes a novel two-component signal transduction system of Streptococcus agalactiae. Infect Immun. 2002, 70: 2434-2440. 10.1128/IAI.70.5.2434-2440.2002.PubMed CentralView ArticlePubMedGoogle Scholar
- Beckert S, Kreikemeyer B, Podbielski A: Group A streptococcal rofA gene is involved in the control of several virulence genes and eukaryotic cell attachment and internalization. Infect Immun. 2001, 69: 534-537. 10.1128/IAI.69.1.534-537.2001.PubMed CentralView ArticlePubMedGoogle Scholar
- Gutekunst H, Eikmanns BJ, Reinscheid DJ: Analysis of RogB-controlled virulence mechanisms and gene repression in Streptococcus agalactiae. Infect Immun. 2003, 71: 5056-5064. 10.1128/IAI.71.9.5056-5064.2003.PubMed CentralView ArticlePubMedGoogle Scholar
- Pritzlaff CA, Chang JC, Kuo SP, Tamura GS, Rubens CE, Nizet V: Genetic basis for the beta-haemolytic/cytolytic activity of group B Streptococcus. Mol Microbiol. 2001, 39: 236-247. 10.1046/j.1365-2958.2001.02211.x.View ArticlePubMedGoogle Scholar
- Hassan AA, Abdulmawjood A, Yildirim AO, Fink K, Lammler C, Schlenstedt R: Identification of streptococci isolated from various sources by determination of cfb gene and other CAMP-factor genes. Can J Microbiol. 2000, 46: 946-951. 10.1139/cjm-46-10-946.View ArticlePubMedGoogle Scholar
- Holmes AR, McNab R, Millsap KW, Rohde M, Hammerschmidt S, Mawdsley JL, Jenkinson HF: The pavA gene of Streptococcus pneumoniae encodes a fibronectin-binding protein that is essential for virulence. Mol Microbiol. 2001, 41: 1395-1408. 10.1046/j.1365-2958.2001.02610.x.View ArticlePubMedGoogle Scholar
- Kreikemeyer B, McIver KS, Podbielski A: Virulence factor regulation and regulatory networks in Streptococcus pyogenes and their impact on pathogen-host interactions. Trends Microbiol. 2003, 11: 224-232.View ArticlePubMedGoogle Scholar
- Herbert MA, Beveridge CJE, Saunders NJ: Bacterial virulence factors in neonatal sepsis: group B streptococcus. Current Opinion in Infectious Diseases. 2004, 17: 225-229. 10.1097/00001432-200406000-00009.View ArticlePubMedGoogle Scholar
- Streptococcus agalactiae complete genome NEM316. 2002,http://www.ncbi.nlm.nih.gov/genomes/framik.cgi?db=Genome&gi=264
- Streptococcus agalactiae complete genome 2603V/R. 2002,http://wwwncbinlmnihgov/genomes/framikcgi?db=Genome&gi=252
- Computational Biology Research Group.http://www.compbio.ox.ac.uk/
- Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25: 3389-3402. 10.1093/nar/25.17.3389.PubMed CentralView ArticlePubMedGoogle Scholar
- Marchler-Bauer A, Anderson JB, DeWeese-Scott C, Fedorova ND, Geer LY, He S, Hurwitz DI, Jackson JD, Jacobs AR, Lanczycki CJ, Liebert CA, Liu C, Madej T, Marchler GH, Mazumder R, Nikolskaya AN, Panchenko AR, Rao BS, Shoemaker BA, Simonyan V, Song JS, Thiessen PA, Vasudevan S, Wang Y, Yamashita RA, Yin JJ, Bryant SH: CDD: a curated Entrez database of conserved domain alignments. Nucleic Acids Res. 2003, 31: 383-387. 10.1093/nar/gkg087.PubMed CentralView ArticlePubMedGoogle Scholar
- Rozen S, Skaletsky H: Primer3 on the WWW for general users and for biologist programmers. Methods Mol Biol. 2000, 132: 365-386.PubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.