- Research article
- Open Access
Complete genome sequence of a serotype 11A, ST62 Streptococcus pneumoniaeinvasive isolate
BMC Microbiologyvolume 11, Article number: 25 (2011)
Streptococcus pneumoniae is an important human pathogen representing a major cause of morbidity and mortality worldwide. We sequenced the genome of a serotype 11A, ST62 S. pneumoniae invasive isolate (AP200), that was erythromycin-resistant due to the presence of the erm(TR) determinant, and carried out analysis of the genome organization and comparison with other pneumococcal genomes.
The genome sequence of S. pneumoniae AP200 is 2,130,580 base pair in length. The genome carries 2216 coding sequences (CDS), 56 tRNA, and 12 rRNA genes. Of the CDSs, 72.9% have a predicted biological known function. AP200 contains the pilus islet 2 and, although its phenotype corresponds to serotype 11A, it contains an 11D capsular locus. Chromosomal rearrangements resulting from a large inversion across the replication axis, and horizontal gene transfer events were observed. The chromosomal inversion is likely implicated in the rebalance of the chromosomal architecture affected by the insertions of two large exogenous elements, the erm(TR)-carrying Tn1806 and a functional prophage designated ϕSpn_200. Tn1806 is 52,457 bp in size and comprises 49 ORFs. Comparative analysis of Tn1806 revealed the presence of a similar genetic element or part of it in related species such as Streptococcus pyogenes and also in the anaerobic species Finegoldia magna, Anaerococcus prevotii and Clostridium difficile. The genome of ϕSpn_200 is 35,989 bp in size and is organized in 47 ORFs grouped into five functional modules. Prophages similar to ϕSpn_200 were found in pneumococci and in other streptococcal species, showing a high degree of exchange of functional modules. ϕSpn_200 viral particles have morphologic characteristics typical of the Siphoviridae family and are capable of infecting a pneumococcal recipient strain.
The sequence of S. pneumoniae AP200 chromosome revealed a dynamic genome, characterized by chromosomal rearrangements and horizontal gene transfers. The overall diversity of AP200 is driven mainly by the presence of the exogenous elements Tn1806 and ϕSpn_200 that show large gene exchanges with other genetic elements of different bacterial species. These genetic elements likely provide AP200 with additional genes, such as those conferring antibiotic-resistance, promoting its adaptation to the environment.
Streptococcus pneumoniae is a Gram-positive human pathogen responsible for serious diseases such as pneumonia, meningitis and sepsis . The reservoir of S. pneumoniae is represented by asymptomatic carriage in the nasopharynx, particularly in young children . The mechanism by which pneumococci become pathogenic is poorly understood, and probably depends on a complex interaction between bacterial virulence factors  and the patients' immunological response . The emergence of antibiotic-resistant S. pneumoniae strains has represented an additional problem in the management of pneumococcal infections . S. pneumoniae strains that are resistant to commonly used antibiotics such as penicillins and macrolides are isolated from all areas of the globe .
So far, more than 90 different S. pneumoniae serotypes have been recognized on the basis of immunochemical differences in the polysaccharide capsule and their number is probably due to increase [7–10].
After implementation of the 7-valent pneumococcal conjugate vaccine (PCV7) in the USA, a profound change in the distribution of the serotypes colonizing children  and causing diseases has been observed [12, 13]. Some of the so-called non-vaccine serotypes, that is serotypes not included in the pneumococcal conjugate vaccine, are becoming increasingly common  and increasingly antibiotic resistant [14, 15].
Novel insights into the genome organization and metabolism of S. pneumoniae have been gained from analysis of complete genomes. To date, 23 pneumococcal strains, belonging to different serotypes including 1, 2, 3, 4, 5, 6B, 14, 19A, 19F and 23F, have been completely sequenced, while other strains have been partially sequenced or are currently under way http://genome.microbio.uab.edu/strep/info/; http://www.sanger.ac.uk/Projects/S_pneumoniae/;http://cmr.tigr.org; http://www.genomesonline.orghttp://www.ncbi.nlm.nih.gov/genome/.
We have sequenced the complete genome of a clinical isolate (AP200) belonging to serotype 11A, Sequence Type (ST) 62, a non-vaccine serotype that is currently on the rise, being one of the most prevalent serotypes isolated both from carriage [16, 17] and invasive diseases  in North America following the introduction of PCV7. According to Brueggemann et al. , serotype 11A is more associated with asymptomatic carriage than with invasive disease indicating a relatively low disease potential. However, serotype 11A strains, especially those belonging to ST62, are able to cause invasive disease with significant mortality [19, 20]. The draft genomes of two other serotype 11A, ST62 pneumococcal strains, SP11-BS70  and MLV-016 [GenBank: NZ_ABGH00000000], are currently available in public databases.
AP200 has been previously reported to harbour the transposon Tn1806, carrying the erythromycin resistance determinant erm(TR), which is uncommon in S. pneumoniae . The genome sequence yielded the whole sequence of Tn1806 and evidence for the presence of another exogenous element, a functional bacteriophage, designated ϕSpn_200.
Results and Discussion
General genome features
The AP200 chromosome is circular and is 2,130,580 base pair in length. The main features of the sequence are shown in Figure 1 and Table 1.The initiation codon of the dnaA gene, adjacent to the origin of replication oriC, was chosen as the base pair 1 for numbering the coding sequences. The overall GC% content is 39.5% but an unusual asymmetry in the GC skew is evident near positions 820,000-870,000, likely resulting from recent acquisitions through horizontal gene transfer. The genome carries 2216 coding sequences (CDS), 56 tRNA, and 12 rRNA genes grouped in four operons. Of the predicted CDSs, 1616 (72.9%) have a predicted biological known function; 145 (6.5%) are similar to hypothetical proteins in other genomes, and 455 (20.5%) have no substantial similarity to other predicted proteins.
The AP200 genome contains approximately 170 kb that are not present in TIGR4 [GenBank: NC_010380], the first sequenced pneumococcal strain . Besides two exogenous elements, such as the large Tn1806 transposon and a temperate bacteriophage designated ϕSpn_200, the extra regions include the type 11A capsular locus, the pilus islet 2 , and two metabolic operons (Additional file 1). Of the latter, one contains the genes of the arginine succinate pathway which is present in most pneumococci as a second alternative to the arginine deaminase pathway and the second likely contains genes for uptake, metabolism and excretion of sulphur containing amino-sugars. Three other operons containing uptake systems of unknown substrates are also present. Other regions of difference between TIGR4 and AP200 include the presence in the latter of a DpnII restriction system and a double glycin-type bacteriocin gene (Additional file 1). The extent and type of genomic variation between AP200 and TIGR4 is in line with the genetic diversity found within this species by other studies comparing a series of pneumococcal genomes [21, 25, 26].
Comparison of the AP200 genome with TIGR4 revealed also a large chromosomal inversion of approximately 163 kb across the replication axis and involving the termination site (Figure 2). Large-scale inversions are typically driven by homologous recombination among repeated regions. The AP200 inversion borders fall within the coding sequences of PhtB and PhtD, two proteins which are part of the histidine-triad proteins family, characterized by the repeated histidine HxxHxH triad motif . This family is composed of 4 proteins (PhtA, PhtB, PhtD, and PhtE) showing high sequence similarity. PhtB and PhtD, which are involved in AP200 chromosomal inversion, reach approximately 87% amino acids identity.
Chromosomal inversions are thought to be implicated in the rebalance of the chromosomal architecture when it is affected by insertions of large DNA regions, such as transposons, IS elements or prophages. In particular, it has been speculated that the chromosomal imbalance could be caused when large DNA fragments are inserted in one side of the replication axis , as in the case of AP200 genome, where the large exogenous elements resided in right of the replication axis. To date, the only pneumococcal genome described to carry a large chromosomal inversion is CGSP14 . Also in CGSP14 the inversion occurs across the termination site but involves a different region (Figure 2). Inversions are present also in 2 recently sequenced pneumococcal genomes, Taiwan 19F-14 [GenBank: NC_012469] and TCH8431/19A [GenBank: NC_014251], although they have not been described (Figure 2). In these strains, the chromosomal inversions involve much larger regions. These observations suggest that the synteny of pneumococcal genome is not always conserved.
A striking feature of pneumococcal genomes is the over-distribution of IS elements [23, 29]. AP200 contains 63 transposases and inactivated derivatives thereof http://www-is.biotoul.fr/is.html. In order of frequency, the insertion sequences present in the genome are IS1239 (10 copies), IS1381-ISSpn7 (9 copies), IS1515 (8 copies), ISSpn2 and IS1167 (6 copies each), IS630, ISSpn1-3 and IS1380-ISSpn5 (4 copies each), and IS1202 (one copy). Interestingly, for 3 of these families, the number and insertion site of the IS elements present in AP200 differ from those present in the other two serotype 11A, ST62 strains, SP11-BS70 [GenBank: NZ_ABAC00000000] and MLV-016 [GenBank: NZ_ABGH00000000], although the draft genome status of these two strains makes it impossible to carry out a complete comparison. Only 3 out of 8 IS1515 insertions, and only 2 out of 4 of the IS1380-ISSpn5 insertions are shared between AP200 and the other serotype 11A strains, while one of the IS1239 copies is present in AP200 only and is integrated in the comC gene, making AP200 unable to develop natural competence. The fact that the insertion sites for IS1239, IS1380, and IS1515 copies vary between ST62 strains suggests that these IS elements maintained their ability to transpose within the strains. In AP200, one copy of IS1515 is inserted within the nanB gene, producing a truncated Neuraminidase B. In addition to these known IS elements, other 7 non characterized elements are present in AP200 in a number of copy ranging from 1 to 3. These ISs have been named from ISSpn_AP200_1 to ISSpn_AP200_7.
Notably, AP200 shares with the other serotype 11A ST62 strains, an unique mutation in the 23S rRNA (T552C) that is not present in the other sequenced pneumococci. This mutation has also been confirmed by Sanger sequencing.
A plethora of virulence factors have been described in S. pneumoniae . Among them, the most important is the polysaccharide capsule, shielding pneumococci from the host natural immune defense. The capsular serotype of AP200 was identified as 11A according to the Quellung reaction , but sequence analysis revealed that the capsular locus matched closely that of serotype 11D. In particular, AP200 showed only 3 nucleotide changes when compared to the 11D capsular locus of the reference strain 70/86 [GenBank: CR931656] : two silent transitions in wze and wchA, respectively, and a G/A transition (G10118A) determining a change of a serine into an asparagine in the glycosyl transferase gene wcrL. Also the capsular locus of the two other ST62 serotype 11A strains, SP11-BS70  and MLV-016 [GenBank: NZ_ABGH00000000], match with the 11D capsular locus. SP11-BS70, like AP200, has been repeatedly tested using the Quellung reaction by us and by the pneumococcal reference laboratory at the Statens Serum Institute, yielding consistently serotype 11A. From these results it appears that these ST62 isolates have a serotype 11A phenotype, but possess an 11D capsular locus. The same conclusion has been reached by Moon Nahm's laboratory examining the serotype 11A isolates obtained at the Centers for Disease Control and Prevention in Atlanta, GA (M. Nahm, personal communication).
Sequence differences between capsular locus 11A and 11D cluster mainly in the insertion sequence (IS1202) flanking the 5' end of the locus and in the wcjE gene, encoding a putative O-acetyl transferase. While the biochemical structure of the type 11A capsule is known , that of type 11D capsule has not been elucidated, therefore it is unclear which structural difference underlies the immunological difference. In addition, serotype 11D is quite rare, since no isolates of this serotype appear in the MLST database or in recent large datasets. On the other hand, recent findings indicate that serotype 11A has a high degree of genetic heterogeneity. A new pneumococcal serotype, designated 11E, has been recently discovered among isolates previously identified as serotype 11A, and has been found to be associated with a mutated or disrupted wcjE gene . On the basis of these data and our results it appears that serotype 11 is genotypically variable and it is likely that its typing scheme will be reconsidered in the near future.
Most of the other pneumococcal virulence factors are surface-exposed proteins such as the choline-binding proteins (CBPs) and the LPXTG proteins. Ten different CBPs genes have been recognized in the genome of AP200, including pspA and pspC, which play an important role in pneumococcal pathogenicity [33, 34]. Both these proteins are characterized by an extensive polymorphism, likely reflecting the immunological selective pressure to which they are exposed. According to the classification of Hollingshead et al. , that defines 6 immunologically-relevant monophyletic groups (clades) on the basis of the divergence of the PspA central region, AP200 PspA belongs to clade 3. Similarly, the PspC protein has been divided into 11 major groups due to unique sequence blocks . According to this classification, AP200 PspC corresponds to PspC3.
The LPXTG family includes proteins anchored to the peptidoglycan cell wall by the action of a sortase transpeptidase that recognises the motif LPXTG. Pili, recently discovered in pneumococci, are composed of LPXTG-type protein subunits, and can be of 2 types, encoded by 2 different islets, PI-1 and PI-2 [24, 37]. AP200 carries PI-2, that is found in 20% of pneumococci only and has been demonstrated to mediate adherence to the epithelial cells of the respiratory tract . The PI-2 region in AP200 is identical to that of serotype 1 PN110 strain , being flanked by the hemH and pepT genes, but is contained in the 163 kb inversion. Of the two other sequenced serotype 11A ST62 strains, only SP11-BS70 carries PI-2. A recent investigation on the prevalence of PI-2-carrying pneumococcal isolates in Atlanta, USA, highlighted the increase of serotypes carrying PI-2 among emerging non-PCV7 serotypes, including serotype 11A .
Four large surface zinc metalloproteinases have been described in S. pneumoniae, including the IgA protease, which cleaves human IgA1 , the ZmpC proteinase, which cleaves human matrix metalloproteinase 9  and ZmpB and ZmpD, whose substrates have not yet been identified . The zinc metalloproteinases are involved in virulence and possess antigenic properties . AP200 carries three of them, iga, zmpB and zmpC, lacking zmpD.
Mobile genetic elements of AP200
The Tn1806 transposon represents the sole erm(TR)-carrying genetic element reported in S. pneumoniae to date, and only a partial sequence was published by our group in 2008 . Tn1806 is 52,457 bp in size, smaller than the size previously estimated by PCR mapping , has a GC content of 31.1%, and comprises 49 ORFs. The chromosomal insertion site (hsdM gene) of Tn1806 is characterized by the duplication of 3 nucleotides (GGG) representing the target sequence for the integration . Although various proteins related to mobilization are present, such as a TraG/TraD protein, a Type IV secretory protein, a relaxase and 3 recombinases at the right end (Figure 3 and Additional file 2), conjugation experiments have failed to show transferability of Tn1806 to other strains . Other putative antibiotic resistance genes are present in Tn1806 in the region flanking erm(TR), such as the two components of a tetronasin ABC-type efflux system and a spectinomycin phosphotransferase. A TetR family transcriptional regulator is located upstream of the tetronasin efflux system, likely being involved in its regulation [43, 44].
Tn1806 shows an overall similarity with the erm(TR)-carrying genetic element described in Streptococcus pyogenes MGAS10750, named ICE10750 RD-2 . ICE, Integrative and Conjugative Element, identifies a new classification nomenclature, grouping self-transmissible genetic elements previously designated as transposons, conjugative transposons, genomic islands and plasmids, sharing a common mechanism of horizontal transfer via site-specific recombination . In this broad definition, also Tn1806 can be considered an ICE. Tn1806 is approximately 4 kb larger than ICE 10750-RD.2 due to the presence of additional regions (Figure 3). Starting from the 5' end of the element, Tn1806 contains 3 additional ORFs homologous to hypothetical proteins of the chimeric element RD1 of S. pyogenes MGAS6180 , 2 ORFs homologous to hypothetical proteins contained in the plasmid pAPRE01 of Anaerococcus prevotii DSM20548, and a retron-type reverse transcriptase inserted inside the adenine-specific DNA methylase gene. In addition, in Tn1806 downstream erm(TR), 2 transposases replace a cytidine deaminase and a Zn-dependent hydrolase present in ICE 10750-RD.2, while 2 hypothetical proteins replace an ORF, which is predicted to encode a death on curing protein, part of a toxin-antitoxin system (Figure 3). The antibiotic-resistance region, including the erm(TR) flanking genes, is present in ICE10750 RD-2  as well as in other S. pyogenes erm(TR)-carrying elements recently described .
Comparative nucleotide analysis with current databases revealed that Tn1806 shows large regions of homology with other putative genetic elements present in the sequenced genomes of different bacterial species, including Finegoldia magna ATCC 29328 [GenBank: AP008971]  and Clostridium difficile M120 [GenBank: FN665653], and with pAPRE01, a plasmid of A. prevotii DSM20548 [GenBank: CP001709]. All these species are anaerobic opportunistic pathogens; F. magna and A. prevotii share the same ecological niche, i.e. the oral cavity, with S. pneumoniae and S. pyogenes, while C. difficile is part of the intestinal microflora. The genetic elements of these three anaerobic species share a high nucleotide identity (88-95%) especially with the leftmost part of Tn1806 (Figure 4). Sequences with similarity to Tn1806 have been found also in the incomplete genome of Ureaplasma urealyticum serovar 9 ATCC 33175 [GenBank: NZ_AAYQ02000002] and in other incomplete genomes belonging to Anaerococcus spp. and Peptoniphilus spp. All these genetic elements share large fragments, with insertions/deletions or replacement of different modules that probably confer element-specific features. Modules can contain different accessory genes: one example is represented by the antibiotic-resistance region that is present in Tn1806 and ICE10750 RD-2, but is missing in the other genetic elements. In F. magna, this region is replaced by a module of similar size including multidrug ABC transporter proteins (Additional file 3). These elements, carried by different bacterial species, likely diversify and evolve through the reciprocal shuffling of regions in putative hot spots; the diversity likely reflects the adaptation to different niches and/or to the antibiotic selective pressure.
ϕSpn_200 prophage genome
The second exogenous region identified in AP200 corresponds to a prophage designated ϕSpn_200. The ϕSpn_200 genome is 35,989 kb in size with a GC content of 39.3%, which is consistent with that of S. pneumoniae. ϕSpn_200 is inserted between the adenylosuccinate synthetase and the tRNA-specific adenosine deaminase genes. Sequence analysis of the junctions between the ϕSpn_200 genome and the host chromosome revealed the presence of a 21-bp duplication (5'- CTTTTTCATAATAATCTCCCT -3'), likely derived from the recombination between the bacterial (attB) and the phage (attP) attachment sites. The confirmation that the 21-bp region corresponds to the attP site was obtained by sequencing the DNA of the phage circular forms.
The genome of ϕSpn_200 includes a total of 47 ORFs organized into five modules: the lysogeny, the replication, the packaging, the structural, and the lytic modules (Figure 5A). Such modular organization, especially the presence of closely arranged lysogeny-related genes, resembled that of the Siphoviridae family infecting low-GC content Gram-positive bacteria . The predicted ORFs were compared with sequences from protein databases and the regions of homology of the ϕSpn_200 genome are described in detail in the Additional file 4.
The lysogeny module is located immediately downstream of the left-end att site; it is composed of the integrase, belonging to the family of tyrosine recombinases, the Cro/CI-like transcriptional regulator and the repressor involved in suppression of the phage lytic cycle (Figure 5A). The second module carries genes with regulatory functions implicated in the replicative processes. The third module includes genes implicated in the packaging of the phage genome concatemers into the empty capsid shell, such as the large terminase gene. The structural region encodes the morphogenetic proteins involved in the head and tail assembly. Among these proteins, it is noteworthy the presence of PblB that corresponds to the phage tail fiber, involved in tail/host recognition. This protein is also considered a phage-encoded virulence factor . In Streptococcus mitis, PblB is carried by the bacteriophage SM1 and together with PblA, a protein that is missing in ϕSpn_200, it can enhance binding of the microorganism to platelets [51, 52]. No other potential virulence factor was identified in ϕSpn_200, but it must be considered that no function was assigned to 28 out of 47 phage ORFs. The last phage module includes genes implicated in cell lysis and phage progeny release, such as those encoding lysin and holin proteins. Holin acts creating holes in the cell wall, thereby allowing lysin to enter the periplasm and begin cell lysis.
An almost identical prophage, inserted in the same chromosomal region at the identical attB attachment site, is present in the newly sequenced S. pneumoniae strain Hungary19A-6 [GenBank: CP000936], and in the draft genomes of CDC1873-00 [GenBank: NZ_ABFS01000005] and SP14-BS69 [GenBank: NZ_ABAD01000021] (Figure 6). Interestingly, a prophage inserted in the same site of ϕSpn_200, is present also in the SP11-BS70 genome, named ϕSpn_11 . ϕSpn_11 and ϕSpn_200 represent different phages although they share the integrase and the following ORF of the lysogeny module, 12 out of 21 genes of the replication module and all the lytic genes (Figure 6). Comparative analysis revealed that ϕSpn_200 showed various degree of similarity with other streptococcal prophages. The ϕSpn_200 packaging and structural modules are highly similar to the corresponding regions of phage LambdaSa2 of Streptococcus agalactiae 2603 V/R , with an amino acid identity ranging from 53 to 92% (Figure 6). The presence in ϕSpn_200 of functional modules, carried also by a different phage, supports the modular theory of phage evolution  according to which the diversification of phages genomes resides mainly on the exchange of entire modules between different phage groups. Indeed, in pneumococcal phages the exchanging unit could consist also in a single gene , as it was the case suggested by the homology of single genes of the replication module of ϕSpn_200 with the corresponding genes of phage MM1 of S. pneumoniae , of phage SM1 of S. mitis  and LambdaSa2 of S. agalactiae 2603 V/R .
According to a recently published prophage typing system , the pneumococcal phages can be classified into three main groups, of which group 1 is the most abundant. On the basis of nucleotide homologies, ϕSpn_200 can be assigned to group 1.
Electron microscopic characterization and infection activity of ϕSpn_200
Concentrated supernatants of mitomycin-induced S. pneumoniae AP200 cultures were examined by transmission electron microscopy. Ultrastructural analysis revealed the presence of phage particles consisting of a small isometric head with a diameter of 56 ± 2 nm and a long flexible tail of 156.8 ± 2 nm, characteristics belonging to the Siphoviridae family  (Figure 5B). A collar structure was observed at the position where head and tail meet (Figure 5B). Since only one prophage was detected in the genome of AP200, we concluded that the phage observed by electron microscopy was ϕSpn_200.
The infection activity of ϕSpn_200 was tested on the pneumococcal strain Rx1 . Results obtained demonstrated that ϕSpn_200 induced the formation of lysis plaques on the Rx1 culture plates (Additional file 5).
The number of sequences of bacterial genomes has been rapidly increasing in the last years thanks to the use of new technologies, such as the high-throughput Roche 454 pyrosequencing [60, 61]. S. pneumoniae serotype 11A is becoming an emergent serotype in the post-PCV7 era and data concerning its genetic characteristics can be of importance for future vaccines. The reasons determining the increase in the incidence of pneumococcal infections due to non vaccine-serotypes, including serotype 11A, are complex and not yet fully understood. Multiple factors could take part in this phenomenon, such as geographical and temporal trends, the prevalence of these serotypes in the community, the ability to evade host defenses, the acquisition of new genetic material that could potentially increase their invasive capacity or their resistance to antibiotics .
In this study, the entire genomic sequence of S. pneumoniae AP200, belonging to serotype 11A and ST62, has been obtained. Sequence analysis revealed chromosomal rearrangements and horizontal gene transfers. A large chromosomal inversion across the replication axis was found: it is likely that this inversion originated to maintain the genome stability affected by horizontal gene transfer events, as suggested by Ding et al. . The presence of large genomic inversions is a phenomenon observed in other streptococcal species, where it could contribute to generate chromosomal shuffling and create novel genetic pools [63–65].
Horizontal gene transfer events involved mainly two mobile elements, the erm(TR)-carrying genetic element Tn1806 and the functional prophage ϕSpn_200. The modular organization recognized inside the two exogenous elements, and their similarity to other elements of different bacterial species, confirm that they have undergone frequent DNA exchanging events, that appear to be the major contributors to the overall diversity of the genome of S. pneumoniae AP200.
Although the availability of complete pneumococcal genomes cannot provide a full explanation for the evolution and spread of a particular serotype or clone, it can contribute information on the pathogenic potential of this important microorganism. Regarding AP200, the presence of pilus islet 2 could confer a selective fitness advantage, mediating adherence to the nasopharingeal epithelium and could represent a target for future vaccines [24, 38]. In addition, the presence of the transposon Tn1806, conferring erythromycin-resistance, is an advantage to the microorganism in view of the large use of macrolides in the community. Finally, in AP200 the discrepancy between the serotyping result and the sequence of the capsular locus deserves further investigations, also in view of the increasing use of PCR-based methods for serotype determination.
S. pneumoniae AP200 was isolated from the cerebrospinal fluid of an adult patient with meningitis in 2003 . AP200 was found to belong to serotype 11A and to ST62, although previously it had been erroneously attributed to a different ST. ST62 is the predicted founder of CC62, to which most serotype 11A isolates belong http://spneumoniae.mlst.net/. AP200 is resistant to erythromycin, with a MIC of 1 μg/ml, and shows inducible resistance to clindamycin due to the presence of the erm(TR) resistance gene .
Sample Preparation and High-density Pyrosequencing
Genomic DNA of AP200 (4 ug), prepared using the Cell and Blood Culture DNA Midi kit (Qiagen, Valencia, CA), was fragmented by nitrogen nebulization for 1 minute at the pressure of 45 psi. Fragmented DNA was purified using silica spin-columns (MinElute PCR purification kit, Qiagen, Valencia, CA) and subsequently analyzed by Agilent Bioanalyzer 2100 with the DNA 1000 Kit (Agilent Technologies, Palo Alto, CA, USA) to check the average fragment size. The double- stranded fragmented DNA was prepared as reported in Roche-454 Library Preparation Manual to obtain the ssDNA library. The sample was analyzed with Agilent Bioanalyzer 2100 and the mRNA Pico Kit (Agilent Technologies), and was fluorometrically quantitated by RiboGreen RNA Quantitation Kit (Invitrogen Inc., Carlsbad, California). A second DNA library (insert size 2000-2500 bp) was prepared starting from 3 ug of total genomic DNA to perform Paired-Ends sequencing, following the Roche-454 Paired End Library Preparation Manual. The samples prepared for the standard shotgun and for the Paired-Ends sequencing were sequenced by means of Genome Sequencer 454 FLX .
Sequencing Data analysis
A total of 263,671 high-quality sequences and 37,704,248 bp were obtained with a 17-fold coverage of the genome. The 454 de Novo Assembler software was used to assemble the sequences that were read. This first automatic step produced 130 contigs, where 91 were large contigs with a maximum size of 149,967 bp. The de novo assembly created 8 scaffolds for a total of 2,107,179 bp, the largest scaffold's size being 1,176,929 bp. A manual check of every added sequence read to confirm the correct assembly was performed. Gaps between and inside the 8 scaffolds, due to difficult assembly of repetitive DNA and complex regions, have been solved using long PCR strategy and Sanger sequencing. A manual inspection of the final assembly was required. Since homopolymeric stretches into the genome can determine a high probability of frameshift error during the assembly of the sequence, potential errors were checked by visual inspection of the sequences read.
Genome annotation and comparison
The generated sequences were annotated identifying coding genes by cross prediction from the FGENESB package http://www.softberry.com/, the GeneMark program  and the GLIMMER program . We considered an open reading frame (ORF) prediction to be good when it was identified by each of the three prediction tools. Discrepant ORFs were manually verified by the Artemis viewer  and by identification of putative ribosomal binding sites. Each gene was functionally classified by assigning a cluster of orthologous group (COG) number or a Kyoto encyclopedia of genes and genomes (KEGG) number, and each predicted protein was compared against every protein in the non- redundant (nr) protein databases http://ncbi.nlm.nih.gov. In order to associate a function with a predicted gene, we used a minimum cut-off of 30% identity and 80% coverage of the gene length, checking at least two best hits among the COG, KEGG, and non- redundant protein databases. The rRNA genes were identified by the FGENESB tool on the basis of sequence conservation, while tRNA genes were detected with the tRNAscan-SE program. The BLASTp algorithm was used to search for protein similarities with other pneumococcal genomes or deposited sequences referred in the present study, following these criteria: >50% similarity at the amino acid level and >50% coverage of protein length.
AP200 was grown in BHI broth at 37°C to achieve a turbidity corresponding to OD620 0.2-0.3. Mytomycin C (Sigma-Aldrich, St. Louis, MO) was added to a final concentration of 0.1 μg/ml and the culture was incubated until lysis occurred, as shown by a decrease in turbidity. Cellular debris was pelleted at 16000 g for 15 min. The induced supernatant was filtered through a 0.44-μm pore size filter (Millipore, Billerica, MA). For negative staining, the filtered supernatant was ultracentrifuged at 100,000 g for 2 h at 4°C. Suspensions of the pellet were placed on Formvar-carbon coated 400 mesh copper grids for 10 s, wicked with filter paper and placed on a drop of 2% sodium phosphotungstate, pH 7.00, for 10 s, wicked again and air-dried. Negatively stained preparations were observed with a Philips 208 electron microscope at 80 kV.
To obtain phage DNA, the phage pellet was lysed with sodium dodecyl sulfate (0.5%), EDTA (10 mM) and proteinase K (500 μg/ml) for 2 h at 37°C. Phage DNA was precipitated with a 10% volume of 3 M NaOAc (pH 5.2) and 2 volumes of ethanol at -70°C for 2 h, washed with 70% ethanol and resuspended in deionized H2O. In order to demonstrate the circularization of the excised prophage, a PCR assay using the phage DNA as template and divergent primers pair (FR9 5'- CTAGACTTGCGATAGCAGTTACC- 3' and FR10 5'- GCTTGAACAATTAAGCCAAGCG-3') designed on the opposite ends of the prophage sequence, was carried out. The PCR product was purified and submitted to sequencing analysis using a Perkin-Elmer ABI 377 DNA sequencer (PE Applied Byosystem).
To demonstrate phage activity, a plaque assay was performed. Briefly, 0.1 ml of filtered induced supernatant was pre-incubated with 0.9 ml of the pneumococcal indicator strain Rx1  at about 108 cells/ml for 30 min at 37°C . 0.1 ml of this adsorption mix was added to 3 ml of 2% blood soft agar, poured on a plate containing a layer of bottom agar and incubated overnight at 37°C.
Nucleotide sequence accession numbers
The AP200 genome sequence was submitted to the GenBank database [GenBank: CP002121]. The nucleotide sequence of Tn1806 was deposited as an update of GenBank accession number [GenBank: EF469826].
Open Reading Frame
protein coding sequence.
Obaro SK, Monteil MA, Henderson DC: The pneumococcal problem. Br Med J. 1996, 312 (7045): 1521-1525.
Bogaert D, De Groot R, Hermans PW: Streptococcus pneumoniae colonisation: the key to pneumococcal disease. Lancet Infect Dis. 2004, 4 (3): 144-154. 10.1016/S1473-3099(04)00938-7.
Kadioglu A, Weiser JN, Paton JC, Andrew PW: The role of Streptococcus pneumoniae virulence factors in host respiratory colonization and disease. Nat Rev Microbiol. 2008, 6 (4): 288-301. 10.1038/nrmicro1871.
McCool TL, Cate TR, Moy G, Weiser JN: The immune response to pneumococcal proteins during experimental human carriage. J Exp Med. 2002, 195 (3): 359-365. 10.1084/jem.20011576.
Tomasz A: New faces of an old pathogen: emergence and spread of multidrug-resistant Streptococcus pneumoniae. Am J Med. 1999, 107 (1A): 55S-62S. 10.1016/S0002-9343(99)00107-2.
Felmingham D, Canton R, Jenkins SG: Regional trends in beta-lactam, macrolide, fluoroquinolone and telithromycin resistance among Streptococcus pneumoniae isolates 2001-2004. J Infect. 2007, 55 (2): 111-118. 10.1016/j.jinf.2007.04.006.
Bentley SD, Aanensen DM, Mavroidi A, Saunders D, Rabbinowitsch E, Collins M, Danohoe K, Harris D, Murphy L, Reeves PR, et al: Genetic analysis of the capsular biosynthetic locus from all 90 pneumococcal serotypes. PLoS Genet. 2006, 2: e31-10.1371/journal.pgen.0020031.
Park IH, Pritchard DG, Cartee R, Brandao A, Brandileone MC, Nahm MH: Discovery of a new capsular serotype (6C) within serogroup 6 of Streptococcus pneumoniae. J Clin Microbiol. 2007, 45 (4): 1225-1233. 10.1128/JCM.02199-06.
Jin P, Kong F, Xiao M, Oftadeh S, Zhou F, Liu C, Russell F, Gilbert GL: First report of putative Streptococcus pneumoniae serotype 6D among nasopharyngeal isolates from Fijian children. J Infect Dis. 2009, 200 (9): 1375-1380. 10.1086/606118.
Calix JJ, Nahm MH: A new pneumococcal serotype, 11E, has a variably inactivated wcjE gene. J Infect Dis. 2010, 202 (1): 29-38. 10.1086/653123.
Huang SS, Platt R, Rifas-Shiman SL, Pelton SI, Goldmann D, Finkelstein JA: Post-PCV7 changes in colonizing pneumococcal serotypes in 16 Massachusetts communities, 2001 and 2004. Pediatrics. 2005, 116 (3): e408-413. 10.1542/peds.2004-2338.
Whitney CG, Farley MM, Hadler J, Harrison LH, Bennett NM, Lynfield R, Reingold A, Cieslak PR, Pilishvili T, Jackson D, et al: Decline in invasive pneumococcal disease after the introduction of protein-polysaccharide conjugate vaccine. N Engl J Med. 2003, 348 (18): 1737-1746. 10.1056/NEJMoa022823.
Hicks LA, Harrison LH, Flannery B, Hadler JL, Schaffner W, Craig AS, Jackson D, Thomas A, Beall B, Lynfield R, et al: Incidence of pneumococcal disease due to non-pneumococcal conjugate vaccine (PCV7) serotypes in the United States during the era of widespread PCV7 vaccination, 1998-2004. J Infect Dis. 2007, 196 (9): 1346-1354. 10.1086/521626.
Pai R, Moore MR, Pilishvili T, Gertz RE, Whitney CG, Beall B: Postvaccine genetic structure of Streptococcus pneumoniae serotype 19A from children in the United States. J Infect Dis. 2005, 192 (11): 1988-1995. 10.1086/498043.
Gertz RE, Li Z, Pimenta FC, Jackson D, Juni BA, Lynfield R, Jorgensen JH, Carvalho Mda G, Beall BW: Increased penicillin nonsusceptibility of nonvaccine-serotype invasive pneumococci other than serotypes 19A and 6A in post-7-valent conjugate vaccine era. J Infect Dis. 2010, 201 (5): 770-775.
Kellner JD, Scheifele D, Vanderkooi OG, Macdonald J, Church DL, Tyrrell GJ: Effects of routine infant vaccination with the 7-valent pneumococcal conjugate vaccine on nasopharyngeal colonization with Streptococcus pneumoniae in children in Calgary, Canada. Pediatr Infect Dis J. 2008, 27 (6): 526-532. 10.1097/INF.0b013e3181658c5c.
Huang SS, Hinrichsen VL, Stevenson AE, Rifas-Shiman SL, Kleinman K, Pelton SI, Lipsitch M, Hanage WP, Lee GM, Finkelstein JA: Continued impact of pneumococcal conjugate vaccine on carriage in young children. Pediatrics. 2009, 124 (1): e1-11. 10.1542/peds.2008-3099.
Richter SS, Heilmann KP, Dohrn CL, Riahi F, Beekmann SE, Doern GV: Changing epidemiology of antimicrobial-resistant Streptococcus pneumoniae in the United States, 2004-2005. Clin Infect Dis. 2009, 48 (3): e23-33. 10.1086/595857.
Brueggemann AB, Griffiths DT, Meats E, Peto T, Crook DW, Spratt BG: Clonal relationships between invasive and carriage Streptococcus pneumoniae and serotype- and clone-specific differences in invasive disease potential. J Infect Dis. 2003, 187 (9): 1424-1432. 10.1086/374624.
Sjostrom K, Spindler C, Ortqvist A, Kalin M, Sandgren A, Kuhlmann-Berenzon S, Henriques-Normark B: Clonal and capsular types decide whether pneumococci will act as a primary or opportunistic pathogen. Clin Infect Dis. 2006, 42 (4): 451-459. 10.1086/499242.
Hiller NL, Janto B, Hogg JS, Boissy R, Yu S, Powell E, Keefe R, Ehrlich NE, Shen K, Hayes J, et al: Comparative genomic analyses of seventeen Streptococcus pneumoniae strains: insights into the pneumococcal supragenome. J Bacteriol. 2007, 189 (22): 8186-8195. 10.1128/JB.00690-07.
Camilli R, Del Grosso M, Iannelli F, Pantosti A: New genetic element carrying the erythromycin resistance determinant erm(TR) in Streptococcus pneumoniae. Antimicrob Agents Chemother. 2008, 52 (2): 619-625. 10.1128/AAC.01081-07.
Tettelin H, Nelson KE, Paulsen IT, Eisen JA, Read TD, Peterson S, Heidelberg J, DeBoy RT, Haft DH, Dodson RJ, et al: Complete genome sequence of a virulent isolate of Streptococcus pneumoniae. Science. 2001, 20 (293): 498-506. 10.1126/science.1061217.
Bagnoli F, Moschioni M, Donati C, Dimitrovska V, Ferlenghi I, Facciotti C, Muzzi A, Giusti F, Emolo C, Sinisi A, et al: A second pilus type in Streptococcus pneumoniae is prevalent in emerging serotypes and mediates adhesion to host cells. J Bacteriol. 2008, 190 (15): 5480-5492. 10.1128/JB.00384-08.
Brückner R, Nuhn M, Reichmann P, Weber B, Hakenbeck R: Mosaic genes and mosaic chromosomes-genomic variation in Streptococcus pneumoniae. Int J Med Microbiol. 2004, 294 (2-3): 157-168.
Tettelin H, Hollingshead SK: Comparative genomics of Streptococcus pneumoniae: intra-strain diversity and genome plasticity. 2004, Washington, DC, USA: ASM Press
Adamou JE, Heinrichs JH, Erwin AL, Walsh W, Gayle T, Dormitzer M, Dagan R, Brewah YA, Barren P, Lathigra R, et al: Identification and characterization of a novel family of pneumococcal proteins that are protective against sepsis. Infect Immun. 2001, 69 (2): 949-958. 10.1128/IAI.69.2.949-958.2001.
Ding F, Tang P, Hsu MH, Cui P, Hu S, Yu J, Chiu CH: Genome evolution driven by host adaptations results in a more virulent and antimicrobial-resistant Streptococcus pneumoniae serotype 14. BMC Genomics. 2009, 10 (158):
Hoskins J, Alborn WEJ, Arnold J, Blaszczak LC, Burgett S, DeHoff BS, Estrem ST, Fritz L, Fu DJ, Fuller W, et al: Genome of the bacterium Streptococcus pneumoniae strain R6. J Bacteriol. 2001, 183 (19): 5709-5717. 10.1128/JB.183.19.5709-5717.2001.
Mitchell AM, Mitchell TJ: Streptococcus pneumoniae: virulence factors and variation. Clin Microbiol Infect Dis. 2010, 16 (5): 411-418. 10.1111/j.1469-0691.2010.03183.x.
Konradsen HB: Validation of serotyping of Streptococcus pneumoniae in Europe. Vaccine. 2005, 23 (11): 1368-1373. 10.1016/j.vaccine.2004.09.011.
Richards JC, Perry MB, Moreau M: Elucidation and comparison of the chemical structures of the specific capsular polysaccharides of Streptococcus pneumoniae groups 11 (11F, 11B, 11C, and 11A). Adv Exp Med Biol. 1988, 228: 595-596.
Briles DE, Tart RC, Swiatlo E, Dillard JP, Smith P, Benton KA, Ralph BA, Brooks-Walter A, Crain MJ, Hollingshead SK, et al: Pneumococcal diversity: considerations for new vaccine strategies with emphasis on pneumococcal surface protein A (PspA). Clin Microbiol Rev. 1998, 11 (4): 645-657.
Rosenow C, Ryan P, Weiser JN, Johnson S, Fontan P, Ortqvist A, Masure HR: Contribution of novel choline-binding proteins to adherence, colonization and immunogenicity of Streptococcus pneumoniae. Mol Microbiol. 1997, 25 (5): 819-829. 10.1111/j.1365-2958.1997.mmi494.x.
Hollingshead SK, Becker R, Briles DE: Diversity of PspA: mosaic genes and evidence for past recombination in Streptococcus pneumoniae. Infect Immun. 2000, 68 (10): 5889-5900. 10.1128/IAI.68.10.5889-5900.2000.
Iannelli F, Oggioni MR, Pozzi G: Allelic variation in the highly polymorphic locus pspC of Streptococcus pneumoniae. Gene. 2002, 284 (1-2): 63-71. 10.1016/S0378-1119(01)00896-4.
Barocchi MA, Ries J, Zogaj X, Hemsley C, Albiger B, Kanth A, Dahlberg S, Fernebro J, Moschioni M, Masignani V, et al: A pneumococcal pilus influences virulence and host inflammatory responses. Proc Natl Acad Sci USA. 2006, 103 (8): 2857-2862. 10.1073/pnas.0511017103.
Zahner D, Gudlavalleti A, Stephens DS: Increase in pilus islet 2-encoded pili among Streptococcus pneumoniae isolates, Atlanta, Georgia, USA. Emerg Infect Dis. 2010, 16 (6): 955-962.
Poulsen K, Reinholdt J, Kilian M: Characterization of the Streptococcus pneumoniae immunoglobulin A1 protease gene (iga) and its translation product. Infect Immun. 1996, 64 (10): 3957-3966.
Oggioni MR, Memmi G, Maggi T, Chiavolini D, Iannelli F, Pozzi G: Pneumococcal zinc metalloproteinase ZmpC cleaves human matrix metalloproteinase 9 and is a virulence factor in experimental pneumonia. Mol Microbiol. 2003, 49 (3): 795-805. 10.1046/j.1365-2958.2003.03596.x.
Camilli R, Pettini E, Del Grosso M, Pozzi G, Pantosti A, Oggioni MR: Zinc metalloproteinase genes in clinical isolates of Streptococcus pneumoniae: association of the full array with a clonal cluster comprising serotypes 8 and 11A. Microbiology. 2006, 152 (2): 313-321. 10.1099/mic.0.28417-0.
Chiavolini D, Memmi G, Maggi T, Iannelli F, Pozzi G: The three extra-cellular zinc metalloproteinases of Streptococcus pneumoniae have a different impact on virulence in mice. BMC Microbiology. 2003, 3: 14-10.1186/1471-2180-3-14.
Serizawa M, Sekizuka T, Okutani A, Banno S, Sata T, Inoue S, Kuroda M: Genomewide screening for novel genetic variations associated with ciprofloxacin resistance in Bacillus anthracis. Antimicrob Agents Chemother. 54 (7): 2787-2792. 10.1128/AAC.01405-09.
Ramos JL, Martinez-Bueno M, Molina-Henares AJ, Teran W, Watanabe K, Zhang X, Gallegos MT, Brennan R, Tobes R: The TetR family of transcriptional repressors. Microbiol Mol Biol Rev. 2005, 69 (2): 326-356. 10.1128/MMBR.69.2.326-356.2005.
Beres SB, Musser JM: Contribution of exogenous genetic elements to the Group A Streptococcus metagenome. PLoS One. 2007, 2 (8): e800-10.1371/journal.pone.0000800.
Burrus V, Pavlovic G, Decaris B, Guédon G: Conjugative transposons: the tip of the iceberg. Mol Microbiol. 2002, 46 (3): 601-610. 10.1046/j.1365-2958.2002.03191.x.
Green NM, Zhang S, Porcella SF, Nagiec MJ, Barbian KD, Beres SB, Lefebvre RB, Musser JM: Genome sequence of a serotype M28 strain of group A Streptococcus: potential new insights into puerperal sepsis and bacterial disease specificity. J Infect Dis. 2005, 192 (5): 760-770. 10.1086/430618.
Varaldo PE, Montanari MP, Giovanetti E: Genetic elements responsible for erythromycin resistance in streptococci. Antimicrob Agents Chemother. 2009, 53 (2): 343-353. 10.1128/AAC.00781-08.
Takatsugu G, Atsushi Y, Hideki H, Minenosuke M, Kozo T, Kenshiro O, Hidehiro T, Kazuaki M, Satoru K, Masahira H, et al: Complete genome sequence of Finegoldia magna, an anaerobic opportunistic pathogen. DNA Research. 2008, 15: 39-47. 10.1093/dnares/dsm030.
Lucchini S, Desiere F, Brussow H: Similarly organized lysogeny modules in temperate Siphoviridae from low GC content Gram-positive bacteria. Virology. 1999, 263 (2): 427-435. 10.1006/viro.1999.9959.
Bensing BA, Siboo IR, Sullam PM: Proteins PblA and PblB of Streptococcus mitis, which promote binding to human platelets, are encoded within a lysogenic bacteriophage. Infect Immun. 2001, 69 (10): 6186-6192. 10.1128/IAI.69.10.6186-6192.2001.
Mitchell J, Siboo IR, Takamatsu D, Chambers HF, Sullam PM: Mechanism of cell surface expression of the Streptococcus mitis platelet binding proteins PblA and PblB. Mol Microbiol. 2007, 64 (3): 844-857. 10.1111/j.1365-2958.2007.05703.x.
Romero P, Croucher NJ, Hiller NL, Hu FZ, Ehrlich GD, Bentley SD, Garcia E, Mitchell TJ: Comparative genomic analysis of ten Streptococcus pneumoniae temperate bacteriphages. J Bacteriol. 2009, 191 (15): 4854-4862. 10.1128/JB.01272-08.
Tettelin H, Masignani V, Cieslewicz MJ, Eisen JA, Peterson S, Wessels MR, Paulsen IT, Nelson KE, Margarit I, Read TD, et al: Complete genome sequence and comparative genomic analysis of an emerging human pathogen, serotype V Streptococcus agalactiae. Proc Natl Acad Sci USA. 2002, 99 (19): 12391-12396. 10.1073/pnas.182380799.
Obregon V, Garcia JL, Garcia E, Lopez R, Garcia P: Genome organization and molecular analysis of the temperate bacteriophages MM1 of Streptococcus pneumoniae. J Bacteriol. 2003, 185 (7): 2362-2368. 10.1128/JB.185.7.2362-2368.2003.
Siboo IR, Bensing BA, Sullam PM: Genomic organization and molecular characterization of SM1, a temperate bacteriophage of Streptococcus mitis. J Bacteriol. 2003, 185 (23): 6968-6975. 10.1128/JB.185.23.6968-6975.2003.
Romero P, Garcia E, Mitchell TJ: Development of a Prophage Typing System and Analysis of Prophage Carriage in Streptococcus pneumoniae. Appl Environ Microbiol. 2009, 75 (6): 1642-1649. 10.1128/AEM.02155-08.
Arendt EK, Fitzgerald GF, van de Guchte M: Molecular characterization of lactococcal bacteriophage Tuc2009 and identification and analysis of genes encoding lysin, a putative holin, and two structural proteins. Appl Environ Microbiol. 1994, 60 (6): 1875-1883.
Pearce BJ, Iannelli F, Pozzi G: Construction of new unencapsulated (rough) strains of Streptococcus pneumoniae. Res Microbiol. 2002, 153 (4): 243-247. 10.1016/S0923-2508(02)01312-8.
Bordoni R, Bonnal R, Rizzi E, Carrera P, Benedetti S, Cremonesi L, Stenirri S, Colombo A, Montrasio C, Bonalumi S, et al: Evaluation of human gene variant detection in amplicon pools by the GS-FLX parallel Pyrosequencer. BMC Genomics. 2008, 9: 464-10.1186/1471-2164-9-464.
Iacono M, Villa L, Fortini D, Bordoni R, Imperi F, Bonnal RJP, Sicheritz-Ponten T, De Bellis G, Visca P, Cassone A, et al: Whole-genome pyrosequencing of an epidemic multidrug-resistant Acinetobacter baumannii strain belonging to the European clone II group. Antimicrob Agents Chemother. 2008, 52 (7): 2616-2625. 10.1128/AAC.01643-07.
Yildirim I, Hanage WP, Lipsitch M, Shea KM, Stevenson A, Finkelstein J, Huang SS, Lee GM, Kleinman K, Pelton SI: Serotype specific invasive capacity and persistent reduction in invasive pneumococcal disease. Vaccine. 2011, 29 (2): 283-288. 10.1016/j.vaccine.2010.10.032.
Nakagawa I, Kurokawa K, Yamashita A, Nakata M, Tomiyasu Y, Okahashi N, Kawabata S, Yamazaki K, Shiba T, Yasunaga T, et al: Genome sequence of an M3 strain of Streptococcus pyogenes reveals a large-scale genomic rearrangement in invasive strains and new insights into phage evolution. Genome Res. 2003, 13 (6A): 1042-1055. 10.1101/gr.1096703.
Maruyama F, Kobata M, Kurokawa K, Nishida K, Sakurai A, Nakano K, Nomura R, Kawabata S, Ooshima T, Nakai K, et al: Comparative genomic analyses of Streptococcus mutans provide insights into chromosomal shuffling and species-specific content. BMC Genomics. 2009, 10: 358-10.1186/1471-2164-10-358.
Denapaite D, Bruckner R, Nuhn M, Reichmann P, Henrich B, Maurer P, Schahle Y, Selbmann P, Zimmermann W, Wambutt R, et al: The genome of Streptococcus mitis B6--what is a commensal?. PLoS One. 2010, 5 (2): e9426-10.1371/journal.pone.0009426.
Margulies M, Egholm M, Altman WE, Attiya S, Bader JS, Bemben LA, Berka J, Braverman MS, Chen YJ, Chen Z, et al: Genome sequencing in microfabricated high-density picolitre reactors. Nature. 2005, 437 (7057): 376-380.
Besemer J, Borodovsky M: GeneMark: web software for gene finding in prokaryotes, eukaryotes and viruses. Nucleic Acids Res. 2005, W451-454. 10.1093/nar/gki487. 33 Web Server
Delcher AL, Harmon D, Kasif S, White O, Salzberg SL: Improved microbial gene identification with GLIMMER. Nucleic Acids Res. 1999, 27 (23): 4636-4641. 10.1093/nar/27.23.4636.
Rutherford K, Parkhill J, Crook J, Horsnell T, Rice P, Rajandream MA, Barrell B: Artemis: sequence visualization and annotation. Bioinformatics. 2000, 16 (10): 944-945. 10.1093/bioinformatics/16.10.944.
Porter RD, Guild WR: Characterization of some pneumococcal bacteriophages. J Virol. 1976, 19 (2): 659-667.
This work was supported in part by grants from the Italian Ministry of University and Research (FIRB 2005 " Costruzione di un Laboratorio Nazionale per lo Studio delle Resistenze Batteriche agli Antibiotici") and from the European Commission, 6th Framework, DRESP2 project and FP7-HEALTH-2007-B-222983. We are indebted to Fen Hu, Allegheny-Singer Research Institute, Pittsburgh, PA, USA for providing strain SP11-BS70 and to Lotte Munch Lambertsen, Statens Serum Institut, Copenhaghen, Denmark for confirming serotypes of the pneumococcal strains.
RC performed the analysis of genetic elements, the phage induction experiments and drafted the manuscript. RJPB, MI and GC performed the bioinformatic analysis and participated in genome comparison. MDG and FI participated in the analysis and comparison of the exogenous genetic elements. ER performed DNA preparation and generated the 454 sequencing data. FS and MM carried out the ultrastructural characterization of phage particles. LM participated in the genome comparison. GDB participated in the design of the study, its coordination and helped in revising the manuscript. MRO participated in the design of the study, carried out the genome comparison and helped in writing the manuscript. AP participated in the design of the study, its coordination and finalized the manuscript. All authors read and approved the final manuscript.