- Methodology article
- Open Access
Expression capable library for studies of Neisseria gonorrhoeae, version 1.0
BMC Microbiologyvolume 5, Article number: 50 (2005)
The sexually transmitted disease, gonorrhea, is a serious health problem in developed as well as in developing countries, for which treatment continues to be a challenge. The recent completion of the genome sequence of the causative agent, Neisseria gonorrhoeae, opens up an entirely new set of approaches for studying this organism and the diseases it causes. Here, we describe the initial phases of the construction of an expression-capable clone set representing the protein-coding ORFs of the gonococcal genome using a recombination-based cloning system.
The clone set thus far includes 1672 of the 2250 predicted ORFs of the N. gonorrhoeae genome, of which 1393 (83%) are sequence-validated. Included in this set are 48 of the 61 ORFs of the gonococcal genetic island of strain MS11, not present in the sequenced genome of strain FA1090. L-arabinose-inducible glutathione-S-transferase (GST)-fusions were constructed from random clones and each was shown to express a fusion protein of the predicted size following induction, demonstrating the use of the recombination cloning system. PCR amplicons of each ORF used in the cloning reactions were spotted onto glass slides to produce DNA microarrays representing 2035 genes of the gonococcal genome. Pilot experiments indicate that these arrays are suitable for the analysis of global gene expression in gonococci.
This archived set of Gateway® entry clones will facilitate high-throughput genomic and proteomic studies of gonococcal genes using a variety of expression and analysis systems. In addition, the DNA arrays produced will allow us to generate gene expression profiles of gonococci grown in a wide variety of conditions. Together, the resources produced in this work will facilitate experiments to dissect the molecular mechanisms of gonococcal pathogenesis on a global scale, and ultimately lead to the determination of the functions of unknown genes in the genome.
Neisseria gonorrhoeae (gonococcus), a Gram-negative diplococcus, is one of two pathogenic members of the Neisseriaceae family of bacteria. N. gonorrhoeae is the causative agent of the sexually transmitted disease, gonorrhea, one of the oldest documented infectious diseases. Gonorrheal disease has significant morbidity both in the US and worldwide. According to the Centers for Disease Control , >350,000 cases of gonorrhea were reported in the United States in 2002. The World Health Organization  estimates that over 19 million cases occur annually in the African continent alone. Treatment of gonorrhea is increasingly problematic due to the high frequency of acquisition of resistance to multiple antibiotics [3, 4] and to the observation that gonococcal infection does not elicit protective immunity . Gonorrheal infections, though not usually life-threatening, also enhance the transmission of HIV .
N. gonorrhoeae is strictly a human pathogen, with no known animal reservoir. The bacterium has no environmental niche, and cannot survive outside the human host. In adults, N. gonorrhoeae is acquired primarily through sexual contact. However, the eyes of newborn infants may be infected by passing through an infected birth canal, resulting in the condition, ophthalmia neonatorum, which can lead to blindness. In most cases, gonococcal infections are limited to the urogenital tract, causing urethritis in men and cervicitis in women. Occasionally, gonococci cross the epithelial barrier to enter the bloodstream causing septicemia, and transit to the joints resulting in arthritis. In women, ascending infections from the endocervix can result in pelvic inflammatory disease, salpingitis, tubal blockage and infertility. N. gonorrhoeae can also establish a carrier state in which apparently healthy individuals harbor culturable and infectious bacteria . Carriers are thought be important for disease dissemination. A recent study revealed that the gonococcal carriage rate in women was 6.7% in a major metropolitan area .
Due to the importance of N. gonorrhoeae to human health, much research effort has focussed on identifying virulence factors and elucidating the biochemical interactions of these factors with the host cell [9–11], with the goal of developing vaccines and alternative treatments. It is clear, however, that in order to fully understand the capabilities of this organism to cause disease and elude eradication, it will be necessary to ultimately determine the functions of a great deal more of the gene products encoded by the gonococcal genome. The recent genome sequencing makes possible a variety of genomic and proteomic studies of N. gonorrhoeae. To facilitate such studies, we have cloned into a bacteriophage lambda-based recombination cloning system (Gateway® , Invitrogen, Carlsbad, CA) 1624 of the 2189 predicted ORFs from the genome of N. gonorrhoeae strain FA1090 , and 48 of the 61 ORFs of the gonococcal genetic island (GGI) of strain MS11 [14, 15]. This clone-set allows the generation of transcriptional and translational fusions without the necessity of additional cloning and sequencing. Coupled to the construction of this clone set, DNA microarrays were generated by spotting the insert DNA onto glass slides. Preliminary experiments with the clone set and DNA arrays indicate that this system is suitable for studies of expression of genes from N. gonorrhoeae in heterologous systems as well as for the study of global gene expression in this organism.
Design of oligonucleotide primers
The goal of this project was to create a plasmid library representing the annotated ORFs of N. gonorrhoeae. The Gateway® Cloning System from Invitrogen  was selected for several reasons. First, Gateway® uses a recombination-based cloning method which has the added benefit that once an archival clone is sequence-validated, subsequent recombinants (ie. into expression vectors) do not need to be sequenced. Second, the initial clones lack transcriptional machinery such that the cloned ORFs are not expressed, thus avoiding problems from lethality due to troublesome gene products. Third, there are several expression and epitope-tagging vector options for the subsequent study of proteins encoded by the cloned ORFs, allowing a variety of approaches to studying their functions. The high efficiency also lends itself to high throughput approaches that are suitable for automation.
A total of 2071 unique primer pairs were successfully designed for the 2189 annotated ORFs of the FA1090 genome  and 61 ORFs of the GGI [14, 15]. These primers were gene specific, and their termini contain sequences for recombination cloning into the entry vector, pDONR221 (Invitrogen). All primers were designed such that the final recombination product yielded the native start codon at the 5' end of the gene, including the 206 of the 229 predicted ORFs with alternative (non-ATG) start codons. Since nearly half of the genes of this group (110/229) are annotated (that is encode putative proteins with significant similarity to proteins of known function), it is very possible that they are functional genes in the gonococcus and were thus included in the clone set design. Whether or not they encode functional proteins will ultimately depend on the results of future expression and mutation studies. The 23 ORFs of this subset not included were less than 400 bp in length and considered too small for the Gateway® system (see below). For the 149 ORFs encoding predicted proteins with an amino-terminal signal sequence (as identified by PSORT  and SignalP ), sequences encoding the signal sequence were removed and an ATG start codon placed at the 5' end of the remaining coding sequence. This was done to reduce problems of expression of hydrophobic signal sequences and to facilitate future expression and targeting studies for such recombinant proteins.
The primer design strategy was iterative, starting with an annealing temperature range of 62°–72°C and primer length set to 18 nt. All ORFs were included in the first iteration, and those ORFs for which a primer pair was not selected were subjected to subsequent iterations. In each subsequent iteration the primer length parameter was increased by one, up to a maximum primer length of 34 nt. Primers larger than this were not designed in part due to cost and convenience of oligo synthesis in our 96-well format. Next, the annealing temperature range was expanded 5°C in both the positive and negative directions and each primer size from 18 to 34 nt was tried again. No further iterations were attempted after the annealing temperature range exceeded 47°–87°C. Failing to meet either of these criteria resulted in a primer pair not being designed for the ORF. There were a total of 177 ORFs for which no primers were designed (see Additional file 2), 174 from FA1090 and 3 from the GGI. Most of these ORFs (165/177 = 93%) were less than 500 bp in length, and were not included since the Gateway® system is reportedly less efficient for cloning fragments of this small size.
To each of the gene specific primer sequences for the 5' ends of the ORFs was added a 21 nt sequence including a consensus ribosome binding site (Shine-Dalgarno sequence). To each of the gene specific primer sequences for the 3' ends of the ORFs was added 20 nt corresponding to the 3' end of the attB2 site necessary for recombination into pDONR221. These were the primary PCR primers, the sequences of which are available on request. A single pair of primers was then designed for a secondary amplification to generate gene specific products containing the attB sites for the recombinatory cloning step. The 5' secondary primer contained the 24 nt attB1 tail and the 21 nt sequence (Shine-Dalgarno) common to all of the primary 5' primer sequences. The 3' secondary primer contained the remaining 10 nt of the attB2 site and the 20 nt attB2 sequence common to all of the primary 3' primer sequences. The attB sequences were as recommended in the Gateway® manual.
PCR amplification of ORFs
The first round of PCR, using gene specific primers, included a total of 2071 different primer pairs. Plates 1 and 2 were organized as pilot reactions and contained primers designed to amplify products ranging from 143 bp to 3485 bp in length. Genomic DNA from N. gonorrhoeae strain FA1090 was used as a template for the reactions. Agarose gel analysis of the amplicons showed that 173 of the 192 reactions yielded a product of the expected size, an efficiency of 90%. Plates 3–21 were then arranged with increasing size of expected product, with plate 3 containing the smallest products and plate 21 the largest. Several primer pairs (from plates 1 and 2) were included in plates 3–21 as internal controls. Plate 22 was an additional control plate, with one half of the plate (rows A-D) duplicated on the other half (rows E-H). Plate 23 corresponded to the genes of the GGI present in strain MS11 [14, 15], and MS11 genomic DNA was used as the template for this plate of reactions. 5 μl of each reaction from the first round of PCR of each plate were run on agarose gels and scored for production of a product and whether it was of the expected size. The results showed 89% of the reactions to produce a product of the correct size.
Following the primary amplification with gene specific primers, all products (in the original 96-well format) were diluted 1:100 and an aliquot subjected to a secondary amplification using a pair of primers corresponding to the sequences common to all of the amplicons, and including additional sequences to generate a complete attB site for the cloning reaction. Aliquots of the secondary amplification were then used directly in the recombination reaction to generate entry clones.
Construction of the library
Secondary amplicons were inserted into pDONR221 by in vitro recombination between the attB sites introduced at the ends of the amplicons and the attP sequences of the vector, maintaining the 96-well format arrangement. Cloning reactions were then transformed into E. coli strain DH5α and a portion of the transformation mix plated on LB plates containing kanamycin. Individual transformants were screened by PCR to determine the presence of and size of the insert. For the first round of screening, four independent transformants from each reaction were screened, maintaining the original 96-well format. A product of 350 bp was observed if no insert was present, a positive clone was identified as having a product 350 bp larger than the size of the corresponding primary PCR product.
For the initial set of 2147 transformation reactions, clones corresponding to 1165 genes were identified in the first four transformants screened, an efficiency of 54%. This efficiency varied greatly with the predicted size of insert, the smallest inserts (plate 3) were 83% positive in the first four screened and the next largest inserts (plate 20) had 17% positive in the first four screened. Plate 21, which had the largest inserts, only yielded 3 transformants as positive after several rounds of screening. Additional transformants for those clones not identified in the first round of screening were individually cultured, screened by PCR, and positive clones frozen down as they were identified. This approach yielded an additional 354 clones.
Following the initial rounds of screening, a list of missing clones was generated and the amplification and cloning steps repeated, optimizing several parameters and analyzing on an individual basis. This approach yielded an additional 283 clones, for a total of 1802 which were subsequently sequenced. Arrangement of the clones in plates for the master set and for sequencing was on an "as identified" basis, such that they are not arranged as in the original 96-well format. Each of the 21 plates contain viable clones in up to 95 of the 96 available positions, with position H12 (and additional wells on some plates) left empty for controls, providing a unique identity for several of the plates.
Sequence verification of the clones
Transformants identified as having an insert of the predicted size were grown in 96-well plates and DNA isolated for sequence analysis. DNA isolation and sequencing was done by at the Human Genome Sequencing Center at Baylor College of Medicine (HGSC) using the same set of primers used to screen transformants for insert size. Sequence reads were posted onto an HGSC website and subsequently downloaded by FTP. DNA reads were processed initially using the STADEN DNA analysis software package . Binary files were converted into .exp files using PREGAP4 to generate text files for each individual sequence read, with sequence quality cutoffs. These data are provided in Additional File 1 (AF1 exp sequence files.zip), and a description of the labelling scheme for the files is in the Methods section.
The data were next analyzed by BLAST  against the N. gonorrhoeae genome sequence database . Clones expected to contain inserts from the GGI, not present in FA1090, were analyzed by BLAST 2  using sequence of the individual GGI ORFs (, GenBank accession number AY803022). BLAST results were then manually tabulated in a file containing the expected gene for each archived clone. Of the 1802 sequenced clones, 58 were expected to be duplicates, leaving 1744 unique clones expected in the clone set. 1550 of the sequences were readable and corresponded to a predicted ORF from N. gonorrhoeae, 1399 of them unique, corresponding to 151 duplicates. Some duplicates were expected, and the remainder likely due to cross contamination from neighboring wells. Of those sequence validated, 55 were not in positions predicted. Most of these were due to human error, such as obvious well transpositions and numbering transpositions. 26 of these, however, had inserts in a backwards orientation and were incomplete. These clones will not be usable in subsequent recombination reactions using the Gateway® system. Together, these data indicate that at present we have a collection of 1672 individual clones from N. gonorrhoeae, 48 of which are from the GGI, and 83% of which have been sequence validated. A list of genes in the clone set with sequencing result information can be found in Additional File 2 (AF2 NG clone set seq status.xls).
Overexpression of randomly chosen ORFs
In order to examine the flexibility of using the clone set to construct various derivatives for which the Gateway® system was designed, three randomly selected pDONR221 derivatives were used to create inducible glutathione-S-transferase (GST) fusions. Plasmids containing ORFs NG1490 (aspS, encodes aspartyl-tRNA synthetase), NG1561 (xthA, encodes exodeoxyribonuclease III), and NG1641 (pivNG, encodes a pilin gene inverting homolog, PivNG) are predicted to encode native proteins of 9.6, 29.0, and 36.2 kdal respectively. Plasmid DNA of pDONR221 derivatives containing these three genes were recombined with the destination vector pDEST15, an N-terminal GST fusion vector. The resulting recombinants were then transformed into E. coli BL21-AI for expression analysis. SDS-PAGE analysis of the proteins after a 2 hr induction with 0.2% L-arabinose is shown in Figure 1. The results show high levels of induction for each of the fusion proteins, with the sizes as predicted (GST adds 29 kdal).
DNA microarray production
Since a very small portion of the secondary amplicons were used for the cloning reactions (5 μl of a 50 μl reaction), the remaining products were used to produce a set of DNA arrays for gene expression analysis. Gel electrophoresis (data not shown) indicated that the efficiency after the secondary amplification was 81%, representing 1681 ORFs of a total of 2071. To generate a more complete DNA array, a second set of primers (360) were designed to amplify internal portions of those ORFs not visible following the secondary PCR. Primers to amplify sequences of two small RNAs were also designed: NG0892.1, ffs, encodes the 4.5S RNA component of the gonococcal signal recognition particle ; and NG0880.1, tmRNA, encodes an RNA that tags abnormal proteins in the cell arising from stalled ribosomes and targets them for proteolysis . Gel analysis of the amplicons produced from the internal PCR primers showed 294 of the 362 to produce products of the expected size, for an efficiency of 82%. Together, the cloning amplicons (1681) and internal ORF and RNA amplicons (294) represent a minimum of 1975 ORFs of the N. gonorrhoeae genome. This is a minimal estimate since all products (regardles of gel result) were to be spotted, and some products might be present, but at amounts too low to be visualized. DNA samples were processed and spotted onto glass slides as described in Methods. As a first test, the arrays were hybridized with a Cy3-labelled random nonamer oligonucleotide (Qiagen). A scan of the slide at 532 nm showed spots at the appropriate positions where DNA had been spotted, and blank spots at the buffer control spots. The next test was to hybridize the arrays with labelled genomic DNA. Total DNA from N. gonorrhoeae strain MS11 was digested with Rsa I and labelled by including Cy3-dCTP in a random-primed Klenow DNA polymerase reaction (Roche Applied Science, Indianapolis, IN). DNA arrays were hybridized and then scanned at 532 nm. Valid hybridization signals (at least one standard deviation above background) were detected for 98% of the spots expected to contain DNA, with those of the MS11-specific island comparable in intensity to those of FA1090 amplified DNA. Overall, the results showed valid signals for 2035 individual genes, with several in duplicate. The fact that this number is higher than expected based on agarose gel analysis of the amplicons before spotting (see above) indicates that some of the reactions produced a product, but at amounts too low to be visualized by ethidium bromide (EtBr) staining. Thus, these arrays represent 90% (2035/2250) of the predicted ORFs of N. gonorrhoeae (including 58 ORFs of the GGI; [14, 15]) and 98% of those for which primers were designed.
Despite the advent of antibiotics in the 1940's, disease due to infection with N. gonorrhoeae remains a major health problem worldwide. The reasons for this are multi-fold. First, resistance to antibiotics by N. gonorrhoeae continues to rise [1, 3, 4]. In addition, treatment with high levels of broad spectrum antibiotics (which is frequently done since patients often do not return for follow up treatment) kills many bacteria of the (often beneficial) normal flora as well as the disease-causing microbe. Second, there is an incredibly high frequency of asymptomatic gonococcal infection, occurring in 5–10% of infected men and up to 50% of infected women. This represents a major reservoir for transmission of the infection. Furthermore, undiagnosed and untreated gonococcal salpingitis can lead to fallopian tube blockage. Partial blockage can result in ectopic pregnancy, which can be life threatening, and complete blockage of the fallopian tubes often leads to infertility. Third, development of a vaccine to protect against gonorrhea has been seriously hampered by the observation that gonococcal infection does not elicit protective immunity . Patients can be reinfected following treatment, and can even be infected by multiple strains at a given time. Thus, alternative treatments and preventative strategies for gonococcal infection are sorely needed.
As a first step in the identification of such alternative treatments and preventatives, it will be necessary to more thoroughly understand the biology of the gonococcus and the molecular mechanisms involved in its interactions with the host environment. Much of the studies to date have focussed primarily on identifying the molecules on the surface of the bacterium that directly interact with the host, and the toxic moieties involved in damage to host cells. Many of the molecules identified are outer membrane (OM) components [23–27], and iron utilization proteins (reviewed in ). A few are secreted , or shed in blebs [30–32]. There has also been significant work in the identification of eukaryotic host cell receptors for gonococcal surface proteins (reviewed in ).
Interactions between gonococci and epithelial cells are beginning to be unraveled. Chen and Clark showed that contact with Hec-1-B human endocervical epithelial cells increases gonococcal infectiveness and that the process involves, in part, de novo protein synthesis by the bacterium . The gonococcal type IV pilus (Tfp) and Opa proteins promote attachment, invasion and trans-epithelial trafficking. The mechanisms underlying Tfp- and Opa-mediated virulence are not yet understood, but these surface structures modulate a series of events in the infected epithelial cell, among them Ca2+ fluxes [34, 35], cortical rearrangements , and receptor phosphorylation [37–39]. Tfp retraction enhances the activation of stress-responsive kinases and the transcription of cytoprotective genes in the infected cell , and triggers the infected cell to produce a molecule that alters bacterial motility behavior . Finally, binding of gonococci to primary urethral cells up-regulates anti-apoptotic factors . These and other observations indicate that gonococcal infection requires the active participation of both the bacterium and the host cell.
The recent completion of the annotated genome sequence of N. gonorrhoeae [13, 43], coupled with the development of high throughput methods for the analysis of gene expression and function, provide an opportunity to significantly advance the study of gonococcal biology and pathogenesis. Like many sequenced genomes, nearly half (44%) of the genes of the annotated gonococcal genome encode hypothetical proteins of unknown function. Furthermore, many of the annotations are based on homologies at the nucleotide and/or amino acid level, and the actual function of the gonococcal proteins have not been demonstrated. In order to realize the full potential of information gleaned from the genome sequence of this (and any) organism, it will be necessary to assign functions to all of the genes of the genome. The newly emerging fields of functional genomics and proteomics offer much promise towards achieving the goal of eventual assignment of functions for each and every gene in a given organism.
In this work, we describe the initial phases of the construction of an expression-capable clone set representing the annotated ORFs of the gonococcal genome using a recombination-based cloning system. The advantages of the system used for this set are numerous. 1) The original sequences in the clone set contain only the ORFs, not the gene expression sequences, thus avoiding the issue of expression-related lethality of the recombinants. 2) The clones can be transferred to a number of expression systems (prokaryotic and eukaryotic), allowing the regulation of genes for overproduction of proteins, or the production of proteins out of the context of the particular environments so as to study their functions. 3) The clones can also be transferred to vectors that result in epitope fusions, such as hexa-histidine, GST, green fluorescent protein (GFP), Lumio™, etc., to the proteins of interest. Protein fusions are useful in localizing proteins (within the bacterium or infected cell), in determining protein-interacting partners, in allowing smaller step purification protocols for structural and activity studies, and for antibody production. This ability has been demonstrated by constructing IPTG-inducible GST-fusions from three random clones from this set (Fig. 1). 4) The clone set is also catalogued in such a way that individual clones of interest are easily identified and recovered from the clone bank (see Additional file 2). 5) Entry clones can also be used to create knockouts by in vitro transposition [44, 45] or shuttle mutagenesis  followed by transformation into naturally competent gonococci . This system is also ammenable to automation, thus increasing the potential output and consistency in the data obtained.
The N. gonorrhoeae clone set thus far includes 1672 of the 2250 predicted ORFs of the genome , of which 83% are sequence-validated. Included in this set are 48 of the 61 ORFs of the MS11 GGI [14, 15]. While this clone set is not yet complete, we believe these initial efforts have resulted in generating a valuable resource for the Neisseria research community. It is hoped that others in the community will share compatible reagents and add to the clone set, making it more comprehensive over time.
Coupled to the clone set construction, a PCR-amplicon based DNA microarray was generated. DNA microarrays are a powerful tool that allow one to measure relative transcript levels for essentially each gene of the genome simultaneously. These DNA arrays represent 2035 ORFs of the N. gonorrhoeae genome: 1977 from strain FA1090  and 58 from the MS11 GGI , comprising 90% of the genes of the genome. Preliminary studies show that these arrays are suitable for examining global gene expression in N. gonorrhoeae.
Many bacterial pathogens are known to respond to changes in their physical environment, often integrating responses to several environmental signals via complex regulatory networks to control expression of a variety of genes [49–53]. The examples of regulatory systems characterized in gonococci are few, with the best characterized being the response to iron availability [54, 55] and antimicrobial compounds [56, 57]. The advent of microarray technologies has opened avenues of research on global gene expression in both prokaryotes and eukaryotes, providing opportunities for studying a variety of organisms, including such genetically intractable microbes as Trepanema pallidum  and Chlamydia trachomatis . Thus, the use of DNA arrays will allow us to more fully explore the response of N. gonorrhoeae to environmental signals at the gene expression level. Since genes are typically only transcribed when the gene product function is required, expression profiles and cluster analyses will allow us to begin to determine the functions of unknown genes in the genome. Thus far, use of these DNA arrays has led to the identification of a regulator involved in the modulation of gonococcal gene expression upon adherence to epithelial cells in .
In summary, the tools described in this work represent a resource which will facilitate experiments to dissect the molecular mechanisms of gonococcal pathogenesis on a global scale. Combining these tools with the gonococcal infection models (tissue culture , organ culture [61–63], and the mouse model ), will allow us to make significant advances in the study of this important pathogen, thus providing us with the knowledge necessary to design therapeutics with which to treat and prevent gonococcal disease.
Gene predictions and initial sequence preparation
All ORF IDs for strain FA1090 reference records can be found at the STDGEN Neisseria gonorrhoeae annotated genome sequence database . All gene sequences were prepared to include the natural start and stop codons. NG0540 and NG0634 have internal stop codons in the sequence database, and nothing was done to correct for this. Sequences of the ORFs of the MS11 GGI  have been deposited in the GenBank database with the accession number AY803022.
Signal peptide identification
In order to determine whether the ORFs contained signal sequences, the programs PSORT  and SignalP  were employed. If both programs predicted a cleavable signal peptide for a given ORF sequence, that constituted "high support". The result of the signal peptide analyses showed 149 sequences with high support for a signal peptide. For those gene nucleotide sequences with a high support signal peptide, the nucleotide sequence representing the signal peptide was removed prior to cloning primer design. When both programs predicted different cleavage sites, the cleavage site that represented the shorter signal peptide was chosen. The rationale for this choice was that it would be better to include a bit of the signal peptide in the PCR product than to exclude a bit of the mature protein in the PCR product.
In the first cycle, the forward and reverse primers were fixed at the same length. This was due to the ease at which Primer3  could be used. The input file was all gene sequences for which no signal peptide sequence was detected (see above). For the second cycle, the prim.aux file was manually inspected. This file contained information about successful primer picks where only a left or right primer could be picked. The strategy was to combine primers of different length. The potential for primer-dimer formation using this strategy was also assessed.
N.g. ORF cloning
Primers for each of the genes were purchased from Illumina, Inc (San Diego, CA) and were designed to add sequences corresponding to part of the attB site necessary for recombination into the Gateway® entry vector, pDONR221 (Invitrogen). Primary amplification was done using genomic DNA at a concentration of ~10 ng/reaction and gene specific primers at 2.5 μM. Reaction mix contained dNTPs, reaction buffer, MgCl2 and Taq polymerase as recommended by the manufacturer (Roche). Reactions conditions for primary PCR were as follows: denaturation at 94°C for 10 min; 10 cycles of 94°C 30 sec, 50°C 1 min, 74°C 1–5 min (depending on length of predicted product); 20 cycles of 94°C 30 sec, 55°C 1 min, 74°C 1–5 min (depending on length of predicted product); and a final extension at 74°C for 10 min. Following the primary amplification with the primer set, products were diluted 1:100 and 1 μl used as template for a secondary amplification using a pair of primers corresponding to the partial attB site common to all of the amplicons, and including additional sequences to generate a complete attB site for the cloning reaction. Reactions conditions for secondary PCR were as follows: denaturation at 94°C for 1 min; 5 cycles of 94°C 15 sec, 45°C 30 sec, 68°C 2 min; 15 cycles of 94°C 15 sec, 55°C 30 sec, 68°C 2 min. 5 μl of the 50 μl PCR was removed and used for cloning, and the remainder used for agarose gel analysis and printing of DNA arrays (N.g.array version 1.0, see below). Cloning reactions were performed according to the manufacturer's instructions (Invitrogen), transformed into E. coli strain DH5α, and transformants selected for kanamycin resistance. Individual transformants were picked in to wells of 96-well plates containing 100 μl L broth containing kanamycin (50 mg/l) and the same toothpick then used to place a small amount of bacteria directly into another plate containing a PCR cocktail. PCR was done using the M13 universal primers, which flank the att sites of the entry vector, pDONR221. A product of 350 bp was observed if no insert was present, providing an internal control for the PCR reactions. Individual clones were identified, stocked in duplicate, and grown for DNA isolation for sequencing.
Sequencing runs from each end of the insert of each of the clones was determined to verify the ORF inserted. Complete sequence verification, (ie. both strands completely across the insert) was not done as it was determined to be impractical. DNA sequencing reactions were performed at the Baylor College of Medicine HGSC. Additional File 1 is a zipped file containing each of the sequence reads as .exp files generated using PREGAP4, and can be opened and read using word processing software. The file is separated into folders labelled SeqPlate #, which refers to sequencing plate number (1–21), and corresponds to the SP# designation in the list of clones in Additional File 2. SP18 sequencing reactions were done twice (SeqPlate 18-1, SeqPlate 18-2) as a sequencing reaction control, and the reactions of SP5 were analyzed three times (SeqPlate 5-1, SeqPlate 5-2, SeqPlate 5-3) as controls. SP20 does not exist, SP22 was not sent for sequencing, and the quality of SP17 sequence was too poor to be readable. The naming of the individual read files is as follows: BGACA(project code) # (1, 2, or 3 = reaction) D or F (primer D = forward, F = reverse) # (box, not same as SP) # (01–96; well, 1 = A1, 2 = B1, 9 = A2, and so on) A or B (run). For example: BGACA3D1701A (found in SeqPlate 1 file) corresponds to SP1 well A1 sequenced with the forward primer (5' end of ORF), result from the third reaction and the first gel.
Gene expression analysis
pDONR221 derivatives chosen from the clone set were recombined with the destination vector, pDEST15 (Invitrogen), an N-terminal GST fusion vector. Plasmid DNA from the pDONR221 derivatives was isolated and incubated with pDEST15 in a LR recombination reaction performed according to the manufacturer's instructions. The recombination mixes were transformed into E. coli DH5α and transformants were selected on LB plates containing 50 mg/l Carbenicillin (Cb50). To test the possibility of false positive clones, the overexpression clones were tested for growth on chloramphenicol (20 mg/l), on which expression recombinants should not grow since the chloramphenicol resistance gene of pDEST15 is replaced by the insert. The resulting plasmids were purified and transformed into E. coli BL21-AI (Invitrogen) which expresses T7 RNA polymerase from the araBAD promotor, and transformants were selected on LB Cb50 plates. BL21-AI strains containing the pDEST15 derivatives were grown overnight and used to inoculate fresh LB medium containing Cb50 to an OD600 of 0.05. Expression of the GST-fusion proteins in E. coli BL21-AI was induced at an OD600 of 0.5 by addition of L-arabinose to a final concentration of 0.2%. Aliquots were removed after 2 hr and total proteins electropheresed on 10% polyacrylamide SDS gels .
DNA microarray construction
PCR amplicons remaining from the cloning reactions (described above) were concentrated and then spotted in duplicate onto TeleChem SuperAmine glass slides (TeleChem International, Inc., Sunnyvale, CA) using a GeneMachines Omnigrid 100 (GeneMachines, Inc., San Carlos, CA) with 16 TeleChem Chipmaker 3 pins at the Genome Technology Support Facility (GTSF) at Michigan State University. Preparation of probes and hybridization conditions were as described . Hybridized microarray slides were scanned using a GenePix 4000B scanner (Axon Instruments, Union City, CA) and images were processed and analyzed using GenePix version 4.1 software.
Availability of clone set
The master library was used to generate a limited set of stock plates, which are stored at various locations as reference stocks. An additional set is stored at Michigan State University and is regarded as the working copy, which is the sole source of material for distribution. Interested parties should contact Cindy Arvidson (corresponding author) to arrange for a copy of the clone set. The clones will only be available as an intact set, individual clones or groups of clones are not available.
Availability of microarrays
DNA arrays will be generated by re-amplification of the amplicons used for the recombination-cloning reactions using primers recognizing the common sequences at the ends of the amplicons at the MSU GTSF. Interested parties should contact Cindy Arvidson (corresponding author) to request slides, which will be produced upon request.
Centers for Disease Control and Prevention.http://www.cdc.gov
World Health Organization.http://www.who.org
Fox KK, Knapp JS, Holmes KK, Hook EW, Judson FN, Thompson SE, Washington JA, Whittington WL: Antimicrobial resistance in Neisseria gonorrhoeae in the United States, 1988-1994: the emergence ofdecreased susceptibility to the fluoroquinolones. J Infect Dis. 1997, 175: 1396-1403.
Shlaes D, Levy S, Archer G: Antimicrobial resistance: new directions. ASM News. 1991, 57: 455-458.
Schneider H, Hale TL, Zollinger WD, Seid RCJ, Hammack CA, Griffiss JM: Does an experimental gonococcal infection protect human volunteers from subsequent reinfection?. Abst Tenth Intnl Path Neisseria Conf. 1996, 10: 13-
Laga M, Manoka A, Kivuvu M, Malele B, Tuliza M, Nzila N, Goeman J, Behets F, Batter V, Alary M, Heyward WL, Ryder RW, Piot P: Non-ulcerative sexually transmitted diseases as risk factors for HIV-1 transmission in women: results from a cohort study. AIDS. 1993, 7: 95-102.
Fletcher JD, Stratton J, Chandler CS, Sparling PF: Epidemiologic notes and reports penicillin-resistant gonorrhea -- North Carolina. MMWR. 1983, 32: 273-275.
Turner CF, Rogers SM, Miller HG, Miller WC, Gribble JN, Chromy JR, Leone PA, Cooley PC, Quinn TC, Zenilman JM: Untreated gonococcal and chlamydial infection in a probability sample of adults. JAMA. 2002, 287: 726-733. 10.1001/jama.287.6.726.
Hauck CR, Meyer TF: 'Small' talk: Opa proteins as mediators of Neisseria-host-cell communication. Curr Opin Microbiol. 2003, 6: 43-49. 10.1016/S1369-5274(03)00004-3.
Merz AJ, So M: Interactions of pathogenic Neisseriae with epithelial cell membranes. Annu Rev Cell Dev Biol. 2000, 16: 423-457. 10.1146/annurev.cellbio.16.1.423.
Pujol C, Eugene E, Morand P, Nassif X: Do pathogenic neisseriae need several ways to modify the host cell cytoskeleton?. Microbes Infect. 2000, 2: 821-827. 10.1016/S1286-4579(00)90367-8.
Walhout AJ, Temple GF, Brasch MA, Hartley JL, Lorson MA, van den Heuvel S, Vidal M: GATEWAY recombinational cloning: application to the cloning of large numbers of open reading frames or ORFeomes. Meth Enzymol. 2000, 328: 575-592.
Brettin T: Neisseria gonorrhoeae annotated genome sequence database.http://www.stdgen.lanl.gov
Dillard JP, Seifert HS: A variable genetic island specific for Neisseria gonorrhoeae is involved in providing DNA for natural transformation and is found more often in disseminated infection isolates. Mol Microbiol. 2001, 41: 263-277. 10.1046/j.1365-2958.2001.02520.x.
Hamilton HL, Dominguez NM, Schwartz KJ, Hackett KT, Dillard JP: Neisseria gonorrhoeae secretes chromosomal DNA via a novel type IV secretion system. Mol Microbiol. 2005, 55: 1704-1721. 10.1111/j.1365-2958.2005.04521.x.
Nakai K, Kanehisa M: Expert system for predicting localization sites in Gram-negative bacteria. Proteins. 1991, 11: 95-110. 10.1002/prot.340110203.
Nielsen H, Engelbrecht J, Brunak S, von Heijne G: Identification of prokaryotic and eukaryotic signal peptides and prediction of their cleavage sites. Protein Engineering. 1997, 10: 1-6. 10.1093/protein/10.1.1.
STADEN DNA sequence analysis package.http://staden.sourceforge.net
Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database searchprograms. Nucl Acids Res. 1997, 25: 3389-3402. 10.1093/nar/25.17.3389.
Tatusova TA, Madden TL: Blast 2 sequences - a new tool for comparing protein and nucleotide sequences. FEMS Microbiol Lett. 1999, 174: 247-250. 10.1016/S0378-1097(99)00149-4.
Frasz C, Arvidson CG: A role for both DNA and RNA in GTP hydrolysis by the Neisseria gonorrhoeae signal recognition particle receptor. J Bacteriol. 2003, 185: 801-808. 10.1128/JB.185.3.801-808.2003.
Gueneau de Novoa P, Williams KP: The tmRNA website: reductive evolution of tmRNA in plastids and other symbionts. Nucleic Acids Res. 2004, 32: D104-108. 10.1093/nar/gkh102.
Fischer SH, Rest RF: Gonococci possessing only certain P.II outer membrane proteins interact with human neutrophils. Infect Immun. 1988, 56: 1574-1579.
Punsalang AP, Sawyer WD: Role of pili in the virulence of Neisseria gonorrhoeae. Infect Immun. 1973, 8: 255-263.
Rudel T, Scheurerpflug I, Meyer TF: Neisseria PilC protein identified as type-4 pilus tip-located adhesin. Nature. 1995, 373: 357-359. 10.1038/373357a0.
Swanson J: Studies on gonococcus infection. IV. Pili: their role in attachment of gonococci to tissue culture cells. J Exp Med. 1973, 137: 571-589. 10.1084/jem.137.3.571.
van Putten JP, Duensing TD, Carlson J: Gonococcal invasion of epithelial cells driven by P.IA, a bacterial ion channel with GTP binding properties. J Exp Med. 1998, 188: 941-952. 10.1084/jem.188.5.941.
Schryvers AB, Stojiljkovic I: Iron acquisition systems in the pathogenic Neisseria. Mol Microbiol. 1999, 32: 1117-1123. 10.1046/j.1365-2958.1999.01411.x.
Simpson DA, Hausinger RP, Mulks M: Purification, characterization, and comparison of the immunoglobulin A1 proteases of Neisseria gonorrhoeae. J Bacteriol. 1988, 170: 1866-1873.
Gregg CR, Melly MA, Hellerqvist CG, Coniglio JG, McGee ZA: Toxic activity of purified lipopolysaccharide of Neisseria gonorrhoeae for human fallopian tube mucosa. J Infect Dis. 1981, 143: 432-439.
Griffiss J, Schneider H, Mandrell R, Yamasaki R, Jarvis G, Kim J, Gibson B, Hamadeh R, Apicella M: Lipooligosaccharides: The principal glycolipids of the Neisserial outer membrane. Rev Infect Dis. 1988, 10: S287-S295.
Melly MA, McGee ZA, Rosenthal RS: Ability of monomeric peptidoglycon fragments from Neisseria gonorrhoeae to damage human fallopian-tube mucosa. J Infect Dis. 1984, 149: 378-386.
Chen JCR, Bavoil P, Clark VL: Enhancement of the invasive ability of Neisseria gonorrhoeae by contact with Hec1B, an adenocarcinoma endometrial cell line. Mol Microbiol. 1991, 5: 1531-1538.
Ayala BP, Vasquez B, Clary S, Tainer JA, Rodland K, So M: The pilus-induced Ca2+ flux triggers lysosome exocytosis and increases the amount of Lamp1 accessible to Neisseria IgA1 protease. Cell Microbiol. 2001, 3: 265-275. 10.1046/j.1462-5822.2001.00112.x.
Kallstrom H, Islam MS, Berggren PO, Jonsson AB: Cell signaling by the type IV pili of pathogenic Neisseria. J Biol Chem. 1998, 273: 21777-21782. 10.1074/jbc.273.34.21777.
Merz AJ, Enns CA, So M: Type IV pili of pathogenic neisseriae elicit cortical plaque formation in epithelial cells. Mol Microbiol. 1999, 32: 1316-1332. 10.1046/j.1365-2958.1999.01459.x.
Chen T, Grunert F, Medina-Marino A, Gotschlich EC: Several carcinoembryonic antigens (CD66) serve as receptors for gonococcal opacity proteins. J Exp Med. 1997, 185: 1557-1564. 10.1084/jem.185.9.1557.
Gray-Owen SD, Dehio C, Haude A, Grunert F, Meyer TF: CD66 carcinoembryonic antigens mediate interactions between Opa- expressing Neisseria gonorrhoeae and human polymorphonuclear phagocytes. EMBO J. 1997, 16: 3435-3445. 10.1093/emboj/16.12.3435.
Lee SW, Bonnah RA, Higashi DL, Atkinson JP, Milgram SL, So M: CD46 is phosphorylated at tyrosine 354 upon infection of epithelial cells by Neisseria gonorrhoeae. J Cell Biol. 2002, 156: 951-957. 10.1083/jcb.200109005.
Howie HL, Glogauer M, So M: The N. gonorrhoeae type IV pilus stimulates mechanosensitive pathways and cytoprotection through a pilT-dependent mechanism. PLoS Biol. 2005, 3: e100-10.1371/journal.pbio.0030100.
Lee SW, Higashi DL, Snyder A, Merz AJ, Potter L, So M: PilT is required for PI(3,4,5)P3-mediated cross-talk between N. gonorrhoeae and epithelial cells. Cell Microbiol. 2005, in press:
Binnicker MJ, Williams RD, Apicella MA: Gonococcal porin IB activates NF-kappaB in human urethral epithelium and increases the expression of host antiapoptotic factors. Infect Immun. 2004, 72: 6408-6417. 10.1128/IAI.72.11.6408-6417.2004.
Roe BA, Lin SP, Song L, Yuan X, Clifton S, Dyer D: Gonococcal genome sequencing project. URL http://www.genome.ou.edu/gono/html. 2002
Du Y, Arvidson CG: Identification of ZipA, an SRP-targeted protein from Neisseria gonorrhoeae. J Bacteriol. 2003, 185: 2122-2130. 10.1128/JB.185.7.2122-2130.2003.
Du Y, Lenz J, Arvidson CG: Global gene expression and the role of sigma factors in Neisseria gonorrhoeae in interactions with epithelial cells. Infect Immun. 2005, 73: 4834-4845. 10.1128/IAI.73.8.4834-4845.2005.
Seifert HS, Ajioka RS, Paruchuri D, Heffron F, So M: Shuttle mutagenesis of Neisseria gonorrhoeae: pilin null mutations lower DNA transformation competence. J Bacteriol. 1990, 172: 40-46.
Goodman SD, Scocca JJ: Identification and arrangement of the DNA sequence recognized in specific transformation of Neisseria gonorrhoeae. Proc Natl Acad Sci USA. 1988, 85: 6982-6986.
Dempsey JF, Litaker W, Hadhure A, Snodgrass TL, Cannon JG: Physical map of the chromosome of Neisseria gonorrhoeae FA1090 with locations of genetic markers, including opa and pil genes. J Bacteriol. 1991, 173: 5476-5486.
Cotter PA, Miller JF: In vivo and ex vivo regulation of bacterial virulence gene expression. Curr Opin Microbiol. 1998, 1: 17-26. 10.1016/S1369-5274(98)80138-0.
Dorman CJ, Porter ME: The Shigella flexneri virulence gene regulatory cascade: A paradigm of bacterial control mechanisms. Mol Microbiol. 1998, 29: 677-684. 10.1046/j.1365-2958.1998.00902.x.
Lucas RL, Lee CA: Unravelling the mysteries of virulence gene regulation in Salmonella typhimurium. Mol Microbiol. 2000, 36: 1024-1033. 10.1046/j.1365-2958.2000.01961.x.
Pettersson J, Nordfelth R, Dubinina E, Bergman T, Gustafsson M, Magnusson KE, Wolf-Watz H: Modulation of virulence factor expression by pathogen target cell contact. Science. 1996, 273: 1231-1233.
Skorupski K, Taylor RK: Control of the ToxR virulence regulon in Vibrio cholerae by environmental stimuli. Mol Microbiol. 1997, 25: 1003-1009. 10.1046/j.1365-2958.1997.5481909.x.
Sebastian S, Agarwal S, Murphy JR, Genco CA: The gonococcal fur regulon: identification of additional genes involved in major catabolic, recombination, and secretory pathways. J Bacteriol. 2002, 184: 3965-3974. 10.1128/JB.184.14.3965-3974.2002.
West S, Sparling F: Response of Neisseria gonorrhoeae to iron limitation: alterations in expression of membrane proteins without apparent siderophore production. Infect Immun. 1985, 47: 388-394.
Hagman KE, Shafer WM: Transcriptional control of the mtr efflux system of Neisseria gonorrhoeae. J Bacteriol. 1995, 177: 4162-4165.
Lee EH, Rouquette-Loughlin C, Folster JP, Shafer WM: FarR regulates the farAB-encoded efflux pump of Neisseria gonorrhoeae via an MtrR regulatory mechanism. J Bacteriol. 2003, 185: 7145-7152. 10.1128/JB.185.24.7145-7152.2003.
McKevitt M, Patel K, Smajs D, Marsh M, McLoughlin M, Norris SJ, Weinstock GM, Palzkill T: Systematic cloning of Treponema pallidum open reading frames for protein expression and antigen discovery. Genome Res. 2003, 13: 1665-1674. 10.1101/gr.288103.
Nicholson TL, Olinger L, Chong K, Schoolnik G, Stephens RS: Global stage-specific gene regulation during the developmental cycle of Chlamydia trachomatis. J Bacteriol. 2003, 185: 3179-3189. 10.1128/JB.185.10.3179-3189.2003.
Shaw JH, Falkow S: Model for invasion of human tissue culture cells by Neisseria gonnorrhoeae. Infect Immun. 1988, 56: 1625-1632.
McGee ZA, Johnson AP, Taylor-Robinson D: Human fallopian tubes in organ culture: preparation, maintenance, and quantitation of damage by pathogenic microorganisms. Infect Immun. 1976, 13: 608-618.
McGee ZA, Johnson AP, Taylor-Robinson D: Pathogenic mechanisms of Neisseria gonorrhoeae: observations on damage to human Fallopian tubes in organ culture by gonococci of colony type 1 or type 4. J Infect Dis. 1981, 143: 413-422.
McGee Z, Stephens D, Hoffman L, Schlech III W, Horn R: Mechanisms of mucosal invasion by pathogenic Neisseria. Rev Infect Dis. 1983, 5: S708-S714.
Jerse AE: Experimental gonococcal genital tract infection and opacity protein expression in estradiol-treated mice. Infect Immun. 1999, 67: 5699-5708.
Rozen S, Skaletsky H: Bioinformatics Methods and Protocols. Edited by: Krawetz SMS. 2000, , Humana Press, Totowa, NJ, 365-386.Primer3 on the WWW for general users and for biologist programmers. Methods in Molecular Biology
Laemmli UK: Cleavage of structural proteins during the assembly of the head of bacteriophage T4. Nature. 1970, 227: 680-685. 10.1038/227680a0.
Work in the Arvidson laboratory was supported by start-up funds to C.G.A. from the Colleges of Human and Osteopathic Medicine at Michigan State University, and an Intramural Research Grant from the MSU Vice President for Research and Graduate Studies. Work performed at the Los Alamos National Laboratory was supported by the Molecular Foundations of Pathogenesis project, U.S. Department of Energy Contract W-7405-ENG-36 and by funds from the LANL Laboratory Directed Research Development Program to M.R.A. Work in the So laboratory was supported in part by NIH grants AI34560, AI47260 and AI49973 to M. S. Alexandra Friedrich was supported by a fellowship within the Postdoc-Programme of the German Academic Exchange Service (DAAD).
The authors are extremely grateful to thank Jeffrey Landgraf of the Michigan State University GTSF for help and advice in the production and use of DNA arrays and Robert Britton of the MSU Department of Microbiology and Molecular Genetics for advice and the use of his array scanner and software. We also thank Audrey Butcher and Catherine Beauduy (Arvidson laboratory) for their participation in analyzing potential clones for the clone set and Anita Amin, Judy Hernandez, Donna Muzny, and the Baylor College of Medicine Human Genome Sequencing Center for assitance in sequencing the clone set.
Thomas Brettin: design of clone set construction, primer design
Michael R. Altherr: design of clone set construction, design and supervision of high-throughput PCR amplification of ORFs and recombination cloning into entry vectors
Ying Du: construction and tests of DNA microarrays, amplification and cloning of several ORFs not obtained in the initial batch cloning
Roxie M. Mason: cataloging of all cloning reactions processed at MSU (Arvidson lab), screening of transformants and preparation of clones for sequencing
Alexandra Friedrich: recombination of selected clones into expression vectors and analysis of proteins produced (Figure 1)
Laura Potter: initial project design, screening and cataloging of all cloning reactions processed at OHSU (So lab), screening of transformants and preparation of clones for sequencing
Chris Langford: batch BLAST analysis of sequencing results
Thomas J. Keller: batch BLAST analysis of sequencing results, wrote new program to faciliate analysis of sequence data
Jason Jens: processing of sequence data, batch program design for analysis of subset of clones from Gonococcal Genetic Island
Heather Howie: screening of transformants and preparation of clones for sequencing
Nathan J. Weyand: screening of transformants and preparation of clones for sequencing
Susan Clary: screening of transformants and preparation of clones for sequencing, amplification and cloning of several ORFs not obtained in the initial batch cloning
Kimberly Prichard: primer design
Susi Wachocki: high-throughput PCR amplification of ORFs and cloning into entry vectors
Erica Sodergren: sequencing of clone set
Joseph P. Dillard: contributed data of the Gonococcal Genetic Island prior to publication, participated in project design
George Weinstock: initial project design
Magdalene So: initial project design, supervison of all work performed at OHSU, substantial writing of manuscript
Cindy Grove Arvidson: initial project design, supervison of all work performed at MSU, screening of transformants from LANL, cloning of several ORFs not cloned in initial high-throughput cloning, collated and organanized all sequence data, primary writing of manuscript, corresponding author