DNA-binding by Haemophilus influenzae and Escherichia coli YbaB, members of a widely-distributed bacterial protein family
© Cooley et al; licensee BioMed Central Ltd. 2009
Received: 21 November 2008
Accepted: 13 July 2009
Published: 13 July 2009
Genes orthologous to the ybaB loci of Escherichia coli and Haemophilus influenzae are widely distributed among eubacteria. Several years ago, the three-dimensional structures of the YbaB orthologs of both E. coli and H. influenzae were determined, revealing a novel "tweezer"-like structure. However, a function for YbaB had remained elusive, with an early study of the H. influenzae ortholog failing to detect DNA-binding activity. Our group recently determined that the Borrelia burgdorferi YbaB ortholog, EbfC, is a DNA-binding protein. To reconcile those results, we assessed the abilities of both the H. influenzae and E. coli YbaB proteins to bind DNA to which B. burgdorferi EbfC can bind.
Both the H. influenzae and the E. coli YbaB proteins bound to tested DNAs. DNA-binding was not well competed with poly-dI-dC, indicating some sequence preferences for those two proteins. Analyses of binding characteristics determined that both YbaB orthologs bind as homodimers. Different DNA sequence preferences were observed between H. influenzae YbaB, E. coli YbaB and B. burgdorferi EbfC, consistent with amino acid differences in the putative DNA-binding domains of these proteins.
Three distinct members of the YbaB/EbfC bacterial protein family have now been demonstrated to bind DNA. Members of this protein family are encoded by a broad range of bacteria, including many pathogenic species, and results of our studies suggest that all such proteins have DNA-binding activities. The functions of YbaB/EbfC family members in each bacterial species are as-yet unknown, but given the ubiquity of these DNA-binding proteins among Eubacteria, further investigations are warranted.
After the genome sequence of H. influenzae strain KW20 rd (also known as H. influenzae Rd) was determined in 1995 , the "Structure 2 Function Project" was established to crystallize recombinant proteins from H. influenzae genes of unknown function http://s2f.umbi.umd.edu/. Among these orphan gene products was the H. influenzae DUF 149 group member annotated as open reading frame (ORF) HI0442, and tentatively named "YbaB" . H. influenzae YbaB (YbaBHi) crystallized as a homodimer, with the central portion forming 3 antiparallel β-strands, long α-helices at the amino- and carboxy-termini (α-helices 1 and 3, respectively), and a short α-helix bridging the β-folded region and α-helix 3 (α-helix 2). The two subunits of the homodimer interface at the β-strand region, α-helix 2 and the initial residues of α-helix 3, while α-helix 1 and the terminal portion of α-helix 3 project away from the dimerization region. This distinctive structure that has been described as resembling a set of tweezers . Although the researchers who initially characterized YbaBHi speculated that it may be a DNA-binding protein, studies conducted at that time failed to detect binding to any of their analyzed DNA probes .
The Escherichia coli chromosome carries an orthologous gene that has been referred to as "ORF 12" (Fig. 1) [4–6]. Recombinant E. coli YbaB (YbaBEc) has also been crystallized and information about its unpublished three-dimensional structure is available on-line http://www.rcsb.org/pdb/explore.do?structureId=1PUG. The determined structures of YbaBEc and YbaBHi are nearly identical. A function for YbaBEc appears not to have been investigated prior to the current work.
The spirochete Borrelia burgdorferi produces a protein named EbfC that shares 29% identical and 56% similar amino acids with YbaBHi (Fig. 1). Our laboratories recently discovered that EbfC binds a specific DNA sequence 5' of the spirochete's erp loci [7–10]. Those results suggested that orthologous proteins may also be DNA-binding proteins. We therefore re-examined the properties of YbaBHi, and found that it does bind to certain DNAs. YbaBEc was also demonstrated to be a DNA-binding protein.
Results and discussion
In control experiments, purified YbaB proteins were treated either by incubation with 1 mg/ml proteinase K for 30 min or by heating in a boiling water bath for 10 min. EMSA of either protease-treated or boiled YbaB preparations did not yield reduced-mobility complexes or reduce the levels of free DNA probe (data not shown), demonstrating that the DNA-binding activity in the purified YbaB preparations was due to the native forms of the proteins.
A previous study did not detect binding of YbaBHi to any tested DNA, leading to the conclusion that this protein does not bind DNA in a completely sequence-independent manner . The present work demonstrated that YbaBHi, and the homologous protein of E. coli, do bind to certain DNAs. EbfC, the orthologous protein of the spirochete B. burgdorferi, binds specifically to the DNA sequence GTnAC and, with a lower affinity, to DNA lacking that sequence [8, 10]. The E. coli and H. influenzae YbaB proteins both exhibited preferences for certain tested DNA sequences, but neither showed the same high affinity for GTnAC as did the spirochetal ortholog. Both YbaB proteins also showed a marked preference for DNA derived from the B. burgdorferi erpAB promoter over poly(dI-dC). Such large differences in affinities for target and non-target sequences may account for the previous failure to detect DNA-binding by YbaBHi . These results suggest that YbaBEc and YbaBHi have higher affinities for some DNA sequences than for others, but whether those preferences depend upon a specific nucleotide sequence(s), A+T content, and/or DNA topology remain to be determined. The three-dimensional structure of dimeric YbaB resembles "tweezers", with α-helices 1 and 3 of each monomeric subunit protruding from the dimerization domains . The spacing between the α-helical protrusions is approximately 15 Å at the base of the dimerization domain and approximately 22 Å at the distal ends of the α-helices , similar to the diameter of B-form duplex DNA (~20Å ). Site-directed mutagenesis studies of the orthologous B. burgdorferi EbfC demonstrated that certain amino acid substitutions in either α-helix 1 or 3 of EbfC eliminate DNA-binding, without affecting dimerization . It is noteworthy that many of the α-helix 1 and 3 residues of EbfC are distinct from residues in both YbaBEc and YbaBHi (Fig. 1), consistent with the differences in DNA preferences between the E. coli and H. influenzae YbaB proteins and their spirochetal ortholog. YbaB/EbfC orthologs of other bacterial species likewise exhibit sequence variations in their α-helices 1 and 3, suggesting that they may also possess unique DNA-binding properties.
The function(s) of YbaB/EbfC proteins remains to be determined. Many bacterial ybaB/ebfC orthologs are located between dnaX and recR, a synteny that has led to suggestions of roles in DNA replication or recombination [3, 5, 6, 15–18]. While the abilities of the examined orthologs to bind DNA may support those hypotheses, several lines of evidence suggest that YbaB/EbfC proteins perform functions that are independent of DNA recombination or replication. Proteomic analyses of cultured H. influenzae detected production of YbaB without accompanying production of DNA repair proteins . A ybaB recR double mutant of Streptomyces coelicolor exhibited recombination defects that could be complemented with recR alone . The ybaB/ebfC orthologs of some bacterial species are not linked to recR or any other recombination-related gene and some, such as the B. burgdorferi, do not even encode RecR [8, 20]. Several bacteria, such as H. influenzae, have ybaB genes located distantly from their dnaX . Moreover, some ybaB family genes can be transcribed independently of their upstream genes, using promoter elements within the 5' gene [4, 6, 21–23].
We demonstrated that YbaBHi is in fact a DNA-binding protein. It exhibits an element of specificity, in that the protein preferentially bound to B. burgdorferi erp Operator 2 DNA over poly-dI-dC and, apparently, the DNA sequences examined by an earlier research group . Consistent with those data, the E. coli YbaB ortholog was also determined to be a DNA-binding protein. For both orthologs, the basic unit of DNA-binding is a homodimer, consistent with results from analyses of soluble proteins and crystallization data. The solved structures of YbaBEc and YbaBHi are distinct from any other known DNA-binding protein. Genes encoding orthologs of YbaB/EbfC proteins are found throughout the Eubacteria, including many important human pathogens, suggesting that these proteins perform important function(s). Thus, continued study of these unique proteins may provide insight regarding critical bacterial processes that might be exploited for infection control.
Bacterial gene sequences
Bacterial protein sequences orthologous to YbaBHi, YbaBEc and B. burgdorferi EbfC were identified by BlastP, using the predicted sequences of those three proteins as queries http://blast.ncbi.nlm.nih.gov/Blast.cgi. Amino acid sequences were aligned using Clustal X, with default parameters . Orthologs from the following bacteria were chosen as representative of different bacterial classifications: the α proteobacterium Rickettsia rickettsiae (accession number NC_009882), the β proteobacterium Neisseria gonorrhoeae (NC_002946.2), the gamma proteobacteria Vibrio cholerae (NC_002505.1) and Pseudomonas putida (NC_010501.1), the delta proteobacterium Bdellovibrio bacteriovorus (NC_005363.1), the firmicutes Clostridium perfringens (NC_003366.1), Bacillus subtilis (NC_000964.2), Enterococcus faecalis (NC_004668.1), and Streptococcus pneumoniae (NC_003098.1), the actinomycete Mycobacterium tuberculosis (NC_000962.2), and the bacteroidete Bacteroides capillosus (NZ_AAXG02000011.1).
Recombinant YbaBHi protein was produced from pET15b-HI0442 (a gift of Osnat Herzberg, University of Maryland) . Recombinant YbaBEc was produced by first PCR amplifying the ybaBEc gene from total genomic DNA using the oligonucleotide primers 5'-CACCCGTGATTGAGGAGGAAACCTATG-3' and 5'-CAGCGGGCTGGTTTGCATCAG-3'. The resulting amplicon was cloned into pET200-TOPO (Invitrogen, Carlsbad, CA), and the insert completely sequenced on both strands. Recombinant B. burgdorferi EbfC was produced using the previously-described plasmid construct p462-M5 .
Each plasmid was individually used to transform E. coli Rosetta pLysS (Novagen, San Diego, CA), and production of recombinant proteins induced by addition of isopropylthiogalactopyranoside. Bacteria were lysed by sonication in 30 mM imidazole, 0.5 M NaCl, 20 mM NaPO4, pH = 7.4, and cleared by centrifugation. The recombinant proteins were purified using His-Trap HP columns and an AKTA-FPLC equipped with a UPC-900 UV absorbance monitor and a Frac920 fraction collector (GE Healthcare, Piscataway, NJ). Proteins were eluted with a constantly increasing gradient between the lysis buffer and 0.75 M imidazole, 20 mM NaPO4, 0.5 M NaCl, pH = 7.4. Proteins were then dialyzed against 1 × e0 buffer (50 mM Tris [pH = 7.5], 1 mM dithiothreitol, 1 mM phenylmethanesulfonyl fluoride, and 100 μl/l Tween-20). Glycerol was added to a final concentration of 10% (vol/vol), and aliquots were snap frozen in liquid nitrogen and stored at -80°C. Purity of protein preparations was assessed by sodium dodecylsulfate-polyacrylamide gel electrophoresis (SDS-PAGE), followed by staining with Coomassie brilliant blue. BCA (bicinchoninic acid) protein assays (Pierce, Rockford, IL), calibrated with bovine serum albumin (Pierce), were used to determine protein concentrations.
Electrophoretic mobility shift assays (EMSA)
All EMSAs were performed at least three times. Biotin-labeled DNA probes were produced based upon the sequence of the B. burgdorferi strain B31 erpAB 5'-noncoding DNA, to which the orthologous EbfC protein is known to bind [7, 8, 10]. Probe b-WT corresponds with bp -160 through -36 (relative to the start of translation) of the erpAB operon, and contains two consensus EbfC-binding sites [8, 10] (Fig. 2). Probe b-WT was produced by PCR using oligonucleotide primers bio-A14A (5'-biotin-TTGTAATGAGTAGTGCATTTG-3') and R8 (5'-GCAATATTTCAAAGATTTAAA-3') from DNA template pBLS591 . That same oligonucleotide primer pair was used to produce probe b-C2 from mutant template pSRJ-2, a derivative of pBLS591 in which EbfC-binding site II was changed to CACAACA (Fig. 2) . Probes b-C20, b-C30, b-C40 and b-C50 were also produced using primers bio-A14A and R8, from mutant templates pSRJ-20, pSRJ30, pSRJ40 and pSRJ50, respectively, derivatives of pSRJ-2 in which single bp mutations were introduced to site I (Fig. 2) . Each PCR reaction product was separated by agarose gel electrophoresis and DNA visualized by ethidium bromide staining. Amplicons were extracted from gels into nuclease-free water using Wizard SV (Promega, Madison, WI), and quantified by spectrophotometric determination of absorbance at 260 nm.
EMSAs were performed using 100 pM biotin-labeled DNA fragment and varying concentrations of purified recombinant YbaBEc or YbaBHi. Binding conditions consisted of 50 mM Tris-HCl (pH = 7.5), 1 mM dithiothreitol, 8 μl/ml protease inhibitor (Sigma-Aldrich, St. Louis, MO), 2 μl/ml phosphatase inhibitor cocktail II (Sigma-Aldrich), and 10% glycerol. Protein and DNA were mixed together, in final volumes of 10 ml, and allowed to proceed toward equilibrium for 20 minutes at room temperature, then subjected to electrophoresis through 6% DNA retardation gels (Invitrogen) for 9000 V-min. DNA was electrotransferred to Biodyne B nylon membranes (Pierce), cross-linked by ultraviolet light, and biotinylated DNA detected using Chemiluminescent Nucleic Acid Detection Modules (Pierce).
Competition for DNA binding by poly(dI-dC) was assessed using the above binding conditions, 2 fmol (0.082 ng) labeled probe b-WT and either 1.2 μg/ml YbaBEc or 2.1 μg/ml YbaBHi. After 20 min incubation at room temperature, either no or 0.1, 0.5, 1, 2 or 4 ng poly(dI-dC) was added to each tube, followed by an additional 20 min incubation at room temperature. DNA-protein mixtures were subjected to electrophoresis and detection as described above.
It is important to note that in this approach, values of stoichiometry and equilibrium constant are not fully independent (fitted values of Ka and n are related by -n log [P] = log Ka). As a result, the parameters returned are the most likely values (in the least squares sense) that are internally-consistent. A similar analysis strategy has been described previously .
In studies of this kind, accurate measurement of Ka values require good estimates of the free protein concentration, [P]. In the present experiments, the protein concentrations (range ~10-8 M to ~10-6 M) exceeded by far the total DNA concentration (10-10 M). Thus, even in the presence of additional DNA binding (up to ~10 protein molecules/DNA), free protein concentration [P] is well-approximated by the total protein concentration, [P]total.
A Superdex 75 10/300 GL column (GE Healthcare) was prepared with a mobile phase consisting of 200 mM NaCl, 50 mM Tris-HCl (pH 7.5), and 1% (vol/vol) glycerol. The column was run with a flow rate of 0.20 ml per min using a Waters 600 pump and controller equipped with a Waters 996 photodiode array UV/Vis detector (Waters, Milford, MA). A calibration curve was created using an MW-GF-70 low-molecular-weight calibration kit (Sigma-Aldrich, St. Louis, MO), and the void volume, V0, was determined by injection of 200 μl of 1 mg/ml blue dextran in elution buffer with 5% glycerol. The remaining protein standards, bovine lung aprotinin (6.5 kDa), horse heart cytochrome c (12.4 kDa), bovine carbonic anhydrase (29 kDa), and bovine serum albumin (66 kDa), were individually prepared in elution buffer with 5% glycerol to total concentrations of 0.3 mg/ml each, and the volume with which the protein eluted, Ve, was determined. The molecular-mass calibration curve was generated by plotting the log (molecular mass) versus Ve/Vo (5). A 200-μl sample of recombinant YbaBHi (approximately 0.2 mg/ml) was then injected and its elution profile compared to the established curve to determine molecular masses of each elution peak.
The work was funded by NIH grant R01-AI044254 to Brian Stevenson and R01-GM070662 to Michael Fried. Sean Riley was supported in part by NIH Training Grant in Microbial Pathogenesis T32-AI49795 and a University of Kentucky Graduate School Dissertation Year Fellowship. We thank Osnat Herzberg for the generous gift of the YbaB-producing plasmid, and Amy Bowman, Catherine Brissette, Logan Burns, Tomasz Bykowski, Ashutosh Verma, Erin Welsh, and Michael Woodman for assistance during these studies and comments on the manuscript.
- Marchler-Bauer A, Anderson JB, Cherukuri PF, DeWeese-Scott C, Geer LY, Gwadz M, He S, Hurwitz DI, Jackson JD, Ke Z, et al: CDD: a conserved domain database for protein classification. Nucleic Acids Res. 2005, 33: D192-196.PubMed CentralPubMedView ArticleGoogle Scholar
- Fleischmann RD, Adams MD, White O, Clayton RA, Kirkness EF, Kerlavage AR, Bult CJ, Tomb JF, Dougherty BA, Merrick JM, et al: Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. Science. 1995, 269: 496-512.PubMedView ArticleGoogle Scholar
- Lim K, Tempczyk A, Parsons JF, Bonander N, Toedt J, Kelman Z, Howard A, Eisenstein E, Herzberg O: Crystal structure of YbaB from Haemophilus influenzae (HI0442), a protein of unknown function coexpressed with the recombinational DNA repair protein RecR. Proteins. 2003, 50: 375-379.PubMedView ArticleGoogle Scholar
- Flower AM, McHenry CS: Transcriptional organization of the Escherichia coli dnaX gene. J Mol Biol. 1991, 220: 649-658.PubMedView ArticleGoogle Scholar
- Mahdi AA, Lloyd RG: The recR locus of Escherichia coli K-12: molecular cloning, DNA sequencing and identification of the gene product. Nucleic Acids Res. 1989, 17: 6781-6794.PubMed CentralPubMedView ArticleGoogle Scholar
- Yeung T, Mullin DA, Chen K, Craig EA, Bardwell JCA, Walker JR: Sequence and expression of the Escherichia coli recR locus. J Bacteriol. 1990, 172: 6042-6047.PubMed CentralPubMedGoogle Scholar
- Babb K, McAlister JD, Miller JC, Stevenson B: Molecular characterization of Borrelia burgdorferi erp promoter/operator elements. J Bacteriol. 2004, 186: 2745-2756.PubMed CentralPubMedView ArticleGoogle Scholar
- Babb K, Bykowski T, Riley SP, Miller MC, DeMoll E, Stevenson B: Borrelia burgdorferi EbfC, a novel, chromosomally-encoded protein, binds specific DNA sequences adjacent to erp loci on the spirochete's resident cp32 prophages. J Bacteriol. 2006, 188: 4331-4339.PubMed CentralPubMedView ArticleGoogle Scholar
- Stevenson B, Bykowski T, Cooley AE, Babb K, Miller JC, Woodman ME, von Lackum K, Riley SP: The Lyme disease spirochete Erp lipoprotein family: structure, function and regulation of expression. Molecular Biology of Spirochetes. Edited by: Cabello FC, Godfrey HP, Hulinska D. 2006, Amsterdam: IOS Press, 354-372.Google Scholar
- Riley SP, Bykowski T, Cooley AE, Burns LH, Babb K, Brissette CA, Bowman A, Rotondi M, Miller MC, DeMoll E, et al: Borrelia burgdorferi EbfC defines a newly-identified, widespread family of bacterial DNA-binding proteins. Nucleic Acids Res. 2009, 37: 1973-1983.PubMed CentralPubMedView ArticleGoogle Scholar
- Fried MG, Crothers DM: Equilibria and kinetics of Lac repressor-operator interactions by polyacrylamide gel electrophoresis. Nucl Acids Res. 1981, 9: 6505-6525.PubMed CentralPubMedView ArticleGoogle Scholar
- Fried MG, Crothers DM: Equilibrium studies of the cyclic AMP receptor protein-DNA interaction. J Mol Biol. 1984, 172: 241-262.PubMedView ArticleGoogle Scholar
- Klotz IM: Ligand-Receptor Interactions. 1997, New York: WileyGoogle Scholar
- Varshavsky A: Electrophoretic assay for DNA-binding proteins. Methods Enzymol. 1987, 151: 551-565.PubMedView ArticleGoogle Scholar
- Bork JM, Cox MM, Inman RB: The RecOR proteins modulate RecA protein function at 5' ends of single-stranded DNA. EMBO J. 2001, 20: 7313-7322.PubMed CentralPubMedView ArticleGoogle Scholar
- Morimatsu K, Kowalczykowski SC: RecFOR proteins load RecA protein onto gapped DNA to accelerate DNA strand exchange: a universal step of recombinational repair. Mol Cell. 2003, 11: 1337-1347.PubMedView ArticleGoogle Scholar
- Flower AM, McHenry CS: The γ subunit of DNA polymerase III holoenzyme of Escherichia coli is produced by ribosomal frameshifting. Proc Natl Acad Sci USA. 1990, 87: 3713-3717.PubMed CentralPubMedView ArticleGoogle Scholar
- Peláez AI, Ribas-Aparicio RM, Gómez A, Rodicio MR: Structural and functional characterization of the recR gene of Streptomyces. Mol Genet Genomics. 2001, 265: 663-672.PubMedView ArticleGoogle Scholar
- Kolker E, Purvine S, Galperin MY, Stolyar S, Goodlett DR, Nesvizhskii AI, Keller A, Xie T, Eng JK, Yi E, et al: Initial proteome analysis of model microorganism Haemophilus influenzae strain Rd KW20. J Bacteriol. 2003, 185: 4593-4602.PubMed CentralPubMedView ArticleGoogle Scholar
- Fraser CM, Casjens S, Huang WM, Sutton GG, Clayton R, Lathigra R, White O, Ketchum KA, Dodson R, Hickey EK, et al: Genomic sequence of a Lyme disease spirochaete, Borrelia burgdorferi. Nature. 1997, 390: 580-586.PubMedView ArticleGoogle Scholar
- Chen K, Saxena P, Walker JR: Expression of the Escherichia coli dnaX gene. J Bacteriol. 1993, 175: 6663-6670.PubMed CentralPubMedGoogle Scholar
- Rezuchova B, Miticka H, Homerova D, Roberts M, Kormanec J: New members of the Escherichia coli σE regulon identified by a two-plasmid system. FEMS Microbiol Lett. 2003, 225: 1-7.PubMedView ArticleGoogle Scholar
- Engels S, Ludwig C, Schweitzer J, Mack C, Bott M, Schaffer S: The transcriptional activator ClgR controls transcription of genes involved in proteolysis and DNA repair in Corynebacterium glutamicum. Mol Microbiol. 2005, 57: 576-591.PubMedView ArticleGoogle Scholar
- Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG: The Clustal X window interface: flexible strategies for multiple sequence alignment aided by quality analyses tools. Nucleic Acids Res. 1997, 24: 4876-4882.View ArticleGoogle Scholar
- Adams CA, Fried MG: Analysis of protein-DNA equilibria by native gel electrophoresis. Protein interactions: Biophysical approaches for the study of complex reversible systems. Edited by: Schuck P. 2007, New York: Academic Press, 417-446.View ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.