- Research article
- Open Access
Identification of new genes in Sinorhizobium melilotiusing the Genome Sequencer FLX system
© Mao et al; licensee BioMed Central Ltd. 2008
Received: 09 January 2008
Accepted: 02 May 2008
Published: 02 May 2008
Sinorhizobium meliloti is an agriculturally important model symbiont. There is an ongoing need to update and improve its genome annotation. In this study, we used a high-throughput pyrosequencing approach to sequence the transcriptome of S. meliloti, and search for new bacterial genes missed in the previous genome annotation. This is the first report of sequencing a bacterial transcriptome using the pyrosequencing technology.
Our pilot sequencing run generated 19,005 reads with an average length of 136 nucleotides per read. From these data, we identified 20 new genes. These new gene transcripts were confirmed by RT-PCR and their possible functions were analyzed.
Our results indicate that high-throughput sequence analysis of bacterial transcriptomes is feasible and next-generation sequencing technologies will greatly facilitate the discovery of new genes and improve genome annotation.
Sinorhizobium meliloti is a micro-symbiont associated with legume plants. This soil bacterium inhabits nodules on the roots of host legume plants, where it reduces atmospheric nitrogen to organic nitrogenous compounds that can be utilized by its hosts. Because of its agricultural and ecological importance, S. meliloti has been extensively studied as a model symbiont. The S. meliloti 1021 genome sequence and the initial annotation of the genome were completed in 2001 [1–4]. The S.meliloti genome comprises three replicons, the 3.65 Mb chromosome, the 1.35 Mb megaplasmid pSymA, and the1.68 Mb megaplasmid pSymB . According to RefSeq , the S. meliloti 1021 genome has 6205 predicted protein-encoding genes. Among these, more than one-third were annotated as "hypothetical" or "unknown". Many research papers have been published on S. meliloti since its genome sequence was completed. Also, more genomes of closely related species such as Brucella spp., Rhodopseudomonas palustris, and S. medicae WSM419 have been sequenced. Comparative genomics including newly sequenced genomes provides new information about the genome of S. meliloti. There is an ongoing need to update and improve its genome annotation. So far, there are no systematic efforts of direct sequencing of its entire transcriptome. Microarray data are available, but most microarray designs are based on annotated genes [6, 7]. High-density whole-genome tiling arrays are not yet available.
The goal of this study was to develop a high-throughput experimental approach to search for new genes of S. meliloti missed in the previous genome annotation [1–4]. We used pyrosequencing  to sequence the transcriptome of S.meliloti. The GS FLX system from Roche and 454 Life Sciences can generate more than 100 million bases per sequencing run with an average yield of greater than 400,000 reads of average length of 250 bases. This platform provides a broad range of applications including whole genome sequencing [9–11], transcriptome and gene regulation studies [12–15], metagenomics analysis  and amplicon sequencing [17, 18]. Although pyrosequencing has been used to sequence microbial genomes, relatively few applications of transcriptome analysis have been reported. Here, we present the first report of sequencing a bacterial transcriptome using the GS FLX platform as an experimental approach for gene discovery.
GS FLX sequencing results
# Sequence reads
Average sequence length
# Sequences aligned to genes
# Sequences in rRNA operons
# Sequences not aligned to the genome (e<0.01)
Validation of new genes using RT-PCR
Summary of new genes
Co-transcribed with upstream gene
Co-transcribed with downstream gene
Target description for predicted genes
putative dioxygenase, slightly similar to catechol 1,2-dioxygenase protein [S. meliloti 1021]
hypothetical protein BMEII0534 [B. melitensis 16M]
two component transcriptional regulator, LuxR family [S. medicae WSM419]
hypothetical protein Smed_0338 [S. medicae WSM419]
hypothetical protein Smed_0524 [S. medicae WSM419]
hypothetical protein Smed_1270 [S. medicae WSM419]
conserved hypothetical signal peptide protein [S. medicae WSM419]
Nodulation protein nolR
hypothetical protein pRL110117 [R. leguminosarum bv. viciae 3841]
hypothetical protein Cvib_0070 [P. vibrioformis DSM 265]
Functional annotation of the new genes
The sequences of putative new genes were searched against the NR database from NCBI and SwissProt from EBI  using BLASTX  and Smith-Waterman programs (; Table 2). Both programs produced very similar results: 10 of the 20 new genes had no significant hits, and the other 10 had either a full length or partial match with proteins in the NR database (Table 2). The genes with no significant hits were relatively short with lengths ranging from 120 to 366 nt.
Four predicted genes showed significant matches with genes with known or putative functions (Table 2). VBISMc2940 had 89% similarity to a conserved hypothetical signal peptide protein from S. medicae WSM419. VBISMb0839 had 69% similary to a two component transcriptional regulator, LuxR family from S. medicae WSM419. VBISMa1337 partially matched to a putative dioxygenase with 25% similarity. VBISMc3282 matched to nodulation protein NolR with 59% similarity. The NolR protein is a transcriptional regulator for common nodulation genes as well as the three nodD copies present in S. meliloti. Previous studies have shown that nolR gene in S. meliloti strain 1021 has a single insertion in the C-terminal coding sequence which abolishes the DNA-binding ability of the NolR protein [30, 31]. Thus, Rm1021 has no NolR activity. NolR- strains nodulate host plants less efficiently than NolR+ strains. In the RefSeq database, VBISMc3282 was not previously annotated as a gene although it is a mutant form of the nolR gene. Here, we demonstrate and confirm that this mutant gene is expressed.
No neighboring genes were detected to be co-transcribed with VBISMc2940, VBISMb0839 or VBISMa1337, while VBISMc3282 was co-transcribed with the downstream gene SMc01535, a hypothetical protein. The other six genes with BLAST hits matched only hypothetical proteins (Table 2).
Gene expression levels
Because the RNA amplification step was linear (Methods), we expect that the cDNA samples we prepared represent relative mRNA levels in the cell. Thus, for a full sequencing run, with high coverage, the number of sequences would be a good indication of gene expression levels. Due to the low coverage of our pilot experiment, we cannot yet estimate gene expression levels based on number of sequences for each gene. However, we expect that most of the genes that showed five or more matches to our transcriptome sequences should be highly expressed in the cell population (Additional file 3). The known genes with high sequence copy number are consistent with our knowledge about the high expression level of those genes under the same growth condition (our unpublished microarray data).
Our study demonstrated that there are many genes missed in the initial genome annotation and it is useful to have large-scale transcriptome analysis to reveal these genes and validate their status. Our results showed that sequencing bacterial transcriptomes using the GS FLX system is feasible and it helps to discover new genes and improve the genome annotation. A full GS FLX sequencing run can produce an average yield of more than 400,000 reads which is 20-fold greater than the yield from our titration run for this study. Even with 90% rRNA population in the sample, there will still be more than 40,000 reads that are non-rRNA transcripts. This provides an average 6X coverage of non-rRNA genes. Our pilot experiment with only 1854 reads already identified 20 new genes. With a full sequencing run, which produces more than 20-fold reads than the titration run, we expect to discover many more new gene transcripts that have been previously missed. However, a full sequencing run with 6X coverage of non-rRNA genes will still not be sufficient to discover all possible new genes expressed, especially for ones with low expression levels, and considering that conditions under which genes are expressed may not be known or studied by any particular set of experiments. According to the previous microarray studies, about 70-80% annotated genes are expressed under the same growth conditions as used in this study ( and our unpublished data). Nevertheless, our study suggests two ways to improve the results: the first is to more effectively remove rRNA or use a normalized cDNA library; the second is to employ "deep" sequencing techniques, either by performing multiple GS FLX runs, or by using Illumina  or ABI  methods which produce millions of reads, but of smaller average length.
Our study indicated that there are still many genes missed in the initial genome annotation of S. meliloti. High-throughput sequence analysis of bacterial transcriptomes is feasible for the identification of new genes. Next-generation sequencing technologies will greatly facilitate the gene discovery process and improve genome annotation.
Cell culture and RNA isolation
Sinorhizobium meliloti strain1021 was grown at 30°C in TY medium  to mid-exponential phase (OD600 = 0.6). Cell growth was stopped by adding 1/9th volume of stop solution (5% buffer equilibrated phenol pH 7.4 in ethanol) and placed on ice. Cells were collected by centrifugation in a microcentrifuge at maximum speed for 3 minutes. The cell pellets were stored in -80°C. Total RNA was isolated by using Qiagen RNeasy bacterial RNA purification kit (Qiagen, Valencia, CA). The total RNA was treated with DNase I on mini-RNeasy column before eluted with RNase free water. For RT-PCR experiments, an additional DNase I treatment was done after RNA was eluted from the RNeasy mini column to ensure that there was no genomic DNA contamination. 20 μl of total RNAs eluted from the RNeasy mini-column were treated with 5 μl DNase I (Qiagen) in 10 μl RDD buffer, 1 μl RNase inhibitor (Invitrogen, Carlsbad, CA) and 64 μl RNase free water (Qiagen) at 25°C for 30 minutes. The RNAs were then extracted with phenol/chloroform and precipitated with ethanol using standard protocols. 16s and 23s rRNAs were then depleted using the MICROBExpress™ Bacterial mRNA Enrichment Kit (Ambion, Austin, TX). Total RNAs and rRNA depleted RNAs were quantified and analyzed on the Agilent 2100 Bioanalyzer. 7 μg of total RNA per reaction was used. After 16s and 23s rRNAs were depleted, about 0.5 μg (7%) RNAs was recovered. The total RNA samples had RNA integrity number (RIN value) of 8.0 or better. As shown in Figure 2, more than 90% 16s and 23s rRNAs were depleted. We analyzed more than 10 independent rRNA-depleted preparations on the Agilent 2100 Bioanalyzer. The results were consistent and showed that the 16s and 23s rRNA peaks were greatly reduced in these preparations but not completely removed (Figure 2). In addition, two small peaks immediately before 16s and 23s rRNAs could not be removed by the Ambion MicrobExpress kit. The two peaks were consistently present in all of our RNA preparations.
Two RNA samples were prepared for RNA amplification. Sample 1 was 16s and 23s rRNA depleted RNA sample as described above. Sample 2 was the 16s and 23s rRNA depleted RNA sample ligated to a 3' RNA adptor (5'-PO4-UUCGCUGUUC UUAGCGGCCG CAUGCUC-idT-3'; idT: 3' inverted deoxythymidine) (Dharmacon Research, Lafayette, CO) and a 5' RNA adptor (5'-OH-AUGUGCGCGA CUUCCUGUAG ACGGAACGCU AGAAGAAA-OH-3') (Dharmacon Research). 3' and 5' adaptor ligations were done as described in Argaman et al. 2001 .
RNA amplification and cDNA preparation
To obtain enough cDNA for sequencing, the 16s and 23s rRNA depleted RNAs (sample 1 and 2) were amplified using Nugen WT-Ovation Pico RNA amplification system . 5 ng of starting RNA was used. The SPIA™ amplified single strand cDNA (2.5 ug) was then taken through a second strand cDNA synthesis, using the following conditions: 5X 2nd strand reaction mix 30 μl (Invitrogen), dNTP, 10 mM 3 μl (Invitrogen), E. coli DNA ligase 1 μl (Invitrogen), E. coli DNA polymerase I 4 μl (Invitrogen), RNase H 1 μl (Invitrogen), RNase-free water 91 μl (Ambion). The reaction mix was incubated at 16°C for 2 hours. The cDNA was then purified using the Qiagen PCR clean up kit resulting 4 ug of cDNA quantified by using the Nanodrop spectrophotometer. 1 ug of cDNA of each sample was size selected (>100 bp) using Roche's GS FLX library Preparation Guide recommendations (no nebulization was necessary due to the size range of the cDNA GS FLX library Preparation Guide), and a single stranded library was created.
GS FLX sequencing and data filtering
The DNA sequencing libraries for the two samples were combined with the sequencing beads in 4 different concentrations to determine the optimal conditions for emPCR amplification. All 8 preparations were sequenced in 8 lanes of a GS FLX sequencing plate using the standard Roche/454 protocols. Sequencing data was obtained after a 7 hour run on the GS FLX. The 54,162 raw reads from GS FLX sequencing run that passed the sample key code filter (initial bases TCAG) were further filtered by the 454 software to eliminate 10,521 mixed reads (with two or more different DNA strands/bead), 15,604 excessively short reads (less than about 50 bp), and 9,032 interrupted reads ("dots"). 35% of the raw reads passed all filters in this titration run to provide the 19,005 high quality reads used in this study.
Sequence analysis and mapping to S. melilotigenome
GS FLX sequences passed filtering criteria were BLASTN-aligned to S. meliloti genome. Sequences with matching to rRNA operons were filtered. The remaining sequences were BLASTed against RefSeq genes and our predicted new genes from our gene annotation pipeline.
5 μg DNase I treated total RNA was reverse transcribed using superscript II with 4 pmoles of equally mixed gene-specific primers for each candidate gene selected (cm012b-cm0031b, Table S1). Primers were designed using Primer3 . For PCR, each 40 μl reaction includes 0.5 μl of 40 μl reverse transcription reaction, 20 μl of 2X GoTaq Green Master Mix (Promaga, Madison, WI) and 0.5 μM of primer pair of each gene (IDT, Coralville, IA). PCR conditions were 95°C 2 min, 30 cycles of 95°C 45 s, 52°C 45 s, 72°C 60 s, and a final cycle of 72°C for 10 min. PCR products were examined by electrophoresis in a 2.5% agarose/TAE/EtBr gel. Sequencing was performed using the BigDye Terminator Cycle Sequencing Kit (Applied Biosystems) and analyzed on an Applied Biosystems model 3730 automated capillary DNA sequencer.
We thank Timothy Driscoll for helpful discussions, Jessica Kraszewski for assistance in the GS FLX runs and Chunxia Wang for providing microarray data. This work was funded by the Commonwealth Research Initiative (CRI) from the Commonwealth of Virginia and by the Virginia Bioinformatics Institute.
- Barnett MJ, Fisher RF, Jones T, Komp C, Abola AP, Barloy-Hubler F, Bowser L, Capela D, Galibert F, Gouzy J, Gurjal M, Hong A, Huizar L, Hyman RW, Kahn D, Kahn ML, Kalman S, Keating DH, Palm C, Peck MC, Surzycki R, Wells DH, Yeh KC, Davis RW, Federspiel NA, Long SR: Nucleotide sequence and predicted functions of the entire Sinorhizobium meliloti pSymA megaplasmid. Proc Natl Acad Sci USA. 2001, 98 (17): 9883-9888. 10.1073/pnas.161294798.PubMed CentralView ArticlePubMedGoogle Scholar
- Capela D, Barloy-Hubler F, Gouzy J, Bothe G, Ampe F, Batut J, Boistard P, Becker A, Boutry M, Cadieu E, Dreano S, Gloux S, Godrie T, Goffeau A, Kahn D, Kiss E, Lelaure V, Masuy D, Pohl T, Portetelle D, Puhler A, Purnelle B, Ramsperger U, Renard C, Thebault P, Vandenbol M, Weidner S, Galibert F: Analysis of the chromosome sequence of the legume symbiont Sinorhizobium meliloti strain 1021. Proc Natl Acad Sci USA. 2001, 98 (17): 9877-9882. 10.1073/pnas.161294398.PubMed CentralView ArticlePubMedGoogle Scholar
- Finan TM, Weidner S, Wong K, Buhrmester J, Chain P, Vorholter FJ, Hernandez-Lucas I, Becker A, Cowie A, Gouzy J, Golding B, Puhler A: The complete sequence of the 1,683-kb pSymB megaplasmid from the N2-fixing endosymbiont Sinorhizobium meliloti. Proc Natl Acad Sci USA. 2001, 98 (17): 9889-9894. 10.1073/pnas.161294698.PubMed CentralView ArticlePubMedGoogle Scholar
- Galibert F, Finan TM, Long SR, Puhler A, Abola P, Ampe F, Barloy-Hubler F, Barnett MJ, Becker A, Boistard P, Bothe G, Boutry M, Bowser L, Buhrmester J, Cadieu E, Capela D, Chain P, Cowie A, Davis RW, Dreano S, Federspiel NA, Fisher RF, Gloux S, Godrie T, Goffeau A, Golding B, Gouzy J, Gurjal M, Batut J: The composite genome of the legume symbiont Sinorhizobium meliloti. Science. 2001, 293 (5530): 668-672. 10.1126/science.1060966.View ArticlePubMedGoogle Scholar
- Pruitt KD, Tatusova T, Maglott DR: NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res. 2007, 35 (Database issue): D61-5. 10.1093/nar/gkl842.PubMed CentralView ArticlePubMedGoogle Scholar
- Barnett MJ, Toman CJ, Fisher RF, Long SR: A dual-genome Symbiosis Chip for coordinate study of signal exchange and development in a prokaryote-host interaction. Proc Natl Acad Sci USA. 2004, 101 (47): 16636-16641. 10.1073/pnas.0407269101.PubMed CentralView ArticlePubMedGoogle Scholar
- Ruberg S, Tian ZX, Krol E, Linke B, Meyer F, Wang Y, Puhler A, Weidner S, Becker A: Construction and validation of a Sinorhizobium meliloti whole genome DNA microarray: genome-wide profiling of osmoadaptive gene expression. J Biotechnol. 2003, 106 (2-3): 255-268. 10.1016/j.jbiotec.2003.08.005.View ArticlePubMedGoogle Scholar
- Margulies M, Egholm M, Altman WE, Attiya S, Bader JS, Bemben LA, Berka J, Braverman MS, Chen YJ, Chen Z, Dewell SB, Du L, Fierro JM, Gomes XV, Godwin BC, Begley RF, Rothberg JM: Genome sequencing in microfabricated high-density picolitre reactors. Nature. 2005, 437 (7057): 376-380.PubMed CentralPubMedGoogle Scholar
- Green RE, Krause J, Ptak SE, Briggs AW, Ronan MT, Simons JF, Du L, Egholm M, Rothberg JM, Paunovic M, Paabo S: Analysis of one million base pairs of Neanderthal DNA. Nature. 2006, 444 (7117): 330-336. 10.1038/nature05336.View ArticlePubMedGoogle Scholar
- Pearson BM, Gaskin DJ, Segers RP, Wells JM, Nuijten PJ, van Vliet AH: The complete genome sequence of Campylobacter jejuni strain 81116 (NCTC11828). J Bacteriol. 2007, 189 (22): 8402-8403. 10.1128/JB.01404-07.PubMed CentralView ArticlePubMedGoogle Scholar
- Poinar HN, Schwarz C, Qi J, Shapiro B, Macphee RD, Buigues B, Tikhonov A, Huson DH, Tomsho LP, Auch A, Rampp M, Miller W, Schuster SC: Metagenomics to paleogenomics: large-scale sequencing of mammoth DNA. Science. 2006, 311 (5759): 392-394. 10.1126/science.1123360.View ArticlePubMedGoogle Scholar
- Bainbridge MN, Warren RL, Hirst M, Romanuik T, Zeng T, Go A, Delaney A, Griffith M, Hickenbotham M, Magrini V, Mardis ER, Sadar MD, Siddiqui AS, Marra MA, Jones SJ: Analysis of the prostate cancer cell line LNCaP transcriptome using a sequencing-by-synthesis approach. BMC Genomics. 2006, 7: 246-10.1186/1471-2164-7-246.PubMed CentralView ArticlePubMedGoogle Scholar
- Berezikov E, Thuemmler F, van Laake LW, Kondova I, Bontrop R, Cuppen E, Plasterk RH: Diversity of microRNAs in human and chimpanzee brain. Nat Genet. 2006, 38 (12): 1375-1377. 10.1038/ng1914.View ArticlePubMedGoogle Scholar
- Emrich SJ, Barbazuk WB, Li L, Schnable PS: Gene discovery and annotation using LCM-454 transcriptome sequencing. Genome Res. 2007, 17 (1): 69-73. 10.1101/gr.5145806.PubMed CentralView ArticlePubMedGoogle Scholar
- Gowda M, Li H, Alessi J, Chen F, Pratt R, Wang GL: Robust analysis of 5'-transcript ends (5'-RATE): a novel technique for transcriptome analysis and genome annotation. Nucleic Acids Res. 2006, 34 (19): e126-10.1093/nar/gkl522.PubMed CentralView ArticlePubMedGoogle Scholar
- Krause L, Diaz NN, Bartels D, Edwards RA, Puhler A, Rohwer F, Meyer F, Stoye J: Finding novel genes in bacterial communities isolated from the environment. Bioinformatics. 2006, 22 (14): e281-9. 10.1093/bioinformatics/btl247.View ArticlePubMedGoogle Scholar
- Sogin ML, Morrison HG, Huber JA, Mark Welch D, Huse SM, Neal PR, Arrieta JM, Herndl GJ: Microbial diversity in the deep sea and the underexplored "rare biosphere". Proc Natl Acad Sci USA. 2006, 103 (32): 12115-12120. 10.1073/pnas.0605127103.PubMed CentralView ArticlePubMedGoogle Scholar
- Taylor KH, Kramer RS, Davis JW, Guo J, Duff DJ, Xu D, Caldwell CW, Shi H: Ultradeep bisulfite sequencing analysis of DNA methylation patterns in multiple gene promoters by 454 sequencing. Cancer Res. 2007, 67 (18): 8511-8518. 10.1158/0008-5472.CAN-07-1016.View ArticlePubMedGoogle Scholar
- Snyder EE, Kampanya N, Lu J, Nordberg EK, Karur HR, Shukla M, Soneja J, Tian Y, Xue T, Yoo H, Zhang F, Dharmanolla C, Dongre NV, Gillespie JJ, Hamelius J, Hance M, Huntington KI, Jukneliene D, Setubal JC, Sobral BW: PATRIC: the VBI PathoSystems Resource Integration Center. Nucleic Acids Res. 2007, 35 (Database issue): D401-6. 10.1093/nar/gkl858.PubMed CentralView ArticlePubMedGoogle Scholar
- Delcher AL, Harmon D, Kasif S, White O, Salzberg SL: Improved microbial gene identification with GLIMMER. Nucleic Acids Res. 1999, 27 (23): 4636-4641. 10.1093/nar/27.23.4636.PubMed CentralView ArticlePubMedGoogle Scholar
- Borodovsky M, McIninch J: Recognition of genes in DNA sequence with ambiguities. Biosystems. 1993, 30 (1-3): 161-171. 10.1016/0303-2647(93)90068-N.View ArticlePubMedGoogle Scholar
- Lukashin AV, Borodovsky M: GeneMark.hmm: new solutions for gene finding. Nucleic Acids Res. 1998, 26 (4): 1107-1115. 10.1093/nar/26.4.1107.PubMed CentralView ArticlePubMedGoogle Scholar
- Tech M, Pfeifer N, Morgenstern B, Meinicke P: TICO: a tool for improving predictions of prokaryotic translation initiation sites. Bioinformatics. 2005, 21 (17): 3568-3569. 10.1093/bioinformatics/bti563.View ArticlePubMedGoogle Scholar
- Suzek BE, Ermolaeva MD, Schreiber M, Salzberg SL: A probabilistic method for identifying start codons in bacterial genomes. Bioinformatics. 2001, 17 (12): 1123-1130. 10.1093/bioinformatics/17.12.1123.View ArticlePubMedGoogle Scholar
- Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25 (17): 3389-3402. 10.1093/nar/25.17.3389.PubMed CentralView ArticlePubMedGoogle Scholar
- NCBI. [http://www.ncbi.nlm.nih.gov]
- Stein LD, Mungall C, Shu S, Caudy M, Mangone M, Day A, Nickerson E, Stajich JE, Harris TW, Arva A, Lewis S: The generic genome browser: a building block for a model organism system database. Genome Res. 2002, 12 (10): 1599-1610. 10.1101/gr.403602.PubMed CentralView ArticlePubMedGoogle Scholar
- SwissProt. [http://www.ebi.ac.uk/swissprot]
- Smith TF, Waterman MS: Identification of common molecular subsequences. J Mol Biol. 1981, 147 (1): 195-197. 10.1016/0022-2836(81)90087-5.View ArticlePubMedGoogle Scholar
- Cren M, Kondorosi A, Kondorosi E: An insertional point mutation inactivates NolR repressor in Rhizobium meliloti 1021. J Bacteriol. 1994, 176 (2): 518-519.PubMed CentralPubMedGoogle Scholar
- Wais RJ, Wells DH, Long SR: Analysis of differences between Sinorhizobium meliloti 1021 and 2011 strains using the host calcium spiking response. Mol Plant Microbe Interact. 2002, 15 (12): 1245-1252. 10.1094/MPMI.2002.15.12.1245.View ArticlePubMedGoogle Scholar
- Illumina. [http://www.illumina.com]
- ABI. [http://www.appliedbiosystems.com]
- Beringer JE: R factor transfer in Rhizobium leguminosarum. J Gen Microbiol. 1974, 84 (1): 188-198.PubMedGoogle Scholar
- Argaman L, Hershberg R, Vogel J, Bejerano G, Wagner EG, Margalit H, Altuvia S: Novel small RNA-encoding genes in the intergenic regions of Escherichia coli. Curr Biol. 2001, 11 (12): 941-950. 10.1016/S0960-9822(01)00270-6.View ArticlePubMedGoogle Scholar
- Nugen. [http://www.nugeninc.com]
- Rozen S, Skaletsky H: Primer3 on the WWW for general users and for biologist programmers. Methods Mol Biol. 2000, 132: 365-386.PubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.