BACTIBASE second release: a database and tool platform for bacteriocin characterization
© Hammami et al; licensee BioMed Central Ltd. 2010
Received: 4 August 2009
Accepted: 27 January 2010
Published: 27 January 2010
BACTIBASE is an integrated open-access database designed for the characterization of bacterial antimicrobial peptides, commonly known as bacteriocins.
For its second release, BACTIBASE has been expanded and equipped with additional functions aimed at both casual and power users. The number of entries has been increased by 44% and includes data collected from published literature as well as high-throughput datasets. The database provides a manually curated annotation of bacteriocin sequences. Improvements brought to BACTIBASE include incorporation of various tools for bacteriocin analysis, such as homology search, multiple sequence alignments, Hidden Markov Models, molecular modelling and retrieval through our taxonomy Browser.
The provided features should make BACTIBASE a useful tool in food preservation or food safety applications and could have implications for the development of new drugs for medical use. BACTIBASE is available at http://bactibase.pfba-lab-tun.org.
The dramatic rise in antibiotic-resistant pathogens has renewed efforts to identify, develop and redesign antibiotics. Bacteriocins are non-toxic inhibitors of bacteria and thus represent potential alternatives or complements to conventional antibiotics in the treatment of infections and in livestock production.
Bacteriocins were first identified almost 100 years ago. A heat-labile substance in Escherichia coli V culture supernatant was found toxic to E. coli S and given the name "colicin". It was thus decided that bacteriocins would be named after the producing species . Fredericq demonstrated that colicins were proteins and that the inhibitory activity depended on the presence of specific receptors on the surface of sensitive cells and was therefore limited to specific species or strains . Since then, bacteriocins have been found among most families of bacteria and many actinomycetes and described as universally produced, including by some members of the Archaea [3, 4]. Klaenhammer estimates that 99% of all bacteria probably produce at least one bacteriocin and the only reason we have not isolated more is that few researchers are looking for them .
Two main features distinguish the majority of bacteriocins from conventional antibiotics: bacteriocins are ribosomally synthesized and have a relatively narrow killing spectrum (3). They make up a highly diverse family of proteins in terms of size, microbial target, mode of action and release and mechanism of immunity and can be divided into two broad groups: those produced by Gram-negative bacteria and those produced by Gram-positive bacteria [6, 7].
Construction and content
The content and format of BACTIBASE have been described previously . While the general format has remained essentially unchanged, data retrieval and presentation have been improved. Data collection and annotation was done essentially the same way as for version 1 and the dataset is currently limited to natural sources. All microbiological information was collected from the literature by PubMed search. A physicochemical dataset was designed using SciDBMaker  and then provided with empirical formula, mass, length, isoelectric point, net charge, the number of basic, acidic, hydrophobic and polar residues, hydropathy index, binding potential index, instability index, aliphatic index, half-life in mammalian cells, yeast and E. coli, cysteine and glycine content, extinction coefficient, absorbance at 280 nm, absent and most prevalent amino acids, secondary (α-helix or β-strand) and tertiary structure (when available), physical method used for structural determination (e.g. NMR spectroscopy or X-ray diffraction) and critical residues for activity, whenever information was available. The Jmol applet http://www.jmol.org was included for tertiary structure visualization. The statistical interface provides data on peptide sequence, function and structure. Data were analyzed using SPSS software (version 17, SPSS Inc.) and medians and standard deviations were calculated. The following is a brief description of the database content.
The entire database is linked to the Bibliography section, which lists all published scientific articles consulted on the subject of each bacteriocin. The 'news' link points to the latest hundred published review articles in PubMed.
Bacteriocin structural analysis tool set
Several useful tools for protein analysis have been integrated into the platform. Users may search bacteriocin homologies using not only the BLAST program  but also FASTA  and SSEARCH . Multiple sequence alignment may be done using CLUSTALW , MUSCLE  and T-COFFEE  and displayed graphically using the embedded JalView applet . We used hidden Markov modeling (HMM) to produce bacteriocin profiles for each known family. The HMMER program was used to provide statistical descriptions of family consensus sequences  in order to allow users to identify the bacterial family that produces the bacteriocins most similar to their sequences.
Understanding of the molecular function of bacteriocins has been enhanced greatly by insight gained from three-dimensional structure. During the past decade, the use of homology modeling to study protein structure has become widespread. This technique generates a model of a protein using an experimental structure of a related protein as a template. We thus incorporated the program MODELLER  into the platform, which implements comparative protein structure modeling by satisfaction of spatial restraint. A sub-database of bacteriocins for which experimental structures have been developed was built. Users should note that only bacteriocins are used as templates in the homology modeling process. A modeling pipeline has been developed for automatic homology modeling from an initial bacteriocin sequence. This feature should be very useful for the in silico design of novel bacteriocins. The ability to develop novel bacteriocin-based-drugs that target prokaryotic as well as eukaryotic cells may open new possibilities for the design of improved antibiotics possessing refined characteristics.
Linking to the BACTIBASE database
It remains very easy to link directly to a specific BACTIBASE entry. With our new domain name, users may link directly to records using their BACTIBASE ID in the format http://bactibase.pfba-lab-tun.org/bacteriocinsview.php?id=BAC059, which will allow links to be maintained even if the bacteriocin data changes.
The forum section is provided to allow anyone to exchange information or ask questions regarding bacteriocins.
Comparison with other databases
In the last decade, several databases dealing with AMPs were developed http://bactibase.pfba-lab-tun.org/links.php. The AMSDb (see: http://www.bbcm.univ.trieste.it/~tossi/amsdb.html), ANTIMIC , APD2 , and CAMP  databases cover all AMPs sequences from diverse origins. Alternatively, some databases focus on AMPs produced by bacteria (BACTIBASE ), plants (PhytAMP ) and shrimp (PenBase ). While AMSdb database covers only AMPs of eukaryotic origin, ANTIMIC database contains about 1700 AMPs from diverse origins (eukaryotes, prokaryotes). Regrettably, this resource was discontinued. The Antimicrobial Peptide Database (APD2) is the most popular of the currently available public collections (containing 944 antibacterial peptides of eukaryotic and prokaryotic origin) . Recently, a new database containing a large Collection of Anti-Microbial Peptides (CAMP) was developed and holds 3782 antimicrobial sequences . While lantibiotics are the class I of bacteriocins, the CAMP database lists them as a distinct family from bacteriocins. This may confuse novice users. Although APD2 and CAMP databases contain very general information about peptides of all types having antibacterial, antifungal or antiviral activities and originating from either eukaryotic or prokaryotic cells, bacteriocins are not described with a useful amount of detail in either of these databases. Not only does BACTIBASE (version 2, July 2009) contain significantly more antimicrobial peptides of bacterial origin, than the APD2 and CAMP databases (177 in BACTIBASE versus ~120 in APD2 and ~68 in CAMP), but also every entry in BACTIBASE is much more detailed. BACTIBASE features, for example, physicochemical and structural information, detailed lists of target organisms and a description of the mode of action for each bacteriocin -- data not available in APD2 or any other online resource (to the best of our knowledge). Also, BACTIBASE hosts a rich and highly usable collection of references, where (i) each entry has been supplied with a short annotation summarizing its topic in ~10 words or less, (ii) is cross-linked to PubMed, and (iii) can be conveniently exported to Citation Manager Software of user's choice. The database provides several tools for bacteriocin sequence analysis (unavailable in APD2; unavailable or static in CAMP), such as homology search, multiple sequence alignments, Hidden Markov Models and molecular modeling. All this makes BACTIBASE a truly unique resource for bacteriocins.
We are currently developing a system for automatic updating of the database. New types of data will be added in the near future. Subsequent development will include integrating a system that automates the prediction of bacteriocin functional amino acids as well as enriching the platform with useful tools for bacteriocin characterization. We also hope to develop new methods/techniques for structural and functional classification of bacteriocins.
The purpose of the database is to serve the research community by organizing information relevant to all types of bacteriocins from all groups of Bacteria. With the inclusion of most known bacteriocin sequences, BACTIBASE 2 has grown into an integrated knowledge base for bacteriocin investigators. It is our hope that the implementation of 'Molecule Authorities' will transform BACTIBASE into a community-driven database (via notes) and that this trend will continue so that the individual investigators will verify or contribute to verifying every entry. We thank all investigators who have provided or will provide valuable feedback regarding the individual entries in this database. As more information about bacteriocins becomes available, the database will be expanded and improved accordingly. While database updating and developments continue, we welcome your comments, suggestions, or corrections.
Availability and requirements
BACTIBASE can be accessed freely at http://bactibase.pfba-lab-tun.org.
Authors thank Dr. Stephen Davids for his critical reading of the manuscript.
- Gartia A: Sur un remarquable exemple d'antagonisme entre deux souches de colibacille. Compt rend soc biol. 1925, 93: 1040-1041.Google Scholar
- Fredericq P: Sur la pluralité des récepteurs d'antibiose de E. coli. CR Soc Biol (Paris). 1946, 140: 1189-1194.Google Scholar
- Riley MA, Wertz JE: Bacteriocins: evolution, ecology, and application. Annu Rev Microbiol. 2002, 56: 117-137. 10.1146/annurev.micro.56.012302.161024.View ArticlePubMedGoogle Scholar
- Shand RF, Leyva KJ: Archaeal antimicrobials: an undiscovered country. Archaea: new models for prokaryotic biology. Edited by: Blum P. 2008, Norfolk: Caister Academic, 233-242.Google Scholar
- Klaenhammer TR: Bacteriocins of lactic acid bacteria. Biochimie. 1988, 70: 337-349. 10.1016/0300-9084(88)90206-4.View ArticlePubMedGoogle Scholar
- Gordon DM, Oliver E, Littlefield-Wyer J: The diversity of bacteriocins in Gram-negative bacteria. Bacteriocins: ecology and evolution. Edited by: Riley MA, Chavan M. 2007, Berlin: Springer, 5-18. full_text.View ArticleGoogle Scholar
- Heng NCK, Wescombe PA, Burton JP, Jack RW, Tagg JR: The diversity of bacteriocins in Gram-positive bacteria. Bacteriocins: ecology and evolution. Edited by: Riley MA, Chavan M. 2007, Berlin: Springer, 45-92. full_text.View ArticleGoogle Scholar
- Hammami R, Zouhir A, Ben Hamida J, Fliss I: BACTIBASE: a new web-accessible database for bacteriocin characterization. BMC Microbiol. 2007, 7: 89-10.1186/1471-2180-7-89.PubMed CentralView ArticlePubMedGoogle Scholar
- Hammami R, Zouhir A, Naghmouchi K, Ben Hamida J, Fliss I: SciDBMaker: new software for computer-aided design of specialized biological databases. BMC Bioinformatics. 2008, 9: 121-10.1186/1471-2105-9-121.PubMed CentralView ArticlePubMedGoogle Scholar
- Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25: 3389-3402. 10.1093/nar/25.17.3389.PubMed CentralView ArticlePubMedGoogle Scholar
- Pearson WR, Lipman DJ: Improved tools for biological sequence comparison. Proc Natl Acad Sci USA. 1988, 85: 2444-2448. 10.1073/pnas.85.8.2444.PubMed CentralView ArticlePubMedGoogle Scholar
- Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignmennt through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994, 22: 4673-4680. 10.1093/nar/22.22.4673.PubMed CentralView ArticlePubMedGoogle Scholar
- Edgar RC: MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004, 32: 1792-1797. 10.1093/nar/gkh340.PubMed CentralView ArticlePubMedGoogle Scholar
- Notredame C, Higgins DG, Heringa J: T-Coffee: a novel method for fast and accurate multiple sequence alignment. J Mol Biol. 2000, 302: 205-217. 10.1006/jmbi.2000.4042.View ArticlePubMedGoogle Scholar
- Waterhouse AM, Procter JB, Martin DM, Clamp M, Barton GJ: Jalview Version 2--a multiple sequence alignment editor and analysis workbench. Bioinformatics. 2009, 25 (9): 1189-91. 10.1093/bioinformatics/btp033.PubMed CentralView ArticlePubMedGoogle Scholar
- Durbin R, Eddy S, Krogh A, Mitchison G: Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids. 1998, Cambridge: Cambridge University PressView ArticleGoogle Scholar
- Sali A, Blundell TL: Comparative protein modelling by satisfaction of spatial restraints. J Mol Biol. 1993, 234: 779-815. 10.1006/jmbi.1993.1626.View ArticlePubMedGoogle Scholar
- Brahmachary M, Krishnan SP, Koh JL, Khan AM, Seah SH, Tan TW, Brusic V, Bajic VB: ANTIMIC: a database of antimicrobial sequences. Nucleic Acids Res. 2004, D586-589. 10.1093/nar/gkh032. 32 Database
- Wang G, Li X, Wang Z: APD2: the updated antimicrobial peptide database and its application in peptide design. Nucleic Acids Res. 2009, D933-937. 10.1093/nar/gkn823. 37 Database
- Thomas S, Karnik S, Barai RS, Jayaraman VK, Idicula-Thomas S: CAMP: A useful resource for research on antimicrobial peptides. Nucleic Acids Res. 2010, D774-D780. 10.1093/nar/gkp1021. 38 Database
- Hammami R, Ben Hamida J, Vergoten G, Fliss I: PhytAMP: a database dedicated to antimicrobial plant peptides. Nucleic Acids Res. 2009, D963-968. 10.1093/nar/gkn655. 37 Database
- Gueguen Y, Garnier J, Robert L, Lefranc MP, Mougenot I, de Lorgeril J, Janech M, Gross PS, Warr GW, Cuthbertson B: PenBase, the shrimp antimicrobial peptide penaeidin database: sequence-based classification and recommended nomenclature. Dev Comp Immunol. 2006, 30 (3): 283-288. 10.1016/j.dci.2005.04.003.View ArticlePubMedGoogle Scholar