EnzyBase: a novel database for enzybiotic studies
© Wu et al; licensee BioMed Central Ltd. 2012
Received: 12 October 2011
Accepted: 11 April 2012
Published: 11 April 2012
Enzybiotics are becoming increasingly recognized as potential alternative therapies for drug-resistant bacteria. Although only a few enzybiotics are currently well characterized, much information is still missing or is unavailable for researchers. The construction of an enzybiotics database would therefore increase efficiency and convenience in investigating these bioactive proteins and thus help reduce or delay the recent increase in antibiotic resistance.
In the present manuscript, we describe the development of a novel and original database called EnzyBase, which contains 1144 enzybiotics from 216 natural sources. To ensure data quality, we limited the source of information to authoritative public databases and published scientific literature. The interface of EnzyBase is easy to use and allows users to rapidly retrieve data according to their desired search criteria and blast the database for homologous sequences. We also describe examples of database-aided enzybiotics discovery and design.
EnzyBase serves as a unique tool for enzybiotic studies. It has several potential applications, e.g. in silico enzybiotic combination as cocktails, and novel enzybiotic design, in response to continuously emerging drug-resistant pathogens. This database is a valuable platform for researchers who are interested in enzybiotic studies. EnzyBase is available online at http://biotechlab.fudan.edu.cn/database/EnzyBase/home.php.
Antibiotic abuse is, in part, responsible for the dramatic increase in the resistance of pathogens to traditional antibiotics . Superbugs, such as MRSA and NDM-1, frequently and seriously threaten public safety [2, 3]. Consequently, the need to develop new classes of antibiotics with novel mechanisms of action against drug-resistant pathogens is becoming very urgent. Enzybiotics [4–8] and antimicrobial peptides (AMPs) have attracted much attention as potential substitutes for conventional antibiotics.
In the present manuscript, enzybiotics are referred to as bacterial cell wall-degrading enzymes, including lysins, bacteriocins, autolysins, and lysozymes. The most important characteristics of enzybiotics are their novel mechanisms of antibacterial action and capacity to kill antibiotic-resistant bacteria . Another significant feature of certain enzybiotics is their low probability of developing bacterial resistance . Compared with AMPs, enzybiotics are large, heat-labile, and narrow-spectrum types of antimicrobial proteins. Consequently, enzybiotics are not always suitable antimicrobial agents. Despite this, certain enzybiotics have been well characterized and widely used. Lysostaphin [12–15] and lysozymes [16–18] are the most studied enzybiotics in regards to their clinical or food applications. Furthermore, despite their apparent limitations in medicine, their potency against multi-drug-resistant pathogens should not be ignored. Therefore, an enzybiotic specific database that not only mobilizes research on enzybiotics, but also makes it more efficient and convenient, needs to be constructed.
Over the past decade, many databases have been developed for AMPs. These databases, including APD [19, 20], ANTIMIC , CAMP , BACTIBASE [23, 24], PhytAMP , PenBase , Defensins , CyBase , and peptaibols Peptaibol , contain AMP sequences from diverse origins or specific families and accordingly have accelerated and stimulated research on AMPs. Conversely, the majority of the sequenced enzybiotics are stored in the manually annotated UniProt/Swiss-Prot  database or scattered in the scientific literature. As a result, it is difficult to find information on enzybiotics for recent users. Developing a central database that stores information on enzybiotics is warranted by investigators to promote their research on enzybiotics discovery and design.
The idea of constructing a database that stores information on enzybiotics arose from our own research experience. We found that we constantly had to query information on enzybiotics from public databases, such as UniProt, and scientific literature. Thus, we decided to construct a database that simplified our research efforts, and comprehensively collected this information. EnzyBase, a novel and original database for enzybiotics studies, was developed and currently contains 1144 enzybiotics from 216 natural sources. This database provides a platform for current users to comprehensively and conveniently research enzybiotics and can be useful for exploring and designing novel enzybiotics for medical use.
Construction and content
EnzyBase was built on an Apache HTTP Server (V2.2.14) with PHP (V5.2.13) and MySQL Server (V5.1.40) as the back-end, and Personal Home Page (PHP), HyperText Markup Language (HTML) and Cascading Style Sheets (CSS) as the front-end. Apache, MySQL, and PHP were preferred as they are open-source software and platform independent, respectively, making them suitable for academic use. The web server and all parts of the database are hosted at Information Office of Fudan University, Shanghai, China.
All enzybiotic sequences were collected manually from the annotated UniProt/Swiss-Prot database or scientific literature. Each enzybiotic without the UniProt link had been excluded. The enzybiotics collected in EnzyBase database are primarily from natural sources, with the exception of genetically-modified sequences. Additional physicochemical data of each enzybiotic was either calculated via Bioperl programs or identified from scientific literature via a PubMed search. All of the collected information was classified and filled into six relational tables in MySQL. For each enzybiotic, a unique identification number (i.e., enzy id) was assigned, beginning with the prefix EN. Each entry also contains general data, such as protein name, protein full name, producer organism, simple function annotation and protein sequence, domains, 3D structure, and relevant references. For all proteins that already exist in the UniProt, Interpro , and/or PDB  databases, hyperlinks to these databases were created in EnzyBase. Additional physicochemical data, including calculated isoelectric point (pI) and charge at pI, are also provided. Moreover, minimal inhibitory concentrations (MICs) are included, if data are available. The BlastP program (BLASTP V2.2.25+) [33, 34] was used for sequence homology searches against EnzyBase.
Utility and discussion
As a web-based database, all data can be accessed and retrieved directly from the web browser. The database browse interface provides the users with a function of navigating the entire database, whereas the search interface provides the users with the function of retrieving their desired information using either the "quick" or "advanced" options. A "quick" search can be performed using only keywords, while the "advanced" search offers the possibility to specify seven separate fields, namely enzy id, uniprotKB entry number (i.e., uniprot id), protein name, producer organism, domains, target organism, and MIC value. The user can query the database by either one condition (excluding MIC, which requires the type of target organism to be initially stated) or a combination of various conditions. Every enzybiotic has its own results page that contains comprehensive information, including general information, antibacterial activities, sequence, structures, domains, and references. The general information consists of enzy id, protein name, protein full name, producer organism, protein mass, calculated pI, antibacterial activity, and simple function annotations. EnzyBase also provides hyperlinks to other databases, such as UniProt, InterPro, PDB, and PubMed, which allows for easier navigation within the World Wide Web pertaining to additional information on enzybiotics. The tools interface permits the use of BLASTP against EnzyBase, which enables users to search the database for homologous sequences, and then copy obtained results for subsequent research. Owing to limitations of disk space on the host site, we did not implement a local BLASTP against the NCBI database but instead supplied a hyperlink to the BLASTP on the NCBI website. The statistical info interface provides data on sources for enzybiotics, the distribution of sequence length, protein mass, calculated protein pI, and domains (please refer to the 'Statistical description and findings' section below for more information). The guide interface provides simple instructions for potential users on how to use the functions of EnzyBase. Additionally, the forum tools, which are based on UseBB, a free forum software, have been integrated into the database to provide information on updates, bug reports, and user discussions.
Statistical description and findings
Top 10 sources of enzybiotics in EnzyBase
Numbers of enzybiotics
Top 10 domains in EnzyBase
Numbers of enzybiotics
The EnzyBase can be used as a tool to aid researchers in exploring the use of enzybiotics or for designing novel enzybiotics. The most prominent weakness of enzybiotics is their narrow spectrum of antibacterial activity. However, a combination of enzybiotics with different spectra of antibacterial activities and/or different mechanisms of action could be used against a broad spectrum of bacterial infections and/or their resistant strains. Through the use of EnzyBase, users can quickly find a series of enzybiotics with optimum antibacterial activities against specific pathogens, and then combine them as a cocktail to measure their therapeutic effect against bacterial infectious diseases. Similar approaches have been successfully used to design phage cocktail therapies for the treatment of infections . For novel enzybiotics design, users could search for potential domains with high antibacterial activities against specific pathogens on EnzyBase and then combine them to create chimeric enzybiotics. For instance, to search for effective antimicrobial proteins against mastitis-causing pathogens, researchers created a novel chimeric peptidoglycan hydrolase fusion protein between lysostaphin and the endolysin of phage B30, which possesses their respective enzymatic domains, and is capable of degrading both streptococcal and staphylococcal peptidoglycans . Thus, the quantity and quality of the data entered in EnzyBase appears to be very important for successfully applying it in such research applications.
In the future, we plan to implement updates, assess the data quality continuously, and integrate some structural analysis tools, such as RasMol , and certain web2.0 functions, such as Wiki, into EnzyBase to improve its interactivity with users and improve research in the field of enzybiotics design and structure function exploration.
In summary, EnzyBase is a comprehensive and web-accessible database of enzybiotics. The current version of EnzyBase has 1144 entries. The database can be queried either by using simply keywords or by combinatorial conditions searches. EnzyBase may aid in enhancing our current understanding of enzybiotics and their mechanisms of action. Its potential applications include the in silico development of combinations of enzybiotics (e.g., cocktails) and the construction of novel enzybiotics against various bacterial infectious diseases. Thus, the database may have implications in the development of new drugs for medical applications.
Availability and requirements
EnzyBase is freely available for academic users at http://biotechlab.fudan.edu.cn/database/EnzyBase/home.php.
We would like to thank all of our colleagues at the State Key Laboratory of Genetic Engineering at Fudan University and Shanghai High-Tech United Bio-Technological R&D Co., Ltd., of China for their contributions in the literature search and discussions regarding this manuscript. This work was supported in part by the major scientific and technological specialized project of China for 'Significant New Formulation of New Drugs' (grant #: 2008ZX09101-032) and the 'Yangtze River Delta' joint scientific and technological project of China (grant 10495810600).
- English BK, Gaur AH: The use and abuse of antibiotics and the development of antibiotic resistance. Adv Exp Med Biol. 2010, 659: 73-82. 10.1007/978-1-4419-0981-7_6.PubMedView Article
- Heddini A, Cars O, Qiang S, Tomson G: Antibiotic resistance in China-a major future challenge. Lancet. 2009, 373: 30-PubMedView Article
- Levy SB, Marshall B: Antibacterial resistance worldwide: causes, challenges and responses. Nat Med. 2004, 10: S122-129. 10.1038/nm1145.PubMedView Article
- Nelson D, Loomis L, Fischetti VA: Prevention and elimination of upper respiratory colonization of mice by group A streptococci by using a bacteriophage lytic enzyme. Proc Natl Acad Sci USA. 2001, 98: 4107-4112. 10.1073/pnas.061038398.PubMedPubMed CentralView Article
- Veiga-Crespo P, Ageitos JM, Poza M, Villa TG: Enzybiotics: a look to the future, recalling the past. J Pharm Sci. 2007, 96: 1917-1924. 10.1002/jps.20853.PubMedView Article
- Hermoso JA, Garcia JL, Garcia P: Taking aim on bacterial pathogens: from phage therapy to enzybiotics. Curr Opin Microbiol. 2007, 10: 461-472. 10.1016/j.mib.2007.08.002.PubMedView Article
- Loessner MJ: Bacteriophage endolysins-current state of research and applications. Curr Opin Microbiol. 2005, 8: 480-487. 10.1016/j.mib.2005.06.002.PubMedView Article
- Fischetti VA: Bacteriophage lytic enzymes: novel anti-infectives. Trends Microbiol. 2005, 13: 491-496. 10.1016/j.tim.2005.08.007.PubMedView Article
- Gordon YJ, Romanowski EG, McDermott AM: A review of antimicrobial peptides and their therapeutic potential as anti-infective drugs. Curr Eye Res. 2005, 30: 505-515. 10.1080/02713680590968637.PubMedPubMed CentralView Article
- Borysowski J, Weber-Dabrowska B, Gorski A: Bacteriophage endolysins as a novel class of antibacterial agents. Exp Biol Med (Maywood). 2006, 231: 366-377.
- Kusuma C, Jadanova A, Chanturiya T, Kokai-Kun JF: Lysostaphin-resistant variants of Staphylococcus aureus demonstrate reduced fitness in vitro and in vivo. Antimicrob Agents Chemother. 2007, 51: 475-482. 10.1128/AAC.00786-06.PubMedPubMed CentralView Article
- Bastos MdCdF, Coutinho BG, Coelho MLV: Lysostaphin: a staphylococcal bacteriolysin with potential clinical applications. pharmaceuticals. 2010, 3: 1139-1161. 10.3390/ph3041139.PubMed CentralView Article
- Yang G, Gao Y, Feng J, Huang Y, Li S, Liu Y, Liu C, Fan M, Shen B, Shao N: C-terminus of TRAP in Staphylococcus can enhance the activity of lysozyme and lysostaphin. Acta Biochim Biophys Sin (Shanghai). 2008, 40: 452-458. 10.1111/j.1745-7270.2008.00415.x.View Article
- Kumar JK: Lysostaphin: an antistaphylococcal agent. Appl Microbiol Biotechnol. 2008, 80: 555-561. 10.1007/s00253-008-1579-y.PubMedView Article
- Rainard P: Tackling mastitis in dairy cows. Nat Biotechnol. 2005, 23: 430-432. 10.1038/nbt0405-430.PubMedView Article
- Tenovuo J: Clinical applications of antimicrobial host proteins lactoperoxidase, lysozyme and lactoferrin in xerostomia: efficacy and safety. Oral Dis. 2002, 8: 23-29. 10.1034/j.1601-0825.2002.1o781.x.PubMedView Article
- Donovan DM: Bacteriophage and peptidoglycan degrading enzymes with antimicrobial applications. Recent Pat Biotechnol. 2007, 1: 113-122. 10.2174/187220807780809463.PubMedView Article
- Gil-Montoya JA, Guardia-Lopez I, Gonzalez-Moles MA: Evaluation of the clinical efficacy of a mouthwash and oral gel containing the antimicrobial proteins lactoperoxidase, lysozyme and lactoferrin in elderly patients with dry mouth-a pilot study. Gerodontology. 2008, 25: 3-9. 10.1111/j.1741-2358.2007.00197.x.PubMedView Article
- Wang Z, Wang G: APD: the Antimicrobial Peptide Database. Nucleic Acids Res. 2004, 32: D590-592. 10.1093/nar/gkh025.PubMedPubMed CentralView Article
- Wang G, Li X, Wang Z: APD2: the updated antimicrobial peptide database and its application in peptide design. Nucleic Acids Res. 2009, 37: D933-937. 10.1093/nar/gkn823.PubMedPubMed CentralView Article
- Brahmachary M, Krishnan SP, Koh JL, Khan AM, Seah SH, Tan TW, Brusic V, Bajic VB: ANTIMIC: a database of antimicrobial sequences. Nucleic Acids Res. 2004, 32: D586-589. 10.1093/nar/gkh032.PubMedPubMed CentralView Article
- Thomas S, Karnik S, Barai RS, Jayaraman VK, Idicula-Thomas S: CAMP: a useful resource for research on antimicrobial peptides. Nucleic Acids Res. 2010, 38: D774-780. 10.1093/nar/gkp1021.PubMedPubMed CentralView Article
- Hammami R, Zouhir A, Ben Hamida J, Fliss I: BACTIBASE: a new web-accessible database for bacteriocin characterization. BMC Microbiol. 2007, 7: 89-10.1186/1471-2180-7-89.PubMedPubMed CentralView Article
- Hammami R, Zouhir A, Le Lay C, Ben Hamida J, Fliss I: BACTIBASE second release: a database and tool platform for bacteriocin characterization. BMC Microbiol. 2010, 10: 22-10.1186/1471-2180-10-22.PubMedPubMed CentralView Article
- Hammami R, Ben Hamida J, Vergoten G, Fliss I: PhytAMP: a database dedicated to antimicrobial plant peptides. Nucleic Acids Res. 2009, 37: 963-968. 10.1093/nar/gkn655.View Article
- Gueguen Y, Garnier J, Robert L, Lefranc MP, Mougenot I, de Lorgeril J, Janech M, Gross PS, Warr GW, Cuthbertson B, et al.: PenBase, the shrimp antimicrobial peptide penaeidin database: sequence-based classification and recommended nomenclature. Dev Comp Immunol. 2006, 30: 283-288. 10.1016/j.dci.2005.04.003.PubMedView Article
- Seebah S, Suresh A, Zhuo S, Choong YH, Chua H, Chuon D, Beuerman R, Verma C: Defensins knowledgebase: a manually curated database and information source focused on the defensins family of antimicrobial peptides. Nucleic Acids Res. 2007, 35: D265-268. 10.1093/nar/gkl866.PubMedPubMed CentralView Article
- Wang CK, Kaas Q, Chiche L, Craik DJ: CyBase: a database of cyclic protein sequences and structures, with applications in protein discovery and engineering. Nucleic Acids Res. 2008, 36: D206-210.PubMedPubMed CentralView Article
- Whitmore L, Wallace BA: The Peptaibol Database: a database for sequences and structures of naturally occurring peptaibols. Nucleic Acids Res. 2004, 32: D593-594. 10.1093/nar/gkh077.PubMedPubMed CentralView Article
- Wu CH, Apweiler R, Bairoch A, Natale DA, Barker WC, Boeckmann B, Ferro S, Gasteiger E, Huang H, Lopez R, et al.: The Universal Protein Resource (UniProt): an expanding universe of protein information. Nucleic Acids Res. 2006, 34: D187-191. 10.1093/nar/gkj161.PubMedPubMed CentralView Article
- Hunter S, Apweiler R, Attwood TK, Bairoch A, Bateman A, Binns D, Bork P, Das U, Daugherty L, Duquenne L, et al.: InterPro: the integrative protein signature database. Nucleic Acids Res. 2009, 37: D211-215. 10.1093/nar/gkn785.PubMedPubMed CentralView Article
- Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE: The Protein Data Bank. Nucleic Acids Res. 2000, 28: 235-242. 10.1093/nar/28.1.235.PubMedPubMed CentralView Article
- Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25: 3389-3402. 10.1093/nar/25.17.3389.PubMedPubMed CentralView Article
- Schaffer AA, Aravind L, Madden TL, Shavirin S, Spouge JL, Wolf YI, Koonin EV, Altschul SF: Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements. Nucleic Acids Res. 2001, 29: 2994-3005. 10.1093/nar/29.14.2994.PubMedPubMed CentralView Article
- Goodridge LD: Design of phage cocktails for therapy from a host range point of view. Enzybiotics: antibiotic enzymes as drugs and therapeutics. Edited by: Villa TG, Veiga-crespo P. 2010, New Jersey: John Wiley &Sons, Inc.,Publication, 199-218. 1
- Donovan DM, Dong S, Garrett W, Rousseau GM, Moineau S, Pritchard DG: Peptidoglycan hydrolase fusions maintain their parental specificities. Appl Environ Microbiol. 2006, 72: 2988-2996. 10.1128/AEM.72.4.2988-2996.2006.PubMedPubMed CentralView Article
- Sayle RA, Milner-White EJ: RASMOL: biomolecular graphics for all. Trends Biochem Sci. 1995, 20: 374-10.1016/S0968-0004(00)89080-5.PubMedView Article
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.