- Research article
- Open Access
Comparative fecal metagenomics unveils unique functional capacity of the swine gut
© Lamendella et al; licensee BioMed Central Ltd. 2011
- Received: 13 September 2010
- Accepted: 15 May 2011
- Published: 15 May 2011
Uncovering the taxonomic composition and functional capacity within the swine gut microbial consortia is of great importance to animal physiology and health as well as to food and water safety due to the presence of human pathogens in pig feces. Nonetheless, limited information on the functional diversity of the swine gut microbiome is available.
Analysis of 637, 722 pyrosequencing reads (130 megabases) generated from Yorkshire pig fecal DNA extracts was performed to help better understand the microbial diversity and largely unknown functional capacity of the swine gut microbiome. Swine fecal metagenomic sequences were annotated using both MG-RAST and JGI IMG/M-ER pipelines. Taxonomic analysis of metagenomic reads indicated that swine fecal microbiomes were dominated by Firmicutes and Bacteroidetes phyla. At a finer phylogenetic resolution, Prevotella spp. dominated the swine fecal metagenome, while some genes associated with Treponema and Anareovibrio species were found to be exclusively within the pig fecal metagenomic sequences analyzed. Functional analysis revealed that carbohydrate metabolism was the most abundant SEED subsystem, representing 13% of the swine metagenome. Genes associated with stress, virulence, cell wall and cell capsule were also abundant. Virulence factors associated with antibiotic resistance genes with highest sequence homology to genes in Bacteroidetes, Clostridia, and Methanosarcina were numerous within the gene families unique to the swine fecal metagenomes. Other abundant proteins unique to the distal swine gut shared high sequence homology to putative carbohydrate membrane transporters.
The results from this metagenomic survey demonstrated the presence of genes associated with resistance to antibiotics and carbohydrate metabolism suggesting that the swine gut microbiome may be shaped by husbandry practices.
- Antibiotic Resistance Mechanism
- Seed Subsystem
- Variable Microbiome
- Chicken Cecum
- Fecal Metagenome
The animal gastrointestinal tract harbors a complex microbial network and its composition reflects the constant co-evolution of these microorganisms with their host environment . Uncovering the taxonomic composition and functional capacity within the animal gut microbial consortia is of great importance to understanding the roles they play in the host physiology and health. Since animal feces can harbor human pathogens, understanding the genetic composition of fecal microbial communities also has important implications for food and water safety. The structure and function of the gut microbial community has received significant attention for decades, although most of the work was restricted by the use of culture-based techniques. Recently, sequence analysis of the 16S rRNA gene has shed new light on the diversity and composition of microbial communities within several animal gut systems . While 16S rRNA gene-based techniques have revealed impressive microbial diversity within gut environments, this approach offers only limited information on the physiological role of microbial consortia within a given gut environment.
Random sequencing of metagenomes has allowed scientists to reveal significant differences in metabolic potential within different environments , including microbial populations associated with host-microbial partnerships. Specifically, the publicly available database IMG/M  contains 596 Mb of sequencing data, representing 1,424, 000 genes from 17 different gut microbiomes. Studying gut metagenomes has particularly helped in uncovering several important biological characteristics of these microbiomes. For example, when 13 human gut metagenomes were compared, Kurokawa et al  found that adult and infant type gut microbiomes have enriched gene families sharing little overlap, suggesting different core functions within the adult and infantile gut microbiota. This study also demonstrated the presence of hundreds of gene families exclusively found in the adult human gut, suggesting various strategies are employed by each type of microbiota to adapt to its intestinal environment . Other gut microbiome studies support these significant differences in core and variable gene content from different animal hosts and environments [1, 6–12]. Thus, comparing the gene content of multiple gut microbiomes can help elucidate the ecological underpinnings of gut systems.
Thus far, the functional genetic potential of the pig distal gut microbiota has not been studied using metagenomics, although it is reasonable to assume that the swine gut supports similar genetic complexity to the human gut system, as they both prefer omnivorous feeding behavior and harbor similar bacterial groups as determined by several phylogenetic studies [13–15]. In this study we used metagenomic data analyses to characterize the swine fecal microbiome with respect to species composition and functional content. In order to search for the potential presence of unique gene functions harbored by the swine gut microbiome, we performed a comparative metagenomic approach, in the context of phylogenetic and functional composition.
Taxonomic distribution of swine fecal metagenomic sequences
Summary of pyrosequencing data from Yorkshire swine fecal samples
Yorkshire Pig Fecal Metagenome GS20
Yorkshire Pig Fecal Metagenome FLX
Total no. of sequences
Total sequence size (bp)
Average sequence length (bp)
Ribosomal Database Project 16S rDNA hits
Greengenes 16S rDNA hits
Both GS20 and FLX metagenomic swine fecal datasets were dominated by Firmicutes and Bacteroidetes phyla (Figure 1), which is consistent with several molecular phylogenetic studies of mammalian gut environments, including the swine gut [2, 8, 10, 14]. Archaeal sequences constituted less than 1% of total rRNA gene sequences retrieved in either swine metagenome, and were dominated by the Methanomicrobia and Thermococci, which is consistent with previous molecular diversity studies of pig manure . While these populations are only a very small fraction of the total microbiota , methanogens contribute significantly to the metabolic potential within in a gut environment . The majority of eukaryotic sequences derived from the swine metagenomes are related to Chordata (i.e., host phylum), fungi, and the Viridiplantae (i.e., feed). Sequences sharing high sequence homology to Balantidium coli were obtained in both swine metagenomes. The latter is a protozoan pathogen that causes balantadiasis in mammalian hosts, including human and swine. Since the samples were collected from healthy animals, these sequences might be associated with non-pathogenic B. coli strains or with pathogenic strains in asymptomatic animals. Viral sequences were rare, comprising less than 1% of the total metagenomic sequences when compared to the SEED database (Additional File 1, Fig. S1). The low abundance of viral sequences retrieved from the swine fecal metagenomes is consistent with viral proportions retrieved in termite, chicken, and cattle gastrointestinal metagenomes, and may be a direct result of limited representation of viral genetic information in currently available databases .
A closer look at the taxonomic distribution of the numerically abundant bacterial orders derived from the swine metagenomes revealed that Clostridiales, unclassified Firmicutes, Bacteroidales, Spirochaetales, unclassified gammaproteobacteria, and Lactobacillales were the top six most abundant bacterial groups (Additional File 1, Fig. S2). At the genus-level taxonomic resolution, Prevotella species were the most abundant, comprising 19-22% of 16S rRNA gene sequences within both swine fecal metagenomes (Additional File 1, Fig. S3). Of the classified Clostridiales, Sporobacter was the next most abundant genus within both the swine fecal metagenomic datasets. Anaerovibro, Clostridium, and Streptococcus genera encompassed at least 5% of rRNA gene sequences in either the GS20 or FLX datasets.
Comparative gut metagenomics using 16S rRNA gene sequences
Diversity of swine gut microbiome
Diversity analyses of the gut microbiomes using 16S rRNA gene sequences
Pig Feces GS20
Pig Feces FLX
Pig Feces Total
Human Infant Total
Human Adult Total
Functional classification of the swine gut metagenome
To predict the metabolic potential within the swine fecal microbiome, both the MG-RAST and the IMG/M-ER annotation pipelines were used. The broad functional classifications of the swine fecal metagenomic reads were expected from previous metagenomic studies of the chicken cecum, cow rumen, human distal gut, and the termite gut. Similar proportions of broad level SEED subsystem classification were retrieved for both the GS20 and FLX swine fecal metagenomes (Additional File 1, Fig. S6). However, only 10% of sequences retrieved from the GS20 pig fecal metagenome were assigned to 574 subsystems, while more than 25% of all FLX reads were classified into 714 subsystems. This is compatible with the longer reads produced by the latter instrument, which allows for more robust gene predictions. When both pig fecal metagenomes were annotated using proxygenes within the JGI IMG/M ER pipeline, nearly one third of all GS20 and FLX pig fecal metagenomes were assigned to Pfams, and over 20% were assigned to COGs. This finding suggests that the proxygene method for gene-centric approaches to metagenomic studies is more robust than the direct BLASTx assignment strategy. Diversity analyses of Subsystems, COGs, and Pfams retrieved from swine metagenomes and other gut metagenomes tested in this study, revealed that larger sequencing efforts generate significantly more functional classes (Additional File 2, Tables S4 & S5). For example, an additional 150 Subsystems, 896 COGs, and 1271 Pfams were retrieved from the FLX run as compared to the GS20 metagenome, suggesting additional sequencing efforts for all gut microbiomes are necessary to cover the high functional diversity in gut environments.
Carbohydrate metabolism was the most abundant SEED subsystem (MG-RAST annotation pipeline) representing 13% of both swine fecal metagenomes (Additional File 1, Fig. S6). Genes associated with cell wall and capsule, stress, and virulence were also very abundant in both metagenomes. Approximately 16% of annotated reads from swine fecal metagenomes were categorized within the clustering-based subsystems, most of which have unknown or putative functions. Additionally, 75% to 90% of metagenomic reads were not assigned to subsystems, suggesting the need for improved binning and coding region prediction algorithms to annotate these unknown sequences.
To improve the meaning of metagenomic functional analysis, we applied statistical methods to compare the 29 broad level functional subsystems that are more or less represented in the different microbiomes. As was expected, all gut metagenomes were dominated by carbohydrate metabolism subsystems with amino acid, protein, cell wall and capsule, and virulence subsystems represented in relatively high abundance as well. Protein metabolism and amino acid subsystems were significantly more abundant in chicken, pig, and cow gut metagenomes (Additional File 1, Fig. S7). Additionally, the termite, fish, and pig gut had a higher proportion of reads classified to the chemotaxis and motility subsystems as compared to other gut metagenomes.
Comparative gut metagenomics
In this study, we examined the functional similarity of the Yorkshire pig fecal metagenome by comparing it to currently available metagenomic projects. Hierarchical clustering of functional profiles derived from gut metagenomes available in the MG-RAST database revealed that the GS20 and FLX swine fecal datasets shared approximately 70% similarity to other human metagenomes (Figure 4B). This analysis also showed the swine gut metagenome clustered more closely with chicken cecal and cow rumen metagenomes than to the human gut metagenomes (Figure 4B).
List of cell wall and capsule SEED subsystem functions overabundant in swine fecal metagenome
putative glycosyltransferase - possibly involved in cell wall localization and side chain formation of rhamnose-glucose polysaccharide
Phosphoglucosamine mutase (EC 22.214.171.124)
COG3178: Predicted phosphotransferase related to Ser/Thr protein kinases
3-deoxy-D-manno-octulosonate 8-phosphate phosphatase (EC 126.96.36.199)
O-antigen export system, permease protein
Glutamine synthetase, clostridia type (EC 188.8.131.52)
D-glycero-D-manno-heptose 1-phosphate guanosyltransferase
UDP-glucose 4-epimerase (EC 184.108.40.206)
Capsular polysaccharide synthesis enzyme Cap8D
D-alanine--D-alanine ligase B (EC 220.127.116.11)
PTS system, N-acetylglucosamine-specific IIB component (EC 18.104.22.168)
Mannose-1-phosphate guanylyltransferase (GDP) (EC 22.214.171.124)
2-Keto-3-deoxy-D-manno-octulosonate-8-phosphate synthase (EC 126.96.36.199)
capsular polysaccharide biosynthesis protein, putative
Capsular polysaccharide synthesis enzyme Cap8L
Summary of BLASTX results of pig fecal assembled contigs
Number of Reads
tetracycline resistant protein TetQ
Bacteroides sp. D1
tetracycline resistance protein
Faecalibacterium prausnitzii A2-165
Bacteroides finegoldii DSM 17565
Faecalibacterium prausnitzii A2-165
ABC transporter, ATP-binding protein
Bacillus thuringiensis serovar pondicheriensis BGSC 4BA1
hypothetical protein PRABACTJOHN 03572
Parabacteroides johnsonii DSM 18315
Faecalibacterium prausnitzii A2-165
Prevotella tannerae ATCC 51259
hypothetical protein COPEUT 02459
Coprococcus eutactus ATCC 27759
conserved hypothetical protein
Bacteroides sp. 4 3 47FAA
Bacteroides thetaiotaomicron VPI-5482
resolvase, N domain protein
Faecalibacterium prausnitzii A2-165
replication initiator protein A
Faecalibacterium prausnitzii A2-165
Bacteroides fragilis 3 1 12
hypothetical protein CLOSS21 01510
Clostridium sp. SS2/1
hypothetical protein BACCOP 00975
Bacteroides coprocola DSM 17136
conserved hypothetical protein
Oxalobacter formigenes HOxBLS
Sphingobacterium spiritivorum ATCC 33300
Bacteroides sp. 2 1 7
hypothetical protein BACCOP 00975
Bacteroides coprocola DSM 17136
conserved hypothetical protein
Magnetospirillum gryphiswaldense MSR-1
Cyanothece sp. PCC 8802
Bacteroides sp. 2 1 7
hypothetical protein ALIPUT 01364
Alistipes putredinis DSM 17216
Lentisphaera araneosa HTCC2155
hypothetical protein CLONEX 03424
Clostridium nexile DSM 1787
No significant similarity found
No significant similarity found
No significant similarity found
No significant similarity found
Interestingly, a majority of these transposable elements belonged to the Bacteroidetes genomes. These genetic elements have been shown to aid in the adaptation of this diverse group of bacteria to the distal gut environments . Many of the genetic features unique to the swine fecal metagenome encoded cell surface features of different Bacteroidetes populations, suggesting the adaptation of Bacteroidetes populations to distinct niches within the swine distal gut microbiome. While the precise role of diet, antibiotic usage, and genetics on shaping the ecology of the distal pig gut will require further study, it should be noted that industrialization of the swine industry has lead to the frequent use antibiotics to supplement the pig diet to maintain and increase meat production.
Studying the swine distal gut metagenome also shed light on the diversity and high occurrence of antibiotic resistance mechanisms employed by the microbiome (Additional File 1, Fig. S11). Antibiotics are widely used as additives in food or water within swine feeding operations to prevent and treat animal disease and to promote animal growth . Seepage and runoff of swine waste into both surface and groundwater with antibiotics and antibiotic-resistant bacteria poses a significant threat to public health. Nearly 6% of all assigned metagenomic reads retrieved from both swine fecal metagenomes were involved in antibiotic resistance mechanisms. Interestingly, tetracycline resistance was the most abundant class of virulence subsystems within the swine fecal metagenome, which may be explained by the fact that this antibiotic class was used in the diet supplied to the animals associated with this study. This antibiotic class is reported as comprising nearly half of the total amount of antibiotics used in commercial swine operations .
Resistance to fluoroquinolones was also well represented in the swine fecal metagenome, and may be explained by the increase of its non-therapeutic use within pig feed. While early studies indicated there was a low risk of fluoroquinolone resistance, recent studies are showing the use of fluoroquinolones is among the most important factors associated with finding resistant E. coli and Campylobacter in animal operations . Interestingly, there was no history of fluoroquinolone use on the swine farm from which these samples were collected. Fluoroquinolone resistance has been found on farms with no history of fluoroquinolone use, suggesting that resistant organisms, such as Campylobacter have the ability to spread between pig farms. Genes with high sequence similarity to methicillin-resistant Staphylococcus subsystem were also retrieved in this study. This finding is important considering MRSA carriage has been elevated in swine and exposed farmers and veterinarians , suggesting that MRSA infection is a significant risk in swine farm resident and worker cohorts.
More than 12% of virulence subsystems identified in the pig fecal metagenome were classified as multi-drug resistance mechanisms, suggesting the pig gut could be a hot-spot for multiple-antibiotic resistant bacteria. One subsystem, the MexA-MexB-OprM multiple drug efflux pump was found exclusively in the swine fecal metagenome. This antibiotic resistance mechanism has been detected only in Pseudomonas aeruginosa strains known to carry resistance in cystic fibrosis patients  and has not been previously described in distal gut environments. Additionally, more than 10% of virulence-associated sequences were assigned to yet-to-be-described virulence subsystems, suggesting that unknown virulence mechanisms are at work within the distal gut. Altogether, the high abundance of metagenomic sequences assigned to known and unknown antibiotic resistance subsystems suggests that functional metagenomics is an adequate tool for assessing the prevalence of antibiotic resistance within high cell density environments.
Comparative metagenomics of proteins involved in the cell wall and capsule subsystems revealed several unique glycosyl transferases and carbohydrate uptake systems. This unique pool of glycosyl transferases may provide a capacity for diversification of surface polysaccharide structures helping shape the genetic functional potential of this gut ecosystem. For example, the acquisition of new types of carbohydrate-binding proteins, transporters, and degradation enzymes through horizontal gene transfer may allow for the utilization of a wider array of substrates that may be utilized for energy harvesting . Pfams and COGs related to virulence factors such as adhesions were numerous within the gene families unique to the swine fecal metagenomes (Additional File 2, Table S6). Proteins involved in carbohydrate transport and attachment were both abundant and unique to the distal swine gut with more than 50 metagenomic contigs sharing high sequence homology to putative carbohydrate membrane transporters. Other proteins involved in carbohydrate metabolism were unique to the swine metagenome including glycosyl hydrolases, cellobiohydrolases, gluconolactonases, maltodextrin metabolism, and pectin lyases. The identification of unique gene families provides one line of evidence that the variable microbiome is a result of the microbial interaction with its surrounding environment. Because the environment surrounding gut microbes can vary among host species, a direct result of this level of functional diversity may be the generation of swine-specific microbiomes. Many proteins of unknown functions were also unique to the swine fecal metagenome, suggesting that some of them may be engaged in novel functions that have important biological meaning.
The high functional similarity between the pig and human metagenome is not surprising in light of the fact that they are mammalian omnivores with similar digestive tract structures and functions. Results from 16S rRNA gene sequence analyses suggest that bacterial gut communities are similar among omnivorous mammals . Similarities at the phylogenetic level between pig and human guts include the large presence of Firmicutes and members of the Bacteroidetes as the most abundant Gram-negative bacteria in their gastrointestinal tracts . While differences in the relative abundance of Lactobacilli phylotypes have been noted, our data provides for the first time a functional perspective on how similar pigs and humans gut systems in spite of the differences in microbial community structure. In contrast, the functional similarities shared between the swine fecal metagenome and the termite gut was surprising and suggestive of previously unknown shared metabolic capabilities between these gut environments. For example, the pig and termite were the only two hosts possessing a suite of functions involved in archaeal lipid biosynthesis (Additional File 2, Fig. S13), suggesting an intimate relationship between the swine and archaeal gut populations . Swine-specific methanogenic populations have been demonstrated in previous studies [17, 27]. Similarities in cell wall and capsule profiles between the swine samples and termite gut may indicate that these functions can endow the swine gut with diversification of surface polysaccharide structures, allowing the host immune system to accommodate a diverse microbiota . Presence of novel carbohydrate binding proteins and transporters also suggest the swine gut is capable of exploiting a diverse array of substrates.
Similarities in functional gene profiles (SEED subsystem abundance) among swine, chicken cecal and cow rumen metagenomes as compared to human gut metagenomes were unexpected considering the similarity shared between pig and human gut anatomy and physiology. These results suggest that that some microbial functions within the swine gut are shared among other agricultural animals, with arguably very different gastrointestinal anatomy and physiologies. For example, the elevated abundance of genes associated with protein turnover in pigs, chicken, and cow gut metagenomes is consistent with an increased use of amino acids for protein accretion in food production animals and is also consistent with the high protein diet fed to the pigs in this study. Additionally, the high abundance and diversity of carbohydrate utilization subsystems found in this swine metagenome may be a result of the high level of complex polysaccharides found in the diet. Altogether these data suggest that agricultural animal husbandry practices can impose significant selective pressures on the gut microbiota, regardless of gut type.
Surprisingly, this pig fecal metagenome revealed the presence of motile Treponema and Anaerovibrio genera. The presence of sequences associated with Treponema in this study (i.e., 3-4% of all sequences swine fecal metagenome) suggests an order of magnitude higher abundance than a previous study in which swine gut microbiota revealed a very low abundance of Spirochetes using a culture independent method (i.e., 0.3% of all phylotypes) . This genus has been previously detected in swine colonic samples but their presence in elevated levels is normally associated with swine dysentery. Discrepancies in community composition between cloning-based methods and non-cloning based methods have been reported in the literature, primarily attributed to PCR amplification biases [28, 29]. While many mammalian gut microbial communities are dominated by non-motile microbes, the termite hindgut and the fish gut harbor motile populations of bacteria, which are known to possess complex social behaviors [12, 30, 31]. This study revealed the pig gut may harbor previously unknown social dynamics, which may be relevant for maintaining compartmentalization and promoting niche selection within monogastric systems.
Herein, we report the first shotgun metagenomic pyrosequencing approach to study the microbiome of the swine distal gut. The overall goal of this study was to characterize the swine fecal microbiome with respect to species composition and functional content. Comparative metagenomic analyses identified unique and/or overabundant taxonomic and functional elements within swine distal gut microbiomes. These genetic attributes may help us better understand the microbial genetic factors that are relevant to swine health. Genes associated with the variable portion of gut microbiomes clustered by host environment with surprising hierarchical trends, suggesting that the variable microbiome content of a given host species may be reflective of the host ecology. While a larger metagenomic database that includes information on intra-host variation is needed for swine and other gut systems, this study provides a baseline for understanding the complexity of the swine gut microbial ecology, while also highlighting striking similarities and differences when compared to other animal gastrointestinal environments.
Fecal Sample Collection
Fecal samples were collected from eight, six-month old Yorkshire pigs from a large swine operation located in Northeastern Ohio, which housed more than 1,000 head of swine at the time of collection. Swine were weaned eight weeks after birth. Their diets consisted of a high-energy corn-soybean meal diet containing 14.00% crude protein, 0.63% lysine, 3.00% crude fat, 4.00% crude fiber, 0.55%- 0.70% calcium, 0.52% phosphorus, 0.35%-0.50% salt, 0.3 ppm selenium, 80 ppm zinc. (Kalmbach Feeds, OH). In addition, swine were supplemented with feed grade antibiotics for improvement in growth performance. Antibiotics consisted of chlortetracycline and penicillin at the concentration of 20 g per ton of feed. Fecal samples were transported to the laboratory on ice within four hours of collection, and stored at -20°C until further processing. Fecal DNA was extracted with the FastDNA SPIN Kit (MP Biomedicals, Inc., Solon, OH) according to the manufacturer's instructions using 0.25 g of each fecal sample. Total DNA was quantified using a NanoDrop® ND-1000 UV spectrophotometer (NanoDrop Technologies, Wilmington, DE).
Pyrosequencing and Gene Annotation
A total of 24 μg (3 μg of each fecal DNA extract, n = 8) were pooled and sent for pyrosequencing to 454 Life Sciences, where two different sequencing runs were performed. The first run was performed using Genome Sequencer GS20 platform while the Genome Sequencer FLX instrument was used for the second run. Each pig fecal metagenomic sequencing run was assembled de novo using the Newbler assembly software by 454 Life Sciences. The metagenomes used in this paper are freely available from the SEED, JGI's IMG/M, and NCBI Short Read Archive. The NCBI genome project ID and GOLD ID for swine fecal GS20 and FLX metagenomic sequencing runs generated in this project are 39267 and Gm00197, respectively.
Raw sequencing reads from both datasets were submitted to the Joint Genome Institute's IMG/M-ER annotation pipeline using the proxygene method for gene annotation [4, 32]. Additionally, both metagenome runs were annotated using the "Phylogenetic Analysis" tool within the MG-RAST pipeline . The BLASTn algorithm (e-value less than1 × 10-5 and a sequence match length greater than 50 nucleotides) was used to identify small subunit rRNA genes from RDP , SILVA SSU , and Greengenes databases . Within the MG-RAST pipeline, the "Metabolic Analysis" tool was used to search sequences from pig fecal metagenomes against the SEED database using the BLASTx algorithm (e-value less than 1×10-5 and a sequence match length greater than 30 nucleotides) .
Comparative Metagenomics and Statistical Analyses
Comparative metagenomics was performed using both the IMG/M and MG-RAST pipelines. GS20 and FLX pig metagenomic runs were compared to the current publicly available gut metagenomes within each of these databases. Within the IMG/M pipeline, the two pig metagenomic runs were compared against three lean mouse (Mus musculus strain C57BL/6J) cecal metagenomes (Metagenome names: Mouse Gut Community lean1-3), two healthy human fecal metagenomes (Metagenome names: Human Gut Community Subject 7-8), and one termite (Nasutitermes sp) hindgut metagenome (Metagenome name: Termite hindgut). Descriptive information about these mouse, human and termite metagenomes can be found in the GOLD database under Gm00071, Gm00052, Gm00013 GOLD IDs, respectively. Within IMG/M the "Compare Genomes" tool was chosen to extract COG and Pfam protein profiles from the swine, mouse, human, and termite gut microbiomes. These profiles were then normalized for sequencing coverage by calculating the percent distribution, prior to downstream statistical analysis. To find over-abundant or unique functions to a given metagenomic dataset, a two-way hierarchical clustering of normalized COG and Pfam abundances was performed using the Bioinformatics Toolbox with Matlab version 2009a. Additionally, to determine if unique or overabundant functions were statistically meaningful, the binomial test within the Shotgun FunctionalizeR program was employed .
The GS20 and FLX pig fecal datasets were also compared against gut metagenomes available within the MG-RAST metagenomic annotation pipeline. The two pig fecal metagnonomic datasets were compared against the following MG-RAST metagenomic projects: cow rumen (Cow Rumen Project: 444168.3), chicken cecum (FS-CAP Project:4440285.3), human infant subjects In-A, In-B, In-D, In-E, In-M and In-R (Human Faeces Projects: 4440946.3, 4440945.3, 4440948.3, 4440950.3, 4440949.3, 4440951.3), human adult subjects F1-S, F1-T, F1-U, F2-V, F2-W, F2-X, and F2-Y (Human Faeces Projects: 4440939.9, 4440941.3, 4440940.3, 4440942.3, 4440943.3, 4440944.3, and 4440947.3), healthy fish gut (Fish Gut Project: 4441695.3), and lean mouse cecum (Human Faeces Project: 4440463.3). Within MG-RAST, phylogenetic information was extracted from these gut metagenomes using RDP , SILVA SSU , and Greengenes databases (e-value less than 1 × 10-5 and a sequence match length greater than 50 nucleotides). These taxonomic profiles were then normalized for differences in sequencing coverage by calculating percent distribution, prior to downstream statistical analysis. A non-parametric Wilcoxon exact test was used to statistically compare the taxonomic composition in any two metagenomes.
Additionally, within MG-RAST, the functional annotations (hits to SEED Subsystems) were extracted (e-value less than 1 × 10-5 and a sequence match length greater than 50 nucleotides) to compare functional attributes across these gut metagenomes. In order to identify statistically significant and biologically meaningful differences between the swine gut and other endiobiotic microbiomes, we employed the two-way Fisher's exact test with a Benjamin-Hochberg FDR multiple test correction within STAMP v1.0.2 .
Observed richness, Chao1 estimator, abundance-based coverage estimator (ACE), jackknife estimator, and bootstrap estimator were used to evaluate community richness. Community diversity was described using Shannon, non-parametric Shannon, and Simpson indices within Mothur v 1.5.0 . Sampling coverage was calculated using Good's coverage for the given operational taxonomic unit (OTU) definition, while the Boneh estimate was used to calculate the number of additional OTUs that would be observed for an additional 500 SSU reads. The aforementioned rRNA diversity indices and rarefaction curves were calculated using Mothur v 1.5.0 program with default parameters  and calculations for each index can found in the Mothur manual (http://www.mothur.org/wiki/Mothur_manual). Functional diversity was assessed using SEED Subsystems , COG, and Pfam abundances from all available gut metagenomes. Diversity estimators used included Shannon-Weiner, Simpson's lambda, and Pielou's evenness analyses for measuring species richness and evenness. Functional diversity estimates, K- dominance plots, Principal Components Analysis, and clustering were performed using the PRIMER-E ecological software package .
The U.S. Environmental Protection Agency, through its Office of Research and Development, funded and managed, or partially funded and collaborated in, the research described herein. It has been subjected to the Agency's administrative review and has been approved for external publication. Any opinions expressed in this paper are those of the author(s) and do not necessarily reflect the views of the Agency, therefore, no official endorsement should be inferred. Any mention of trade names or commercial products does not constitute endorsement or recommendation for use. This work was also partly funded by the United States Environmental Protection Agency Traineeship and National Science Foundation grant to DBO.
- Ley RE, Peterson DA, Gordon JI: Ecological and evolutionary forces shaping microbial diversity in the human intestine. Cell. 2006, 124: 837-848. 10.1016/j.cell.2006.02.017.PubMedView ArticleGoogle Scholar
- Ley RE, Hamady M, Lozupone C, Turnbaugh PJ, Ramey RR, Bircher JS, Schlegel ML, Tucker TA, Schrenzel MD, Knight R, Gordon JI: Evolution of mammals and their gut microbes. Science. 2008, 320: 1647-1651. 10.1126/science.1155725.PubMedPubMed CentralView ArticleGoogle Scholar
- Hugenholtz P, Tyson GW: Microbiology metagenomics. Nature. 2008, 455: 481-483. 10.1038/455481a.PubMedView ArticleGoogle Scholar
- Markowitz VM, Ivanova N, Szeto E, Palaniappan K, Chu K, Dalevi D, Chen IM, Grechkin Y, Dubchak I, Anderson I, Lykidis A, Mavromatis K, Hugenholtz P, Kyrpides NC: IMG/M: a data management and analysis system for metagenomes. Nucleic Acids Res. 2008, 36: D534-D538.PubMedPubMed CentralView ArticleGoogle Scholar
- Kurokawa K, Itoh T, Kuwahara T, Oshima K, Toh H, Toyoda A, Takami H, Morita H, Sharma VK, Srivastava TP, Taylor TD, Noguchi H, Mori H, Ogura Y, Ehrlich DS, Itoh K, Takagi T, Sakaki Y, Hayashi T, Hattori M: Comparative metagenomics revealed commonly enriched gene sets in human gut microbiomes. DNA Res. 2007, 14: 169-181. 10.1093/dnares/dsm018.PubMedPubMed CentralView ArticleGoogle Scholar
- Brulc JM, Antonopoulos DA, Miller ME, Wilson MK, Yannarell AC, Dinsdale EA, Edwards RE, Frank ED, Emerson JB, Wacklin P, Coutinho PM, Henrissat B, Nelson KE, White BA: Gene-centric metagenomics of the fiber-adherent bovine rumen microbiome reveals forage specific glycoside hydrolases. Proc Natl Acad Sci. 2009, 106: 1948-1953. 10.1073/pnas.0806191105.PubMedPubMed CentralView ArticleGoogle Scholar
- Dinsdale EA, Edwards RA, Hall D, Angly F, Breitbart M, Brulc JM, Furlan M, Desnues C, Haynes M, Li L, McDaniel L, Moran MA, Nelson KE, Nilsson C, Olson R, Paul J, Brito BR, Ruan Y, Swan BK, Stevens R, Valentine DL, Thurber RV, Wegley L, White BA, Rohwer F: Functional metagenomic profiling of nine biomes. Nature. 2008, 452: 629-632. 10.1038/nature06810.PubMedView ArticleGoogle Scholar
- Qu A, Brulc JM, Wilson MK, Law BF, Theoret JR, Joens LA, Konkel ME, Angly F, Dinsdale EA, Edwards RA, Nelson KE, White BA: Comparative metagenomics reveals host specific metavirulomes and horizontal gene transfer elements in the chicken cecum microbiome. PLoS ONE. 2008, 3: e2945-10.1371/journal.pone.0002945.PubMedPubMed CentralView ArticleGoogle Scholar
- Tringe SG, von Mering C, Kobayashi A, Salamov AA, Chen K, Chang HW, Podar M, Short JM, Mathur EJ, Detter JC, Bork P, Hugenholtz P, Rubin EM: Comparative metagenomics of microbial communities. Science. 2005, 308: 554-557. 10.1126/science.1107851.PubMedView ArticleGoogle Scholar
- Turnbaugh PJ, Ley R, Mahowald M, Magrini V, Mardis E, Gordon J: An obesity-associated gut microbiome with increased capacity for energy harvest. Nature. 2006, 444: 1027-1031. 10.1038/nature05414.PubMedView ArticleGoogle Scholar
- Turnbaugh PJ, Ley RE, Hamady M, Fraser-Liggett CM,R, Knight R, Gordon JI: The Human Microbiome Project. Nature. 2007, 449: 804-810. 10.1038/nature06244.PubMedPubMed CentralView ArticleGoogle Scholar
- Warnecke F, Luginbuhl P, Ivanova N, Ghassemian M, Richardson TH, Stege JT, Cayouette M, McHardy AC, Djordjevic G, Aboushadi N, Sorek R, Tringe SG, Podar M, Martin HG, Kunin V, Dalevi D, Madejska J, Kirton E, Platt D, Szeto E, Salamov A, Barry K, Mikhailova N, Kyrpides NC, Matson EG, Ottesen EA, Zhang XN, Hernandez M, Murillo C, Acosta LG: Metagenomic and functional analysis of hindgut microbiota of a wood-feeding higher termite. Nature. 2007, 450: 560-565. 10.1038/nature06269.PubMedView ArticleGoogle Scholar
- Castillo M, Skene G, Roca M, Anguita M, Badiola I, Duncan SH, Flint HJ, Martín-Orúe SM: Application of 16S rRNA gene-targetted fluorescence in situ hybridization and restriction fragment length polymorphism to study porcine microbiota along the gastrointestinal tract in response to different sources of dietary fibre. FEMS Microbiol Ecol. 2007, 59: 138-146. 10.1111/j.1574-6941.2006.00204.x.PubMedView ArticleGoogle Scholar
- Leser TD, Amenuvor JZ, Jensen TK, Lindecrona RH, Boye M, Moller K: Culture-independent analysis of gut bacteria: the pig gastrointestinal tract microbiota revisited. Appl Environ Microbiol. 2002, 68: 673-690. 10.1128/AEM.68.2.673-690.2002.PubMedPubMed CentralView ArticleGoogle Scholar
- Lin C, Markowitz LVM, Mavromatis K, Ivanova NN, Chen IM, Chu K, Kyrpides NC: IMG ER: a system for microbial genome annotation expert review and curation. Bioinformatics. 2009, 25: 2271-2278. 10.1093/bioinformatics/btp393.View ArticleGoogle Scholar
- Snell-Castro R, Godon JJ, Delgenès JP, Dabert P: Characterisation of the microbial diversity in a pig manure storage pit using small subunit rDNA sequence analysis. FEMS Microbiol Ecol. 2005, 52: 229-242. 10.1016/j.femsec.2004.11.016.PubMedView ArticleGoogle Scholar
- Lamendella R, Santo Domingo JW, Yannarell AC, Ghosh S, Di Giovanni G, Mackie RI, Oerther DB: Evaluation of swine-specific PCR assays used for fecal source tracking and analysis of molecular diversity of swine-specific "Bacteroidales" populations. Appl Environ Microbiol. 2009, 75: 5787-5796. 10.1128/AEM.00448-09.PubMedPubMed CentralView ArticleGoogle Scholar
- Jensen BB: Methanogenesis in monogastric animals. Environ Monit Assess. 1996, 42: 99-112. 10.1007/BF00394044.PubMedView ArticleGoogle Scholar
- Fangman TJ, Hardin LE, Grellner G, Carlson MS, Zulovich JM, Coleman JL: Performance and disease status of pigs grown in a wean-to-finish facility compared to pigs grown in a conventional nursery and grower-finisher facility. J Swine Health Prod. 2001, 9: 71-76.Google Scholar
- USDA National Animal Health Monitoring System: Part I. Reference of Swine Health and Management in the United States. 2001, United StatesGoogle Scholar
- Taylor NM, Clifton-Hadley FA, Wales AD, Ridley A, Davies RH: Farm-level risk factors for fluoroquinolone resistance in E. coli and thermophilic Campylobacter spp. on finisher pig farms. Epidemiol Infect. 2009, 137: 1121-1134. 10.1017/S0950268808001854.PubMedView ArticleGoogle Scholar
- van den Broek, van Cleef BA, Haenen A, Broens EM, van der Wolf PJ, van den Broek MJ, Huijsdens XW, Kluytmans JA, Van de Giessen AW, Tiemersma EW: Methicillin-resistant Staphylococcus aureus in people living and working in pig farms. Epidemiol Infect. 2009, 137: 700-708. 10.1017/S0950268808001507.View ArticleGoogle Scholar
- Li XZ, Nikaido H, Poole K: Role of mexA-mexB-oprM in antibiotic efflux in Pseudomonas aeruginosa. Antimicrob Agents Chemother. 1995, 39: 1948-1953.PubMedPubMed CentralView ArticleGoogle Scholar
- Boneca IG: The role of peptidoglycan in pathogenesis. Curr Opinion Microbiol. 2005, 8: 46-53. 10.1016/j.mib.2004.12.008.View ArticleGoogle Scholar
- Lindemann MD: Supplemental folic acid: a requirement for optimizing swine reproduction. Journal of Animal Science. 1993, 71: 239-246.PubMedGoogle Scholar
- Ufnar JA, Ufnar DF, Wang SY, Ellender RD: Development of a swine-specific fecal pollution marker based on host differences in methanogen mcrA genes. Appl Environ Microbiol. 2007, 73: 5209-17. 10.1128/AEM.00319-07.PubMedPubMed CentralView ArticleGoogle Scholar
- Boucher Y, Kamekura M, Doolittle WF: Origins and evolution of isoprenoid lipid biosynthesis in archaea. Mol Microbiol. 2004, 52: 515-527. 10.1111/j.1365-2958.2004.03992.x.PubMedView ArticleGoogle Scholar
- Webster G, Newberry CJ, Fry JC, Weightman AJ: Assessment of bacterial community structure in the deep sub-seafloor biosphere by 16S rDNA-based techniques: a cautionary tale. J Micro Methods. 2003, 55: 155-164. 10.1016/S0167-7012(03)00140-4.View ArticleGoogle Scholar
- Wilson KH, Wilson WJ, Radosevich JL, DeSantis TZ, Viswanathan VS, Kuczmarski TA, Andersen GL: High-density microarray of small-subunit ribosomal DNA probes. Appl Environ Microbiol. 2002, 68: 2535-2541. 10.1128/AEM.68.5.2535-2541.2002.PubMedPubMed CentralView ArticleGoogle Scholar
- Ingham CJ, Ben-Jacob E: Swarming and complex pattern formation in Paenibacillus vortex studied by imagine and tracking cells. BMC Microbiology. 2008, 8: 36-10.1186/1471-2180-8-36.PubMedPubMed CentralView ArticleGoogle Scholar
- Kirchhof G, Eckert BBM, Stoffels MJI, Baldani JI VM, Reis VM, Hartmann A: Herbaspirillum frisingense sp. nov., a new nitrogen-fixing bacterial species that occurs in C4-fibre plants. Int J Syst Evol Microbiol. 2001, 51: 157-168.PubMedView ArticleGoogle Scholar
- Dalevi D, Ivanova NN, Mavromatis K, Hooper S, Szeto E, Hugenholtz P, Kyrpides NC, Markowitz VM: Annotation of metagenome short reads using proxygenes. Bioinformatics. 2008, 24: i7-13. 10.1093/bioinformatics/btn276.PubMedView ArticleGoogle Scholar
- Meyer F, Paarmann D, D'Souza M, Olson R, Glass EM, Kubal M, Paczian T, Rodriguez A, Stevens R, Wilke A, Wilkening J, Edwards RA: The Metagenomics RAST server - A public resource for the automatic phylogenetic and functional analysis of metagenomes. BMC Bioinformatics. 2008, 9: 386-10.1186/1471-2105-9-386.PubMedPubMed CentralView ArticleGoogle Scholar
- Cole JR, Chai B, Farris RJ, Wang Q, Kulam-Syed-Mohidee AS, McGarrell DM, Bandela AM, Cardenas E, Garrity GM, Tiedje JM: The ribosomal database project (RDP-II): introducing myRDP space and quality controlled public data. Nucleic Acids Res. 2007, 35: 169-172.View ArticleGoogle Scholar
- Pruess E, Quast C, Knittel K, Fuchs B, Ludwig W, Peplies J, Glöckner FO: SILVA: a comprehensive online resource for quality checked and aligned ribosomal RNA sequence data compatible with ARB. Nuc Acids Res. 2007, 35: 7188-7196. 10.1093/nar/gkm864.View ArticleGoogle Scholar
- DeSantis TZ, Hugenholtz P, Larsen N, Rojas M, Brodie EL, Keller K, Huber T, Dalevi D, Hu P, Andersen GL: Greengenes, a chimera-checked 16S rRNA gene database and workbench compatible with ARB. Appl Environ Microbiol. 2006, 72: 5069-5072. 10.1128/AEM.03006-05.PubMedPubMed CentralView ArticleGoogle Scholar
- Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. J Mol Biol. 1990, 215: 403-410.PubMedView ArticleGoogle Scholar
- Kristiansson E, Hugenholtz P, Dalevi D: ShotgunFunctionalizeR: An R-package for functional analysis of metagenomic data. Bioinformatics. 2009, 25: 2737-2738. 10.1093/bioinformatics/btp508.PubMedView ArticleGoogle Scholar
- Parks DH, Beiko RG: Identifying biologically relevant differences between metagenomic communities. Bioinformatics. 2010, 26: 715-721. 10.1093/bioinformatics/btq041.PubMedView ArticleGoogle Scholar
- Schloss PD, Westcott SL, Ryabin T, Hall JR, Hartmann M, Hollister EB, Lesniewski RA, Oakley BB, Parks DH, Robinson CJ, Sahl JW, Stres B, Thallinger GG, Van Horn DJ, Weber CF: Introducing mothur: open source, platform-independent, community-supported software for describing and comparing microbial communities. Appl Environ Microbiol. 2009, 75: 7537-41. 10.1128/AEM.01541-09.PubMedPubMed CentralView ArticleGoogle Scholar
- Overbeek R, Begley T, Butler RM, Choudhuri JV, Chuang HY, Cohoon M, de Crécy-Lagard V, Diaz N, Disz T, Edward R, Fonstein M, Frank ED, Gerdes S, Glass EM, Goesmann A, Hanson A, Iwata-Reuyl D, Jensen R, Jamshidi N, Krause L, Kubal M, Larsen N, Linke B, McHardy AC, Meyer F, Neuweger H, Olsen G, Olson R, Osterman A, Portnoy V, Pusch GD, Rodionov DA, Rückert C, Steiner J, Stevens R, Thiele I, Vassieva O, Ye Y, Zagnitko O, Vonstein V: The subsystems approach to genome annotation and its use in the project to annotate 1000 genomes. Nucleic Acids Res. 2005, 33: 5691-702. 10.1093/nar/gki866.PubMedPubMed CentralView ArticleGoogle Scholar
- Clarke KR, Gorley RN: PRIMER-E. 2001, PRIMER-E Ltd, Plymout, UKGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.