Analysis and comparison of the pan-genomic properties of sixteen well-characterized bacterial genera

Trost, Brett; Haakensen, Monique; Pittet, Vanessa; Ziola, Barry; Kusalik, Anthony

doi:10.1186/1471-2180-10-258

Table 3 Results of protein content cohesiveness experiments

From: Analysis and comparison of the pan-genomic properties of sixteen well-characterized bacterial genera

		Core proteomes				Unique proteomes
S	N _I	$N_{C}^{A}$	$N_{C}^{R}$	P _C	$N_{C}^{>}$	$N_{U}^{A}$	$N_{U}^{R}$	P _U	$N_{U}^{>}$
Bacillus anthracis	3	4941	2123	**	0/25	168	1	**	0/25
Bacillus cereus	4	2881	1840	**	0/25	2	0	-	0/25
Bacillus thuringiensis	2	4255	2864	**	5/25	4	7	n.s.	7/25
Brucella abortus	3	2699	2603	**	6/25	2	1	*	4/25
Brucella suis	2	3025	2760	**	2/24	5	4	n.s.	5/24
Burkholderia ambifaria	2	5609	3798	**	1/25	198	17	**	0/25
Burkholderia cenocepacia	3	5908	3352	**	0/25	168	0	**	0/25
Burkholderia mallei	4	3623	3086	**	1/25	18	0	-	0/25
Burkholderia pseudomallei	4	4972	3086	**	0/25	45	0	-	0/25
Clostridium botulinum	8	1514	763	**	0/25	10	0	-	0/25
Clostridium perfringens	3	2110	1085	**	0/25	298	0	**	0/25
Lactobacillus casei	2	2355	959	**	0/25	593	5	**	0/25
Lactobacillus delbrueckii	2	1372	959	**	0/25	222	5	**	0/25
Lactobacillus reuteri	2	1402	959	**	0/25	120	5	**	0/25
Mycobacterium bovis	2	3822	2577	**	1/25	36	38	n.s.	3/25
Mycobacterium tuberculosis	3	3724	2118	**	0/25	26	17	n.s.	3/25
Neisseria gonorrhoeae	2	1795	1560	**	0/8	229	3	**	0/8
Neisseria meningitidis	4	1547	1426	**	0/14	75	4	**	0/14

Column headings are: S, species; N_I, number of sequenced isolates of species S; $N_{C}^{A}$ , core proteome size of the sequenced isolates of S; $N_{C}^{R}$ , average core proteome size of the randomly-generated sets; P_C, probability that the average core proteome size of the randomly-generated sets is different than the core proteome size of the sequenced isolates of S; $N_{C}^{>}$ , fraction of random sets having a core proteome larger than S. $N_{U}^{A}$ , $N_{U}^{R}$ , P_Uand $N_{U}^{>}$ are analogous to $N_{C}^{A}$ , $N_{C}^{R}$ , P_C, and $N_{C}^{>}$ , respectively, and refer to the comparisons involving the number of proteins found in all sequenced isolates of S, but no other isolates from the same genus ("unique proteomes"). In some cases, all of the random sets corresponding to a particular species had zero unique proteins. No P-value could be computed for these because the standard deviation of these values was zero. In these situations, the P_Ucolumn contains a dash character (-). The averages in both column $N_{C}^{R}$ and column $N_{U}^{R}$ are rounded to the nearest whole number. For certain rows, column $N_{U}^{R}$ shows a value of 0; in some cases, this value is exact, while in other situations, it is due to rounding. If due to rounding, then the standard deviation of the random sets is non-zero, and column P_Ucontains a P-value. For columns P_Cand P_U, "n.s." means "not significant", a single asterisk indicates a P-value of less than 0.05, and a double asterisk indicates a P-value of less than 0.001. See Table 4 for the continuation of this table.

Back to article page