Skip to main content

Table 1 The training data

From: A complete map of potential pathogenicity markers of avian influenza virus subtype H5 predicted from 11 expressed proteins

Protein

H5N1

Non-H5N1

Total

All Features

Significant Features

HP

LP

HP

LP

HP

LP

HA

1377

54

48

512

1425

566

616

82

NA

551

32

23

264

574

296

593

114

M1

161

9

13

52

174

61

329

16

M2

186

9

14

63

200

72

98

18

NS1

425

16

22

148

447

164

249

71

NS2

202

3

14

53

216

56

129

25

NP

294

12

22

113

316

125

511

22

PA

465

22

25

235

490

257

730

57

PB1

405

26

25

223

430

249

775

44

PB2

446

26

23

247

469

273

783

62

PB1-F2

135

16

15

114

150

130

101

40

  1. The HP and LP columns represent the number of highly pathogenic and low pathogenic sequences in each of the proteins, respectively. The ‘All features’ column is the total number of features (i.e. AA’s) from which significant features are selected with Monte Carlo Feature Selection