Skip to main content

Table 1 Counts of unique taxa identified using Boston pilot 16S amplicon and shotgun metagenomics datasets in at least one samples. Percent overlaps from the shotgun perspective are reported in parentheses

From: Systematic evaluation of supervised machine learning for sample origin prediction using metagenomic sequencing data

 

Overall Taxa Counts

Technology (tool)

Taxa

species

genus

family

order

class

phylum

Amplicon 16S

Bacteria

143

328

173

83

52

18

Shotgun (Kraken2 + Bracken)

All

1630

523

201

89

37

20

Bacteria Only

1516

500

186

81

33

16

Overlap

75 (5%)

197 (39%)

117 (63%)

46 (57%)

23 (70%)

12 (75%)

Shotgun (MetaPhlAn2)

All

342

239

116

50

28

16

Bacteria Only

322

211

102

44

23

12

Overlap

61 (19%)

128 (61%)

84 (82%)

36 (82%)

17 (74%)

9 (75%)