Skip to main content
Fig. 5 | Biology Direct

Fig. 5

From: Profiling microbial strains in urban environments using metagenomic sequencing data

Fig. 5

Normalised phylogenetic distance vs genomic-content distance within samples of six representative species of the MetaSub dataset. Each data point refers to a pair of two strains of the same species in different samples. The genomic distance is defined as the normalised Hamming distance between binary vectors of presence-absence as reported by PanPhlAn. The phylogenetic distance is defined as the branch length distance of the two leaves in the StrainPhlAn phylogenetic tree, normalised over the total branch length of the tree. Pearson’s correlation coefficients are A. pittii: 0.57, E. cloacae: 0.85, E. coli: 0.75, P. acnes: 0.79, A. radioresistens: 0.34 and P. stutzeri: 0.41. P-values are always lower than 1e-5

Back to article page