Skip to main content
Fig. 4 | Biology Direct

Fig. 4

From: Systematic evaluation of supervised machine learning for sample origin prediction using metagenomic sequencing data

Fig. 4

Nested 10-fold cross validation predictions from Lasso-regularized regression and classification compared to true locations. Latitude (a) and longitude (b) predictions from the multivariate regression model on species abundance data are plotted against the true geographic coordinates on the x-axis. Each data point represents a sample from the corresponding city, as indicated in the legend. The dashed line shows where predictions would be exactly correct. c Predictions from the classification model are illustrated in comparison to true sources. Each entry shows the number of samples predicted to be the corresponding city (row) and originally from the corresponding reference (column)

Back to article page