PM tutorial: Taxonomic classification results are the same before and after retraining the classifier
misialq opened this issue · 2 comments
Bug Description
In the course of going through the Parkinson’s Mouse Tutorial, we noticed that when looking at the taxonomic classification results using the classifier retrained with the information about the typical stool sample composition, the species mentioned in the tutorial (B. ovatus) can be found in both sets of results in equal counts. In other words, with the data provided in the tutorial, retraining the classifier does not really improve the classification results with regard to B. ovatus. It seems as if the data originally used to train the first classifier changed in the meantime giving rise to similar results. In this context the tutorial question about the presence of B. ovatus in both results is potentially outdated.
Steps to reproduce the behavior
- Open the
taxonomy.qzv
andbespoke_taxonomy.qzv
visualizations from the PM tutorial - Filter the taxon list for "ovatus"
- Compare results obtained in both
Expected behavior
Not sure, but supposedly the original taxonomy result should have less taxons identified as ovatus
?
Actual behavior
Both results show the same number of ovatus taxa.
Screenshots
from taxonomy.qzv:
from bespoke_taxonomy.qzv:
Comments
- This is under the assumption that retraining the classifier should improve identification results.
@BenKaehler, did you write that part of the PD Mice tutorial? If so, care to comment?