qiime2/docs

PM tutorial: Taxonomic classification results are the same before and after retraining the classifier

misialq opened this issue · 2 comments

Bug Description
In the course of going through the Parkinson’s Mouse Tutorial, we noticed that when looking at the taxonomic classification results using the classifier retrained with the information about the typical stool sample composition, the species mentioned in the tutorial (B. ovatus) can be found in both sets of results in equal counts. In other words, with the data provided in the tutorial, retraining the classifier does not really improve the classification results with regard to B. ovatus. It seems as if the data originally used to train the first classifier changed in the meantime giving rise to similar results. In this context the tutorial question about the presence of B. ovatus in both results is potentially outdated.

Steps to reproduce the behavior

  1. Open the taxonomy.qzv and bespoke_taxonomy.qzv visualizations from the PM tutorial
  2. Filter the taxon list for "ovatus"
  3. Compare results obtained in both

Expected behavior
Not sure, but supposedly the original taxonomy result should have less taxons identified as ovatus?

Actual behavior
Both results show the same number of ovatus taxa.

Screenshots
from taxonomy.qzv:
taxonomy
from bespoke_taxonomy.qzv:
bespoke_taxonomy

Comments

  1. This is under the assumption that retraining the classifier should improve identification results.

@BenKaehler, did you write that part of the PD Mice tutorial? If so, care to comment?