gjospin/PhyloSift

Misclassifications

Opened this issue · 0 comments

Hi I have a few metagenomic sequences that seem to give misclassifications.

The sequences should be Actinobacteria - everything I've done (draw trees, blast, etc) puts them in the Actinobacteria.

Here's a small example of a 30S ribosomal protein S2 from one of my contigs:

7758.s2
ATGAGTACAAACATGAAAGAACTCTTAGAAGCAGGAGTTCATTTTGGTCACCAGACCCAACGTTGGAACCCTAAAATGGATAACTTCATTTATGGAGATAAAAGCGGAATACATATCTTAGATTTAAGAATTACTTATGAGGCAATTGCTCAAGCAGAAGATTTTGTTCAGAAAATTGTAGCAAATGGTGGAAAAGTATTATTCGTAGGAACCAAACCACAAGCTCAAAATGTTATTCAGGAACAAGCAGAAGCATCAGGGATGCCTTTTGTTAATCACAGGTGGTTGGGTGGTATGCTAACAAATTTCAAAACTATTATTAAAAGAGTTATTTATTTAAAAGAATTAATTTCTTTAGAAGATTCTGGTGAAATTAATGCATATCCTAAACCAGAAAGACTTAGAATTCGAAGAGAGATTACGAAATTAACACGTTCAATTGGTGGCATAGTTAATTTGAGTAAAATACCAGATGCTATATTTATTGTTGATTTAATGAATGAATCAACTGCACTTACCGAAGCCAATAAACTGGGTATACCTGTAATTGGTCTAGCAGATTCCAATGTCGACCCAACGGGCGTTGATATCGTTATACCTGGCAATGATGATGCAATCAGATCTATCGAAGTAGTTACTTCAGCAATCGCTGAAGCGTGCGCAAAAGGTGCTGGACTTGAAGCAGTTATTGAAGAAAAAGAATCGAACTAA

Phylosift seems to think it's from the phylum "Parcubacteria". I know automated methods are never going to be perfect but being off at the phylum makes me think something is wrong with how Phylosift is parsing on my end.

Thanks a bunch!