chadlaing/Panseq

the result

liuyongming0612 opened this issue · 1 comments

why the result of panGenome.fasta have so many sequence of "NNNN......."

Hi @liuyongming0612,

The pan-genome having so many Ns would be due to the input sequences having so many Ns. The Ns will show up as not-matching anything, and therefore be included in the pan-genome because they are "novel".

I would suggest either removing them from your input genomes, or the generated pan-genome.

I hope that helps,
Chad