mafExtractor would split MSA even they should be together
zy041225 opened this issue · 3 comments
Hi,
I'm dealing with my MSA using mafExtractor, but some output is split into two or more alignments when they should be one.
For example, when running the following command
mafExtractor --maf chrX.nonoverlap.maf --seq "hsap.chrX" --start 100067347 --stop 100067714 > split/chrX.nonoverlap.maf.100067347-100067714.maf
one of the MSA
origin.txt
are split into multiple parts
split.txt
and if I add --soft
, it output the same alignment as input without extraction
Please check.
I'm pretty sure it's doing the right thing here. The sequence you asked for, hsap.chrX, has gaps in it. mafExtractor only cares about that sequence so any insertions are left out.
Thanks for the explanation. BTW, will mafExtractor check if there's any gap in reference only?