calacademy-research/minibar

Problem with the -F option: the names of the output files get messed up

thecorz opened this issue · 2 comments

Hello, thank you for developing the tool, really useful

When I try to demultiplex my reads into different sample files, I get files with the names of multiple samples concatenated

When I run the below line in Ubuntu 20.04 using Python 3.8:
python3 minibar.py minibar_barcode.txt FAT74402_pass_fc9452d2_1.fasta -F

If I name my samples '1', '2', '3' and '4' I get files like the following:
`sample1.fasta
sample1_2.fasta
sample1_2_3.fasta
sample2
sample2_4'

What I tried:

  • Changing the names of the files to something very simple
  • Play with the options -M {1,2} -C -CC -e -l -n
  • Changing from fastq to fasta
  • Running the test files: When I run minibar with the test files of the repo, the names of the output files are fine

Is it a python version issue?

Thanks you very much

I can see the problem happening now in python3.8 and python2.7 so I don't think it's a matter of the python version