lh3/minimap2

How to accelerate the running process by adjust parameters

wk8910 opened this issue · 6 comments

Hi Heng,
I'm currently running minimap2 for very large genome. However, the running time is unbearable. I want to trade mapping quality with running time. Which parameters can be adjusted to get faster speed? Do you have any suggestions?
Thank you very much!
Best,
Kun

The obvious options are -k and -w to control seeding. Increase those to be less sensitive, but faster.

lh3 commented

What's the size of your genome? What're the current parameters in use? How slow is "unbearable"?

The obvious options are -k and -w to control seeding. Increase those to be less sensitive, but faster.

Thanks for your reply!

What's the size of your genome? What're the current parameters in use? How slow is "unbearable"?

Hi Heng,
The genome size is about 40 Gigabase. I'm now trying to perform reads to reads alignment (The total amount of reads is about 2Tb). I'm currently using "-x ava-ont -t 8 -l 5000" and it need about 50 compute machine for more than one month.
Best,
Kun

lh3 commented

Get a machine with 256GB RAM and use -I50g. Increase the number of threads -t. Larger -k and -w will help as @armintoepfer said.

Get a machine with 256GB RAM and use -I50g. Increase the number of threads -t. Larger -k and -w will help as @armintoepfer said.

Thanks very much!
I'm trying to test these new parameters.