Controlling CPU usage for the DRAM.py distill pipeline for DRAM v.1.5.0
Opened this issue · 1 comments
I am trying to control CPU usage for the DRAM.py distill pipeline on DRAM v.1.5.0 that I am using on DRAM.py annotate .tsv output from metagenome assembled genomes. As I am using a shared lab server with 128 available CPU threads and over 20 users I need to set a max CPU usage and provide a run-time estimate. However, there does not appear to be any flag to control the max number of threads that the DRAM.py distill pipeline uses. When I ran the following command and monitored CPU use DRAM.py distill used up to 20 CPU's when I tested the command on a single metagenome assembled genome DRAM.py annotate output .tsv file:
DRAM.py distill -i annotations.tsv -o DRAM_results_distilled --trna_path trnas.tsv --rrna_path rrnas.tsv
Is there any way to control CPU usage in the DRAM.py distill pipeline?
Thank you so much for your time and assistance in advance!
@nataliesarno DRAM v2 just got its first public beta release, and there are options to control the number of threads, memory allocation, SLURM queue process size (number of simultaneous SLURM jobs), all from the config or from CLI. The documentation is light right now with the first public beta release. We would love for you to give it a try, but if you want to wait, the documentation updates will be coming in the next couple weeks.