Concurrent running of cells for step 2.

Question

Concurrent running of cells for step 2.

JakeJackson22 opened this issue a year ago · 2 comments

Hi, I am a bioinformatician in the Haniffa Lab and I am trying to use SComatic to detect mutations on single cell data.

I have 122 cell types and notice that for step 2 of your example workflow you state this:

"In our experience, when running SComatic in an HPC cluster it is most efficient to compute the base count information for all cell types at once, specially when submitting each python line independently and at the same time to the cluster, and using multiple processors (p.e. --nprocs 16)"

I am using the Sanger FARM and just wondered if you had advice on how best to run SComatic for 122 cell types, as it seems to take a very long time as I am doing now, and step 2 seems to run sequentially rather than concurrently (how would I amend to code to run concurrently?).

Thank you very much

Answer 1 · 2024-01-29T22:31:16.000Z

Dear user,
How are you submitting the jobs to the FARM?

Could you please paste the command (ideally "for" loop) for submitting the jobs to the cluster?

Thanks,
Fran

Answer 2 · 2024-02-05T10:18:07.000Z

Hi Fran,

Thanks for the reply. I believe I have fixed the issue, using a pipeline developed by the Teichmann lab.

Thanks,
Jake