[feature] control "submit"-level resource distribution

Question

[feature] control "submit"-level resource distribution

Closed this issue 5 months ago · 8 comments

Hi @jan-janssen - is there a way to control the distribution of cores/threads/gpus at executor.submit() time (e.g. per-"job")? I was looking over the more recent versions and it seems like the interface has gone more the way of initializing the executors with this information - but I very likely could have missed something.

Answer 1 · 2024-02-13T20:29:26.000Z

Hi @mgt16-LANL , that is correct, typically we define the resources once for the executor and then each function which is submitted to a given executor uses the same pre-defined set of resources. The background for this is that the executor gets a set of reserved resources. In principle it would be technically possible to assign resources at the submit level, still that is currently not implemented.

Answer 2 · 2024-02-13T21:26:03.000Z

I guess I'm pretty interested in having the resource allocation be dynamically available, especially from flux/slurm backends for more dynamic/load-balancing workflows! I'll tag as a request. Is there there a reason we couldn't just add these to *args or **kwargs in the BaseExecutor class submit function to be differentially handled by FluxPythonInterface bootup function, for example?

Answer 3 · 2024-02-14T05:17:37.000Z

There are two reasons:

it cloud lead to a confusion with the function arguments, for example if the function has an argument cores and pympipool also uses an argument cores.
it requires us to start a new python process for each task, in the current implementation we start one python process per executor per slot and then reuse these. A executor can execute multiple slots in parallel, depending on how the number of max_workers is set.

Answer 4 · 2024-02-14T17:05:17.000Z

-it cloud lead to a confusion with the function arguments, for example if the function has an argument cores and pympipool also uses an argument cores.

My suggestion would be use something like "runtime cores" or a different nomenclature for the submit-time resource assignment.

it requires us to start a new python process for each task, in the current implementation we start one python process per executor per slot and then reuse these. A executor can execute multiple slots in parallel, depending on how the number of max_workers is set.

I'm not sure I understand this one - from the https://github.com/pyiron/pympipool/blob/main/pympipool/flux/executor.py code:

    def bootup(self, command_lst):
        if self._oversubscribe:
            raise ValueError(
                "Oversubscribing is currently not supported for the Flux adapter."
            )
        if self._executor is None:
            self._executor = flux.job.FluxExecutor()
        jobspec = flux.job.JobspecV1.from_command(
            command=command_lst,
            num_tasks=self._cores,
            cores_per_task=self._threads_per_core,
            gpus_per_task=self._gpus_per_core,
            num_nodes=None,
            exclusive=False,
        )
        jobspec.environment = dict(os.environ)
        if self._cwd is not None:
            jobspec.cwd = self._cwd
        self._future = self._executor.submit(jobspec)

It would seem like under the single python process, you could expose the underlying Jobspec to the user at submission time without requiring an additional python process?

Answer 5 · 2024-02-14T17:13:36.000Z

About the second part, the bootup() happens when the executor is created and then the python process remains active until the executor is closed. Meaning you create an executor, it creates N workers (defined by max_workers) each with a specific set of resources as defined by the executor. Then functions are submitted to the executor and the executor internally distributes these functions to its workers.

Answer 6 · 2024-02-14T17:17:24.000Z

Ah! That makes sense. Is the preferred method for getting this type of functionality with pympipool to define a set of executors to use as "queues" with more/less resources?

Answer 7 · 2024-02-14T17:20:58.000Z

Ah! That makes sense. Is the preferred method for getting this type of functionality with pympipool to define a set of executors to use as "queues" with more/less resources?

Yes, at least that is how I was using it so far. This allows pympipool to reuse the python processes it started, meaning it no longer has to go through flux for every function that you submit.

Answer 8 · 2024-04-17T02:01:15.000Z

@mgt16-LANL I have an initial draft for this interface available in #293 it would be very interesting to see if this also solves your needs.