[CLI-10] Overhaul job spec to resemble a DAG
Closed this issue · 0 comments
nravic commented
Caltech could use this too, we need to build repeatable, composable and retryable jobs that can be defined as DAGs.
This way, if a step in a DAG fails, the Cedana server retries it on another machine. The composability leads way to some really interesting use cases for users.
We can leverage some of the retryability already written to help build this out.
From SyncLinear.com | CLI-10