NVIDIA/NeMo-Curator

[FEA] Add support for Multiple Model Quality Classification

sarahyurick opened this issue · 1 comments

In previous versions of NeMo Curator, we supported multiple model quality classification with a combination of Slurm and Python scripts. These scripts were designed to allow the user to pass in multiple model paths at once for running multi-node multi-GPU data classification.

Now, we are moving away from Slurm scripts in favor of a Python API. I think we should eventually create a Python API (ideally using Crossfit) to support multiple model classification.