Opened this issue 3 months ago · 1 comments
Several tasks are helped with an initial SFT step before RL. It would be good for ART to support that directly so that folks can create programmatic pipelines that can easily stitch both together
my bad typo working on this