/minRLHF

A minimal PyTorch re-implementation of RLHF

GNU General Public License v3.0GPL-3.0

Watchers