Reproduction of SLiC-HF: Sequence Likelihood Calibration with Human Feedback
(Work in progress)
# Install conda
wget https://repo.anaconda.com/miniconda/Miniconda3-py39_23.3.1-0-Linux-x86_64.sh
sh Miniconda3-py39_23.3.1-0-Linux-x86_64.sh
# Install this
source devops/install.sh
# Local development
sh src/do_everything_debug.sh
# Full training
sh src/do_everything_large.sh
- Long short experiment - Comparing
src/train/train_slic.py::slic_loss
function vssrc/train/train_slic.py::slic_loss_logits
function ont5-base
.