eperrier/Stable-Alignment
Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Language Models in Simulated Human Society".
PythonNOASSERTION
Watchers
No one’s watching this repository yet.