/nanoChatGPT

A crude RLHF layer on top of nanoGPT with Gumbel-Softmax trick

Primary LanguagePythonMIT LicenseMIT

Stargazers

No one’s star this repository yet.