Pinned Repositories
Algorithm-Distillation-RLHF
briefly
cookiecutter-research
Starting point for python + jax research repos
cubster
DeepSpeedExamples
Example models using DeepSpeed
dei
Fake-News
Private repo for 2018 group design practical
minRLHF
A (somewhat) minimal library for finetuning language models with PPO on human feedback.
mr_nlp
treeQuadrature
thomfoster's Repositories
thomfoster/minRLHF
A (somewhat) minimal library for finetuning language models with PPO on human feedback.
thomfoster/Algorithm-Distillation-RLHF
thomfoster/mr_nlp
thomfoster/treeQuadrature
thomfoster/briefly
thomfoster/cookiecutter-research
Starting point for python + jax research repos
thomfoster/cubster
thomfoster/DeepSpeedExamples
Example models using DeepSpeed
thomfoster/dei
thomfoster/Fake-News
Private repo for 2018 group design practical
thomfoster/former
Simple transformer implementation from scratch in pytorch.
thomfoster/GMOD1
thomfoster/integration
Repository for my 4th year dissertation on marginal likelihood computation using descision trees.
thomfoster/jaxued
thomfoster/meta-representations
thomfoster/public-video-seg-data
thomfoster/thomfoster.github.io
thomfoster/treeMod8000
thomfoster/trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
thomfoster/votify
Voting app for the Oxford Law and CS course
thomfoster/XMem
[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model