Pinned Repositories
awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
CodeGen
CodeGen is an open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.
ColossalAI
Making large AI models cheaper, faster and more accessible
PPOCoder
Code for "PPOCoder: Execution-based Code Generation using Deep Reinforcement Learning"
toolformer-pytorch
Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI
TPSR
[NeurIPS 2023] This is the official code for the paper "TPSR: Transformer-based Planning for Symbolic Regression"
ffmott's Repositories
ffmott/ColossalAI
Making large AI models cheaper, faster and more accessible
ffmott/TPSR
[NeurIPS 2023] This is the official code for the paper "TPSR: Transformer-based Planning for Symbolic Regression"
ffmott/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
ffmott/CodeGen
CodeGen is an open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.
ffmott/PPOCoder
Code for "PPOCoder: Execution-based Code Generation using Deep Reinforcement Learning"
ffmott/toolformer-pytorch
Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI