Pinned Repositories
AC-GAN
Auxiliary Classifier Generative Adversarial Network in Torch7
academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
cbc-emecom
Code repository for Capacity, Bandwidth, and Compositionality in Emergent Language Learning (AAMAS 2020)
dae-gan-pytorch
DAE-GAN-Pytorch
DCGAN
Deep Convolution Generative Adversarial Network in Torch7
DNN-Activation-Brain
Code repository for Dissecting the DNN Brain for a Better Insight (ICASSP 2016)
kaldi
This is now the official location of the Kaldi project.
s2p
Code repository for On the interaction between supervision and self-play in emergent communication (ICLR 2020)
stat-nlp-fall2017
Homework assignments for Statistical Natural Language Processing - NYU - Fall 2017
backpropper's Repositories
backpropper/academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
backpropper/babelcode
backpropper/distrax
backpropper/dm_robotics
Libraries, tools and tasks created and used at DeepMind Robotics.
backpropper/ede
Code for the paper "Uncertainty-Driven Exploration for Generalization in Reinforcement Learning".
backpropper/evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
backpropper/human-eval
Code for the paper "Evaluating Large Language Models Trained on Code"
backpropper/inference
Reference implementations of MLPerf™ inference benchmarks
backpropper/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
backpropper/llama
Inference code for LLaMA models
backpropper/llama-recipes
Examples and recipes for Llama model
backpropper/LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
backpropper/math
The MATH Dataset (NeurIPS 2021)
backpropper/mlx
MLX: An array framework for Apple silicon
backpropper/mlx-examples
Examples in the MLX framework
backpropper/openai-cookbook
Examples and guides for using the OpenAI API
backpropper/openai-quickstart-python
Python example app from the OpenAI API quickstart tutorial
backpropper/OpenDevin
🐚 OpenDevin: Code Less, Make More
backpropper/optax
Optax is a gradient processing and optimization library for JAX.
backpropper/PromptPG
Data and code for the ICLR 2023 paper "Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning".
backpropper/reward-bench
RewardBench: the first evaluation tool for reward models.
backpropper/rliable
[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.
backpropper/Stable-Alignment
Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Language Models in Simulated Human Society".
backpropper/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
backpropper/summarize-from-feedback
Code for "Learning to summarize from human feedback"
backpropper/SWE-agent
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.29% of bugs in the SWE-bench evaluation set and takes just 1.5 minutes to run.
backpropper/tiktoken
backpropper/training
Reference implementations of MLPerf™ training benchmarks
backpropper/trl
Train transformer language models with reinforcement learning.
backpropper/website