backpropper

research scientist @google-deepmind

Google DeepMindLondon, United Kingdom

Pinned Repositories

AC-GAN
Auxiliary Classifier Generative Adversarial Network in Torch7
Language:Lua5 3 10
academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript1 1 00
baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python1 2 00
cbc-emecom
Code repository for Capacity, Bandwidth, and Compositionality in Emergent Language Learning (AAMAS 2020)
Language:Python6 3 31
dae-gan-pytorch
DAE-GAN-Pytorch
Language:Python3 2 00
DCGAN
Deep Convolution Generative Adversarial Network in Torch7
Language:OpenEdge ABL1 2 01
DNN-Activation-Brain
Code repository for Dissecting the DNN Brain for a Better Insight (ICASSP 2016)
Language:Python4 2 11
kaldi
This is now the official location of the Kaldi project.
Language:C++1 2 00
s2p
Code repository for On the interaction between supervision and self-play in emergent communication (ICLR 2020)
Language:Python16 3 12
stat-nlp-fall2017
Homework assignments for Statistical Natural Language Processing - NYU - Fall 2017
1 3 030

backpropper's Repositories

backpropper/academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript1 1 00
backpropper/babelcode
Language:Python0 0
backpropper/distrax
Language:Python0 0
backpropper/dm_robotics
Libraries, tools and tasks created and used at DeepMind Robotics.
Language:Python1 0
backpropper/ede
Code for the paper "Uncertainty-Driven Exploration for Generalization in Reinforcement Learning".
Language:Python1 0
backpropper/evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Language:Python0 0
backpropper/human-eval
Code for the paper "Evaluating Large Language Models Trained on Code"
Language:Python0 0
backpropper/inference
Reference implementations of MLPerf™ inference benchmarks
Language:Python0 0
backpropper/LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
Language:Python0 0
backpropper/llama
Inference code for LLaMA models
Language:Python0 0
backpropper/llama-recipes
Examples and recipes for Llama model
Language:Python0 0
backpropper/LMFlow
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
Language:Python0 0
backpropper/math
The MATH Dataset (NeurIPS 2021)
Language:Python0 0
backpropper/mlx
MLX: An array framework for Apple silicon
Language:C++0 0
backpropper/mlx-examples
Examples in the MLX framework
Language:Python0 0
backpropper/openai-cookbook
Examples and guides for using the OpenAI API
Language:Python1 0
backpropper/openai-quickstart-python
Python example app from the OpenAI API quickstart tutorial
Language:CSS1 0
backpropper/OpenDevin
🐚 OpenDevin: Code Less, Make More
Language:Python0 0
backpropper/optax
Optax is a gradient processing and optimization library for JAX.
Language:Python0 0
backpropper/PromptPG
Data and code for the ICLR 2023 paper "Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning".
Language:Python0 0
backpropper/reward-bench
RewardBench: the first evaluation tool for reward models.
Language:Python0 0
backpropper/rliable
[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.
Language:Jupyter Notebook1 0
backpropper/Stable-Alignment
Multi-agent Social Simulation + Efficient, Effective, and Stable alternative of RLHF. Code for the paper "Training Socially Aligned Language Models in Simulated Human Society".
Language:Python0 0
backpropper/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language:Python0 0
backpropper/summarize-from-feedback
Code for "Learning to summarize from human feedback"
Language:Python1 0
backpropper/SWE-agent
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.29% of bugs in the SWE-bench evaluation set and takes just 1.5 minutes to run.
Language:Python0 0
backpropper/tiktoken
Language:Python1 0
backpropper/training
Reference implementations of MLPerf™ training benchmarks
Language:Python0 0
backpropper/trl
Train transformer language models with reinforcement learning.
Language:Python0 0
backpropper/website
Language:HTML0 0