Pinned Repositories
homework_fall2020
Assignments for Berkeley CS 285: Deep Reinforcement Learning (Fall 2020)
ddpo
Code for the paper "Training Diffusion Models with Reinforcement Learning"
diffuser
Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"
face-intrinsics
gamma-models
Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"
intrinsics-network
Code for the paper "Self-Supervised Intrinsic Image Decomposition"
mbpo
Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"
o2p2
spatial-reasoning
Code for the paper "Representation Learning for Grounded Spatial Reasoning"
trajectory-transformer
Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"
jannerm's Repositories
jannerm/diffuser
Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"
jannerm/mbpo
Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"
jannerm/trajectory-transformer
Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"
jannerm/ddpo
Code for the paper "Training Diffusion Models with Reinforcement Learning"
jannerm/intrinsics-network
Code for the paper "Self-Supervised Intrinsic Image Decomposition"
jannerm/spatial-reasoning
Code for the paper "Representation Learning for Grounded Spatial Reasoning"
jannerm/gamma-models
Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"
jannerm/o2p2
jannerm/face-intrinsics
jannerm/torch-colormap
MATLAB-style colormaps for Torch
jannerm/bullet-manipulation
Robotic manipulation in PyBullet
jannerm/slice-sampling
jannerm/d4rl
A benchmark for offline reinforcement learning.
jannerm/bairblog.github.io
jannerm/bullet3
Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.
jannerm/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
jannerm/doodad
jannerm/gamma-blog
jannerm/Gumbel_Softmax_VAE
PyTorch implementation of a Variational Autoencoder with Gumbel-Softmax Distribution
jannerm/gym_minigrid
jannerm/mbpo-www
jannerm/mujoco-py
MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.
jannerm/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for accelerating ML workloads.
jannerm/rlkit
Collection of reinforcement learning algorithms
jannerm/SimpleSAC
A simple and easy to use implementation of the soft actor-critic algorithm.
jannerm/viskit
rllab's viskit with some added features