Pinned Repositories
1d-overparam
bet
Code and website for Behavior Transformers: Cloning k modes with one stone.
biobert-pretrained
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
brax
Massively parallel rigidbody physics simulation on accelerator hardware.
Coursework
Dynamic3DGaussians
open_flamingo
An open-source framework for training large multimodal models.
real2code
robot-collab
Codebase for paper: RoCo: Dialectic Multi-Robot Collaboration with Large Language Models
Stat-157
For Berkeley Deep Learning Course Stat 157
MandiZhao's Repositories
MandiZhao/robot-collab
Codebase for paper: RoCo: Dialectic Multi-Robot Collaboration with Large Language Models
MandiZhao/real2code
MandiZhao/Dynamic3DGaussians
MandiZhao/open_flamingo
An open-source framework for training large multimodal models.
MandiZhao/1d-overparam
MandiZhao/bet
Code and website for Behavior Transformers: Cloning k modes with one stone.
MandiZhao/brax
Massively parallel rigidbody physics simulation on accelerator hardware.
MandiZhao/cs285_fall19
MandiZhao/dextairity
[RSS 2022, Best System Paper Finalist] DextAIRity: Deformable Manipulation Can be a Breeze
MandiZhao/Rainbow
Rainbow: Combining Improvements in Deep Reinforcement Learning
MandiZhao/dm_control
DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
MandiZhao/dqnPer
MandiZhao/librealsense
Intel® RealSense™ SDK
MandiZhao/LoS
MandiZhao/mj_envs
A collection of MuJoCo based environments.
MandiZhao/mjenv_kitchen
MandiZhao/mjrl
Reinforcement learning algorithms for MuJoCo tasks
MandiZhao/mvp
Masked Visual Pre-training for Motor Control
MandiZhao/obj2mjcf
A CLI for processing composite Wavefront OBJ files for use in MuJoCo.
MandiZhao/ocfweb
The main ocf website
MandiZhao/puppet
Puppet config for OCF servers and lab machines
MandiZhao/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
MandiZhao/r3m
Pre-training Reusable Representations for Robotic Manipulation Using Diverse Human Video Data
MandiZhao/rlpyt
Reinforcement Learning in PyTorch
MandiZhao/train-procgen
Code for the paper "Leveraging Procedural Generation to Benchmark Reinforcement Learning"
MandiZhao/vil_site
MandiZhao/website
MandiZhao/website-1
MandiZhao/WiLoR
WiLoR: End-to-end 3D hand localization and reconstruction in-the-wild
MandiZhao/YARR
Yet Another Robotics and Reinforcement (YARR) learning framework for PyTorch.