MandiZhao

PhDing at Stanford

Palo Alto

Pinned Repositories

1d-overparam
0 2 00
bet
Code and website for Behavior Transformers: Cloning k modes with one stone.
Language:Python0 0 00
biobert-pretrained
BioBERT: a pre-trained biomedical language representation model for biomedical text mining
0 1 00
brax
Massively parallel rigidbody physics simulation on accelerator hardware.
Language:Jupyter Notebook0 1 00
Coursework
Language:Java1 1 00
Dynamic3DGaussians
Language:Python2 0 00
open_flamingo
An open-source framework for training large multimodal models.
Language:Python1 0 00
real2code
Language:Python69 1 33
robot-collab
Codebase for paper: RoCo: Dialectic Multi-Robot Collaboration with Large Language Models
Language:Python169 5 1328
Stat-157
For Berkeley Deep Learning Course Stat 157
Language:Jupyter Notebook2 2 00

MandiZhao's Repositories

MandiZhao/robot-collab
Codebase for paper: RoCo: Dialectic Multi-Robot Collaboration with Large Language Models
Language:Python169 5 1328
MandiZhao/real2code
Language:Python69 1 33
MandiZhao/Dynamic3DGaussians
Language:Python2 0 00
MandiZhao/open_flamingo
An open-source framework for training large multimodal models.
Language:Python1 0 00
MandiZhao/1d-overparam
0 2 00
MandiZhao/bet
Code and website for Behavior Transformers: Cloning k modes with one stone.
Language:Python0 0 00
MandiZhao/brax
Massively parallel rigidbody physics simulation on accelerator hardware.
Language:Jupyter Notebook0 1 00
MandiZhao/cs285_fall19
Language:Python0 2 00
MandiZhao/dextairity
[RSS 2022, Best System Paper Finalist] DextAIRity: Deformable Manipulation Can be a Breeze
Language:C++0 0 00
MandiZhao/Rainbow
Rainbow: Combining Improvements in Deep Reinforcement Learning
Language:Python0 1 00
MandiZhao/dm_control
DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
Language:Python0 0
MandiZhao/dqnPer
Language:Python2 1
MandiZhao/librealsense
Intel® RealSense™ SDK
Language:C++1 0
MandiZhao/LoS
Language:Jupyter Notebook1 0
MandiZhao/mj_envs
A collection of MuJoCo based environments.
Language:Python1 0
MandiZhao/mjenv_kitchen
Language:Python1 0
MandiZhao/mjrl
Reinforcement learning algorithms for MuJoCo tasks
Language:Python0 0
MandiZhao/mvp
Masked Visual Pre-training for Motor Control
Language:Python1 0
MandiZhao/obj2mjcf
A CLI for processing composite Wavefront OBJ files for use in MuJoCo.
Language:Python0 0
MandiZhao/ocfweb
The main ocf website
Language:Python1 0
MandiZhao/puppet
Puppet config for OCF servers and lab machines
Language:Puppet1 0
MandiZhao/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Language:Python1 0
MandiZhao/r3m
Pre-training Reusable Representations for Robotic Manipulation Using Diverse Human Video Data
Language:Jupyter Notebook1 0
MandiZhao/rlpyt
Reinforcement Learning in PyTorch
Language:Python1 0
MandiZhao/train-procgen
Code for the paper "Leveraging Procedural Generation to Benchmark Reinforcement Learning"
Language:Jupyter Notebook1 0
MandiZhao/vil_site
Language:SCSS1 0
MandiZhao/website
Language:HTML1 0
MandiZhao/website-1
Language:HTML1 0
MandiZhao/WiLoR
WiLoR: End-to-end 3D hand localization and reconstruction in-the-wild
MandiZhao/YARR
Yet Another Robotics and Reinforcement (YARR) learning framework for PyTorch.
Language:Python1 0