Pinned Repositories
SQDDPG
This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled by ''Shapley Q-value: A Local Reward Approach to Solve Global Reward Games''.
codejam
Practice C++ and VIM on https://code.google.com/codejam/
dorie_mike_recipe
A simple Pandoc-powered static site generator for your recipe collection – it effortlessly turns a set of Markdown-formatted recipes into a lightweight, responsive, searchable website.
DPR
Dense Passage Retriever - is a set of tools and models for open domain Q&A task.
EWC_extension
To reimplement EWC and extend it to block approximation and RNN network
fine-tuning-locomotion
HDNO
This is the source code for HDNO: a hierarchical model for task-oriented dialogue system.
IsaacGymEnvs
Isaac Gym Reinforcement Learning Environments
mpc_rrl
rrl_usr
mikezhang95's Repositories
mikezhang95/HDNO
This is the source code for HDNO: a hierarchical model for task-oriented dialogue system.
mikezhang95/EWC_extension
To reimplement EWC and extend it to block approximation and RNN network
mikezhang95/rrl_usr
mikezhang95/dorie_mike_recipe
A simple Pandoc-powered static site generator for your recipe collection – it effortlessly turns a set of Markdown-formatted recipes into a lightweight, responsive, searchable website.
mikezhang95/mpc_rrl
mikezhang95/codejam
Practice C++ and VIM on https://code.google.com/codejam/
mikezhang95/DPR
Dense Passage Retriever - is a set of tools and models for open domain Q&A task.
mikezhang95/fine-tuning-locomotion
mikezhang95/IsaacGymEnvs
Isaac Gym Reinforcement Learning Environments
mikezhang95/kaggle-football
codes for google research football competition on kaggle
mikezhang95/mbrl-lib
Library for Model Based RL
mikezhang95/MDMRC
A General Framework for Multi-Document Machine Reading Comprehension Problem
mikezhang95/mikezhang95.github.io
mikezhang95/MyFiles
mikezhang95/MyImages
As a image bed
mikezhang95/RAG
mikezhang95/robot_io
Controller for Franka Emika Panda, Teleop and RGB-D Cameras
mikezhang95/rollout_mpc
Apply Rollout Policy to Increase the Horizon of MPC
mikezhang95/tdmpc2
Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"
mikezhang95/trifinger_data