mikezhang95

Pinned Repositories

SQDDPG
This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled by ''Shapley Q-value: A Local Reward Approach to Solve Global Reward Games''.
Language:Python117 4 1043
codejam
Practice C++ and VIM on https://code.google.com/codejam/
Language:C++0 1 00
dorie_mike_recipe
A simple Pandoc-powered static site generator for your recipe collection – it effortlessly turns a set of Markdown-formatted recipes into a lightweight, responsive, searchable website.
Language:HTML1 0 00
DPR
Dense Passage Retriever - is a set of tools and models for open domain Q&A task.
Language:Python0 1 00
EWC_extension
To reimplement EWC and extend it to block approximation and RNN network
Language:Python2 0 00
fine-tuning-locomotion
Language:Python0 1 00
HDNO
This is the source code for HDNO: a hierarchical model for task-oriented dialogue system.
Language:Python18 2 01
IsaacGymEnvs
Isaac Gym Reinforcement Learning Environments
Language:Python00
mpc_rrl
Language:Python1 1 00
rrl_usr
Language:Python2 1 00

mikezhang95's Repositories

mikezhang95/HDNO
This is the source code for HDNO: a hierarchical model for task-oriented dialogue system.
Language:Python18 2 01
mikezhang95/EWC_extension
To reimplement EWC and extend it to block approximation and RNN network
Language:Python2 0 00
mikezhang95/rrl_usr
Language:Python2 1 00
mikezhang95/dorie_mike_recipe
A simple Pandoc-powered static site generator for your recipe collection – it effortlessly turns a set of Markdown-formatted recipes into a lightweight, responsive, searchable website.
Language:HTML1 0 00
mikezhang95/mpc_rrl
Language:Python1 1 00
mikezhang95/codejam
Practice C++ and VIM on https://code.google.com/codejam/
Language:C++0 1 00
mikezhang95/DPR
Dense Passage Retriever - is a set of tools and models for open domain Q&A task.
Language:Python0 1 00
mikezhang95/fine-tuning-locomotion
Language:Python0 1 00
mikezhang95/IsaacGymEnvs
Isaac Gym Reinforcement Learning Environments
Language:Python00
mikezhang95/kaggle-football
codes for google research football competition on kaggle
Language:Python0 2 01
mikezhang95/mbrl-lib
Library for Model Based RL
Language:Python0 1 00
mikezhang95/MDMRC
A General Framework for Multi-Document Machine Reading Comprehension Problem
Language:Python4 0
mikezhang95/mikezhang95.github.io
Language:HTML1 0
mikezhang95/MyFiles
Language:Jupyter Notebook2 0
mikezhang95/MyImages
As a image bed
2 0
mikezhang95/RAG
Language:Python2 0
mikezhang95/robot_io
Controller for Franka Emika Panda, Teleop and RGB-D Cameras
Language:C0 0
mikezhang95/rollout_mpc
Apply Rollout Policy to Increase the Horizon of MPC
Language:Jupyter Notebook2 0
mikezhang95/tdmpc2
Code for "TD-MPC2: Scalable, Robust World Models for Continuous Control"
Language:Jupyter Notebook
mikezhang95/trifinger_data
Language:Python1 0