JasonMa2016
PhD student at University of Pennsylvania. I research in reinforcement learning and robot learning.
Philadelphia, PA
Pinned Repositories
DrEureka
Official Repository for "DrEureka: Language Model Guided Sim-To-Real Transfer" (RSS 2024)
Eureka
Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)
vip
Official repository for "VIP: Towards Universal Visual Reward and Representation via Value-Implicit Pre-Training"
2021
Website for the Offline RL Workshop at NeurIPS 2020.
BCQ
PyTorch implementation of BCQ for "Off-Policy Deep Reinforcement Learning without Exploration"
CODAC
Official repository for paper "Conservative Offline Distributional Reinforcement Learning" (NeurIPS 2021)
GoFAR
Official repository for Paper "Offline Goal-Conditioned Reinforcement Learning via f-Advantage Regression" (NeurIPS 2022)
LDS
Official repository for paper "Likelihood-Based Diverse Sampling for Trajectory Forecasting" (ICCV 2021)
SMODICE
Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML 2022)
LIV
Official repository for "LIV: Language-Image Representations and Rewards for Robotic Control" (ICML 2023)
JasonMa2016's Repositories
JasonMa2016/GoFAR
Official repository for Paper "Offline Goal-Conditioned Reinforcement Learning via f-Advantage Regression" (NeurIPS 2022)
JasonMa2016/SMODICE
Official repository for paper "Versatile Offline Imitation from Observations and Examples via Regularized State-Occupancy Matching" (ICML 2022)
JasonMa2016/LDS
Official repository for paper "Likelihood-Based Diverse Sampling for Trajectory Forecasting" (ICCV 2021)
JasonMa2016/CODAC
Official repository for paper "Conservative Offline Distributional Reinforcement Learning" (NeurIPS 2021)
JasonMa2016/2021
Website for the Offline RL Workshop at NeurIPS 2020.
JasonMa2016/BCQ
PyTorch implementation of BCQ for "Off-Policy Deep Reinforcement Learning without Exploration"
JasonMa2016/conv-social-pooling
Code for model proposed in: Nachiket Deo and Mohan M. Trivedi,"Convolutional Social Pooling for Vehicle Trajectory Prediction." CVPRW, 2018
JasonMa2016/D-eck
A deck of Cards, written in D
JasonMa2016/dqn-pytorch
DQN to play Atari Pong
JasonMa2016/droid_policy_learning
DROID Policy Learning and Evaluation
JasonMa2016/embodied-clip
Official codebase for EmbCLIP
JasonMa2016/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
JasonMa2016/General_projects
JasonMa2016/human_aware_rl
Code for "On the Utility of Learning about Humans for Human-AI Coordination"
JasonMa2016/learn2learn
PyTorch Meta-learning Framework for Researchers
JasonMa2016/meta-rl-bandits
A simple RNN meta-learner
JasonMa2016/mj_envs
A collection of MuJoCo based environments.
JasonMa2016/mjrl
Reinforcement learning algorithms for MuJoCo tasks
JasonMa2016/mushroom-rl
Python library for Reinforcement Learning experiments.
JasonMa2016/oyster
Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)
JasonMa2016/RC_RL
JasonMa2016/Reinforcement-learning
Modular implementations of reinforcement learning algorithms with PyTorch
JasonMa2016/rl
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
JasonMa2016/rlkit
Collection of reinforcement learning algorithms
JasonMa2016/robomimic
robomimic: A Modular Framework for Robot Learning from Demonstration
JasonMa2016/spinningup
An educational resource to help anyone learn deep reinforcement learning.
JasonMa2016/tensorflow
Computation using data flow graphs for scalable machine learning
JasonMa2016/website