Pinned Repositories
EmbodiedBench
Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.
2048_env
2048 environment for Reinforcement Learning and DQN algorithm
AWGCSL
Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.
Generalizable-Reward-Model
Code for NeurIPS 2024 paper "Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs"
GOAT
Code for the ICML 2023 paper "What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?".
Modular_HER
Modular-HER is revised from OpenAI baselines and supports many improvements for Hindsight Experience Replay as modules.
RiC
Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"
RIQL
Code for ICLR 2024 paper "Towards Robust Offline Reinforcement Learning under Diverse Data Corruption"
RORL
Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"
Sparse-Reward-Algorithms
Implement many Sparse Reward algorithms in Gym Fetch environment
YangRui2015's Repositories
YangRui2015/Sparse-Reward-Algorithms
Implement many Sparse Reward algorithms in Gym Fetch environment
YangRui2015/RiC
Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"
YangRui2015/2048_env
2048 environment for Reinforcement Learning and DQN algorithm
YangRui2015/AWGCSL
Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.
YangRui2015/Generalizable-Reward-Model
Code for NeurIPS 2024 paper "Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs"
YangRui2015/RORL
Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"
YangRui2015/Modular_HER
Modular-HER is revised from OpenAI baselines and supports many improvements for Hindsight Experience Replay as modules.
YangRui2015/RIQL
Code for ICLR 2024 paper "Towards Robust Offline Reinforcement Learning under Diverse Data Corruption"
YangRui2015/GOAT
Code for the ICML 2023 paper "What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?".
YangRui2015/Model-basedHER
Model-based Hindsight Experience Replay
YangRui2015/HERO
code for Combining Hindsight and Imagination in Multi-goal ReinforcementLearning
YangRui2015/UWMSG
YangRui2015/d3rlpy
An offline deep reinforcement learning library
YangRui2015/reward-bench
RewardBench: the first evaluation tool for reward models.
YangRui2015/rlkit_pro
Improving original rlkit forked from https://github.com/vitchyr/rlkit
YangRui2015/starter-academic
YangRui2015/yangrui2015.github.io
YangRui2015/COMP4901Y_Course_HKUST
Course Material for the UG Course COMP4901Y
YangRui2015/dmc2gym
OpenAI Gym wrapper for the DeepMind Control Suite
YangRui2015/icml-nips-iclr-dataset
Papers, authors and author affiliations from ICML, NeurIPS and ICLR 2006-2021
YangRui2015/Meta-Envs
Meta environments package for reinforcement learning
YangRui2015/minimalRL
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
YangRui2015/PPE
YangRui2015/rl-baselines-zoo
A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.
YangRui2015/rlkit
Collection of reinforcement learning algorithms
YangRui2015/sequential_social_dilemma_games
Repo for reproduction of sequential social dilemmas
YangRui2015/SocialRobot
YangRui2015/stable-baselines
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
YangRui2015/starter-hugo-academic
YangRui2015/YangRui2015
Hi there