YangRui2015

Do less and do better.

UIUC IL, USA

Pinned Repositories

EmbodiedBench
Official repo of EmbodiedBench, a comprehensive benchmark designed to evaluate MLLMs as embodied agents.
Language:Python462
2048_env
2048 environment for Reinforcement Learning and DQN algorithm
Language:Python39 3 111
AWGCSL
Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.
Language:Python26 2 02
Generalizable-Reward-Model
Code for NeurIPS 2024 paper "Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs"
Language:Python221
GOAT
Code for the ICML 2023 paper "What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?".
Language:Python9 2 01
Modular_HER
Modular-HER is revised from OpenAI baselines and supports many improvements for Hindsight Experience Replay as modules.
Language:Python16 3 22
RiC
Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"
Language:Python55 2 164
RIQL
Code for ICLR 2024 paper "Towards Robust Offline Reinforcement Learning under Diverse Data Corruption"
Language:Python13 2 00
RORL
Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"
Language:Python18 2 34
Sparse-Reward-Algorithms
Implement many Sparse Reward algorithms in Gym Fetch environment
Language:Python86 2 121

YangRui2015's Repositories

YangRui2015/Sparse-Reward-Algorithms
Implement many Sparse Reward algorithms in Gym Fetch environment
Language:Python86 2 121
YangRui2015/RiC
Code for the ICML 2024 paper "Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment"
Language:Python55 2 164
YangRui2015/2048_env
2048 environment for Reinforcement Learning and DQN algorithm
Language:Python39 3 111
YangRui2015/AWGCSL
Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.
Language:Python26 2 02
YangRui2015/Generalizable-Reward-Model
Code for NeurIPS 2024 paper "Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs"
Language:Python221
YangRui2015/RORL
Code for NeurIPS 2022 paper "Robust offline Reinforcement Learning via Conservative Smoothing"
Language:Python18 2 34
YangRui2015/Modular_HER
Modular-HER is revised from OpenAI baselines and supports many improvements for Hindsight Experience Replay as modules.
Language:Python16 3 22
YangRui2015/RIQL
Code for ICLR 2024 paper "Towards Robust Offline Reinforcement Learning under Diverse Data Corruption"
Language:Python13 2 00
YangRui2015/GOAT
Code for the ICML 2023 paper "What is Essential for Unseen Goal Generalization of Offline Goal-conditioned RL?".
Language:Python9 2 01
YangRui2015/Model-basedHER
Model-based Hindsight Experience Replay
Language:Python9 2 24
YangRui2015/HERO
code for Combining Hindsight and Imagination in Multi-goal ReinforcementLearning
Language:Python2 2 00
YangRui2015/UWMSG
Language:Python2 1 02
YangRui2015/d3rlpy
An offline deep reinforcement learning library
Language:Python1 1 01
YangRui2015/reward-bench
RewardBench: the first evaluation tool for reward models.
Language:Python1 0 0
YangRui2015/rlkit_pro
Improving original rlkit forked from https://github.com/vitchyr/rlkit
Language:Python1 2 0
YangRui2015/starter-academic
Language:Jupyter Notebook1 2 0
YangRui2015/yangrui2015.github.io
Language:JavaScript1 2 0
YangRui2015/COMP4901Y_Course_HKUST
Course Material for the UG Course COMP4901Y
YangRui2015/dmc2gym
OpenAI Gym wrapper for the DeepMind Control Suite
Language:Python1 0
YangRui2015/icml-nips-iclr-dataset
Papers, authors and author affiliations from ICML, NeurIPS and ICLR 2006-2021
Language:Python1 0
YangRui2015/Meta-Envs
Meta environments package for reinforcement learning
Language:Python2 0
YangRui2015/minimalRL
Implementations of basic RL algorithms with minimal lines of codes! (pytorch based)
Language:Python1 0
YangRui2015/PPE
Language:Jupyter Notebook
YangRui2015/rl-baselines-zoo
A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.
Language:Python1 0
YangRui2015/rlkit
Collection of reinforcement learning algorithms
Language:Python1 0
YangRui2015/sequential_social_dilemma_games
Repo for reproduction of sequential social dilemmas
Language:Python1 0
YangRui2015/SocialRobot
Language:Python1 0
YangRui2015/stable-baselines
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
Language:Python1 0
YangRui2015/starter-hugo-academic
Language:TeX2 0
YangRui2015/YangRui2015
Hi there
2 0