Pinned Repositories
D4RL
A collection of reference environments for offline reinforcement learning
NES-HT
Hard-Thresholding Meets Evolution Strategies in Reinforcement Learning
TD3BCpp
Robust Offline Reinforcement Learning from Contaminated Demonstrations
OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Object-Detection-and-Tracking-on-Jetson-Nano
Group 06’s Project for ML701@MBZUAI
glorgao's Repositories
glorgao/NES-HT
Hard-Thresholding Meets Evolution Strategies in Reinforcement Learning
glorgao/TD3BCpp
Robust Offline Reinforcement Learning from Contaminated Demonstrations
glorgao/D4RL
A collection of reference environments for offline reinforcement learning