Pinned Repositories
stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
jat
General multi-task deep RL Agent
trl
Train transformer language models with reinforcement learning.
openrlbenchmark
deep_rl
Single-file truly minimal implementation of state-of-the-art reinforcement learning algorithms.
franka_panda_description
Franka Panda robot description, modified for providing gazebo simulation. Also includes more accurate dynamics values for KDL dynamics parameter estimations.
gym-continuous-maze
Continuous maze environment integrated with OpenAI/Gym
lge
panda-gym
Set of robotic environments based on PyBullet physics engine and gymnasium.
personal-website
Personal website
qgallouedec's Repositories
qgallouedec/panda-gym
Set of robotic environments based on PyBullet physics engine and gymnasium.
qgallouedec/lge
qgallouedec/personal-website
Personal website
qgallouedec/stable-baselines3-contrib
Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code
qgallouedec/blog
Public repo for HF blog posts
qgallouedec/open-rl-leaderboard-utils
qgallouedec/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
qgallouedec/trl-monitoring
qgallouedec/unsloth
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
qgallouedec/accelerate
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
qgallouedec/bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
qgallouedec/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
qgallouedec/course
The Hugging Face course on Transformers
qgallouedec/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
qgallouedec/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
qgallouedec/garage
A toolkit for reproducible reinforcement learning research.
qgallouedec/Gymnasium
A standard API for reinforcement learning and a diverse set of reference environments (formerly Gym)
qgallouedec/HighwayEnv
A minimalist environment for decision-making in autonomous driving
qgallouedec/huggingface_sb3
Additional code for Stable-baselines3 to load and upload models from the Hub.
qgallouedec/jat
qgallouedec/Metaworld
Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning
qgallouedec/pink-noise-rl
qgallouedec/qgallouedec
qgallouedec/rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
qgallouedec/rliable
[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.
qgallouedec/sample-factory
High throughput synchronous and asynchronous reinforcement learning
qgallouedec/smol-vision
Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜
qgallouedec/super-happiness
qgallouedec/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
qgallouedec/trl
Train transformer language models with reinforcement learning.