zhang-zi-hao's Stars
dso-org/deep-symbolic-optimization
A deep learning framework for symbolic optimization.
Thinklab-SJTU/awesome-ml4co
Awesome machine learning for combinatorial optimization papers.
PufferAI/PufferLib
Simplifying reinforcement learning for complex game environments
kindredresearch/SenseAct
SenseAct: A computational framework for developing real-world robot learning tasks
modelbased/minirllab
Mini RL Lab
giovannidispoto/awesome-rl-internships
List of companies/universities lab that might offers Internships in the Reinforcement Learning field
quark0/darts
Differentiable architecture search for convolutional and recurrent networks
danfenghong/IEEE_TPAMI_SpectralGPT
Hong, D., Zhang, B., Li, X., Li, Y., Li, C., Yao, J., Yokoya, N., Li, H., Ghamisi, P., Jia, X., Plaza, A. and Gamba, P., Benediktsson, J., Chanussot, J. (2024). SpectralGPT: Spectral remote sensing foundation model. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024. DOI:10.1109/TPAMI.2024.3362475.
denisgriaznov/CustomMuJoCoEnviromentForRL
This is a very simple example of creating and training your own MuJoCo environment using RL algorithms through the Gymnasium.
EzgiKorkmaz/adversarial-reinforcement-learning
Reading list for adversarial perspective and robustness in deep reinforcement learning.
Farama-Foundation/Metaworld
Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning
araffin/sbx
SBX: Stable Baselines Jax (SB3 + Jax)
adityab/CrossQ
Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"
ok-robot/ok-robot
An open, modular framework for zero-shot, language conditioned pick-and-drop tasks in arbitrary homes.
KindXiaoming/pykan
Kolmogorov Arnold Networks
riiswa/kanrl
Kolmogorov-Arnold Network for Reinforcement Leaning, initial experiments
facebookresearch/BenchMARL
A collection of MARL benchmarks based on TorchRL
simondlevy/gym-copter
Gymnasium environment for reinforcement learning with multicopters
Wenxuan-Zhou/PLAS
Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]
facebookresearch/searchformer
Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".
uoe-agents/PO-GPL
Official code for "A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning"
katerakelly/oyster
Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)
krashkov/Belief-Propagation
Overview and implementation of Belief Propagation and Loopy Belief Propagation algorithms: sum-product, max-product, max-sum
hehonghui/awesome-english-ebooks
经济学人(含音频)、纽约客、卫报、连线、大西洋月刊等英语杂志免费下载,支持epub、mobi、pdf格式, 每周更新
MarcoMeter/recurrent-ppo-truncated-bptt
Baseline implementation of recurrent PPO using truncated BPTT
rail-berkeley/serl
SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning
rail-berkeley/serl_franka_controllers
Cartesian impedance controller with reference limiting for Franka Emika Robot
Cornell-RL/drpo
Dateset Reset Policy Optimization
pjreddie/darknet
Convolutional Neural Networks
luchris429/purejaxrl
Really Fast End-to-End Jax RL Implementations