zhang-zi-hao

zhang-zi-hao's Stars

dso-org/deep-symbolic-optimization
A deep learning framework for symbolic optimization.
Language:Python542119
Thinklab-SJTU/awesome-ml4co
Awesome machine learning for combinatorial optimization papers.
Language:Python1.6k189
PufferAI/PufferLib
Simplifying reinforcement learning for complex game environments
Language:Python64026
kindredresearch/SenseAct
SenseAct: A computational framework for developing real-world robot learning tasks
Language:Python21141
modelbased/minirllab
Mini RL Lab
Language:Python11
giovannidispoto/awesome-rl-internships
List of companies/universities lab that might offers Internships in the Reinforcement Learning field
192
quark0/darts
Differentiable architecture search for convolutional and recurrent networks
Language:Python3.9k842
danfenghong/IEEE_TPAMI_SpectralGPT
Hong, D., Zhang, B., Li, X., Li, Y., Li, C., Yao, J., Yokoya, N., Li, H., Ghamisi, P., Jia, X., Plaza, A. and Gamba, P., Benediktsson, J., Chanussot, J. (2024). SpectralGPT: Spectral remote sensing foundation model. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024. DOI:10.1109/TPAMI.2024.3362475.
Language:Python13413
denisgriaznov/CustomMuJoCoEnviromentForRL
This is a very simple example of creating and training your own MuJoCo environment using RL algorithms through the Gymnasium.
Language:Python223
EzgiKorkmaz/adversarial-reinforcement-learning
Reading list for adversarial perspective and robustness in deep reinforcement learning.
795
Farama-Foundation/Metaworld
Collections of robotics environments geared towards benchmarking multi-task and meta reinforcement learning
Language:Python1.2k259
araffin/sbx
SBX: Stable Baselines Jax (SB3 + Jax)
Language:Python29130
adityab/CrossQ
Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"
Language:Python503
ok-robot/ok-robot
An open, modular framework for zero-shot, language conditioned pick-and-drop tasks in arbitrary homes.
Language:Python40728
KindXiaoming/pykan
Kolmogorov Arnold Networks
Language:Jupyter Notebook13.8k1.2k
riiswa/kanrl
Kolmogorov-Arnold Network for Reinforcement Leaning, initial experiments
Language:Python23428
facebookresearch/BenchMARL
A collection of MARL benchmarks based on TorchRL
Language:Python19423
simondlevy/gym-copter
Gymnasium environment for reinforcement learning with multicopters
Language:Python275
Wenxuan-Zhou/PLAS
Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]
Language:Python4611
facebookresearch/searchformer
Official codebase for the paper "Beyond A* Better Planning with Transformers via Search Dynamics Bootstrapping".
Language:Jupyter Notebook27913
uoe-agents/PO-GPL
Official code for "A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning"
Language:Python122
katerakelly/oyster
Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)
Language:Python469123
krashkov/Belief-Propagation
Overview and implementation of Belief Propagation and Loopy Belief Propagation algorithms: sum-product, max-product, max-sum
Language:Jupyter Notebook14545
hehonghui/awesome-english-ebooks
经济学人(含音频)、纽约客、卫报、连线、大西洋月刊等英语杂志免费下载,支持epub、mobi、pdf格式, 每周更新
Language:HTML19.9k1.5k
MarcoMeter/recurrent-ppo-truncated-bptt
Baseline implementation of recurrent PPO using truncated BPTT
Language:Jupyter Notebook11415
rail-berkeley/serl
SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning
Language:Python28924
rail-berkeley/serl_franka_controllers
Cartesian impedance controller with reference limiting for Franka Emika Robot
Language:C++559
Cornell-RL/drpo
Dateset Reset Policy Optimization
Language:Python25
pjreddie/darknet
Convolutional Neural Networks
Language:C25.6k21.3k
luchris429/purejaxrl
Really Fast End-to-End Jax RL Implementations
Language:Python63054