panyxy's Stars
opendilab/awesome-decision-transformer
A curated list of Decision Transformer resources (continually updated)
opendilab/awesome-exploration-rl
A curated list of awesome exploration RL resources (continually updated)
opendilab/awesome-model-based-RL
A curated list of awesome model based RL resources (continually updated)
opendilab/InterFuser
[CoRL 2022] InterFuser: Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer
opendilab/DI-smartcross
Decision Intelligence platform for Traffic Crossing Signal Control
opendilab/GoBigger
[ICLR 2023] Come & try Decision-Intelligence version of "Agar"! Gobigger could also help you with multi-agent decision intelligence study.
opendilab/DI-drive
Decision Intelligence Platform for Autonomous Driving simulation.
opendilab/DI-star
An artificial intelligence platform for the StarCraft II with large-scale distributed training and grand-master agents.
opendilab/DI-engine
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
facebookresearch/CausalSkillLearning
Codebase for project about unsupervised skill learning via variational inference and causality.
SxJyJay/Transformer-backbone
The reproduce of Transformer architecture in paper "Attention is all your need"
vub-ai-lab/bdpi
Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration
TJU-DRL-LAB/self-supervised-rl
FF93/Parameter-based-Value-Functions
ikostrikov/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
NLP-LOVE/ML-NLP
此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。
921kiyo/symbolic-rl
Symbolic Reinforcement Learning using Inductive Logic Programming
alexfrom0815/Online-3D-BPP-PCT
Code implementation of "Learning Efficient Online 3D Bin Packing on Packing Configuration Trees". We propose to enhance the practical applicability of online 3D Bin Packing Problem (BPP) via learning on a hierarchical packing configuration tree which makes the deep reinforcement learning (DRL) model easy to deal with practical constraints and well-performing even with continuous solution space.
changgyhub/leetcode_101
LeetCode 101:力扣刷题指南
soumye/bwmodel
Implementation of Recall Traces
soumye/recalltraces
Implementation of Recall Traces for Atari
florensacc/rllab-curriculum
hijkzzz/pymarl2
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
jax-ml/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
zhm-real/MotionPlanning
Motion planning algorithms commonly used on autonomous vehicles. (path planning + path tracking)
RoboStack/ros-noetic
vinca configuration files for ros-noetic
SheffieldML/GPy
Gaussian processes framework in python
albumentations-team/albumentations
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
gaoxiang12/slambook-en
The English version of 14 lectures on visual SLAM.
salesforce/warp-drive
Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning Framework on a GPU (JMLR 2022)