panyxy

PhD student at HKUST

HKUSTHong Kong

panyxy's Stars

opendilab/awesome-decision-transformer
A curated list of Decision Transformer resources (continually updated)
73631
opendilab/awesome-exploration-rl
A curated list of awesome exploration RL resources (continually updated)
42612
opendilab/awesome-model-based-RL
A curated list of awesome model based RL resources (continually updated)
97652
opendilab/InterFuser
[CoRL 2022] InterFuser: Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer
Language:Python55648
opendilab/DI-smartcross
Decision Intelligence platform for Traffic Crossing Signal Control
Language:Python2354
opendilab/GoBigger
[ICLR 2023] Come & try Decision-Intelligence version of "Agar"! Gobigger could also help you with multi-agent decision intelligence study.
Language:Python46334
opendilab/DI-drive
Decision Intelligence Platform for Autonomous Driving simulation.
Language:Python58457
opendilab/DI-star
An artificial intelligence platform for the StarCraft II with large-scale distributed training and grand-master agents.
Language:Python1.2k115
opendilab/DI-engine
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
Language:Python3.2k381
facebookresearch/CausalSkillLearning
Codebase for project about unsupervised skill learning via variational inference and causality.
Language:Python4116
SxJyJay/Transformer-backbone
The reproduce of Transformer architecture in paper "Attention is all your need"
Language:Python182
vub-ai-lab/bdpi
Sample-Efficient Reinforcement Learning with Bootstrapped Dual Policy Iteration
Language:Python255
TJU-DRL-LAB/self-supervised-rl
Language:Python354
FF93/Parameter-based-Value-Functions
Language:Python6
ikostrikov/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Language:Python3.7k830
NLP-LOVE/ML-NLP
此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现，也是作为一个算法工程师必会的理论基础知识。
Language:Jupyter Notebook16.2k4.6k
921kiyo/symbolic-rl
Symbolic Reinforcement Learning using Inductive Logic Programming
Language:Lasso6110
alexfrom0815/Online-3D-BPP-PCT
Code implementation of "Learning Efficient Online 3D Bin Packing on Packing Configuration Trees". We propose to enhance the practical applicability of online 3D Bin Packing Problem (BPP) via learning on a hierarchical packing configuration tree which makes the deep reinforcement learning (DRL) model easy to deal with practical constraints and well-performing even with continuous solution space.
Language:Python25849
changgyhub/leetcode_101
LeetCode 101：力扣刷题指南
8.9k1.2k
soumye/bwmodel
Implementation of Recall Traces
Language:Python1
soumye/recalltraces
Implementation of Recall Traces for Atari
Language:Python4
florensacc/rllab-curriculum
Language:Python13143
hijkzzz/pymarl2
Fine-tuned MARL algorithms on SMAC (100% win rates on most scenarios)
Language:Python643126
jax-ml/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Language:Python30.9k2.8k
zhm-real/MotionPlanning
Motion planning algorithms commonly used on autonomous vehicles. (path planning + path tracking)
Language:Python2.3k581
RoboStack/ros-noetic
vinca configuration files for ros-noetic
Language:Shell46973
SheffieldML/GPy
Gaussian processes framework in python
Language:Python2.1k566
albumentations-team/albumentations
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Language:Python14.5k1.7k
gaoxiang12/slambook-en
The English version of 14 lectures on visual SLAM.
Language:TeX1.5k267
salesforce/warp-drive
Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning Framework on a GPU (JMLR 2022)
Language:Python46778

panyxy

panyxy's Stars

opendilab/awesome-decision-transformer

opendilab/awesome-exploration-rl

opendilab/awesome-model-based-RL

opendilab/InterFuser

opendilab/DI-smartcross

opendilab/GoBigger

opendilab/DI-drive

opendilab/DI-star

opendilab/DI-engine

facebookresearch/CausalSkillLearning

SxJyJay/Transformer-backbone

vub-ai-lab/bdpi

TJU-DRL-LAB/self-supervised-rl

FF93/Parameter-based-Value-Functions

ikostrikov/pytorch-a2c-ppo-acktr-gail

NLP-LOVE/ML-NLP

921kiyo/symbolic-rl

alexfrom0815/Online-3D-BPP-PCT

changgyhub/leetcode_101

soumye/bwmodel

soumye/recalltraces

florensacc/rllab-curriculum

hijkzzz/pymarl2

jax-ml/jax

zhm-real/MotionPlanning

RoboStack/ros-noetic

SheffieldML/GPy

albumentations-team/albumentations

gaoxiang12/slambook-en

salesforce/warp-drive