Pinned Repositories
AlexNet-Prod
Reproduction process of AlexNet
AlgoXY
Book of Elementary Algorithms and Data structures
algs4
Algorithms, 4th edition textbook code and libraries
amc
[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices
AutoCrawler
Official implement of paper "AutoCrawler: A Progressive Understanding Web Agent for Web Crawler Generation"
Autoformer
2021-NeurIPS-Autoformer-LTSF: "Autoformer: Decomposition Transformers with Auto-Correlation for Long-Term Series Forecasting" (NeurIPS 2021), https://arxiv.org/abs/2106.13008
awesome-cs-books
经典编程书籍大全,涵盖:计算机系统与网络、系统架构、算法与数据结构、前端开发、后端开发、移动开发、数据库、测试、项目与团队、程序员职业修炼、求职面试等
awesome-DeepLearning
深度学习入门课、资深课、特色课、学术案例、产业实践案例、深度学习知识百科及面试题库The course, case and knowledge of Deep Learning and AI
awesome-reinforcement-learning-lib
GitHub's code repository is all you need
nlpcab
xiaoyangyang2's Repositories
xiaoyangyang2/AlgoXY
Book of Elementary Algorithms and Data structures
xiaoyangyang2/AutoCrawler
Official implement of paper "AutoCrawler: A Progressive Understanding Web Agent for Web Crawler Generation"
xiaoyangyang2/daily_arxiv
Using GitHub Action to collect paper list with publicly available source code in the daily arxiv
xiaoyangyang2/Deep-Reinforcement-Learning-Hands-On-Second-Edition
Deep-Reinforcement-Learning-Hands-On-Second-Edition, published by Packt
xiaoyangyang2/deep-rl-class
This repo contain the syllabus of the Hugging Face Deep Reinforcement Learning Class.
xiaoyangyang2/DunkCityDynasty
xiaoyangyang2/ElegantRL
Scalable and Elastic Deep Reinforcement Learning Using PyTorch. Please star. 🔥
xiaoyangyang2/epymarl
An extension of the PyMARL codebase that includes additional algorithms and environment support
xiaoyangyang2/gtrick
Bag of Tricks for Graph Neural Networks.
xiaoyangyang2/Hands-on-RL
https://hrl.boyuai.com/
xiaoyangyang2/InforMARL
Code for our paper: Scalable Multi-Agent Reinforcement Learning through Intelligent Information Aggregation
xiaoyangyang2/invalid-action-masking
Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
xiaoyangyang2/mader
Trajectory Planner in Multi-Agent and Dynamic Environments
xiaoyangyang2/marl_transfer
Code for paper 'Learning transferable cooperative behaviors in multi-agent teams' (ICML 2019)
xiaoyangyang2/Multi-Agent-Constrained-Policy-Optimisation
Multi-Agent Constrained Policy Optimisation (MACPO; MAPPO-L).
xiaoyangyang2/omnisafe
OmniSafe is an infrastructural framework for accelerating SafeRL research.
xiaoyangyang2/on-policy
This is the official implementation of Multi-Agent PPO (MAPPO).
xiaoyangyang2/panther
Perception-Aware Trajectory Planner in Dynamic Environments
xiaoyangyang2/PARL
A high-performance distributed training framework for Reinforcement Learning
xiaoyangyang2/PGL
Paddle Graph Learning (PGL) is an efficient and flexible graph learning framework based on PaddlePaddle
xiaoyangyang2/planning
List of planning algorithms developed at MIT-ACL
xiaoyangyang2/PPOxFamily
PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )
xiaoyangyang2/Practical_RL
A course in reinforcement learning in the wild
xiaoyangyang2/privateGPT
Interact privately with your documents using the power of GPT, 100% privately, no data leaks
xiaoyangyang2/RACE
(ICML 2023) The official code for RACE: Improve Multi-Agent Reinforcement Learning with Representation Asymmetry and Collaborative Evolution
xiaoyangyang2/Safe-Reinforcement-Learning-Baselines
The repository is for safe reinforcement learning baselines.
xiaoyangyang2/TD3
Author's PyTorch implementation of TD3 for OpenAI gym tasks
xiaoyangyang2/transferlearning
Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习
xiaoyangyang2/uav_bs_ctrl
Code implementation of "Cooperative Trajectory Design of Multiple UAV Base Stations with Heterogeneous Graph Neural Networks".
xiaoyangyang2/WZU-machine-learning-course
温州大学《机器学习》课程资料(代码、课件等)