Pinned Repositories
ACE
[AAAI 2023] Official PyTorch implementation of paper "ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency".
Andrew-Ng-Machine-Learning-Notes
The offical notes of Andrew Ng Machine Learning in Stanford University
AttrPrompt
[NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.
awesome-deep-rl
For deep RL and the future of AI.
DI-engine
OpenDILab Decision AI Engine
DI-sheep
羊了个羊 + 深度强化学习(Deep Reinforcement Learning + 3 Tiles Game)
genius-invokation-simulator
An unofficial simulator for Genius Invokation TCG in Genshin Impact; 七圣召唤模拟器
LightZero
LightZero: A lightweight and efficient MCTS/AlphaZero/MuZero algorithm toolkit.
PPOxFamily
PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )
rocket-recycling
Rocket-recycling with Reinforcement Learning
nighood's Repositories
nighood/ACE
[AAAI 2023] Official PyTorch implementation of paper "ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency".
nighood/Andrew-Ng-Machine-Learning-Notes
The offical notes of Andrew Ng Machine Learning in Stanford University
nighood/AttrPrompt
[NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.
nighood/awesome-deep-rl
For deep RL and the future of AI.
nighood/awesome-model-based-RL
A curated list of awesome model based RL resources (continually updated)
nighood/cs229-2018-autumn
All notes and materials for the CS229: Machine Learning course by Stanford University
nighood/DI-engine
OpenDILab Decision AI Engine
nighood/DI-engine-docs
DI-engine docs (Chinese and English)
nighood/DI-sheep
羊了个羊 + 深度强化学习(Deep Reinforcement Learning + 3 Tiles Game)
nighood/DI-toolkit
A simple toolkit package for opendilab
nighood/easy-scraping-tutorial
Simple but useful Python web scraping tutorial code.
nighood/genius-invokation-simulator
An unofficial simulator for Genius Invokation TCG in Genshin Impact; 七圣召唤模拟器
nighood/LightZero
LightZero: A lightweight and efficient MCTS/AlphaZero/MuZero algorithm toolkit.
nighood/PPOxFamily
PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )
nighood/rocket-recycling
Rocket-recycling with Reinforcement Learning
nighood/CrowdSim
nighood/csharp_practice
nighood/DI-card
nighood/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
nighood/freedom
just a try
nighood/GCRL-min-AoI
[INFOCOM 2022] AoI-minimal UAV Crowdsensing by Model-based Graph Convolutional Reinforcement Learning
nighood/genius-invokation-gym
原神七圣召唤模拟环境 Simulator of Genius Invocation
nighood/gobang
javascript gobang AI,JS五子棋AI,源码+教程,基于Alpha-Beta剪枝算法(不是神经网络)
nighood/myDITest
nighood/notes
札记
nighood/rag_wiki
nighood/Xia_Env