nighood

RL blabla

Tsinghua UniversityShenzhen/Beijing

Pinned Repositories

ACE
[AAAI 2023] Official PyTorch implementation of paper "ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency".
Language:Python0 0 00
Andrew-Ng-Machine-Learning-Notes
The offical notes of Andrew Ng Machine Learning in Stanford University
0 0 00
AttrPrompt
[NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.
Language:Python0 0 00
awesome-deep-rl
For deep RL and the future of AI.
Language:HTML0 0 00
DI-engine
OpenDILab Decision AI Engine
Language:Python0 0 00
DI-sheep
羊了个羊 + 深度强化学习（Deep Reinforcement Learning + 3 Tiles Game)
Language:Python0 0 00
genius-invokation-simulator
An unofficial simulator for Genius Invokation TCG in Genshin Impact; 七圣召唤模拟器
Language:TypeScript0 0 00
LightZero
LightZero: A lightweight and efficient MCTS/AlphaZero/MuZero algorithm toolkit.
Language:Python0 0 00
PPOxFamily
PPO x Family DRL Tutorial Course（决策智能入门级公开课：8节课帮你盘清算法理论，理顺代码逻辑，玩转决策AI应用实践）
Language:Python0 0 00
rocket-recycling
Rocket-recycling with Reinforcement Learning
Language:Python0 0 01

nighood's Repositories

nighood/ACE
[AAAI 2023] Official PyTorch implementation of paper "ACE: Cooperative Multi-agent Q-learning with Bidirectional Action-Dependency".
Language:Python0 0 00
nighood/Andrew-Ng-Machine-Learning-Notes
The offical notes of Andrew Ng Machine Learning in Stanford University
0 0 00
nighood/AttrPrompt
[NeurIPS 2023] This is the code for the paper `Large Language Model as Attributed Training Data Generator: A Tale of Diversity and Bias`.
Language:Python0 0 00
nighood/awesome-deep-rl
For deep RL and the future of AI.
Language:HTML0 0 00
nighood/awesome-model-based-RL
A curated list of awesome model based RL resources (continually updated)
0 0 00
nighood/cs229-2018-autumn
All notes and materials for the CS229: Machine Learning course by Stanford University
Language:Jupyter Notebook0 0 00
nighood/DI-engine
OpenDILab Decision AI Engine
Language:Python0 0 00
nighood/DI-engine-docs
DI-engine docs (Chinese and English)
Language:Python0 0 00
nighood/DI-sheep
羊了个羊 + 深度强化学习（Deep Reinforcement Learning + 3 Tiles Game)
Language:Python0 0 00
nighood/DI-toolkit
A simple toolkit package for opendilab
Language:Python0 0 00
nighood/easy-scraping-tutorial
Simple but useful Python web scraping tutorial code.
Language:Jupyter Notebook0 0 00
nighood/genius-invokation-simulator
An unofficial simulator for Genius Invokation TCG in Genshin Impact; 七圣召唤模拟器
Language:TypeScript0 0 00
nighood/LightZero
LightZero: A lightweight and efficient MCTS/AlphaZero/MuZero algorithm toolkit.
Language:Python0 0 00
nighood/PPOxFamily
PPO x Family DRL Tutorial Course（决策智能入门级公开课：8节课帮你盘清算法理论，理顺代码逻辑，玩转决策AI应用实践）
Language:Python0 0 00
nighood/rocket-recycling
Rocket-recycling with Reinforcement Learning
Language:Python0 0 01
nighood/CrowdSim
Language:Python1 0
nighood/csharp_practice
nighood/DI-card
Language:Python0 0
nighood/direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
Language:Python0 0
nighood/freedom
just a try
Language:Assembly1 0
nighood/GCRL-min-AoI
[INFOCOM 2022] AoI-minimal UAV Crowdsensing by Model-based Graph Convolutional Reinforcement Learning
Language:Python0 0
nighood/genius-invokation-gym
原神七圣召唤模拟环境 Simulator of Genius Invocation
Language:Python0 0
nighood/gobang
javascript gobang AI，JS五子棋AI，源码+教程，基于Alpha-Beta剪枝算法（不是神经网络）
Language:JavaScript0 0
nighood/myDITest
Language:Python1 0
nighood/notes
札记
nighood/rag_wiki
Language:Jupyter Notebook1 0
nighood/Xia_Env

nighood

Pinned Repositories

ACE

Andrew-Ng-Machine-Learning-Notes

AttrPrompt

awesome-deep-rl

DI-engine

DI-sheep

genius-invokation-simulator

LightZero

PPOxFamily

rocket-recycling

nighood's Repositories

nighood/ACE

nighood/Andrew-Ng-Machine-Learning-Notes

nighood/AttrPrompt

nighood/awesome-deep-rl

nighood/awesome-model-based-RL

nighood/cs229-2018-autumn

nighood/DI-engine

nighood/DI-engine-docs

nighood/DI-sheep

nighood/DI-toolkit

nighood/easy-scraping-tutorial

nighood/genius-invokation-simulator

nighood/LightZero

nighood/PPOxFamily

nighood/rocket-recycling

nighood/CrowdSim

nighood/csharp_practice

nighood/DI-card

nighood/direct-preference-optimization

nighood/freedom

nighood/GCRL-min-AoI

nighood/genius-invokation-gym

nighood/gobang

nighood/myDITest

nighood/notes

nighood/rag_wiki

nighood/Xia_Env