Pinned Repositories
DI-engine
OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.
DI-sheep
羊了个羊 + 深度强化学习(Deep Reinforcement Learning + 3 Tiles Game)
DI-star
An artificial intelligence platform for the StarCraft II with large-scale distributed training and grand-master agents.
LightZero
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
PPOxFamily
PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )
PsyDI
PsyDI: Towards a Personalized and Progressively In-depth Chatbot for Psychological Measurements. (e.g. MBTI Measurement Agent)
CARAFE_pytorch
naive CARAFE implementation in Pytorch
CariGANs
pytorch implementation for CariGANS
DI-sheep
TP-GAN
pytorch implemention for TP-GAN
PaParaZz1's Repositories
PaParaZz1/annonated_code_viz
PaParaZz1/awesome-decision-transformer
A curated list of Decision Transformer resources (continually updated)
PaParaZz1/DI-card
PaParaZz1/fastapi-weibo
PaParaZz1/ML-tutorial
PaParaZz1/PPOxFamily
PPO x Family DRL Tutorial Course(决策智能入门级公开课:8节课帮你盘清算法理论,理顺代码逻辑,玩转决策AI应用实践 )
PaParaZz1/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
PaParaZz1/CleanS2S
High-quality and streaming Speech-to-Speech interactive agent in a single file. 只用一个文件实现的流式全双工语音交互原型智能体!
PaParaZz1/CodeMorpheus
CodeMorpheus: Generate code self-portraits with one click(一键生成代码自画像,决策型 AI + 生成式 AI)
PaParaZz1/csharp_practice
PaParaZz1/D4RL
A collection of reference environments for offline reinforcement learning
PaParaZz1/data_generation
PaParaZz1/diffuser
Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"
PaParaZz1/dmc2gym
OpenAI Gym wrapper for the DeepMind Control Suite
PaParaZz1/ds_comm_bench
PaParaZz1/empathic-voice-interface-starter
PaParaZz1/evogym
A large-scale benchmark for co-optimizing the design and control of soft robots, as seen in NeurIPS 2021.
PaParaZz1/fastapi-vercel
A FastAPI example app deployed on Vercel
PaParaZz1/GenerativeRL
Python library for solving reinforcement learning (RL) problems using generative models (e.g. Diffusion Models).
PaParaZz1/genius-invokation-gym
原神七圣召唤模拟环境 Simulator of Genius Invocation
PaParaZz1/LightZero
LightZero: A lightweight and efficient MCTS/AlphaZero/MuZero algorithm toolkit.
PaParaZz1/LLMRiddles
Open-Source Reproduction/Demo of the LLM Riddles Game
PaParaZz1/nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
PaParaZz1/nextjs-dashboard
PaParaZz1/OpenAOE
LLM Group Chat Framework: chat with multiple LLMs at the same time. 大模型群聊框架:同时与多个大语言模型聊天。
PaParaZz1/PsyDI
PaParaZz1/rag_examples
PaParaZz1/SO2
[AAAI2024] A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning
PaParaZz1/tbdata
PaParaZz1/tracing