shuoyang2000's Stars
microsoft/generative-ai-for-beginners
21 Lessons, Get Started Building with Generative AI đź”— https://microsoft.github.io/generative-ai-for-beginners/
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Z3Prover/z3
The Z3 Theorem Prover
lucidrains/denoising-diffusion-pytorch
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
Farama-Foundation/Gymnasium
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
AntixK/PyTorch-VAE
A Collection of Variational Autoencoders (VAE) in PyTorch.
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
openmlsys/openmlsys-zh
《Machine Learning Systems: Design and Implementation》- Chinese Version
hojonathanho/diffusion
Denoising Diffusion Probabilistic Models
google-deepmind/dm_control
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
kzl/decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
real-stanford/diffusion_policy
[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
Farama-Foundation/D4RL
A collection of reference environments for offline reinforcement learning
utiasDSL/gym-pybullet-drones
PyBullet Gymnasium environments for single and multi-agent reinforcement learning of quadcopter control
jannerm/diffuser
Code for the paper "Planning with Diffusion for Flexible Behavior Synthesis"
tencent-ailab/hok_env
Honor of Kings AI Open Environment of Tencent
gambitproject/gambit
Gambit: The package for computation in game theory
semitable/robotic-warehouse
Multi-Robot Warehouse (RWARE): A multi-agent reinforcement learning environment
anuragajay/decision-diffuser
int8/counterfactual-regret-minimization
Counterfactual regret minimization algorithm for Kuhn poker
dreal/dreal4
Automated Reasoning in Nonlinear Theories of Reals
Gongsta/Poker-AI
Developing a Poker AI as a personal project. Poker is an interesting game to develop an AI for because it is an imperfect information game. This means that unlike perfect-information games such as Chess, in Poker, there is this uncertainty about the opponent's hand, which allows really interesting plays like bluffing.
KumarRobotics/kr_mp_design
A guidance for the design and evaluation of motion planners for quadrotors in Environments with Varying Complexities
vincekurtz/stlpy
A python library for control from Signal Temporal Logic (STL) specifications
oxford-oxcav/fossil
shuoyang2000/neural_hybrid_cbf
Code for "Learning Local Control Barrier Functions for Safety Control of Hybrid Systems"
shuoyang2000/STLgame
Official Implementation of STLGame (L4DC'25)
shuoyang2000/nash_equilibrium_solver
A simple Nash Equilibrium solver for two-player zero-sum games
Apollo-Lab-Yale/spaces_comparative_study