glue25's Stars
OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
lucidrains/improving-transformers-world-model-for-rl
Implementation of the new SOTA for model based RL, from the paper "Improving Transformer World Models for Data-Efficient RL", in Pytorch
KwaiVGI/VideoAlign
Improving Video Generation with Human Feedback
tsinghua-fib-lab/agentsociety
AgentSociety: Large-scale Social Simulation to Understand Human Behaviors and Society through LLM-driven Agents
XuehaiPan/mate
MATE: the Multi-Agent Tracking Environment.
USC-GVL/PhysBench
[ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding>
OpenRobotLab/Seer
[ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation
CraftJarvis/MineStudio
MineStudio: A Streamlined Package for Minecraft AI Agent Development
vLAR-group/NVFi
NVFi in PyTorch (NeurIPS 2023)
LuoUndergradXJTU/TwiBot-22
Offical repository of TwiBot-22 @ NeurIPS 2022, Datasets and Benchmarks Track.
topazape/ST-ResNet
Implementation of the paper - Deep Spatio-Temporal Residual Networks for Citywide Crowd Flows Prediction
CILAB-MA/Machine_ToM
The Implementation of "Machine Theory of Mind", ICML 2018
joonspk-research/genagents
tsinghua-fib-lab/ACL24-EconAgent
pybox2d/pybox2d
2D Game Physics for Python
bytedance/IRASim
cts198859/deeprl_network
multi-agent deep reinforcement learning for networked system control.
rlworkgroup/garage
A toolkit for reproducible reinforcement learning research.
uoe-agents/MVD
Multi-view Disentanglement for Reinforcement Learning with Multiple Cameras
Wenyueh/game_theory
How to create rational LLM-based agents? Using game-theoretic workflows!
sotopia-lab/awesome-social-agents
A collection of works that investigate social agents, simulations and their real-world impact in text, embodied, and robotics contexts.
NKAI-Decision-Team/LLM-PySC2
LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmind's PySC2 Learning Environment API as a Python LLM Environment.
ashleve/lightning-hydra-template
PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡
fhbzc/CSS_program
A list of computational social science (CSS) program, people and groups
PKU-RL/DGN
DGN Code
RifleZhang/LLaVA-Hound-DPO
phyworld/phyworld
tsinghua-fib-lab/EmbodiedCity
jbr-ai-labs/mamba
This code accompanies the paper "Scalable Multi-Agent Model-Based Reinforcement Learning".
ZifanWu/MAG
Code accompanying paper "Models as Agents: Optimizing Multi-Step Predictions of Interactive Local Models in Model-Based Multi-Agent Reinforcement Learning".