glue25

glue25's Stars

OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Language:Python5.5k543
lucidrains/improving-transformers-world-model-for-rl
Implementation of the new SOTA for model based RL, from the paper "Improving Transformer World Models for Data-Efficient RL", in Pytorch
Language:Python772
KwaiVGI/VideoAlign
Improving Video Generation with Human Feedback
Language:Python125
tsinghua-fib-lab/agentsociety
AgentSociety: Large-scale Social Simulation to Understand Human Behaviors and Society through LLM-driven Agents
Language:Python15825
XuehaiPan/mate
MATE: the Multi-Agent Tracking Environment.
Language:Python3722
USC-GVL/PhysBench
[ICLR 2025] Official implementation and benchmark evaluation repository of <PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding>
Language:Python381
OpenRobotLab/Seer
[ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation
Language:Python1034
CraftJarvis/MineStudio
MineStudio: A Streamlined Package for Minecraft AI Agent Development
Language:Python1998
vLAR-group/NVFi
NVFi in PyTorch (NeurIPS 2023)
Language:Python421
LuoUndergradXJTU/TwiBot-22
Offical repository of TwiBot-22 @ NeurIPS 2022, Datasets and Benchmarks Track.
Language:Python19444
topazape/ST-ResNet
Implementation of the paper - Deep Spatio-Temporal Residual Networks for Citywide Crowd Flows Prediction
Language:Python365
CILAB-MA/Machine_ToM
The Implementation of "Machine Theory of Mind", ICML 2018
Language:Python222
joonspk-research/genagents
Language:Python32786
tsinghua-fib-lab/ACL24-EconAgent
Language:Python7214
pybox2d/pybox2d
2D Game Physics for Python
Language:Python48893
bytedance/IRASim
Language:Python925
cts198859/deeprl_network
multi-agent deep reinforcement learning for networked system control.
Language:Python40490
rlworkgroup/garage
A toolkit for reproducible reinforcement learning research.
Language:Python1.9k312
uoe-agents/MVD
Multi-view Disentanglement for Reinforcement Learning with Multiple Cameras
Language:Python71
Wenyueh/game_theory
How to create rational LLM-based agents? Using game-theoretic workflows!
Language:Python566
sotopia-lab/awesome-social-agents
A collection of works that investigate social agents, simulations and their real-world impact in text, embodied, and robotics contexts.
Language:TypeScript8123
NKAI-Decision-Team/LLM-PySC2
LLM-PySC2 is NKAI Decision Team and NUDT Decision Team's Python component of the StarCraft II LLM Decision Environment. It exposes Deepmind's PySC2 Learning Environment API as a Python LLM Environment.
Language:Python11010
ashleve/lightning-hydra-template
PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡
Language:Python4.5k681
fhbzc/CSS_program
A list of computational social science (CSS) program, people and groups
16820
PKU-RL/DGN
DGN Code
Language:Python34288
RifleZhang/LLaVA-Hound-DPO
Language:Python14122
phyworld/phyworld
Language:Jupyter Notebook1176
tsinghua-fib-lab/EmbodiedCity
Language:Python20710
jbr-ai-labs/mamba
This code accompanies the paper "Scalable Multi-Agent Model-Based Reinforcement Learning".
Language:Python5310
ZifanWu/MAG
Code accompanying paper "Models as Agents: Optimizing Multi-Step Predictions of Interactive Local Models in Model-Based Multi-Agent Reinforcement Learning".
Language:Python153