Pinned Repositories
Cradle
The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements.
agent-studio
[ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents
CDS
coinrun
Code for the paper "Quantifying Transfer in Reinforcement Learning"
CurriculumMARL
Code of "Towards Skilled Population Curriculum for MARL" + Implementation of Curriculum MARL algorithms based on Ray
data-privacy
Preserve data privacy with k-anonymity (samarati & mondrian), differential privacy, federated learning, paillier homomorphic encryption, etc.
mappo-football
Multi-Agent PPO (MAPPO) with the Google Research Football environment.
pddpg-hfo
Half Field Offense in Robocup 2D Soccer with reinforcement learning
Synapse
[ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control
memo
Memory-Guided Diffusion for Expressive Talking Video Generation
ltzheng's Repositories
ltzheng/agent-studio
[ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents
ltzheng/data-privacy
Preserve data privacy with k-anonymity (samarati & mondrian), differential privacy, federated learning, paillier homomorphic encryption, etc.
ltzheng/Synapse
[ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control
ltzheng/pddpg-hfo
Half Field Offense in Robocup 2D Soccer with reinforcement learning
ltzheng/CurriculumMARL
Code of "Towards Skilled Population Curriculum for MARL" + Implementation of Curriculum MARL algorithms based on Ray
ltzheng/mappo-football
Multi-Agent PPO (MAPPO) with the Google Research Football environment.
ltzheng/CDS
ltzheng/coinrun
Code for the paper "Quantifying Transfer in Reinforcement Learning"
ltzheng/football
Google football with more wrappers, scenarios and flexible task parameters
ltzheng/pymarl2
Rethinking the Importance of Implementation Tricks in Multi-Agent Reinforcement Learning
ltzheng/ltzheng.github.io
ltzheng/muzero-general
MuZero