ltzheng

CS PhD student @ NTU Singapore. BS @ USTC.

Nanyang Technological UniversitySingapore

Pinned Repositories

Cradle
The Cradle framework is a first attempt at General Computer Control (GCC). Cradle supports agents to ace any computer task by enabling strong reasoning abilities, self-improvment, and skill curation, in a standardized general environment with minimal requirements.
Language:Python1.8k 23 31154
agent-studio
Environments, tools, and benchmarks for general computer agents
Language:Python16014
alphafold
Open source code for AlphaFold.
Language:Python1 1 00
CurriculumMARL
Code of "Towards Skilled Population Curriculum for MARL" + Implementation of Curriculum MARL algorithms based on Ray
Language:Python11 2 11
data-privacy
Preserve data privacy with k-anonymity (samarati & mondrian), differential privacy, federated learning, paillier homomorphic encryption, etc.
Language:Python55 2 37
mappo-football
Multi-Agent PPO (MAPPO) with the Google Research Football environment.
Language:Python2 1 00
parallel-computing-ustc
Experiments for the Parallel Computing course
Language:C++2 2 00
pddpg-hfo
Half Field Offense in Robocup 2D Soccer with reinforcement learning
Language:Python31 2 36
Replica-Currency-Estimation
Python implementation of Replica Currency Estimation
Language:Python2 2 00
Synapse
[ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control
Language:HTML48 4 78

ltzheng's Repositories

ltzheng/data-privacy
Preserve data privacy with k-anonymity (samarati & mondrian), differential privacy, federated learning, paillier homomorphic encryption, etc.
Language:Python55 2 37
ltzheng/Synapse
[ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control
Language:HTML48 4 78
ltzheng/pddpg-hfo
Half Field Offense in Robocup 2D Soccer with reinforcement learning
Language:Python31 2 36
ltzheng/CurriculumMARL
Code of "Towards Skilled Population Curriculum for MARL" + Implementation of Curriculum MARL algorithms based on Ray
Language:Python11 2 11
ltzheng/mappo-football
Multi-Agent PPO (MAPPO) with the Google Research Football environment.
Language:Python2 1 00
ltzheng/parallel-computing-ustc
Experiments for the Parallel Computing course
Language:C++2 2 00
ltzheng/Replica-Currency-Estimation
Python implementation of Replica Currency Estimation
Language:Python2 2 00
ltzheng/alphafold
Open source code for AlphaFold.
Language:Python1 1 00
ltzheng/CDS
Language:Python1 1 00
ltzheng/coinrun
Code for the paper "Quantifying Transfer in Reinforcement Learning"
Language:Python1 0 0
ltzheng/football
Google football with more wrappers, scenarios and flexible task parameters
Language:Python1 1 00
ltzheng/formal-methods-ustc
Experiments for the Formal Methods course
Language:Python1 2 01
ltzheng/numerical-analysis
Assignments for the Numerical Methods course at USTC
Language:MATLAB1 2 0
ltzheng/OFDClean
Contextual Data Cleaning with Ontological Functional Dependencies
Language:Java1 2 02
ltzheng/pymarl2
Rethinking the Importance of Implementation Tricks in Multi-Agent Reinforcement Learning
Language:Python1 1 0
ltzheng/DexterousHands
This is a library that provides dual dexterous hand manipulation tasks through Isaac Gym
Language:Python0 0
ltzheng/dm_control
DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
Language:Python0 0
ltzheng/dmc2gym
OpenAI Gym wrapper for the DeepMind Control Suite
Language:Python0 0
ltzheng/EfficientZero
Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.
Language:Python0 0
ltzheng/jafar
JAX reimplementation of the DeepMind paper "Genie: Generative Interactive Environments"
Language:Python0 0
ltzheng/ltzheng.github.io
Language:HTML1 0
ltzheng/muzero-general
MuZero
Language:Python0 0
ltzheng/open_spiel
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
Language:C++0 0
ltzheng/overcooked_ai
A benchmark environment for fully cooperative human-AI performance.
Language:Jupyter Notebook0 0
ltzheng/pix2act
Language:Python0 0
ltzheng/ray
An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
Language:Python1 0
ltzheng/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
Language:Python0 0
ltzheng/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python0 0