Pinned Repositories
navix
Accelerated minigrid environments with JAX
purejaxrl
Really Fast End-to-End Jax RL Implementations
MultiTQ
MULTITQ is a large-scale dataset featuring ample relevant facts and multiple temporal granularities.
Grounding_LLMs_with_online_RL
We perform functional grounding of LLMs' knowledge in BabyAI-Text
saycanpay
Official code release of AAAI 2024 paper SayCanPay.
CoTASP
Official code for the paper: Continual Task Allocation in Meta-Policy Network via Sparse Prompting
Research-about-Deepfake
It contains research about deepfake of 3 students from NTU.
ArchyZheng's Repositories
ArchyZheng/navix
Accelerated minigrid environments with JAX
ArchyZheng/purejaxrl
Really Fast End-to-End Jax RL Implementations