ethanluoyc
PhD student at UCL AI Center. Former intern at @deepmind and @secondmind-labs.
University College LondonLondon, United Kingdom
ethanluoyc's Stars
trimstray/the-book-of-secret-knowledge
A collection of inspiring lists, manuals, cheatsheets, blogs, hacks, one-liners, cli/web tools and more.
jwyang/faster-rcnn.pytorch
A faster pytorch implementation of faster r-cnn
skypilot-org/skypilot
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
google-deepmind/alphageometry
copier-org/copier
Library and command-line utility for rendering projects templates.
google-deepmind/penzai
A JAX research toolkit for building, editing, and visualizing neural networks.
PufferAI/PufferLib
Simplifying reinforcement learning for complex game environments
danijar/dreamerv3
Mastering Diverse Domains through World Models
ARISE-Initiative/robosuite
robosuite: A Modular Simulation Framework and Benchmark for Robot Learning
waymo-research/waymax
A JAX-based simulator for autonomous driving research.
vimalabs/VIMA
Official Algorithm Implementation of ICML'23 Paper "VIMA: General Robot Manipulation with Multimodal Prompts"
google-deepmind/concordia
A library for generative social simulation
lcswillems/rl-starter-files
RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code
vikashplus/robohive
A unified framework for robot learning
kevinzakka/mjctrl
Minimal, clean, single-file implementations of common robotics controllers in MuJoCo.
ZhengyiLuo/SMPLSim
Simulating SMPL humanoid, supporting PHC/PHC-MJX/PULSE/SimXR code bases.
rwightman/efficientnet-jax
EfficientNet, MobileNetV3, MobileNetV2, MixNet, etc in JAX w/ Flax Linen and Objax
vwxyzjn/cleanba
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
waterhorse1/ChessGPT
(NeurIPS 2023) ChessGPT - Bridging Policy Learning and Language Modeling
seohongpark/HILP
Foundation Policies with Hilbert Representations (ICML 2024)
instadeepai/sebulba
🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX
hsvgbkhgbv/shapley-q-learning
This repo is the implementation of paper ''SHAQ: Incorporating Shapley Value Theory into Multi-Agent Q-Learning''.
ethanluoyc/corax
Corax: Core RL in JAX
Asap7772/PTR
This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Multitask Data via Offline Reinforcement Learning.
RosettaWYzhang/Roam
This repostory contains code and data instructions for ROAM, 3DV 2024. Authors: Wanyue Zhang, Rishabh Dabral, Thomas Leimkühler, Vladislav Golyanik†, Marc Habermann†, Christian Theobalt.
Difio3333/slaythetext
A Text Based Copy of Slay The Spire entirely played in the shell.
kevinzakka/dm_env_wrappers
Standalone library of frequently-used wrappers for dm_env environments.
davidbrandfonbrener/imitation_pretraining
ethanluoyc/lxm3
LXM3: XManager launch backend for HPC clusters
dtch1997/d4rl-slim-benchmark