minqi's Stars
schmidtdominik/LAPO
Code for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)
FLAIROx/JaxMARL
Multi-Agent Reinforcement Learning with JAX
jennyzzt/awesome-open-ended
Awesome Open-ended AI
princeton-nlp/tree-of-thought-llm
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
openai/prm800k
800,000 step-level correctness labels on LLM solutions to MATH problems
srush/GPU-Puzzles
Solve puzzles. Learn CUDA.
luchris429/purejaxrl
Really Fast End-to-End Jax RL Implementations
danijar/ninjax
General Modules for JAX
danijar/dreamerv3
Mastering Diverse Domains through World Models
apple/ml-stable-diffusion
Stable Diffusion with Core ML on Apple Silicon
norvig/pytudes
Python programs, usually short, of considerable difficulty, to perfect particular skills.
vadimdemedes/ink
🌈 React for interactive command-line apps
CarperAI/OpenELM
Evolution Through Large Models
CompVis/stable-diffusion
A latent text-to-image diffusion model
facebookresearch/dcd
Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.
JackHopkins/PaperclipMaximiser
A Paperclip Maximiser in Factorio to evaluate instrumental convergence in LLMs.
voletiv/mcvd-pytorch
Official implementation of MCVD: Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation (https://arxiv.org/abs/2205.09853)
lucidrains/DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
facebookresearch/moolib
A library for distributed ML training with PyTorch
ucl-dark/paired
PAIRED in PyTorch 🔥
aqlaboratory/openfold
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction of AlphaFold 2
danijar/dreamerv2
Mastering Atari with Discrete World Models
idiap/fast-transformers
Pytorch library for fast transformer implementations
facebookresearch/minihack
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
alex-petrenko/sample-factory
High throughput synchronous and asynchronous reinforcement learning
alex-petrenko/megaverse
High-throughput simulation platform for Artificial Intelligence reseach
ikostrikov/jaxrl
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
EleutherAI/gpt-neo
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
Aperocky/cellular-automata
ARISE-Initiative/robosuite
robosuite: A Modular Simulation Framework and Benchmark for Robot Learning