minqi

Lucida LabsOxford, UK

minqi's Stars

schmidtdominik/LAPO
Code for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)
Language:Python776
FLAIROx/JaxMARL
Multi-Agent Reinforcement Learning with JAX
Language:Python44381
jennyzzt/awesome-open-ended
Awesome Open-ended AI
18619
princeton-nlp/tree-of-thought-llm
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
Language:Python4.9k454
openai/prm800k
800,000 step-level correctness labels on LLM solutions to MATH problems
Language:Python1.7k103
srush/GPU-Puzzles
Solve puzzles. Learn CUDA.
Language:Jupyter Notebook10k863
luchris429/purejaxrl
Really Fast End-to-End Jax RL Implementations
Language:Python74162
danijar/ninjax
General Modules for JAX
Language:Python602
danijar/dreamerv3
Mastering Diverse Domains through World Models
Language:Python1.4k233
apple/ml-stable-diffusion
Stable Diffusion with Core ML on Apple Silicon
Language:Python16.9k948
norvig/pytudes
Python programs, usually short, of considerable difficulty, to perfect particular skills.
Language:Jupyter Notebook23.2k2.4k
vadimdemedes/ink
🌈 React for interactive command-line apps
Language:TypeScript27.3k613
CarperAI/OpenELM
Evolution Through Large Models
Language:Python69686
CompVis/stable-diffusion
A latent text-to-image diffusion model
Language:Jupyter Notebook68.6k10.2k
facebookresearch/dcd
Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.
Language:Python12625
JackHopkins/PaperclipMaximiser
A Paperclip Maximiser in Factorio to evaluate instrumental convergence in LLMs.
Language:Python3
voletiv/mcvd-pytorch
Official implementation of MCVD: Masked Conditional Video Diffusion for Prediction, Generation, and Interpolation (https://arxiv.org/abs/2205.09853)
Language:Python33226
lucidrains/DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
Language:Python11.2k1.1k
facebookresearch/moolib
A library for distributed ML training with PyTorch
Language:C++36621
ucl-dark/paired
PAIRED in PyTorch 🔥
Language:Python5620
aqlaboratory/openfold
Trainable, memory-efficient, and GPU-friendly PyTorch reproduction of AlphaFold 2
Language:Python2.8k551
danijar/dreamerv2
Mastering Atari with Discrete World Models
Language:Python902194
idiap/fast-transformers
Pytorch library for fast transformer implementations
Language:Python1.6k179
facebookresearch/minihack
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Language:Python48459
alex-petrenko/sample-factory
High throughput synchronous and asynchronous reinforcement learning
Language:Python833113
alex-petrenko/megaverse
High-throughput simulation platform for Artificial Intelligence reseach
Language:C++22020
ikostrikov/jaxrl
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
Language:Jupyter Notebook63669
EleutherAI/gpt-neo
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
Language:Python8.2k953
Aperocky/cellular-automata
Language:TypeScript462
ARISE-Initiative/robosuite
robosuite: A Modular Simulation Framework and Benchmark for Robot Learning
Language:Python1.4k429

minqi

minqi's Stars

schmidtdominik/LAPO

FLAIROx/JaxMARL

jennyzzt/awesome-open-ended

princeton-nlp/tree-of-thought-llm

openai/prm800k

srush/GPU-Puzzles

luchris429/purejaxrl

danijar/ninjax

danijar/dreamerv3

apple/ml-stable-diffusion

norvig/pytudes

vadimdemedes/ink

CarperAI/OpenELM

CompVis/stable-diffusion

facebookresearch/dcd

JackHopkins/PaperclipMaximiser

voletiv/mcvd-pytorch

lucidrains/DALLE2-pytorch

facebookresearch/moolib

ucl-dark/paired

aqlaboratory/openfold

danijar/dreamerv2

idiap/fast-transformers

facebookresearch/minihack

alex-petrenko/sample-factory

alex-petrenko/megaverse

ikostrikov/jaxrl

EleutherAI/gpt-neo

Aperocky/cellular-automata

ARISE-Initiative/robosuite