xiaomengy's Stars
pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
electronicarts/CnC_Remastered_Collection
facebookarchive/caffe2
Caffe2 is a lightweight, modular, and scalable deep learning framework.
facebookresearch/mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
google-deepmind/alphageometry
google-deepmind/mctx
Monte Carlo tree search in JAX
pytorch/rl
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
google/coding-competitions-archive
Google Coding Competitions problem archive
facebookresearch/nle
The NetHack Learning Environment
NVlabs/curobo
CUDA Accelerated Robot Library
google-deepmind/reverb
Reverb is an efficient and easy-to-use data storage and transport system designed for machine learning research
XuezheMax/megalodon
Reference implementation of Megalodon 7B model
pytorch-labs/LeanRL
LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.
google-deepmind/alphastar
facebookresearch/moolib
A library for distributed ML training with PyTorch
facebookresearch/nocturne
A data-driven, fast driving simulator for multi-agent coordination under partial observability.
MichaelTMatthews/Craftax
(Crafter + NetHack) in JAX. ICML 2024 Spotlight.
sotopia-lab/sotopia
Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)
adamkarvonen/chess_gpt_eval
A repo to evaluate various LLM's chess playing abilities.
facebookresearch/jps
Code for "Joint Policy Search for Collaborative Multi-agent Incomplete Information Games"
AntoineRichard/OmniLRS
SpaceR and SRL Lunar simulation
BricksRL/bricksrl
BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGO