xiaomengy

Artificial Intelligence, Reinforcement Learning

Google DeepMind

xiaomengy's Stars

pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:Python83.5k 1.7k 46.1k22.5k
electronicarts/CnC_Remastered_Collection
Language:C++18.3k 524 994.7k
facebookarchive/caffe2
Caffe2 is a lightweight, modular, and scalable deep learning framework.
Language:Shell8.4k 528 1.3k1.9k
facebookresearch/mae
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
Language:Python7.3k 56 1921.2k
google-deepmind/alphageometry
Language:Python4.1k 53 122465
google-deepmind/mctx
Monte Carlo tree search in JAX
Language:Python2.3k 28 48190
pytorch/rl
A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
Language:Python2.3k 41 629305
google/coding-competitions-archive
Google Coding Competitions problem archive
Language:HTML975 31 3265
facebookresearch/nle
The NetHack Learning Environment
Language:C940 30 113113
NVlabs/curobo
CUDA Accelerated Robot Library
Language:Python774 17 196120
google-deepmind/reverb
Reverb is an efficient and easy-to-use data storage and transport system designed for machine learning research
Language:C++702 25 12493
XuezheMax/megalodon
Reference implementation of Megalodon 7B model
Language:Cuda504 14 752
pytorch-labs/LeanRL
LeanRL is a fork of CleanRL, where selected PyTorch scripts optimized for performance using compile and cudagraphs.
Language:Python420 8 717
google-deepmind/alphastar
Language:Python409 11 652
facebookresearch/moolib
A library for distributed ML training with PyTorch
Language:C++366 12 1920
facebookresearch/nocturne
A data-driven, fast driving simulator for multi-agent coordination under partial observability.
Language:Python263 13 4529
MichaelTMatthews/Craftax
(Crafter + NetHack) in JAX. ICML 2024 Spotlight.
Language:Python197 3 1321
sotopia-lab/sotopia
Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)
Language:Python159 2 7019
adamkarvonen/chess_gpt_eval
A repo to evaluate various LLM's chess playing abilities.
Language:Python64 2 715
facebookresearch/jps
Code for "Joint Policy Search for Collaborative Multi-agent Incomplete Information Games"
Language:C++50 10 28
AntoineRichard/OmniLRS
SpaceR and SRL Lunar simulation
Language:Python47 3 1313
BricksRL/bricksrl
BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGO
Language:Python46 1 02

xiaomengy

xiaomengy's Stars

pytorch/pytorch

electronicarts/CnC_Remastered_Collection

facebookarchive/caffe2

facebookresearch/mae

google-deepmind/alphageometry

google-deepmind/mctx

pytorch/rl

google/coding-competitions-archive

facebookresearch/nle

NVlabs/curobo

google-deepmind/reverb

XuezheMax/megalodon

pytorch-labs/LeanRL

google-deepmind/alphastar

facebookresearch/moolib

facebookresearch/nocturne

MichaelTMatthews/Craftax

sotopia-lab/sotopia

adamkarvonen/chess_gpt_eval

facebookresearch/jps

AntoineRichard/OmniLRS

BricksRL/bricksrl