hmhyau's Stars
karpathy/llm.c
LLM training in simple, raw C/CUDA
amiratag/ACE
Towards Automatic Concept-based Explanations
google-research/pisac
Tensorflow 2 source code for the PI-SAC agent from "Predictive Information Accelerates Learning in RL" (NeurIPS 2020)
py-why/dowhy
DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.
Farama-Foundation/Miniworld
Simple and easily configurable 3D FPS-game-like environments for reinforcement learning
tinygrad/tinygrad
You like pytorch? You like micrograd? You love tinygrad! ❤️
hardmaru/slimevolleygym
A simple OpenAI Gym environment for single and multi-agent reinforcement learning
junhyukoh/value-prediction-network
NIPS 2017 Value Prediction Network
maraghuram/I-DQN
Towards Better Interpretability in Deep Q-Networks (Codebase)
pkumusic/O-DRL
Object Sensitive Deep Reinforcement Learning
hill-a/stable-baselines
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
obastani/viper
Henrygwb/Explaining-DL
AcutronicRobotics/gym-gazebo2
gym-gazebo2 is a toolkit for developing and comparing reinforcement learning algorithms using ROS 2 and Gazebo
google-deepmind/open_spiel
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
whoenig/libMultiRobotPlanning
Library with search algorithms for task and path planning for multi robot/agent systems
merschformann/RAWSim-O
A simulation framework for Robotic Mobile Fulfillment Systems
ConnorJL/GPT2
An implementation of training for GPT2, supports TPUs
microsoft/terminal
The new Windows Terminal and the original Windows console host, all in the same place!
gsartoretti/PRIMAL
PRIMAL: Pathfinding via Reinforcement and Imitation Multi-Agent Learning -- Distributed RL/IL code for Multi-Agent Path Finding (MAPF)
aemkei/jsfuck
Write any JavaScript with 6 Characters: []()!+
pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
cvat-ai/cvat
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
philc/vimium
The hacker's browser.
openai/neural-mmo
Code for the paper "Neural MMO: A Massively Multiagent Game Environment for Training and Evaluating Intelligent Agents"
openai/gpt-2
Code for the paper "Language Models are Unsupervised Multitask Learners"
titu1994/tf-eager-examples
A set of simple examples ported from PyTorch for Tensorflow Eager Execution
google-deepmind/graph_nets
Build Graph Nets in Tensorflow
tensorlayer/TensorLayer
Deep Learning and Reinforcement Learning Library for Scientists and Engineers
tqdm/tqdm
:zap: A Fast, Extensible Progress Bar for Python and CLI