hmhyau

A machine learning and deep learning enthusiast.

hmhyau's Stars

karpathy/llm.c
LLM training in simple, raw C/CUDA
Language:Cuda23.3k2.6k
amiratag/ACE
Towards Automatic Concept-based Explanations
Language:Python15439
google-research/pisac
Tensorflow 2 source code for the PI-SAC agent from "Predictive Information Accelerates Learning in RL" (NeurIPS 2020)
Language:Python4010
py-why/dowhy
DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.
Language:Python7k922
Farama-Foundation/Miniworld
Simple and easily configurable 3D FPS-game-like environments for reinforcement learning
Language:Python694129
tinygrad/tinygrad
You like pytorch? You like micrograd? You love tinygrad! ❤️
Language:Python26.2k2.9k
hardmaru/slimevolleygym
A simple OpenAI Gym environment for single and multi-agent reinforcement learning
Language:Python714108
junhyukoh/value-prediction-network
NIPS 2017 Value Prediction Network
Language:Python16540
maraghuram/I-DQN
Towards Better Interpretability in Deep Q-Networks (Codebase)
Language:Jupyter Notebook8
pkumusic/O-DRL
Object Sensitive Deep Reinforcement Learning
Language:Python92
hill-a/stable-baselines
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
Language:Python4.1k723
obastani/viper
Language:Python262
Henrygwb/Explaining-DL
Language:Python5517
AcutronicRobotics/gym-gazebo2
gym-gazebo2 is a toolkit for developing and comparing reinforcement learning algorithms using ROS 2 and Gazebo
Language:Python416107
google-deepmind/open_spiel
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
Language:C++4.2k922
whoenig/libMultiRobotPlanning
Library with search algorithms for task and path planning for multi robot/agent systems
Language:C++791218
merschformann/RAWSim-O
A simulation framework for Robotic Mobile Fulfillment Systems
Language:C#19366
ConnorJL/GPT2
An implementation of training for GPT2, supports TPUs
Language:Python1.4k334
microsoft/terminal
The new Windows Terminal and the original Windows console host, all in the same place!
Language:C++95k8.2k
gsartoretti/PRIMAL
PRIMAL: Pathfinding via Reinforcement and Imitation Multi-Agent Learning -- Distributed RL/IL code for Multi-Agent Path Finding (MAPF)
Language:Python29478
aemkei/jsfuck
Write any JavaScript with 6 Characters: []()!+
Language:JavaScript8.1k672
pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
Language:Python82.2k22.1k
cvat-ai/cvat
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
Language:TypeScript12.3k3k
philc/vimium
The hacker's browser.
Language:JavaScript23k2.5k
openai/neural-mmo
Code for the paper "Neural MMO: A Massively Multiagent Game Environment for Training and Evaluating Intelligent Agents"
Language:Python1.6k261
openai/gpt-2
Code for the paper "Language Models are Unsupervised Multitask Learners"
Language:Python22.3k5.5k
titu1994/tf-eager-examples
A set of simple examples ported from PyTorch for Tensorflow Eager Execution
Language:Jupyter Notebook7310
google-deepmind/graph_nets
Build Graph Nets in Tensorflow
Language:Python5.3k782
tensorlayer/TensorLayer
Deep Learning and Reinforcement Learning Library for Scientists and Engineers
Language:Python7.3k1.6k
tqdm/tqdm
:zap: A Fast, Extensible Progress Bar for Python and CLI
Language:Python28.4k1.3k

hmhyau

hmhyau's Stars

karpathy/llm.c

amiratag/ACE

google-research/pisac

py-why/dowhy

Farama-Foundation/Miniworld

tinygrad/tinygrad

hardmaru/slimevolleygym

junhyukoh/value-prediction-network

maraghuram/I-DQN

pkumusic/O-DRL

hill-a/stable-baselines

obastani/viper

Henrygwb/Explaining-DL

AcutronicRobotics/gym-gazebo2

google-deepmind/open_spiel

whoenig/libMultiRobotPlanning

merschformann/RAWSim-O

ConnorJL/GPT2

microsoft/terminal

gsartoretti/PRIMAL

aemkei/jsfuck

pytorch/pytorch

cvat-ai/cvat

philc/vimium

openai/neural-mmo

openai/gpt-2

titu1994/tf-eager-examples

google-deepmind/graph_nets

tensorlayer/TensorLayer

tqdm/tqdm