dblakely's Stars
huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
ankitects/anki
Anki's shared backend and web components, and the Qt frontend
yandex/YaLM-100B
Pretrained language model with 100B parameters
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
THUDM/CogView2
official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"
jannerm/trajectory-transformer
Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"
marp-team/marp
The entrance repository of Markdown presentation ecosystem
bhanushalimahesh3/node-website
Build Simple Website with NodeJS, Express & EJS view engine
borisdayma/dalle-mini
DALL·E Mini - Generate images from a text prompt
gorhill/uBlock
uBlock Origin - An efficient blocker for Chromium and Firefox. Fast and lean.
quenhus/uBlock-Origin-dev-filter
Filters to block and remove copycat-websites from DuckDuckGo, Google and other search engines. Specific to dev websites like StackOverflow or GitHub.
rfeinman/tictactoe-reinforcement-learning
Train a tic-tac-toe agent using reinforcement learning.
rajcscw/nlp-gym
NLPGym - A toolkit to develop RL agents to solve NLP tasks.
openai/summarize-from-feedback
Code for "Learning to summarize from human feedback"
huggingface/trl
Train transformer language models with reinforcement learning.
openai/lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
openai/gym
A toolkit for developing and comparing reinforcement learning algorithms.
patrick-kidger/torchtyping
Type annotations and dynamic checking for a tensor's shape, dtype, names, etc.
srush/Tensor-Puzzles
Solve puzzles. Improve your pytorch.
Zeta36/chess-alpha-zero
Chess reinforcement learning by AlphaGo Zero methods.
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
DLR-RM/rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
huggingface/deep-rl-class
This repo contains the syllabus of the Hugging Face Deep Reinforcement Learning Course.
bigscience-workshop/t-zero
Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)
TorchCraft/TorchCraftAI
A platform that lets you build agents to learn to play StarCraft: Brood War.
EvanHahn/flood
my take on a Video Game
LyWangPX/Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutions
Solutions of Reinforcement Learning, An Introduction
lebrice/SimpleParsing
Simple, Elegant, Typed Argument Parsing with argparse
gothinkster/node-express-realworld-example-app