dblakely

YogiWashington DC

dblakely's Stars

huggingface/accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
Language:Python8.1k1k
ankitects/anki
Anki's shared backend and web components, and the Qt frontend
Language:Rust19.5k2.2k
yandex/YaLM-100B
Pretrained language model with 100B parameters
Language:Python3.7k300
huggingface/diffusers
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.
Language:Python27k5.5k
THUDM/CogView2
official code repo for paper "CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers"
Language:Python95277
jannerm/trajectory-transformer
Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"
Language:Python46864
marp-team/marp
The entrance repository of Markdown presentation ecosystem
Language:TypeScript8.1k150
bhanushalimahesh3/node-website
Build Simple Website with NodeJS, Express & EJS view engine
Language:HTML6895
borisdayma/dalle-mini
DALL·E Mini - Generate images from a text prompt
Language:Python14.8k1.2k
gorhill/uBlock
uBlock Origin - An efficient blocker for Chromium and Firefox. Fast and lean.
Language:JavaScript48.9k3.2k
quenhus/uBlock-Origin-dev-filter
Filters to block and remove copycat-websites from DuckDuckGo, Google and other search engines. Specific to dev websites like StackOverflow or GitHub.
Language:Python2.3k45
rfeinman/tictactoe-reinforcement-learning
Train a tic-tac-toe agent using reinforcement learning.
Language:Python5723
rajcscw/nlp-gym
NLPGym - A toolkit to develop RL agents to solve NLP tasks.
Language:Python19619
openai/summarize-from-feedback
Code for "Learning to summarize from human feedback"
Language:Python998145
huggingface/trl
Train transformer language models with reinforcement learning.
Language:Python10.5k1.4k
openai/lm-human-preferences
Code for the paper Fine-Tuning Language Models from Human Preferences
Language:Python1.3k164
openai/gym
A toolkit for developing and comparing reinforcement learning algorithms.
Language:Python35.1k8.6k
patrick-kidger/torchtyping
Type annotations and dynamic checking for a tensor's shape, dtype, names, etc.
Language:Python1.4k34
srush/Tensor-Puzzles
Solve puzzles. Improve your pytorch.
Language:Jupyter Notebook3.4k295
Zeta36/chess-alpha-zero
Chess reinforcement learning by AlphaGo Zero methods.
Language:Jupyter Notebook2.1k481
vwxyzjn/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Language:Python6k681
DLR-RM/rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
Language:Python2.2k526
DLR-RM/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Language:Python9.5k1.7k
huggingface/deep-rl-class
This repo contains the syllabus of the Hugging Face Deep Reinforcement Learning Course.
Language:MDX4k610
bigscience-workshop/t-zero
Reproduce results and replicate training fo T0 (Multitask Prompted Training Enables Zero-Shot Task Generalization)
Language:Python46053
TorchCraft/TorchCraftAI
A platform that lets you build agents to learn to play StarCraft: Brood War.
Language:C++651124
EvanHahn/flood
my take on a Video Game
Language:JavaScript41
LyWangPX/Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutions
Solutions of Reinforcement Learning, An Introduction
Language:Jupyter Notebook2.1k468
lebrice/SimpleParsing
Simple, Elegant, Typed Argument Parsing with argparse
Language:Python44154
gothinkster/node-express-realworld-example-app
Language:TypeScript3.7k1.6k

dblakely

dblakely's Stars

huggingface/accelerate

ankitects/anki

yandex/YaLM-100B

huggingface/diffusers

THUDM/CogView2

jannerm/trajectory-transformer

marp-team/marp

bhanushalimahesh3/node-website

borisdayma/dalle-mini

gorhill/uBlock

quenhus/uBlock-Origin-dev-filter

rfeinman/tictactoe-reinforcement-learning

rajcscw/nlp-gym

openai/summarize-from-feedback

huggingface/trl

openai/lm-human-preferences

openai/gym

patrick-kidger/torchtyping

srush/Tensor-Puzzles

Zeta36/chess-alpha-zero

vwxyzjn/cleanrl

DLR-RM/rl-baselines3-zoo

DLR-RM/stable-baselines3

huggingface/deep-rl-class

bigscience-workshop/t-zero

TorchCraft/TorchCraftAI

EvanHahn/flood

LyWangPX/Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutions

lebrice/SimpleParsing

gothinkster/node-express-realworld-example-app