windweller

An RLer, NLPer, functional programmer. CS PhD at Stanford.

Stanford, CA

windweller's Stars

godotengine/godot
Godot Engine – Multi-platform 2D and 3D game engine
Language:C++92.2k 1.5k 54k21.4k
acmesh-official/acme.sh
A pure Unix shell script implementing ACME client protocol
Language:Shell40.4k 487 3.1k5.1k
microsoft/autogen
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Language:Python36.4k 417 2.2k5.3k
Lightning-AI/pytorch-lightning
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
Language:Python28.7k 252 7.2k3.4k
google-research/arxiv-latex-cleaner
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
Language:Python5.5k 34 55335
pytorch/captum
Model interpretability and understanding for PyTorch
Language:Python5k 280 553503
hill-a/stable-baselines
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
Language:Python4.2k 63 949724
pgmpy/pgmpy
Python Library for learning (Structure and Parameter), inference (Probabilistic and Causal), and simulations in Bayesian Networks.
Language:Python2.8k 76 911721
google/brax
Massively parallel rigidbody physics simulation on accelerator hardware.
Language:Jupyter Notebook2.4k 33 360259
Farama-Foundation/Minigrid
Simple and easily configurable grid world environments for reinforcement learning
Language:Python2.1k 39 193614
tristandeleu/pytorch-meta
A collection of extensions and data-loaders for few-shot learning & meta-learning in PyTorch
Language:Python2k 44 141256
touilleMan/godot-python
Python support for Godot 🐍🐍🐍
Language:Python1.9k 90 294144
ruotianluo/self-critical.pytorch
Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.
Language:Python998 20 279277
facebookresearch/torchbeast
A PyTorch Platform for Distributed RL
Language:Python743 16 37113
princeton-nlp/LM-BFF
[ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723
Language:Python722 29 50132
microsoft/Trace
End-to-end Generative Optimization for AI Agents
Language:Python402 10 826
WilsonWangTHU/mbbl
Language:Python389 16 868
gsbDBI/ExperimentData
Language:HTML230 25 0118
brendanator/atari-rl
Atari - Deep Reinforcement Learning algorithms in TensorFlow
Language:Python135 9 631
chuangg/CLEVRER
PyTorch implementation of ICLR 2020 paper "CLEVRER: CoLlision Events for Video REpresentation and Reasoning"
Language:Python113 6 1226
kmario23/KenLM-training
Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2
112 6 721
jlin816/dialop
DialOp: Decision-oriented dialogue environments for collaborative language agents
Language:Python103 1 06
salaniz/pytorch-gve-lrcn
PyTorch implementations for "Generating Visual Explanations" (GVE) and "Long-term Recurrent Convolutional Networks" (LRCN)
Language:Python92 4 1822
CausalAIBook/MetricsMLNotebooks
Notebooks for Applied Causal Inference Powered by ML and AI
Language:Jupyter Notebook90 6 940
microsoft/LLF-Bench
A benchmark for evaluating learning agents based on just language feedback
Language:Python61 6 613
qipeng/stay-hungry-stay-focused
This repository hosts the authors' implementation of the paper "Stay Hungry, Stay Focused: Generating Informative and Specific Questions in Information-Seeking Conversations", published in Findings of EMNLP 2020.
Language:Python26 3 05
cicl-stanford/moca
Language model evaluation for morality and causality
Language:Python16 2 00
CuriousCat-7/Graph-Structure-of-Neural-Networks
An unofficial re-implementation of Graph Structure of Neural Networks (Jiaxuan You · Kaiming He · Jure Leskovec · Saining Xie) ICML 2020
Language:Python10 2 03
kyunghyuncho/map_plan_backprop
Language:Jupyter Notebook9 1 00
yaoliucs/BCQ
Author's PyTorch implementation of BCQ for "Off-Policy Deep Reinforcement Learning without Exploration"
Language:Python2 1 0

windweller

windweller's Stars

godotengine/godot

acmesh-official/acme.sh

microsoft/autogen

Lightning-AI/pytorch-lightning

google-research/arxiv-latex-cleaner

pytorch/captum

hill-a/stable-baselines

pgmpy/pgmpy

google/brax

Farama-Foundation/Minigrid

tristandeleu/pytorch-meta

touilleMan/godot-python

ruotianluo/self-critical.pytorch

facebookresearch/torchbeast

princeton-nlp/LM-BFF

microsoft/Trace

WilsonWangTHU/mbbl

gsbDBI/ExperimentData

brendanator/atari-rl

chuangg/CLEVRER

kmario23/KenLM-training

jlin816/dialop

salaniz/pytorch-gve-lrcn

CausalAIBook/MetricsMLNotebooks

microsoft/LLF-Bench

qipeng/stay-hungry-stay-focused

cicl-stanford/moca

CuriousCat-7/Graph-Structure-of-Neural-Networks

kyunghyuncho/map_plan_backprop

yaoliucs/BCQ