windweller's Stars
godotengine/godot
Godot Engine – Multi-platform 2D and 3D game engine
acmesh-official/acme.sh
A pure Unix shell script implementing ACME client protocol
microsoft/autogen
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Lightning-AI/pytorch-lightning
Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.
google-research/arxiv-latex-cleaner
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
pytorch/captum
Model interpretability and understanding for PyTorch
hill-a/stable-baselines
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
pgmpy/pgmpy
Python Library for learning (Structure and Parameter), inference (Probabilistic and Causal), and simulations in Bayesian Networks.
google/brax
Massively parallel rigidbody physics simulation on accelerator hardware.
Farama-Foundation/Minigrid
Simple and easily configurable grid world environments for reinforcement learning
tristandeleu/pytorch-meta
A collection of extensions and data-loaders for few-shot learning & meta-learning in PyTorch
touilleMan/godot-python
Python support for Godot 🐍🐍🐍
ruotianluo/self-critical.pytorch
Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.
facebookresearch/torchbeast
A PyTorch Platform for Distributed RL
princeton-nlp/LM-BFF
[ACL 2021] LM-BFF: Better Few-shot Fine-tuning of Language Models https://arxiv.org/abs/2012.15723
microsoft/Trace
End-to-end Generative Optimization for AI Agents
WilsonWangTHU/mbbl
gsbDBI/ExperimentData
brendanator/atari-rl
Atari - Deep Reinforcement Learning algorithms in TensorFlow
chuangg/CLEVRER
PyTorch implementation of ICLR 2020 paper "CLEVRER: CoLlision Events for Video REpresentation and Reasoning"
kmario23/KenLM-training
Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2
jlin816/dialop
DialOp: Decision-oriented dialogue environments for collaborative language agents
salaniz/pytorch-gve-lrcn
PyTorch implementations for "Generating Visual Explanations" (GVE) and "Long-term Recurrent Convolutional Networks" (LRCN)
CausalAIBook/MetricsMLNotebooks
Notebooks for Applied Causal Inference Powered by ML and AI
microsoft/LLF-Bench
A benchmark for evaluating learning agents based on just language feedback
qipeng/stay-hungry-stay-focused
This repository hosts the authors' implementation of the paper "Stay Hungry, Stay Focused: Generating Informative and Specific Questions in Information-Seeking Conversations", published in Findings of EMNLP 2020.
cicl-stanford/moca
Language model evaluation for morality and causality
CuriousCat-7/Graph-Structure-of-Neural-Networks
An unofficial re-implementation of Graph Structure of Neural Networks (Jiaxuan You · Kaiming He · Jure Leskovec · Saining Xie) ICML 2020
kyunghyuncho/map_plan_backprop
yaoliucs/BCQ
Author's PyTorch implementation of BCQ for "Off-Policy Deep Reinforcement Learning without Exploration"