Pinned Repositories
3DChess
ACSL-2012-Blair-Practice-Code
ai-deadlines
:alarm_clock: AI conference deadline countdowns
arxiv-sanity-preserver
Web interface for browsing, search and filtering recent arxiv submissions
baby-rlhf
Simple conceptual implementation of reinforcement learning from human preferences.
Bing-Maps-V8-TypeScript-Definitions
This project contains the TypeScript definitions for the Bing Maps V8 Web Control.
crossword
Multiplayer crossword puzzle sovler that accepts the NYT .puz format
FlowProject
input-filter
Model code for "A Noisy Channel Model for Systematizing Unpredictable Input Variation" presented at BUCLD 44
jordan-schneider's Repositories
jordan-schneider/baby-rlhf
Simple conceptual implementation of reinforcement learning from human preferences.
jordan-schneider/FlowProject
jordan-schneider/ai-deadlines
:alarm_clock: AI conference deadline countdowns
jordan-schneider/input-filter
Model code for "A Noisy Channel Model for Systematizing Unpredictable Input Variation" presented at BUCLD 44
jordan-schneider/commit-on-run
jordan-schneider/common
ML templates and core algorithms
jordan-schneider/dist-transformer-sim
Simulate distributed transformer training runtimes
jordan-schneider/driver-env
Gym implementation of Sadigh's driver environment for reinforcement learning.
jordan-schneider/driving-preferences
jordan-schneider/experiment-server
jordan-schneider/fallen-london-sims
Simulators for grinds in fallen london
jordan-schneider/git-setup
Git subcommand to do common setup tasks for python projects
jordan-schneider/google-birthdays
Google killed the feature where it would email reminders to you about people's birthdays. This script attempts to replicate that.
jordan-schneider/linear-procgen
jordan-schneider/multimodal-reward-learning
Learn a Bayesian Posterior distribution over reward functions using human feedback of different kinds
jordan-schneider/ODE2VAE
ODE2VAE: Deep generative second order ODEs with Bayesian neural networks
jordan-schneider/omegaconf
Flexible Python configuration system. The last one you will ever need.
jordan-schneider/OpenLock
OpenLock Environment for OpenAI Gym
jordan-schneider/OpenLockLearner-AAAI20
Repo for AAAI20 paper "Theory-based Causal Transfer: Integrating Instance-level Induction and Abstract-level Structure Learning"
jordan-schneider/phasic-policy-gradient
Code for the paper "Phasic Policy Gradient"
jordan-schneider/procgen
Procgen Benchmark: Procedurally-Generated Game-Like Gym-Environments
jordan-schneider/procgen-experiment
jordan-schneider/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
jordan-schneider/pytorch-qrnn
PyTorch implementation of the Quasi-Recurrent Neural Network - up to 16 times faster than NVIDIA's cuDNN LSTM
jordan-schneider/rlpyt
Reinforcement Learning in PyTorch
jordan-schneider/task
CLI task tracker with rich features
jordan-schneider/TD3
Author's PyTorch implementation of TD3 for OpenAI gym tasks
jordan-schneider/titan_utils
Utility to select most available machine and gpu in Titan cluster
jordan-schneider/torch-tensor-eq
Provides a subclass of torch.Tensor whose equal function returns a bool.
jordan-schneider/value-alignment-verification