wenkesj's Stars
fomorians/contextual_rnn
This repository contains the code for the paper "Contextual Recurrent Neural Networks"
Mayer123/LASER
Code for the paper "LASER: LLM Agent with State-Space Exploration for Web Navigation"
johnma2006/mamba-minimal
Simple, minimal implementation of the Mamba SSM in one file of PyTorch.
state-spaces/mamba
Mamba SSM architecture
wenkesj/practical-pendulum
Pendulum Simulation and Deep Reinforcement Learning Algorithms
google-research/tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
allenai/ir_datasets
Provides a common interface to many IR ranking datasets.
beir-cellar/beir
A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.
nmslib/nmslib
Non-Metric Space Library (NMSLIB): An efficient similarity search library and a toolkit for evaluation of k-NN methods for generic non-metric spaces.
SeldonIO/MLServer
An inference server for your machine learning models, including support for multiple frameworks, multi-model serving and more
utilForever/baba-is-auto
Baba Is You simulator using C++ with some reinforcement learning
fomorians-oss/snake
Implementation of a Snake OpenAI Gym environment and solutions thereof.
google-research/seed_rl
SEED RL: Scalable and Efficient Deep-RL with Accelerated Central Inference. Implements IMPALA and R2D2 algorithms in TF2 with SEED's architecture.
fomorians-oss/awr
Reference implementation of Advantage Weighted Regression.
google-deepmind/open_spiel
OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.
fomorians/tool-use
Baselines for tool use environments.
fomorians/gym_pycolab
Gym interface for custom pycolab games.
fomorians/gym_tool_use
Gym tool use environments.
google-deepmind/hanabi-learning-environment
hanabi_learning_environment is a research platform for Hanabi experiments.
tensorflow/datasets
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
fomorians/counting-mnist
A simple synthetic dataset and baseline model for visual counting.
fomorians/vae
Variational autoencoder with TF Eager and Probability.
fomorians-oss/pyoneer
Tensor utilities, reinforcement learning, and more!
fomorians/ppo
Implementation of PPO with TF 2.0 and Pyoneer.
fomorians/forward-models
A tutorial on forward models for model-based reinforcement learning.
google-deepmind/trfl
TensorFlow Reinforcement Learning
unixpickle/camera-hijack
A chrome extension to mess with the webcam
Farama-Foundation/Minigrid
Simple and easily configurable grid world environments for reinforcement learning
GPflow/GPflow
Gaussian processes in TensorFlow
Kautenja/gym-super-mario-bros
An OpenAI Gym interface to Super Mario Bros. & Super Mario Bros. 2 (Lost Levels) on The NES