self-supervisor

(Augustine Mavor-Parker). PhD student at UCL Centre for Artificial Intelligence with Lewis Griffin and Caswell Barry.

University College LondonLondon

Pinned Repositories

Escaping-Stochastic-Traps-With-Aleatoric-Mapping-Agents
Language:Python1 1 00
gruvbox-gantt-charts-with-pgfgantt
Language:TeX0 1 00
gym-minigrid_personal
Minimalistic gridworld package for OpenAI Gym
Language:Python1 0 00
How_to_stay_curious_while_avoiding_noisy_TVs
Language:Python5 1 01
jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Language:Python0 0 00
SARSA-Mountain-Car-Sutton-and-Barto
Implementation of Sutton and Barto SARSA mountain car algorithm, with their tile coding implementation used as features.
Language:Jupyter Notebook0 1 01
self-supervisor.github.io
A minimalist Jekyll theme, ideally designed for your academic site.
Language:SCSS0 0 00

self-supervisor's Repositories

self-supervisor/How_to_stay_curious_while_avoiding_noisy_TVs
Language:Python5 1 01
self-supervisor/Escaping-Stochastic-Traps-With-Aleatoric-Mapping-Agents
Language:Python1 1 00
self-supervisor/gym-minigrid_personal
Minimalistic gridworld package for OpenAI Gym
Language:Python1 0 00
self-supervisor/allocentric-scene-perception
This repo hosts both the Allocentric Scene Perception (ASP) benchmark and a biologically plausible model for unsupervised segmentation of objects
Language:Python0 0 00
self-supervisor/cule
CuLE: A CUDA port of the Atari Learning Environment (ALE)
Language:C++0 0 01
self-supervisor/gruvbox-gantt-charts-with-pgfgantt
Language:TeX0 1 00
self-supervisor/jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Language:Python0 0 00
self-supervisor/SARSA-Mountain-Car-Sutton-and-Barto
Implementation of Sutton and Barto SARSA mountain car algorithm, with their tile coding implementation used as features.
Language:Jupyter Notebook0 1 01
self-supervisor/self-supervisor.github.io
A minimalist Jekyll theme, ideally designed for your academic site.
Language:SCSS0 0 00
self-supervisor/brax
Massively parallel rigidbody physics simulation on accelerator hardware.
Language:Jupyter Notebook0 0
self-supervisor/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Language:Python0 0
self-supervisor/doula-ai-website
Language:HTML
self-supervisor/genomic_bottleneck
Language:Python1 0
self-supervisor/genomic_bottleneck_v2
Language:Python2 1
self-supervisor/gymnax-blines
Baselines for gymnax 🤖
Language:Jupyter Notebook0 0
self-supervisor/implementations-nfq
Neural Fitted Q Iteration - First Experiences with a Data Efficient Neural Reinforcement Learning Method
Language:Python0 0
self-supervisor/IsaacGymEnvs
Isaac Gym Reinforcement Learning Environments
Language:Python0 0
self-supervisor/minihack
MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research
Language:Python0 0
self-supervisor/noreward-rl
[ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning
Language:Python0 0
self-supervisor/purejaxrl
Really Fast End-to-End Jax RL Implementations
Language:Python0 0
self-supervisor/python-rl
Some Reinforcement Learning in Python
Language:Python0 0
self-supervisor/PyTorch-CIFAR-10-autoencoder
This is a reimplementation of the blog post "Building Autoencoders in Keras". Instead of using MNIST, this project uses CIFAR10.
Language:Python0 0
self-supervisor/random-network-distillation
Code for the paper "Exploration by Random Network Distillation"
Language:Python0 0
self-supervisor/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for accelerating ML workloads.
Language:Python0 0
self-supervisor/SR-Learning-Resources
Some notebooks and code to help people get started with understanding successor representations using both discrete states and continuous features
Language:Jupyter Notebook0 0
self-supervisor/streamlit-agent
Reference implementations of several LangChain agents as Streamlit apps
Language:Python
self-supervisor/subnetwork-probing
Language:Jupyter Notebook1 0
self-supervisor/vizdoomgym
OpenAI Gym wrapper for ViZDoom enviroments
Language:Python0 0
self-supervisor/wandb_pickle
Wrapper around wandb that makes it easier to do custom python plots.
Language:Python1 0
self-supervisor/wandb_scraper
Language:Jupyter Notebook1

self-supervisor

Pinned Repositories

Escaping-Stochastic-Traps-With-Aleatoric-Mapping-Agents

gruvbox-gantt-charts-with-pgfgantt

gym-minigrid_personal

How_to_stay_curious_while_avoiding_noisy_TVs

jax

SARSA-Mountain-Car-Sutton-and-Barto

self-supervisor.github.io

self-supervisor's Repositories

self-supervisor/How_to_stay_curious_while_avoiding_noisy_TVs

self-supervisor/Escaping-Stochastic-Traps-With-Aleatoric-Mapping-Agents

self-supervisor/gym-minigrid_personal

self-supervisor/allocentric-scene-perception

self-supervisor/cule

self-supervisor/gruvbox-gantt-charts-with-pgfgantt

self-supervisor/jax

self-supervisor/SARSA-Mountain-Car-Sutton-and-Barto

self-supervisor/self-supervisor.github.io

self-supervisor/brax

self-supervisor/cleanrl

self-supervisor/doula-ai-website

self-supervisor/genomic_bottleneck

self-supervisor/genomic_bottleneck_v2

self-supervisor/gymnax-blines

self-supervisor/implementations-nfq

self-supervisor/IsaacGymEnvs

self-supervisor/minihack

self-supervisor/noreward-rl

self-supervisor/purejaxrl

self-supervisor/python-rl

self-supervisor/PyTorch-CIFAR-10-autoencoder

self-supervisor/random-network-distillation

self-supervisor/ray

self-supervisor/SR-Learning-Resources

self-supervisor/streamlit-agent

self-supervisor/subnetwork-probing

self-supervisor/vizdoomgym

self-supervisor/wandb_pickle

self-supervisor/wandb_scraper