Pinned Repositories
anomaly-detection-in-cpp
My implementation in c++ of an anomaly detection project from the ML course by Andrew Ng on Coursera.
baba-is-gym
Gym environments for "baba is you" https://store.steampowered.com/app/736260/Baba_Is_You/
gym-alttp-gridworld
A gym environment for Stuart Armstrong's model of a treacherous turn.
harlow
Tutorial & scripts to run a meta-rl model on DeepMind Lab's Harlow task environment.
meta_rl
The Tensorflow code and a DeepMind Lab wrapper for my article "Meta-Reinforcement Learning" on FloydHub.
quantilizers
Code from "How useful is quantilization for mitigating specification-gaming?"
rl-book-challenge
self-studying the Sutton & Barto the hard way
spinning-up-a-Pong-AI-with-deep-RL
Code for "Spinning Up a Pong AI With Deep RL" on FloydHub.
talk-to-paul
two-step-task
Implementation of the two-step-task as described in "Prefrontal cortex as a meta-reinforcement learning system" and "Learning to Reinforcement Learn".
mtrazzi's Repositories
mtrazzi/rl-book-challenge
self-studying the Sutton & Barto the hard way
mtrazzi/two-step-task
Implementation of the two-step-task as described in "Prefrontal cortex as a meta-reinforcement learning system" and "Learning to Reinforcement Learn".
mtrazzi/spinning-up-a-Pong-AI-with-deep-RL
Code for "Spinning Up a Pong AI With Deep RL" on FloydHub.
mtrazzi/meta_rl
The Tensorflow code and a DeepMind Lab wrapper for my article "Meta-Reinforcement Learning" on FloydHub.
mtrazzi/harlow
Tutorial & scripts to run a meta-rl model on DeepMind Lab's Harlow task environment.
mtrazzi/baba-is-gym
Gym environments for "baba is you" https://store.steampowered.com/app/736260/Baba_Is_You/
mtrazzi/talk-to-paul
mtrazzi/quantilizers
Code from "How useful is quantilization for mitigating specification-gaming?"
mtrazzi/twitter-spam
tweetstorm 100 characters every 100 seconds in a thread
mtrazzi/gomoku
mtrazzi/42-projects
My projects from project-based learning education 42
mtrazzi/cs231n
mtrazzi/FIRE
Reproducing the results from the paper: FIRE: An Integrated Trust and Reputation Model for Open Multi-Agent Systems. European Conference on Artificial Intelligence
mtrazzi/treacherous-turn-simulations-bci
Open-sourcing my research proposals, hoping to find potential research collaborators.
mtrazzi/tw-ch
mtrazzi/under_the_dirt
mtrazzi/corporate-culture
mtrazzi/eegi-backend
mtrazzi/gmail-to-google-contacts
I was tired of manually adding mailing lists to google contacts/LinkedIn, so I wrote a python script.
mtrazzi/go
mtrazzi/krpsim
mtrazzi/libft
mtrazzi/libft-1
A customizable cross-platform standard library for C
mtrazzi/ML-roots-you
mtrazzi/mlab
mtrazzi/mtrazzi.github.io
mtrazzi/no-style-please
A (nearly) no-CSS, fast, minimalist Jekyll theme.
mtrazzi/npuzzle
mtrazzi/pom
mtrazzi/reply-gang