Miffyli
Researcher at Meta Fundamental Artificial Intelligence Research (FAIR), working on reinforcement learning.
@facebookresearchLondon, UK
Pinned Repositories
stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
gan-aimbots
Code for the experiments done in the paper "GAN-Aimbots: Using Machine Learning for Cheating in First Person Shooters"
im2latex-dataset
Python tools for creating suitable dataset for OpenAI's im2latex task: https://openai.com/requests-for-research/#im2latex
minecraft-bc-2020
Behavioural cloning solution to MineRL2020 competition
nle-sample-factory-baseline
rl-action-space-shaping
Experiment code for testing effect of various action space transformations in reinforcement learning
sym
Website for detailed game mechanics
ToriLLE
Toribash Learning Environment
minerl
MineRL Competition for Sample Efficient Reinforcement Learning - Python Package
Video-Pre-Training
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos
Miffyli's Repositories
Miffyli/ToriLLE
Toribash Learning Environment
Miffyli/sym
Website for detailed game mechanics
Miffyli/rl-action-space-shaping
Experiment code for testing effect of various action space transformations in reinforcement learning
Miffyli/gan-aimbots
Code for the experiments done in the paper "GAN-Aimbots: Using Machine Learning for Cheating in First Person Shooters"
Miffyli/nle-sample-factory-baseline
Miffyli/policy-supervectors
Creating fixed-length vectors to describe RL/GA policies
Miffyli/mastering-chutes-and-ladders
The source code for mastering the game of Chutes and Ladders
Miffyli/rl-human-prior-tricks
Evaluating different engineering tricks that make RL work
Miffyli/minecraft-bc-2020
Behavioural cloning solution to MineRL2020 competition
Miffyli/asv-cm-reinforce
Optimizing speaker verification and spoofing countermeasure systems together with REINFORCE
Miffyli/minecraft-bc
Submission code of UEFDRL team to NeurIPS 2019 MineRL challenge (5th place)
Miffyli/Conditional_Diffusion_MNIST
Conditional diffusion model to generate MNIST. Minimal script. Based on 'Classifier-Free Diffusion Guidance'.
Miffyli/TerrariaCustomMediumcore
A Terraria TSchock server plugin that allows customizing which items are dropped upon death
Miffyli/Agents_that_Listen
Train an agent to play VizDoom with multi sensory inputs. Trained using sample factory
Miffyli/AIDungeon
Infinite adventures await!
Miffyli/basalt_competition_baseline_submissions
Miffyli/brain-tokyo-workshop
🧠🗼
Miffyli/catastrophic-forgetting
Source code for the experiments in my MSc thesis titled Understanding Forgetting in Artificial Neural Networks.
Miffyli/competition_submission_starter_template
The submission template for the MineRL Competition on Sample Efficicent RL @ NeurIPS 2019. Clone this to make a new submission!
Miffyli/gym
A toolkit for developing and comparing reinforcement learning algorithms.
Miffyli/gym-docs
Code for Gym documentation website
Miffyli/incubator
Collection of in-progress libraries for entity neural networks.
Miffyli/minerl
MineRL Competition for Sample Efficient Reinforcement Learning - Python Package
Miffyli/OpenRA
Open Source real-time strategy game engine for early Westwood games such as Command & Conquer: Red Alert written in C# using SDL and OpenGL. Runs on Windows, Linux, *BSD and Mac OS X.
Miffyli/Plugins
🧪☕️⚡️A list of TShock for Terraria plugins.
Miffyli/project-NN-Pytorch-scripts
Miffyli/PyJSONViewer
A JSON viewer using pure python
Miffyli/stable-baselines3
PyTorch version of Stable Baselines, improved implementations of reinforcement learning algorithms.
Miffyli/Toybox
The Machine Learning Toybox for testing the behavior of autonomous agents.
Miffyli/Video-Pre-Training
Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos