timoklein
Reinforcement Learning PhD @ Probabilistic Machine Learning Group
University of ViennaVienna, Austria
Pinned Repositories
breaking-the-reclustering-barrier
Code and pre-trained models for the paper "Breaking the Reclustering Barrier in Centroid-based Deep Clustering"
alphazero-gym
AlphaZero for continuous control tasks
car_racer
Deep reinforcement learning in autonomous driving
crelu-pytorch
CReLU activation function from the paper "Loss of Plasticity in Continual Deep Reinforcement Learning"
implicit_underparameterization
Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning (pytorch)
infer
InFeR: Understanding and Preventing Capacity Loss in Reinforcement Learning (pytorch)
ma_thesis
Combining Reinforcement Learning and Search for Cooperative Trajectory Planning
neural_citation
Context aware citation recommendation
plasticity-injection-torch
Deep Reinforcement Learning with Plasticity Injection (pytorch)
redo
ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)
timoklein's Repositories
timoklein/redo
ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)
timoklein/alphazero-gym
AlphaZero for continuous control tasks
timoklein/car_racer
Deep reinforcement learning in autonomous driving
timoklein/plasticity-injection-torch
Deep Reinforcement Learning with Plasticity Injection (pytorch)
timoklein/infer
InFeR: Understanding and Preventing Capacity Loss in Reinforcement Learning (pytorch)
timoklein/neural_citation
Context aware citation recommendation
timoklein/crelu-pytorch
CReLU activation function from the paper "Loss of Plasticity in Continual Deep Reinforcement Learning"
timoklein/implicit_underparameterization
Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning (pytorch)
timoklein/ma_thesis
Combining Reinforcement Learning and Search for Cooperative Trajectory Planning
timoklein/bandit_algos
Some common algorithms for multi-armed bandit problems
timoklein/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
timoklein/ClustPy
A Python library for advanced clustering algorithms
timoklein/cpp_optim
Nonlinear optimization examples in C++
timoklein/dmcgym
timoklein/garage
A toolkit for reproducible reinforcement learning research.
timoklein/markov-abstractions-ablations
DM control Markov component ablations
timoklein/outlier_detection
Class based Python implementations of outlier detection algorithms.
timoklein/Minigrid
Simple and easily configurable grid world environments for reinforcement learning
timoklein/navix
Accelerated minigrid environments with JAX
timoklein/purejaxql
Simple single-file baselines for Q-Learning in pure-GPU setting
timoklein/rl_graph_breaks
An example of torch.compile graph breaks in RL code using SAC-discrete as an example
timoklein/Stoix
🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL
timoklein/udemy_cpp
C++ Course
timoklein/uvadlc_notebooks
Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023
timoklein/wandb_tutorial
Code example for some basic wandb functionality