timoklein

Reinforcement Learning PhD @ Probabilistic Machine Learning Group

University of ViennaVienna, Austria

Pinned Repositories

breaking-the-reclustering-barrier
Code and pre-trained models for the paper "Breaking the Reclustering Barrier in Centroid-based Deep Clustering"
Language:Python6 3 00
alphazero-gym
AlphaZero for continuous control tasks
Language:Python23 3 74
car_racer
Deep reinforcement learning in autonomous driving
Language:Python8 3 35
crelu-pytorch
CReLU activation function from the paper "Loss of Plasticity in Continual Deep Reinforcement Learning"
Language:Python3 1 00
implicit_underparameterization
Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning (pytorch)
Language:Python2 1 00
infer
InFeR: Understanding and Preventing Capacity Loss in Reinforcement Learning (pytorch)
Language:Python4 3 00
ma_thesis
Combining Reinforcement Learning and Search for Cooperative Trajectory Planning
Language:TeX1 2 00
neural_citation
Context aware citation recommendation
Language:Jupyter Notebook4 2 44
plasticity-injection-torch
Deep Reinforcement Learning with Plasticity Injection (pytorch)
Language:Python7 2 00
redo
ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)
Language:Python25 4 84

timoklein's Repositories

timoklein/redo
ReDo: The Dormant Neuron Phenomenon in Deep Reinforcement Learning (pytorch)
Language:Python25 4 84
timoklein/alphazero-gym
AlphaZero for continuous control tasks
Language:Python23 3 74
timoklein/car_racer
Deep reinforcement learning in autonomous driving
Language:Python8 3 35
timoklein/plasticity-injection-torch
Deep Reinforcement Learning with Plasticity Injection (pytorch)
Language:Python7 2 00
timoklein/infer
InFeR: Understanding and Preventing Capacity Loss in Reinforcement Learning (pytorch)
Language:Python4 3 00
timoklein/neural_citation
Context aware citation recommendation
Language:Jupyter Notebook4 2 44
timoklein/crelu-pytorch
CReLU activation function from the paper "Loss of Plasticity in Continual Deep Reinforcement Learning"
Language:Python3 1 00
timoklein/implicit_underparameterization
Implicit Under-Parameterization Inhibits Data-Efficient Deep Reinforcement Learning (pytorch)
Language:Python2 1 00
timoklein/ma_thesis
Combining Reinforcement Learning and Search for Cooperative Trajectory Planning
Language:TeX1 2 00
timoklein/bandit_algos
Some common algorithms for multi-armed bandit problems
Language:Jupyter Notebook0 2 00
timoklein/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
Language:Python0 1 01
timoklein/ClustPy
A Python library for advanced clustering algorithms
Language:Python0 0 00
timoklein/cpp_optim
Nonlinear optimization examples in C++
Language:C++0 2 00
timoklein/dmcgym
Language:Python0 1 00
timoklein/garage
A toolkit for reproducible reinforcement learning research.
Language:Python0 0 00
timoklein/markov-abstractions-ablations
DM control Markov component ablations
Language:Python0 1 00
timoklein/outlier_detection
Class based Python implementations of outlier detection algorithms.
Language:Scilab0 1 00
timoklein/JAX-in-Action
Notebooks for the "JAX in Action" book
Language:Jupyter Notebook
timoklein/Minigrid
Simple and easily configurable grid world environments for reinforcement learning
Language:Python0 0
timoklein/navix
Accelerated minigrid environments with JAX
timoklein/purejaxql
Simple single-file baselines for Q-Learning in pure-GPU setting
Language:Python0 0
timoklein/rl_graph_breaks
An example of torch.compile graph breaks in RL code using SAC-discrete as an example
Language:Python
timoklein/Stoix
🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL
timoklein/udemy_cpp
C++ Course
Language:Makefile2 01
timoklein/uvadlc_notebooks
Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023
Language:Jupyter Notebook0 0
timoklein/wandb_tutorial
Code example for some basic wandb functionality
Language:Python1 0

timoklein

Pinned Repositories

breaking-the-reclustering-barrier

alphazero-gym

car_racer

crelu-pytorch

implicit_underparameterization

infer

ma_thesis

neural_citation

plasticity-injection-torch

redo

timoklein's Repositories

timoklein/redo

timoklein/alphazero-gym

timoklein/car_racer

timoklein/plasticity-injection-torch

timoklein/infer

timoklein/neural_citation

timoklein/crelu-pytorch

timoklein/implicit_underparameterization

timoklein/ma_thesis

timoklein/bandit_algos

timoklein/cleanrl

timoklein/ClustPy

timoklein/cpp_optim

timoklein/dmcgym

timoklein/garage

timoklein/markov-abstractions-ablations

timoklein/outlier_detection

timoklein/JAX-in-Action

timoklein/Minigrid

timoklein/navix

timoklein/purejaxql

timoklein/rl_graph_breaks

timoklein/Stoix

timoklein/udemy_cpp

timoklein/uvadlc_notebooks

timoklein/wandb_tutorial