kschweig

Institute of Machine Learning - JKU LinzLinz

Pinned Repositories

babyai
BabyAI platform. A testbed for training agents to understand and execute language commands.
Language:Python00
BCQ
Author's PyTorch implementation of BCQ for continuous and discrete actions
Language:Python00
CCC
Code for CCC
Language:Python00
d4rl
A benchmark for offline reinforcement learning.
Language:Python00
DDU
Code for Deterministic Neural Networks with Appropriate Inductive Biases Capture Epistemic and Aleatoric Uncertainty
Language:Jupyter Notebook00
disparate-benefits
The Disparate Benefits of Deep Ensembles
Language:Jupyter Notebook00
DissectOfflineRL
Dissect Offline Reinforcement Learning, what do we need wrt. datasets and buffer strategies to succeed in this setting.
Language:Python0 1 00
entropybaseduq
Language:Python00
error-parity
Achieve error-rate fairness between societal groups for any score-based classifier.
Language:Python00
OfflineRL
Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning
Language:Jupyter Notebook24 1 16

kschweig's Repositories

kschweig/OfflineRL
Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning
Language:Jupyter Notebook24 1 16
kschweig/babyai
BabyAI platform. A testbed for training agents to understand and execute language commands.
Language:Python00
kschweig/BCQ
Author's PyTorch implementation of BCQ for continuous and discrete actions
Language:Python00
kschweig/CCC
Code for CCC
Language:Python00
kschweig/d4rl
A benchmark for offline reinforcement learning.
Language:Python00
kschweig/DDU
Code for Deterministic Neural Networks with Appropriate Inductive Biases Capture Epistemic and Aleatoric Uncertainty
Language:Jupyter Notebook00
kschweig/disparate-benefits
The Disparate Benefits of Deep Ensembles
Language:Jupyter Notebook00
kschweig/DissectOfflineRL
Dissect Offline Reinforcement Learning, what do we need wrt. datasets and buffer strategies to succeed in this setting.
Language:Python0 1 00
kschweig/entropybaseduq
Language:Python00
kschweig/error-parity
Achieve error-rate fairness between societal groups for any score-based classifier.
Language:Python00
kschweig/GPT2-Volley
Language:Jupyter Notebook00
kschweig/gym-games
A gym version of various games for reinforcenment learning.
Language:Python00
kschweig/hopfield-layers
Hopfield Networks is All You Need
kschweig/MinAtar
Language:Python
kschweig/offline-rl.github.io
kschweig/ProjectOfflineRL
Project work in the domain of offline RL
Language:Python
kschweig/python-zwoasi
Python binding for the ZWO ASI library. Control ZWO ASI cameras from python.
kschweig/shrinkbench-models
kschweig/SNNs
Tutorials and implementations for "Self-normalizing networks"
Language:Jupyter Notebook
kschweig/spe2py
Loads Princeton Instruments LightField (SPE 3.0) files into a python environment.
kschweig/take_it_easy
Homebrew stochastic Reinforcement Learning environment to test various DRL algorithms on
Language:Python
kschweig/torch-sgld
SGLD and cSGLD as a PyTorch Optimizer
kschweig/understandingbdl

kschweig

Pinned Repositories

babyai

BCQ

CCC

d4rl

DDU

disparate-benefits

DissectOfflineRL

entropybaseduq

error-parity

OfflineRL

kschweig's Repositories

kschweig/OfflineRL

kschweig/babyai

kschweig/BCQ

kschweig/CCC

kschweig/d4rl

kschweig/DDU

kschweig/disparate-benefits

kschweig/DissectOfflineRL

kschweig/entropybaseduq

kschweig/error-parity

kschweig/GPT2-Volley

kschweig/gym-games

kschweig/hopfield-layers

kschweig/MinAtar

kschweig/offline-rl.github.io

kschweig/ProjectOfflineRL

kschweig/python-zwoasi

kschweig/shrinkbench-models

kschweig/SNNs

kschweig/spe2py

kschweig/take_it_easy

kschweig/torch-sgld

kschweig/understandingbdl