kosmitive

Pinned Repositories

abstract_rl
A modular python implementation of various policy gradient algorithms for use in control problems on experimental quanser robots. This repository includes implementations of Maximum A Posteriori Policy Optimization, Trust Region Policy Optimization and a draft for Soft Actor Critic.
Language:Python2 2 00
approximate-signal-cancellation
This is a small framework to simulate algorithms in the area of signal processing. A GUI was implemented and simple transformations like direct inversion, Fourier transformation techniques over regression techniques are available.
Language:C++0 2 00
boed-pytorch
A simple project, which explores the variational estimators of Foster (https://arxiv.org/abs/1903.05480) in a Bayesian linear regression setting. Using nested Monte Carlo estimators the exact (convex) information gain is calculated for the regression.
Language:Python1 1 00
bootstrapped-dqn
An implementation of boostrapped DQN (https://arxiv.org/abs/1602.04621). It was created during my bachelor thesis at TU Darmstadt, and you can find thesis at http://www.ias.tu-darmstadt.de/uploads/Theses/Abschlussarbeiten/markus_semmler_bsc.pdf.
Language:Python1 1 00
burrolib
Burrolib provides a library for multi-agent Markov games for researchers. It considers Markov games from an economical perspective. The modular agent design allows different agent implementations for a single game. The user can choose whether to design an expert system or a free learning approach.
Language:Python4 1 00
fmpsdk
SDK for Financial Modeling Prep's (FMP) API
Language:Python1 1 00
gitignore
A collection of useful .gitignore templates
1 1 00
pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Language:Python1 1 00
qlearn
This repository contains scripts for executing QLearning algorithms onto different environments. It uses Tensorflow and features different discrete state/action environments. One can display different plots, like the value function or a comparison between different agents. The focus lies on the exploration efficiency.
Language:Python0 1 00
sticky-hdp-slds-hmm
An implementation of a hierarchical Dirichlet process (HDP) combined with a switching linear dynamical systems (SLDS) from https://arxiv.org/abs/1003.3829. It is a rather complex model and thus computationally expensive. Note the hyper parameters have to be adjusted.
Language:Python1 1 00

kosmitive's Repositories

kosmitive/burrolib
Burrolib provides a library for multi-agent Markov games for researchers. It considers Markov games from an economical perspective. The modular agent design allows different agent implementations for a single game. The user can choose whether to design an expert system or a free learning approach.
Language:Python4 1 00
kosmitive/abstract_rl
A modular python implementation of various policy gradient algorithms for use in control problems on experimental quanser robots. This repository includes implementations of Maximum A Posteriori Policy Optimization, Trust Region Policy Optimization and a draft for Soft Actor Critic.
Language:Python2 2 00
kosmitive/boed-pytorch
A simple project, which explores the variational estimators of Foster (https://arxiv.org/abs/1903.05480) in a Bayesian linear regression setting. Using nested Monte Carlo estimators the exact (convex) information gain is calculated for the regression.
Language:Python1 1 00
kosmitive/bootstrapped-dqn
An implementation of boostrapped DQN (https://arxiv.org/abs/1602.04621). It was created during my bachelor thesis at TU Darmstadt, and you can find thesis at http://www.ias.tu-darmstadt.de/uploads/Theses/Abschlussarbeiten/markus_semmler_bsc.pdf.
Language:Python1 1 00
kosmitive/fmpsdk
SDK for Financial Modeling Prep's (FMP) API
Language:Python1 1 00
kosmitive/gitignore
A collection of useful .gitignore templates
1 1 00
kosmitive/pandas
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
Language:Python1 1 00
kosmitive/sticky-hdp-slds-hmm
An implementation of a hierarchical Dirichlet process (HDP) combined with a switching linear dynamical systems (SLDS) from https://arxiv.org/abs/1003.3829. It is a rather complex model and thus computationally expensive. Note the hyper parameters have to be adjusted.
Language:Python1 1 00
kosmitive/approximate-signal-cancellation
This is a small framework to simulate algorithms in the area of signal processing. A GUI was implemented and simple transformations like direct inversion, Fourier transformation techniques over regression techniques are available.
Language:C++0 2 00
kosmitive/qlearn
This repository contains scripts for executing QLearning algorithms onto different environments. It uses Tensorflow and features different discrete state/action environments. One can display different plots, like the value function or a comparison between different agents. The focus lies on the exploration efficiency.
Language:Python0 1 00
kosmitive/rnn-tetherball-dynamics
Bachelor project by using recurrent neural networks to predict the 3-dimensional dynamics of a tetherball. It implements highway networks and also gated recurrent units. This implementation is highly modular, based on Tensorflow and can be adapted accordingly.
Language:Python2 0
kosmitive/sampleproject
A sample project that exists for PyPUG's "Tutorial on Packaging and Distributing Projects"
Language:Python0 0
kosmitive/univariate-distributions
This repository contains several continuous and discrete univariate distributions. It uses MRG32k3a generator to create uniform samples. These uniform samples are then transformed to yield a sample from any supported distribution. A space system is used to represent the domain of the samples.
Language:Python2 0