janmaltel

Pinned Repositories

alpaca-lora
Code for reproducing the Stanford Alpaca InstructLLaMA result on consumer hardware
Language:Jupyter Notebook0 0 00
bogaaak.github.io
Language:HTML00
deepspeed-sagemaker-example
Language:Jupyter Notebook0 0 00
dominance-filters
filter actions using (cumulative) dominance
Language:Python00
examples
TensorFlow examples
Language:Jupyter Notebook00
GazeHeuristic
Explore effectiveness and failure modes of the Gaze Heuristic
Language:JavaScript00
gradio
Create UIs for your machine learning model in Python in 3 minutes
Language:HTML00
pytetris
Language:Python3 1 00
q-learning-introduction
Code for the talk "A gentle introduction to Q-learning in Python" held at the PyData Bristol Meetup on July 18, 2019.
Language:Jupyter Notebook1 0 00
stew-tetris
Shrinkage Toward Equal Weights in Tetris
Language:Python2 1 01

janmaltel's Repositories

janmaltel/pytetris
Language:Python3 1 00
janmaltel/stew-tetris
Shrinkage Toward Equal Weights in Tetris
Language:Python2 1 01
janmaltel/q-learning-introduction
Code for the talk "A gentle introduction to Q-learning in Python" held at the PyData Bristol Meetup on July 18, 2019.
Language:Jupyter Notebook1 0 00
janmaltel/alpaca-lora
Code for reproducing the Stanford Alpaca InstructLLaMA result on consumer hardware
Language:Jupyter Notebook0 0 00
janmaltel/bogaaak.github.io
Language:HTML00
janmaltel/deepspeed-sagemaker-example
Language:Jupyter Notebook0 0 00
janmaltel/dominance-filters
filter actions using (cumulative) dominance
Language:Python00
janmaltel/examples
TensorFlow examples
Language:Jupyter Notebook00
janmaltel/GazeHeuristic
Explore effectiveness and failure modes of the Gaze Heuristic
Language:JavaScript00
janmaltel/gradio
Create UIs for your machine learning model in Python in 3 minutes
Language:HTML00
janmaltel/gym
A toolkit for developing and comparing reinforcement learning algorithms.
janmaltel/gym-feature-gridworld
A simple gridworld environment where state-action pairs have a simple feature representation. Uses OpenAI gym format.
Language:Python1 0
janmaltel/models
Models and examples built with TensorFlow
Language:Python0 0
janmaltel/ProMP
ProMP: Proximal Meta-Policy Search
Language:Python0 0
janmaltel/pytorch-maml-rl
Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch
Language:Python
janmaltel/rand_param_envs
Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7
Language:Python1
janmaltel/rl-baselines-zoo
A collection of 100+ pre-trained RL agents using Stable Baselines, training and hyperparameter optimization included.
Language:Python0 0
janmaltel/rl-visualizations
Simple visualizations of basic RL algorithms in a simple gridworld. Written in Python, the visualizations can be seen directly in the respective jupyter notebooks.
1 0
janmaltel/stable-baselines
A fork of OpenAI Baselines, implementations of reinforcement learning algorithms
Language:Python0 0
janmaltel/stew
Shrinkage Toward Equal Weights
Language:Python1 0
janmaltel/td-gammon
Implementation of TD-Gammon in TensorFlow.
Language:Python
janmaltel/template
This is the repository for the distill web framework
Language:JavaScript0 0
janmaltel/tetris
A Tetris implementation tailored for use in reinforcement learning applications.
Language:Python1
janmaltel/toy-data
Create toy data for linear or deep machine learning models
Language:Python2 0
janmaltel/v97
Proceedings of ICML 2019
Language:TeX0 0
janmaltel/weightagnostic.github.io
repo for interactive article
Language:JavaScript