mike-gimelfarb

Researcher in artificial intelligence and reinforcement learning.

University of TorontoToronto

Pinned Repositories

bayesian-epsilon-greedy
Public repository for a paper in UAI 2019 describing adaptive epsilon-greedy exploration using Bayesian ensembles for deep reinforcement learning.
Language:Python6 1 01
bayesian-reward-shaping
Bayesian Reward Shaping Framework for Deep Reinforcement Learning
Language:Python21 3 05
bboptpy
Powerful and scalable black-box optimization algorithms for Python and C++.
Language:C++5 2 01
cascade-correlation-neural-networks
A general framework for cascade correlation architectures in Python with wrappers to keras, tensorflow and sklearn
Language:Python12 3 04
contextual-policy-reuse-deep-rl
Framework for Contextually Transferring Knowledge from Multiple Source Policies in Deep Reinforcement Learning
3 2 00
deep-successor-features-for-transfer
A reusable framework for successor features for transfer in deep reinforcement learning using keras.
Language:Python39 3 011
numerical-integration
a curated collection of algorithms for performing numerical integration of black-box functions and estimating limits of series and sequences with high precision in Java
Language:Java4 3 02
optim4j
Library for numerical optimization of functions written in pure Java.
Language:Java3 3 03
pyRDDLGym
A toolkit for auto-generation of OpenAI Gym environments from RDDL description files.
Language:Python67 7 4017
pyRDDLGym-jax
JAX compilation of RDDL description files, and a differentiable planner in JAX.
Language:Python2 2 31

mike-gimelfarb's Repositories

mike-gimelfarb/deep-successor-features-for-transfer
A reusable framework for successor features for transfer in deep reinforcement learning using keras.
Language:Python39 3 011
mike-gimelfarb/bayesian-reward-shaping
Bayesian Reward Shaping Framework for Deep Reinforcement Learning
Language:Python21 3 05
mike-gimelfarb/cascade-correlation-neural-networks
A general framework for cascade correlation architectures in Python with wrappers to keras, tensorflow and sklearn
Language:Python12 3 04
mike-gimelfarb/bayesian-epsilon-greedy
Public repository for a paper in UAI 2019 describing adaptive epsilon-greedy exploration using Bayesian ensembles for deep reinforcement learning.
Language:Python6 1 01
mike-gimelfarb/bboptpy
Powerful and scalable black-box optimization algorithms for Python and C++.
Language:C++5 2 01
mike-gimelfarb/numerical-integration
a curated collection of algorithms for performing numerical integration of black-box functions and estimating limits of series and sequences with high precision in Java
Language:Java4 3 02
mike-gimelfarb/contextual-policy-reuse-deep-rl
Framework for Contextually Transferring Knowledge from Multiple Source Policies in Deep Reinforcement Learning
3 2 00
mike-gimelfarb/optim4j
Library for numerical optimization of functions written in pure Java.
Language:Java3 3 03
mike-gimelfarb/mfpy
A very simple framework for solving MDPs using model-free reinforcement learning.
Language:Python1 1 00
mike-gimelfarb/bayesian-experience-reuse
Appendix for the IJCAI 2021 submission entitled "Bayesian Experience Reuse for Learning from Multiple Demonstrators"
00
mike-gimelfarb/gamma-models
Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"
mike-gimelfarb/JaxPlan-GurobiPlan-ICAPS-2024
Experiments for ICAPS 2024 paper " JaxPlan and GurobiPlan: Optimization Baselines for Replanning in Discrete and Mixed Discrete-Continuous Probabilistic Domains"
Language:Python3 0
mike-gimelfarb/LineSearches.jl
Line search methods for optimization and root-finding
Language:Julia1 0
mike-gimelfarb/mike-gimelfarb
1 0
mike-gimelfarb/mike-gimelfarb.github.io
Personal website based on jekyll theme.
Language:SCSS
mike-gimelfarb/Recession-Predictor
Project description: https://medium.com/p/recession-prediction-using-machine-learning-de6eee16ca94?source=email-2adc3d3cd2ed--writer.postDistributed&sk=2f1dab9738769f9658634e61576a08bd
Language:Python1 0

mike-gimelfarb

Pinned Repositories

bayesian-epsilon-greedy

bayesian-reward-shaping

bboptpy

cascade-correlation-neural-networks

contextual-policy-reuse-deep-rl

deep-successor-features-for-transfer

numerical-integration

optim4j

pyRDDLGym

pyRDDLGym-jax

mike-gimelfarb's Repositories

mike-gimelfarb/deep-successor-features-for-transfer

mike-gimelfarb/bayesian-reward-shaping

mike-gimelfarb/cascade-correlation-neural-networks

mike-gimelfarb/bayesian-epsilon-greedy

mike-gimelfarb/bboptpy

mike-gimelfarb/numerical-integration

mike-gimelfarb/contextual-policy-reuse-deep-rl

mike-gimelfarb/optim4j

mike-gimelfarb/mfpy

mike-gimelfarb/bayesian-experience-reuse

mike-gimelfarb/gamma-models

mike-gimelfarb/JaxPlan-GurobiPlan-ICAPS-2024

mike-gimelfarb/LineSearches.jl

mike-gimelfarb/mike-gimelfarb

mike-gimelfarb/mike-gimelfarb.github.io

mike-gimelfarb/Recession-Predictor