markov-decision-processes

There are 349 repositories under markov-decision-processes topic.

afshinea/stanford-cs-221-artificial-intelligence
VIP cheatsheets for Stanford's CS 221 Artificial Intelligence
2.5k 86 1486
sudharsan13296/Hands-On-Reinforcement-Learning-With-Python
Master Reinforcement and Deep Reinforcement Learning using OpenAI Gym and TensorFlow
Language:Jupyter Notebook827 44 3325
JuliaPOMDP/POMDPs.jl
MDPs and POMDPs in Julia - An interface for defining, solving, and simulating fully and partially observable Markov decision processes on discrete and continuous spaces.
Language:Julia649 44 34298
Svalorzen/AI-Toolbox
A C++ framework for MDPs and POMDPs with Python bindings
Language:C++641 34 5999
joanby/curso-algebra-lineal
Curso de Álgebra Lineal
Language:HTML439 200 01k
ds4dm/ecole
Extensible Combinatorial Optimization Learning Environments
Language:C++311 8 15467
odow/SDDP.jl
Stochastic Dual Dynamic Programming in Julia
Language:Julia282 17 30259
h2r/pomdp-py
A framework to build and solve POMDP problems. Documentation: https://h2r.github.io/pomdp-py/
Language:Python199 11 3648
colinskow/move37
Coding Demos from the School of AI's Move37 Course
Language:Python180 15 3115
DES-Lab/AALpy
An Automata Learning Library Written in Python
Language:Python154 6 2920
Limmen/csle
A research platform to develop automated security policies using quantitative methods, e.g., optimal control, computational game theory, reinforcement learning, optimization, evolutionary methods, and causal inference.
Language:Python104 5 10219
florist-notes/CS228_PGM
🌀 Stanford CS 228 - Probabilistic Graphical Models
Language:Python92 2 129
sachinbiradar9/Markov-Decision-Processes
Implementation of value iteration algorithm for calculating an optimal MDP policy
Language:Python91 3 142
wrighteagle2d/wrighteaglebase
WrightEagle Base Code for RoboCup Soccer Simulation 2D
Language:C++88 13 137
OpenSourceEconomics/respy
Framework for the simulation and estimation of some finite-horizon discrete choice dynamic programming models.
Language:Python75 6 19631
lsunsi/markovjs
Reinforcement Learning in JavaScript
Language:JavaScript74 7 04
italohdc/LearnSnake
🐍 AI that learns to play Snake using Q-Learning (Reinforcement Learning)
Language:JavaScript70 4 019
amflorio/dvrp-stochastic-requests
Online algorithms for solving large-scale dynamic vehicle routing problems with stochastic requests
Language:Makefile65 2 118
ImanRHT/QECO
A QoE-Oriented Computation Offloading Algorithm based on Deep Reinforcement Learning for Mobile Edge Computing
Language:Python63 6 713
rllab-snu/tsallis_actor_critic_mujoco
Implementation of Tsallis Actor Critic method
Language:Jupyter Notebook61 11 010
masouduut94/MCTS-agent-python
Monte Carlo Tree Search (MCTS) is a method for finding optimal decisions in a given domain by taking random samples in the decision space and building a search tree accordingly. It has already had a profound impact on Artificial Intelligence (AI) approaches for domains that can be represented as trees of sequential decisions, particularly games and planning problems. In this project I used a board game called "HEX" as a platform to test different simulation strategies in MCTS field.
Language:Python60 3 19
chauvinSimon/Hierarchical-Decision-Making-for-Autonomous-Driving
Rich literature review and discussion on the implementation of "Hierarchical Decision-Making for Autonomous Driving"
56 4 014
thiagopbueno/awesome-probabilistic-planning
A curated list of online resources for probabilistic planning: papers, software and research groups around the world!
54 2 012
aws-samples/amazon-sagemaker-amazon-routing-challenge-sol
AWS Last Mile Route Sequence Optimization
Language:Python53 8 112
sshkhr/Practical_RL
My solutions to Yandex Practical Reinforcement Learning course in PyTorch and Tensorflow
Language:Jupyter Notebook53 5 125
callmespring/RL-short-course
Reinforcement Learning Short Course
Language:Jupyter Notebook47 3 015
zafarali/emdp
Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations
Language:Python47 5 814
alexge233/relearn
A Reinforcement Learning Library for C++11/14
Language:C++42 4 213
dsietz/test-data-generation
Test Data Generation
Language:Rust36 8 133
iisys-hof/map-matching-2
High Performance Map Matching with Markov Decision Processes (MDPs) and Hidden Markov Models (HMMs).
Language:C++34 5 37
nasa/pymdptoolbox
Markov Decision Process (MDP) Toolbox for Python
Language:Python32 15 031
JuliaPOMDP/QuickPOMDPs.jl
Concise and friendly interfaces for defining MDP and POMDP models for use with POMDPs.jl solvers
Language:Julia29 5 196
shehio/Everything-Financial-Engineering
Links for the most relevant topics
28 2 02
kevin-hanselman/grid-world-rl
Value iteration, policy iteration, and Q-Learning in a grid-world MDP.
Language:Python24 4 012
makokal/MDPN
Unified notation for Markov Decision Processes PO(MDP)s
Language:TeX24 2 32
yudhisteer/Reinforcement-Learning-for-Supply-Chain-Management
The goal of the project was to design the logistic model of autonomous robots that would supply garment parts from the Cutting Dept to the Makeup Dept in the shortest time possible and using the most optimized path.
Language:Python24 1 06

markov-decision-processes

afshinea/stanford-cs-221-artificial-intelligence

sudharsan13296/Hands-On-Reinforcement-Learning-With-Python

JuliaPOMDP/POMDPs.jl

Svalorzen/AI-Toolbox

joanby/curso-algebra-lineal

ds4dm/ecole

odow/SDDP.jl

h2r/pomdp-py

colinskow/move37

DES-Lab/AALpy

Limmen/csle

florist-notes/CS228_PGM

sachinbiradar9/Markov-Decision-Processes

wrighteagle2d/wrighteaglebase

OpenSourceEconomics/respy

lsunsi/markovjs

italohdc/LearnSnake

amflorio/dvrp-stochastic-requests

ImanRHT/QECO

rllab-snu/tsallis_actor_critic_mujoco

masouduut94/MCTS-agent-python

chauvinSimon/Hierarchical-Decision-Making-for-Autonomous-Driving

thiagopbueno/awesome-probabilistic-planning

aws-samples/amazon-sagemaker-amazon-routing-challenge-sol

sshkhr/Practical_RL

callmespring/RL-short-course

zafarali/emdp

alexge233/relearn

dsietz/test-data-generation

iisys-hof/map-matching-2

nasa/pymdptoolbox

JuliaPOMDP/QuickPOMDPs.jl

shehio/Everything-Financial-Engineering

kevin-hanselman/grid-world-rl

makokal/MDPN

yudhisteer/Reinforcement-Learning-for-Supply-Chain-Management