reinforcement-learning-F22

Repository that contains assignments for RL-F22 course work in Innopolis University

Removing Cargo Agent:

An Agent based on DQN that moves several cargos across the grid world into a common desirable r ectangle area. There are several cargos, each cargo can be moved either horizontally or vertically by one cell up or down. The size of the overall world may vary, as well as the placement of the cargo and desirable area. The game ends when all the cargos are in the desirable area and do not overlap.

Implementation of Different Deep Learning Architectures:

Implemention of building blocks for VGG, GoogleNet, ResNet, ResNeXt and SENet.

Multi Armed Bandit:

Implementation of multi armed bandit and exploring different policies, greedy, epsilon-greedy, softmax and softmax with temprature.

SARSA and SARSA(λ) :

Implementation of SARSA and SARSA(λ) learning algorithms on a windy gridworld environment.

mariammaher550/reinforcement-learning-F22

reinforcement-learning-F22

Removing Cargo Agent:

Implementation of Different Deep Learning Architectures:

Multi Armed Bandit:

SARSA and SARSA(λ) :