Pinned Repositories
alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4
alphazero_singleplayer
Single player Alpha Zero implementation
AWT2017
AWT course project. A crowdsorcing web appilcation, where masters can publish jobs and workers can subscribe and complete the jobs. Built with a model driven approach.
bayesianQLearning
implementation of Bayesian Q Learning RL Algorithm
Briscola
Project of Mobile Development class. A native Android implementation of the classic italian Card game
como-center-prediction
A data pipeline used to build a predictive model on the number of people visting the center of the city of Como, Northern Italy, based on data collected by sensors in the city center and weather data
oac-explore
Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)
PRL_2021_Open_Loop_Planning_F1_Strategy
Reccomender-System
Reccomender systems course project. Contains Collaborative User based, Item based, Content based reccomenders
weightedDQN
amarildolikmeta's Repositories
amarildolikmeta/bayesianQLearning
implementation of Bayesian Q Learning RL Algorithm
amarildolikmeta/oac-explore
Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)
amarildolikmeta/PRL_2021_Open_Loop_Planning_F1_Strategy
amarildolikmeta/alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4
amarildolikmeta/alphazero_singleplayer
Single player Alpha Zero implementation
amarildolikmeta/AWT2017
AWT course project. A crowdsorcing web appilcation, where masters can publish jobs and workers can subscribe and complete the jobs. Built with a model driven approach.
amarildolikmeta/Briscola
Project of Mobile Development class. A native Android implementation of the classic italian Card game
amarildolikmeta/como-center-prediction
A data pipeline used to build a predictive model on the number of people visting the center of the city of Como, Northern Italy, based on data collected by sensors in the city center and weather data
amarildolikmeta/Reccomender-System
Reccomender systems course project. Contains Collaborative User based, Item based, Content based reccomenders
amarildolikmeta/weightedDQN
amarildolikmeta/AML_project
amarildolikmeta/Disertation
Disertation of my Master's Thesis : Driving Exploration Trhough Particle Q Distributions
amarildolikmeta/ewrl2022
Website for the European Workshop on Reinforcement Learning 2022
amarildolikmeta/irl_real_life
amarildolikmeta/M2L_Poster
Poster and Presentation for 'Handling Non-Stationary Experts inInverse Reinforcement Learning' presented in M2L 2020
amarildolikmeta/models
Models and examples built with TensorFlow
amarildolikmeta/small-mbrl
amarildolikmeta/torch_workers_gpu
amarildolikmeta/wac_explore
Code reproducing the results of the paper Wasserstein Actor-Critic:Directed Exploration via Optimism for Continuous-Actions Control
amarildolikmeta/wasserstein_actor_critic
Code for the paper "Wasserstein Actor-Critic: Directed Exploration via Optimism for Continuous-Actions Control"