Pinned Repositories
alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
amortized-dag-gflownet
Code for "Bayesian Structure Learning with Generative Flow Networks"
argos3
A parallel, multi-engine simulator for heterogeneous swarm robotics
awesome-autonomous-vehicles
Curated List of Self-Driving Cars and Autonomous Vehicles Resources
comp-551-machine-learning
Applied Machine Learning Courseworks
comp-6771-image-processing
Image Processing courseworks
lane-slam
SLAM using line
life-long-learning
Useful resources to learn lifelong learning
pol-job-finder
An interface for connecting employees with employers
TCGA_Benchmark
TCGA Benchmark Tasks for Clinical Attribute Prediction based on Genome
mandanasmi's Repositories
mandanasmi/lane-slam
SLAM using line
mandanasmi/life-long-learning
Useful resources to learn lifelong learning
mandanasmi/TCGA_Benchmark
TCGA Benchmark Tasks for Clinical Attribute Prediction based on Genome
mandanasmi/comp-551-machine-learning
Applied Machine Learning Courseworks
mandanasmi/comp-6771-image-processing
Image Processing courseworks
mandanasmi/pol-job-finder
An interface for connecting employees with employers
mandanasmi/alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
mandanasmi/amortized-dag-gflownet
Code for "Bayesian Structure Learning with Generative Flow Networks"
mandanasmi/argos3
A parallel, multi-engine simulator for heterogeneous swarm robotics
mandanasmi/awesome-autonomous-vehicles
Curated List of Self-Driving Cars and Autonomous Vehicles Resources
mandanasmi/COMP579-Project-Template
mandanasmi/concordia-thesis-template
Concordia Thesis Latex Template
mandanasmi/ConSpec
mandanasmi/ContrastiveRL
coding up conspec paper from scratch plus extensions
mandanasmi/EC
Episodic Control
mandanasmi/fastargs
Python library for argument and configuration management
mandanasmi/gym-duckietown-agent
This is the template for the gym agent.
mandanasmi/maml
Code for "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"
mandanasmi/MEMRL
PhD Thesis work -- computational model of learning and memory in decision making in reinforcement learning tasks
mandanasmi/mushroom-rl
Python library for Reinforcement Learning.
mandanasmi/Neural-Episodic-Control
Implementation of Deepmind's Neural Episodic Control
mandanasmi/nma_rl_games
mandanasmi/personal-webpage
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
mandanasmi/Software
This repository contains all the software that runs on the Duckiebot, as well as support files (e.g. maps), plus hardware schematics.
mandanasmi/summary-of-my-online-courses
A summary of my favorite online courses
mandanasmi/torch-rl
A recurrent, multi-process and readable PyTorch implementation of the deep reinforcement algorithms A2C and PPO