mandanasmi

PhD student interested in reinforcement learning and the brain

@mila_iqia Montreal, Canada

Pinned Repositories

alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
Language:Python00
amortized-dag-gflownet
Code for "Bayesian Structure Learning with Generative Flow Networks"
Language:Python00
argos3
A parallel, multi-engine simulator for heterogeneous swarm robotics
Language:C++00
awesome-autonomous-vehicles
Curated List of Self-Driving Cars and Autonomous Vehicles Resources
00
comp-551-machine-learning
Applied Machine Learning Courseworks
Language:Jupyter Notebook1 2 00
comp-6771-image-processing
Image Processing courseworks
Language:Matlab1 3 01
lane-slam
SLAM using line
Language:Python26 7 112
life-long-learning
Useful resources to learn lifelong learning
23 3 14
pol-job-finder
An interface for connecting employees with employers
Language:HTML1 4 32
TCGA_Benchmark
TCGA Benchmark Tasks for Clinical Attribute Prediction based on Genome
Language:Jupyter Notebook12 1 15

mandanasmi's Repositories

mandanasmi/lane-slam
SLAM using line
Language:Python26 7 112
mandanasmi/life-long-learning
Useful resources to learn lifelong learning
23 3 14
mandanasmi/TCGA_Benchmark
TCGA Benchmark Tasks for Clinical Attribute Prediction based on Genome
Language:Jupyter Notebook12 1 15
mandanasmi/comp-551-machine-learning
Applied Machine Learning Courseworks
Language:Jupyter Notebook1 2 00
mandanasmi/comp-6771-image-processing
Image Processing courseworks
Language:Matlab1 3 01
mandanasmi/pol-job-finder
An interface for connecting employees with employers
Language:HTML1 4 32
mandanasmi/alpha-zero-general
A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more
Language:Python00
mandanasmi/amortized-dag-gflownet
Code for "Bayesian Structure Learning with Generative Flow Networks"
Language:Python00
mandanasmi/argos3
A parallel, multi-engine simulator for heterogeneous swarm robotics
Language:C++00
mandanasmi/awesome-autonomous-vehicles
Curated List of Self-Driving Cars and Autonomous Vehicles Resources
00
mandanasmi/COMP579-Project-Template
Language:Python00
mandanasmi/concordia-thesis-template
Concordia Thesis Latex Template
Language:TeX
mandanasmi/ConSpec
Language:Python0 0
mandanasmi/ContrastiveRL
coding up conspec paper from scratch plus extensions
Language:Python
mandanasmi/EC
Episodic Control
mandanasmi/fastargs
Python library for argument and configuration management
mandanasmi/gym-duckietown-agent
This is the template for the gym agent.
Language:Python2 0
mandanasmi/maml
Code for "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"
Language:Python
mandanasmi/MEMRL
PhD Thesis work -- computational model of learning and memory in decision making in reinforcement learning tasks
mandanasmi/mushroom-rl
Python library for Reinforcement Learning.
Language:Python1 0
mandanasmi/Neural-Episodic-Control
Implementation of Deepmind's Neural Episodic Control
Language:Python1 0
mandanasmi/nma_rl_games
Language:Python1 0
mandanasmi/personal-webpage
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript1 0
mandanasmi/Software
This repository contains all the software that runs on the Duckiebot, as well as support files (e.g. maps), plus hardware schematics.
Language:Python
mandanasmi/summary-of-my-online-courses
A summary of my favorite online courses
Language:Jupyter Notebook2 0
mandanasmi/torch-rl
A recurrent, multi-process and readable PyTorch implementation of the deep reinforcement algorithms A2C and PPO
Language:Jupyter Notebook

mandanasmi

Pinned Repositories

alpha-zero-general

amortized-dag-gflownet

argos3

awesome-autonomous-vehicles

comp-551-machine-learning

comp-6771-image-processing

lane-slam

life-long-learning

pol-job-finder

TCGA_Benchmark

mandanasmi's Repositories

mandanasmi/lane-slam

mandanasmi/life-long-learning

mandanasmi/TCGA_Benchmark

mandanasmi/comp-551-machine-learning

mandanasmi/comp-6771-image-processing

mandanasmi/pol-job-finder

mandanasmi/alpha-zero-general

mandanasmi/amortized-dag-gflownet

mandanasmi/argos3

mandanasmi/awesome-autonomous-vehicles

mandanasmi/COMP579-Project-Template

mandanasmi/concordia-thesis-template

mandanasmi/ConSpec

mandanasmi/ContrastiveRL

mandanasmi/EC

mandanasmi/fastargs

mandanasmi/gym-duckietown-agent

mandanasmi/maml

mandanasmi/MEMRL

mandanasmi/mushroom-rl

mandanasmi/Neural-Episodic-Control

mandanasmi/nma_rl_games

mandanasmi/personal-webpage

mandanasmi/Software

mandanasmi/summary-of-my-online-courses

mandanasmi/torch-rl