kvas7andy
AI engineer | Deep Reinforcement Learning & Generative AI researcher
The HKUSTHong Kong, Israel
Pinned Repositories
bm3il
Bayesian Multi-type Mean Field Multi-agent Imitation Learning
CyberBattleSim_Web
Version of CyberBattleSim https://github.com/microsoft/CyberBattleSim with extended funcitonality for training RL agents attacks on web applications
Kaggle
Kaggle competitions
kdd_project_2018
Team 20 KDD course project 2018 "Fine-Tuning strategy for classification based on transfer & active learning"
maml_rl
Code for RL experiments in "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"
OHR_SmoothSVM
Online Handwriting Recognition with Smoothed SVM
unreal
Reinforcement learning with unsupervised auxiliary tasks
kvas7andy's Repositories
kvas7andy/kdd_project_2018
Team 20 KDD course project 2018 "Fine-Tuning strategy for classification based on transfer & active learning"
kvas7andy/bm3il
Bayesian Multi-type Mean Field Multi-agent Imitation Learning
kvas7andy/unreal
Reinforcement learning with unsupervised auxiliary tasks
kvas7andy/CyberBattleSim_Web
Version of CyberBattleSim https://github.com/microsoft/CyberBattleSim with extended funcitonality for training RL agents attacks on web applications
kvas7andy/maml_rl
Code for RL experiments in "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"
kvas7andy/ai-deadlines
:alarm_clock: AI conference deadline countdowns
kvas7andy/aron_assign
kvas7andy/CyberBattleSim
An experimentation and research platform to investigate the interaction of automated agents in an abstract simulated network environments.
kvas7andy/deep-reinforcement-learning
Repo for the Deep Reinforcement Learning Nanodegree program
kvas7andy/DirectedInfo-GAIL
kvas7andy/drl_berkley_course
Lecture notes & Assignments of the CS294-112 course on Deep Reinforcement Learning in UC Berkley
kvas7andy/examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
kvas7andy/gym
A toolkit for developing and comparing reinforcement learning algorithms.
kvas7andy/HowToTrainYourMAMLPytorch
The original code for the paper "How to train your MAML" along with a replication of the original "Model Agnostic Meta Learning" (MAML) paper in Pytorch.
kvas7andy/MA-AIRL
Multi-Agent Adversarial Inverse Reinforcement Learning, ICML 2019.
kvas7andy/MAGAIL
Pytorch implementation of Multi-Agent Generative Adversarial Imitation Learning
kvas7andy/magail_felixykliu
kvas7andy/maml
Code for "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"
kvas7andy/minos
MINOS: Multimodal Indoor Simulator
kvas7andy/multiagent-gail
kvas7andy/multiagent-gail_wsjeon
multiagent-gail working with multiagent-particle-env-v2 (which was modified by magail authors)
kvas7andy/multiagent-particle-envs
kvas7andy/ngsim_env
Learning human driver models from NGSIM data with imitation learning.
kvas7andy/olympics2021
Updated with World Bank Data Olympics dataset and NEW PCP coordinates plot notebook
kvas7andy/Option-GAIL
kvas7andy/papers
Research papers outline and analysis
kvas7andy/polygon-pascalvoc-writer
For generating Pascal VOC XML image annotation files. Supports polygon & bounding-boxes.
kvas7andy/Practical_RL
A course in reinforcement learning in the wild
kvas7andy/ROMA
Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)
kvas7andy/tensorboard
TensorFlow's Visualization Toolkit