kvas7andy

AI engineer | Deep Reinforcement Learning & Generative AI researcher

The HKUSTHong Kong, Israel

Pinned Repositories

bm3il
Bayesian Multi-type Mean Field Multi-agent Imitation Learning
Language:Python3 3 00
CyberBattleSim_Web
Version of CyberBattleSim https://github.com/microsoft/CyberBattleSim with extended funcitonality for training RL agents attacks on web applications
Language:Jupyter Notebook1 2 00
Kaggle
Kaggle competitions
Language:Python0 2 00
kdd_project_2018
Team 20 KDD course project 2018 "Fine-Tuning strategy for classification based on transfer & active learning"
Language:Python6 4 02
maml_rl
Code for RL experiments in "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"
Language:Python0 3 00
OHR_SmoothSVM
Online Handwriting Recognition with Smoothed SVM
Language:TeX3 2 00
unreal
Reinforcement learning with unsupervised auxiliary tasks
Language:Python2 3 00

kvas7andy's Repositories

kvas7andy/kdd_project_2018
Team 20 KDD course project 2018 "Fine-Tuning strategy for classification based on transfer & active learning"
Language:Python6 4 02
kvas7andy/bm3il
Bayesian Multi-type Mean Field Multi-agent Imitation Learning
Language:Python3 3 00
kvas7andy/unreal
Reinforcement learning with unsupervised auxiliary tasks
Language:Python2 3 00
kvas7andy/CyberBattleSim_Web
Version of CyberBattleSim https://github.com/microsoft/CyberBattleSim with extended funcitonality for training RL agents attacks on web applications
Language:Jupyter Notebook1 2 00
kvas7andy/maml_rl
Code for RL experiments in "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"
Language:Python0 3 00
kvas7andy/ai-deadlines
:alarm_clock: AI conference deadline countdowns
Language:HTML1 0
kvas7andy/aron_assign
Language:Jupyter Notebook2 0
kvas7andy/CyberBattleSim
An experimentation and research platform to investigate the interaction of automated agents in an abstract simulated network environments.
kvas7andy/deep-reinforcement-learning
Repo for the Deep Reinforcement Learning Nanodegree program
Language:Jupyter Notebook1 0
kvas7andy/DirectedInfo-GAIL
Language:Python1 0
kvas7andy/drl_berkley_course
Lecture notes & Assignments of the CS294-112 course on Deep Reinforcement Learning in UC Berkley
Language:Python2 0
kvas7andy/examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
Language:Python2 0
kvas7andy/gym
A toolkit for developing and comparing reinforcement learning algorithms.
Language:Python1 0
kvas7andy/HowToTrainYourMAMLPytorch
The original code for the paper "How to train your MAML" along with a replication of the original "Model Agnostic Meta Learning" (MAML) paper in Pytorch.
Language:Python3 0
kvas7andy/MA-AIRL
Multi-Agent Adversarial Inverse Reinforcement Learning, ICML 2019.
Language:Python1 0
kvas7andy/MAGAIL
Pytorch implementation of Multi-Agent Generative Adversarial Imitation Learning
Language:Python1 0
kvas7andy/magail_felixykliu
Language:Python1 0
kvas7andy/maml
Code for "Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks"
Language:Python3 0
kvas7andy/minos
MINOS: Multimodal Indoor Simulator
Language:Python3 0
kvas7andy/multiagent-gail
Language:Python2 0
kvas7andy/multiagent-gail_wsjeon
multiagent-gail working with multiagent-particle-env-v2 (which was modified by magail authors)
Language:Python1 0
kvas7andy/multiagent-particle-envs
Language:Python1 0
kvas7andy/ngsim_env
Learning human driver models from NGSIM data with imitation learning.
Language:Jupyter Notebook2 0
kvas7andy/olympics2021
Updated with World Bank Data Olympics dataset and NEW PCP coordinates plot notebook
Language:Jupyter Notebook
kvas7andy/Option-GAIL
Language:Python1 0
kvas7andy/papers
Research papers outline and analysis
2 0
kvas7andy/polygon-pascalvoc-writer
For generating Pascal VOC XML image annotation files. Supports polygon & bounding-boxes.
Language:Python1 0
kvas7andy/Practical_RL
A course in reinforcement learning in the wild
Language:Jupyter Notebook2 0
kvas7andy/ROMA
Codes accompanying the paper "ROMA: Multi-Agent Reinforcement Learning with Emergent Roles" (ICML 2020 https://arxiv.org/abs/2003.08039)
Language:Python1 0
kvas7andy/tensorboard
TensorFlow's Visualization Toolkit
Language:Python2 0

kvas7andy

Pinned Repositories

bm3il

CyberBattleSim_Web

Kaggle

kdd_project_2018

maml_rl

OHR_SmoothSVM

unreal

kvas7andy's Repositories

kvas7andy/kdd_project_2018

kvas7andy/bm3il

kvas7andy/unreal

kvas7andy/CyberBattleSim_Web

kvas7andy/maml_rl

kvas7andy/ai-deadlines

kvas7andy/aron_assign

kvas7andy/CyberBattleSim

kvas7andy/deep-reinforcement-learning

kvas7andy/DirectedInfo-GAIL

kvas7andy/drl_berkley_course

kvas7andy/examples

kvas7andy/gym

kvas7andy/HowToTrainYourMAMLPytorch

kvas7andy/MA-AIRL

kvas7andy/MAGAIL

kvas7andy/magail_felixykliu

kvas7andy/maml

kvas7andy/minos

kvas7andy/multiagent-gail

kvas7andy/multiagent-gail_wsjeon

kvas7andy/multiagent-particle-envs

kvas7andy/ngsim_env

kvas7andy/olympics2021

kvas7andy/Option-GAIL

kvas7andy/papers

kvas7andy/polygon-pascalvoc-writer

kvas7andy/Practical_RL

kvas7andy/ROMA

kvas7andy/tensorboard