yangyi0318

Pinned Repositories

adversarial-attacks-pytorch
PyTorch implementation of adversarial attacks.
Language:Python0 1 00
awesome-latex-drawing
Drawing Bayesian networks, graphical models, tensors, and technical frameworks and illustrations in LaTeX.
Language:TeX00
Awesome-Learning-with-Label-Noise
A curated list of resources for Learning with Noisy Labels
0 1 00
awesome-self-supervised-learning
A curated list of awesome self-supervised methods
0 1 00
badnets-pytorch
Simple PyTorch implementations of Badnets on MNIST and CIFAR10.
Language:Python0 0 00
boolean_composition
Code for the paper "A Boolean Task Algebra For Reinforcement Learning"
Language:Python0 1 00
cla_demo
Demo code for a clustering-based label-aware autoencoder
Language:Python0 1 00
composition
Code for the paper "Composing Value Functions in Reinforcement Learning"
Language:HTML0 1 00
CoNAL
Code for AAAI 2021 long paper Learning from Crowds by Modeling Common Confusions.
Language:Python0 1 00
cpu
《自己动手写CPU》
Language:Verilog0 1 00

yangyi0318's Repositories

yangyi0318/adversarial-attacks-pytorch
PyTorch implementation of adversarial attacks.
Language:Python0 1 00
yangyi0318/awesome-latex-drawing
Drawing Bayesian networks, graphical models, tensors, and technical frameworks and illustrations in LaTeX.
Language:TeX00
yangyi0318/Awesome-Learning-with-Label-Noise
A curated list of resources for Learning with Noisy Labels
0 1 00
yangyi0318/awesome-self-supervised-learning
A curated list of awesome self-supervised methods
0 1 00
yangyi0318/badnets-pytorch
Simple PyTorch implementations of Badnets on MNIST and CIFAR10.
Language:Python0 0 00
yangyi0318/boolean_composition
Code for the paper "A Boolean Task Algebra For Reinforcement Learning"
Language:Python0 1 00
yangyi0318/cla_demo
Demo code for a clustering-based label-aware autoencoder
Language:Python0 1 00
yangyi0318/composition
Code for the paper "Composing Value Functions in Reinforcement Learning"
Language:HTML0 1 00
yangyi0318/CoNAL
Code for AAAI 2021 long paper Learning from Crowds by Modeling Common Confusions.
Language:Python0 1 00
yangyi0318/cpu
《自己动手写CPU》
Language:Verilog0 1 00
yangyi0318/dads
Code for 'Dynamics-Aware Unsupervised Discovery of Skills' (DADS). Enables skill discovery without supervision, which can be combined with model-based control.
Language:Python1 0
yangyi0318/Deep-Reinforcement-Learning-Algorithms-with-PyTorch
PyTorch implementations of deep reinforcement learning algorithms and environments
Language:Python1 0
yangyi0318/deep_laa
Language:Python1 0
yangyi0318/dm_control
DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
Language:Python1 0
yangyi0318/fuzzy-data-fusion
yangyi0318/garage
A toolkit for reproducible reinforcement learning research.
Language:Python1 0
yangyi0318/hindsight-experience-replay
This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.
Language:Python1 0
yangyi0318/Learning-Independent-SKills
Task dependent skill transformation is challenging due to the ignorance of the relationships between primitive skills. In this project, we propose a skill decomposition algorithm to learn independent skills, which are more suitable than primitive skills for task dependent skill transformation.
Language:Python1 0
yangyi0318/NSFC-LaTex
Language:TeX0 0
yangyi0318/paper-reading
比做算法的懂工程落地，比做工程的懂算法模型。
yangyi0318/ptan
PyTorch Agent Net: reinforcement learning toolkit for pytorch
Language:Python1 0
yangyi0318/PyTorch-VAE
A Collection of Variational Autoencoders (VAE) in PyTorch.
Language:Python1 0
yangyi0318/raylab
Reinforcement learning algorithms in RLlib
Language:Python1 0
yangyi0318/rllab-curriculum
yangyi0318/softlearning
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.
Language:Python1 0
yangyi0318/SoftQLearning
SoftQ Implementation
yangyi0318/spinningup-workspace
Reading notes & PyTorch experiments on OpenAI's "Spinning Up in DRL" tutorial.
Language:Python1 0
yangyi0318/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
yangyi0318/Stein-Variational-Gradient-Descent
code for the paper "Stein Variational Gradient Descent (SVGD): A General Purpose Bayesian Inference Algorithm"
Language:Python1 0
yangyi0318/Tabular-RL-with-Python
Tabular Reinforcement Learning Algorithms with NumPy & Visualizations with Seaborn
Language:Jupyter Notebook1 0