Pinned Repositories
aamas_19
Source code for the paper "Online Abstraction with MDP Homomorphisms for Deep Learning".
adversarial
Code and hyperparameters for the paper "Generative Adversarial Networks"
ai-deadlines
:alarm_clock: AI conference deadline countdowns
AlphaGOZero-python-tensorflow
Congratulation to DeepMind! This is a reengineering implementation (on behalf of many other git repo in /support/) of DeepMind's Oct19th publication: [Mastering the Game of Go without Human Knowledge]. The supervised learning approach is more practical for individuals. (This repository has single purpose of education only)
anonymous_github
Anonymous Github is a proxy server to support anonymous browsing of Github repositories for open-science code and data.
Arcade-Learning-Environment
The Arcade Learning Environment (ALE) -- a platform for AI research.
ArraySortAlgorithm
各个排序算法
atari-py
An `openai/atari-py` fork with Windows support and removed zlib/libpng dependencies. Binaries (wheels) are on "Releases" tab.
Synthetic-PandoraHearts-Jack
TStarBots
Lamperougeyxy's Repositories
Lamperougeyxy/TStarBots
Lamperougeyxy/cellpylib
A library for working with Cellular Automata, for Python.
Lamperougeyxy/compare_gan
Compare GAN code.
Lamperougeyxy/cuml
cuML - RAPIDS Machine Learning Library
Lamperougeyxy/DeepMimic
Motion imitation with deep reinforcement learning.
Lamperougeyxy/dopamine
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
Lamperougeyxy/feudal-montezuma
Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge
Lamperougeyxy/game-of-life
Simple Python implementation of Conway's game of life and other cellular automata, computed using numpy.fft
Lamperougeyxy/ganhacks
starter from "How to Train a GAN?" at NIPS2016
Lamperougeyxy/imitation
Code for the paper "Generative Adversarial Imitation Learning"
Lamperougeyxy/imitation-learning
Repository to store conditional imitation learning based AI that runs on CARLA.
Lamperougeyxy/irl-imitation
Implementation of Inverse Reinforcement Learning (IRL) algorithms in python/Tensorflow. Deep MaxEnt, MaxEnt, LPIRL
Lamperougeyxy/Lamperougeyxy.github.io
Open Blog of Yu Xiaoyang
Lamperougeyxy/mcts
Board game AI implementations using Monte Carlo Tree Search
Lamperougeyxy/mind-SC2
Efficient Reinforcement Learning with a Mind-Game for Full-Length StarCraft II
Lamperougeyxy/minigo
An open-source implementation of the AlphaGoZero algorithm
Lamperougeyxy/morph-net
Fast & Simple Resource-Constrained Learning of Deep Network Structure
Lamperougeyxy/OODP
Object-Oriented Dynamics Predictor
Lamperougeyxy/PhoenixGo
Go AI program which implements the AlphaGo Zero paper
Lamperougeyxy/planet
Deep Planning Network: Control from pixels by latent planning with learned dynamics
Lamperougeyxy/RandWireNN
Implementation of: "Exploring Randomly Wired Neural Networks for Image Recognition"
Lamperougeyxy/Spiking-Neural-Network
Pure python implementation of SNN
Lamperougeyxy/Tempotron-1
an implemet of supervised algorithm Tempotron in SNN
Lamperougeyxy/tempotron-classifier
Python implementation of a tempotron classifier for the MNIST database.
Lamperougeyxy/tensor2tensor
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
Lamperougeyxy/Tensorflow-Tutorial-1
Tensorflow tutorial from basic to hard
Lamperougeyxy/the-gan-zoo
A list of all named GANs!
Lamperougeyxy/TStarBot1
Lamperougeyxy/Variational-Discriminator-Bottleneck-Tensorflow-Implementation
Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow - Tensorlfow Implementation
Lamperougeyxy/vgan
Code for image generation of Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow