Pinned Repositories
3d-shapes
This repository contains the 3D shapes dataset, used in Kim, Hyunjik and Mnih, Andriy. "Disentangling by Factorising." In Proceedings of the 35th International Conference on Machine Learning (ICML). 2018. to assess the disentanglement properties of unsupervised learning methods.
3D_CNN_tensorflow
KITTI data processing and 3D CNN for Vehicle Detection
3DObjectTracking
Official Code: A Sparse Gaussian Approach to Region-Based 6DoF Object Tracking
3DOD_thesis
3D Object Detection for Autonomous Driving in PyTorch, trained on the KITTI dataset.
AA-AEGD
This repository is the implementation of Anderson acceleration for "adaptive gradient descent with energy" (AEGD).
academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
ACER
Actor-critic with experience replay
DoubleQLearning
A comparison of Q-Learning with Double Q-Learning in Reinforcement Learning problems.
Practical_RL
A course in reinforcement learning in the wild
srgd
Implementation of stochastic relativistic gradient descent from https://arxiv.org/pdf/1903.04100.pdf
liuqi8827's Repositories
liuqi8827/ai-economist
Foundation is a flexible, modular, and composable framework to model socio-economic behaviors and dynamics with both agents and governments. This framework can be used in conjunction with reinforcement learning to learn optimal economic policies, as done by the AI Economist (https://www.einstein.ai/the-ai-economist).
liuqi8827/ai2thor
An open-source platform for Visual AI.
liuqi8827/asa
Code for paper "Adversarial Support Alignment"
liuqi8827/cic
CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery
liuqi8827/CLUB
Code for ICML2020 paper - CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information
liuqi8827/commonsense-rl
Knowledge-Aware RL agents with Commonsense Reasoning
liuqi8827/Contour-Stochastic-Gradient-Langevin-Dynamics
An elegant adaptive importance sampling algorithms for simulations of multi-modal distributions (NeurIPS'20)
liuqi8827/Counter-Strike_Behavioural_Cloning
NeurIPS workshop paper 'Counter-Strike Deathmatch with Large-Scale Behavioural Cloning'
liuqi8827/DeepRL
Modularized Implementation of Deep RL Algorithms in PyTorch
liuqi8827/hddrl
Hierarchical Decentralized Deep Reinforcement Learning
liuqi8827/hermiter
Efficient Sequential and Batch Estimation of Univariate and Bivariate Probability Density Functions and Cumulative Distribution Functions along with Quantiles (Univariate) and Spearman's Correlation (Bivariate)
liuqi8827/ILSwiss
ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (template) in PyTorch.
liuqi8827/LeetCodeAnimation
Demonstrate all the questions on LeetCode in the form of animation.(用动画的形式呈现解LeetCode题目的思路)
liuqi8827/lookahead
Implementation for the Lookahead Optimizer.
liuqi8827/LWDRLD
Lightweight deep RL Libraray for discrete control.
liuqi8827/MixMatch-pytorch
Code for "MixMatch - A Holistic Approach to Semi-Supervised Learning"
liuqi8827/mushroom-rl
Python library for Reinforcement Learning.
liuqi8827/muzero-pytorch
Pytorch Implementation of MuZero
liuqi8827/offpolicy_selection_eslb
liuqi8827/ppo-implementation-details
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
liuqi8827/pyts
A Python package for time series classification
liuqi8827/reinforcement-learning-an-introduction-1
Solutions to exercises in Reinforcement Learning: An Introduction (2nd Edition).
liuqi8827/Reinforcement_Learning_in_Python
Implementing Reinforcement Learning, namely Q-learning and Sarsa algorithms, for global path planning of mobile robot in unknown environment with obstacles. Comparison analysis of Q-learning and Sarsa
liuqi8827/risk-sensitive-rl
Adaptive Risk Tendency Implicit Quantile Network for Drone Navigation under Partial Observability.
liuqi8827/rlpy3
RLPy Reinforcement Learning Framework: Python3 fork
liuqi8827/spinningup
An educational resource to help anyone learn deep reinforcement learning.
liuqi8827/tdmpc
Code for "Temporal Difference Learning for Model Predictive Control"
liuqi8827/tonic
Tonic RL library
liuqi8827/Visual_MARL
The visualization of a multi-agent reinforcement learning (MARL)-based strategy with efficient exploration strategy.
liuqi8827/WassersteinTSNE