liuqi8827

student at Harbin Institute of Technology

Harbin Institute of TechnologyShenzhen, China

Pinned Repositories

3d-shapes
This repository contains the 3D shapes dataset, used in Kim, Hyunjik and Mnih, Andriy. "Disentangling by Factorising." In Proceedings of the 35th International Conference on Machine Learning (ICML). 2018. to assess the disentanglement properties of unsupervised learning methods.
Language:Jupyter Notebook0 1 00
3D_CNN_tensorflow
KITTI data processing and 3D CNN for Vehicle Detection
Language:Python0 2 00
3DObjectTracking
Official Code: A Sparse Gaussian Approach to Region-Based 6DoF Object Tracking
Language:C++0 1 00
3DOD_thesis
3D Object Detection for Autonomous Driving in PyTorch, trained on the KITTI dataset.
Language:Python00
AA-AEGD
This repository is the implementation of Anderson acceleration for "adaptive gradient descent with energy" (AEGD).
Language:Jupyter Notebook0 0 00
academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript00
ACER
Actor-critic with experience replay
Language:Python00
DoubleQLearning
A comparison of Q-Learning with Double Q-Learning in Reinforcement Learning problems.
Language:Python10
Practical_RL
A course in reinforcement learning in the wild
Language:Jupyter Notebook10
srgd
Implementation of stochastic relativistic gradient descent from https://arxiv.org/pdf/1903.04100.pdf
Language:Python10

liuqi8827's Repositories

liuqi8827/ai-economist
Foundation is a flexible, modular, and composable framework to model socio-economic behaviors and dynamics with both agents and governments. This framework can be used in conjunction with reinforcement learning to learn optimal economic policies, as done by the AI Economist (https://www.einstein.ai/the-ai-economist).
Language:Python1 0
liuqi8827/ai2thor
An open-source platform for Visual AI.
liuqi8827/asa
Code for paper "Adversarial Support Alignment"
liuqi8827/cic
CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery
liuqi8827/CLUB
Code for ICML2020 paper - CLUB: A Contrastive Log-ratio Upper Bound of Mutual Information
liuqi8827/commonsense-rl
Knowledge-Aware RL agents with Commonsense Reasoning
Language:Inform 71 0
liuqi8827/Contour-Stochastic-Gradient-Langevin-Dynamics
An elegant adaptive importance sampling algorithms for simulations of multi-modal distributions (NeurIPS'20)
Language:Jupyter Notebook0 0
liuqi8827/Counter-Strike_Behavioural_Cloning
NeurIPS workshop paper 'Counter-Strike Deathmatch with Large-Scale Behavioural Cloning'
Language:Python1 0
liuqi8827/DeepRL
Modularized Implementation of Deep RL Algorithms in PyTorch
Language:Python0 0
liuqi8827/hddrl
Hierarchical Decentralized Deep Reinforcement Learning
liuqi8827/hermiter
Efficient Sequential and Batch Estimation of Univariate and Bivariate Probability Density Functions and Cumulative Distribution Functions along with Quantiles (Univariate) and Spearman's Correlation (Bivariate)
liuqi8827/ILSwiss
ILSwiss is an Easy-to-run Imitation Learning (IL, or Learning from Demonstration, LfD) and also Reinforcement Learning (RL) framework (template) in PyTorch.
liuqi8827/LeetCodeAnimation
Demonstrate all the questions on LeetCode in the form of animation.（用动画的形式呈现解LeetCode题目的思路）
liuqi8827/lookahead
Implementation for the Lookahead Optimizer.
liuqi8827/LWDRLD
Lightweight deep RL Libraray for discrete control.
liuqi8827/MixMatch-pytorch
Code for "MixMatch - A Holistic Approach to Semi-Supervised Learning"
Language:Python1 0
liuqi8827/mushroom-rl
Python library for Reinforcement Learning.
liuqi8827/muzero-pytorch
Pytorch Implementation of MuZero
liuqi8827/offpolicy_selection_eslb
liuqi8827/ppo-implementation-details
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
liuqi8827/pyts
A Python package for time series classification
Language:Python1 0
liuqi8827/reinforcement-learning-an-introduction-1
Solutions to exercises in Reinforcement Learning: An Introduction (2nd Edition).
Language:Jupyter Notebook1 0
liuqi8827/Reinforcement_Learning_in_Python
Implementing Reinforcement Learning, namely Q-learning and Sarsa algorithms, for global path planning of mobile robot in unknown environment with obstacles. Comparison analysis of Q-learning and Sarsa
liuqi8827/risk-sensitive-rl
Adaptive Risk Tendency Implicit Quantile Network for Drone Navigation under Partial Observability.
Language:Python1 0
liuqi8827/rlpy3
RLPy Reinforcement Learning Framework: Python3 fork
Language:Python1 0
liuqi8827/spinningup
An educational resource to help anyone learn deep reinforcement learning.
liuqi8827/tdmpc
Code for "Temporal Difference Learning for Model Predictive Control"
liuqi8827/tonic
Tonic RL library
liuqi8827/Visual_MARL
The visualization of a multi-agent reinforcement learning (MARL)-based strategy with efficient exploration strategy.
liuqi8827/WassersteinTSNE