liuqi8827

student at Harbin Institute of Technology

Harbin Institute of TechnologyShenzhen, China

Pinned Repositories

3d-shapes
This repository contains the 3D shapes dataset, used in Kim, Hyunjik and Mnih, Andriy. "Disentangling by Factorising." In Proceedings of the 35th International Conference on Machine Learning (ICML). 2018. to assess the disentanglement properties of unsupervised learning methods.
Language:Jupyter Notebook0 1 00
3D_CNN_tensorflow
KITTI data processing and 3D CNN for Vehicle Detection
Language:Python0 2 00
3DObjectTracking
Official Code: A Sparse Gaussian Approach to Region-Based 6DoF Object Tracking
Language:C++0 1 00
3DOD_thesis
3D Object Detection for Autonomous Driving in PyTorch, trained on the KITTI dataset.
Language:Python0 2 00
AA-AEGD
This repository is the implementation of Anderson acceleration for "adaptive gradient descent with energy" (AEGD).
Language:Jupyter Notebook0 0 00
academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Language:JavaScript0 1 00
ACER
Actor-critic with experience replay
Language:Python0 1 00
DoubleQLearning
A comparison of Q-Learning with Double Q-Learning in Reinforcement Learning problems.
Language:Python1 1 00
Practical_RL
A course in reinforcement learning in the wild
Language:Jupyter Notebook1 1 00
srgd
Implementation of stochastic relativistic gradient descent from https://arxiv.org/pdf/1903.04100.pdf
Language:Python10

liuqi8827's Repositories

liuqi8827/AA-AEGD
This repository is the implementation of Anderson acceleration for "adaptive gradient descent with energy" (AEGD).
Language:Jupyter Notebook0 0 00
liuqi8827/Awesome-CV
:page_facing_up: Awesome CV is LaTeX template for your outstanding job application
Language:TeX0 0
liuqi8827/awesome-rl-for-cybersecurity
A curated list of resources dedicated to reinforcement learning applied to cyber security.
0 0
liuqi8827/BlenderProc
A procedural Blender pipeline for photorealistic training image generation
Language:Python0 0
liuqi8827/ChatPaper
Use ChatGPT to summary the Arxiv papers.
Language:Python0 0
liuqi8827/Deep-RL-Notes
A collection of comprehensive notes on Deep Reinforcement Learning, customized for UC Berkeley's CS 285 (prev. CS 294-112)
Language:TeX0 0
liuqi8827/deep-symbolic-optimization
Source code for deep symbolic optimization.
Language:Python0 0
liuqi8827/envlogger
A tool for recording RL trajectories.
Language:Python0 0
liuqi8827/explainable-minichess
Chess environment for smaller chess variants, AlphaZero-like MCTS-learning, and Concept Detection
Language:PureBasic0 0
liuqi8827/FGD-trading
An implementation of a `fictitious gradient descent' algorithm to find the mean field Nash equilibrium for a an example trading problem.
Language:Python0 0
liuqi8827/fiss_planner
[RA-L 2022] FISS: A Trajectory Planning Framework using Fast Iterative Search and Sampling Strategy for Autonomous Driving
Language:C++0 0
liuqi8827/godot_rl_agents
An Open Source package that allows video game creators, AI researchers and hobbyists the opportunity to learn complex behaviors for their Non Player Characters or agents
Language:GDScript0 0
liuqi8827/Griddly
A grid-world game engine for game AI research
Language:C++0 0
liuqi8827/leetcode
Provide all my solutions and explanations in Chinese for all the Leetcode coding problems.
0 0
liuqi8827/leetcode-1
推荐刷题网站：https://www.lintcode.com/?utm_source=tf-github-lucifer2022 LeetCode Solutions: A Record of My Problem Solving Journey.( leetcode题解，记录自己的leetcode解题之路。)
Language:JavaScript0 0
liuqi8827/leetcode-2
Python & JAVA Solutions for Leetcode
Language:Python0 0
liuqi8827/LeetCode-Py
⛽️「算法通关手册」，超详细的「算法与数据结构」基础讲解教程，700+ 道「LeetCode 题目」详细解析。通过「算法理论学习」和「编程实战练习」相结合的方式，从零基础到彻底掌握算法知识。
Language:Python0 0
liuqi8827/levenberg-marquardt-method
Python implementation of Levenberg-Marquardt algorithm built from scratch using NumPy.
Language:Python0 0
liuqi8827/levenberg-marquardt-method-1
Language:Python0 0
liuqi8827/memory-maze
Evaluating long-term memory of reinforcement learning algorithms
Language:Python0 0
liuqi8827/MinAtar-Faster
Optimized version of the MinAtar (testbed for AI agents) codebase along with benchmarks for standard Reinforcement Learning agents on various environments.
liuqi8827/Mini-batch-SGD-large-dynamic-networks
large dynamic network latent space inference via mini-batch stochastic gradient descent. Variational approach for lower bound marginal maximization.
Language:Jupyter Notebook0 0
liuqi8827/optuna
A hyperparameter optimization framework
Language:Python0 0
liuqi8827/PaS_CrowdNav
Occlusion-Aware Crowd Navigation Using People as Sensors: ICRA2023
Language:Python0 0
liuqi8827/PhySO
Physical Symbolic Optimization
Language:Python0 0
liuqi8827/QDax
Accelerated Quality-Diversity
Language:Python0 0
liuqi8827/relod
An efficient remote-onboard architecture for real-time Reinforcement Learning
Language:Python0 0
liuqi8827/Top-AI-Conferences-Paper-with-Code
MLNLP: This repository is a collection of AI top conferences papers (e.g. ACL, EMNLP, NAACL, COLING, AAAI, IJCAI, ICLR, NeurIPS, and ICML) with open resource code
0 0
liuqi8827/torchimize
numerical optimization algorithms in pytorch
Language:Python0 0
liuqi8827/unscalable-heuristic-approximator
Deep learning/Reinforcement Learning methods for A*
Language:Python0 0