Pinned Repositories
3d-shapes
This repository contains the 3D shapes dataset, used in Kim, Hyunjik and Mnih, Andriy. "Disentangling by Factorising." In Proceedings of the 35th International Conference on Machine Learning (ICML). 2018. to assess the disentanglement properties of unsupervised learning methods.
3D_CNN_tensorflow
KITTI data processing and 3D CNN for Vehicle Detection
3DObjectTracking
Official Code: A Sparse Gaussian Approach to Region-Based 6DoF Object Tracking
3DOD_thesis
3D Object Detection for Autonomous Driving in PyTorch, trained on the KITTI dataset.
AA-AEGD
This repository is the implementation of Anderson acceleration for "adaptive gradient descent with energy" (AEGD).
academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
ACER
Actor-critic with experience replay
DoubleQLearning
A comparison of Q-Learning with Double Q-Learning in Reinforcement Learning problems.
Practical_RL
A course in reinforcement learning in the wild
srgd
Implementation of stochastic relativistic gradient descent from https://arxiv.org/pdf/1903.04100.pdf
liuqi8827's Repositories
liuqi8827/AA-AEGD
This repository is the implementation of Anderson acceleration for "adaptive gradient descent with energy" (AEGD).
liuqi8827/Awesome-CV
:page_facing_up: Awesome CV is LaTeX template for your outstanding job application
liuqi8827/awesome-rl-for-cybersecurity
A curated list of resources dedicated to reinforcement learning applied to cyber security.
liuqi8827/BlenderProc
A procedural Blender pipeline for photorealistic training image generation
liuqi8827/ChatPaper
Use ChatGPT to summary the Arxiv papers.
liuqi8827/Deep-RL-Notes
A collection of comprehensive notes on Deep Reinforcement Learning, customized for UC Berkeley's CS 285 (prev. CS 294-112)
liuqi8827/deep-symbolic-optimization
Source code for deep symbolic optimization.
liuqi8827/envlogger
A tool for recording RL trajectories.
liuqi8827/explainable-minichess
Chess environment for smaller chess variants, AlphaZero-like MCTS-learning, and Concept Detection
liuqi8827/FGD-trading
An implementation of a `fictitious gradient descent' algorithm to find the mean field Nash equilibrium for a an example trading problem.
liuqi8827/fiss_planner
[RA-L 2022] FISS: A Trajectory Planning Framework using Fast Iterative Search and Sampling Strategy for Autonomous Driving
liuqi8827/godot_rl_agents
An Open Source package that allows video game creators, AI researchers and hobbyists the opportunity to learn complex behaviors for their Non Player Characters or agents
liuqi8827/Griddly
A grid-world game engine for game AI research
liuqi8827/leetcode
Provide all my solutions and explanations in Chinese for all the Leetcode coding problems.
liuqi8827/leetcode-1
推荐刷题网站:https://www.lintcode.com/?utm_source=tf-github-lucifer2022 LeetCode Solutions: A Record of My Problem Solving Journey.( leetcode题解,记录自己的leetcode解题之路。)
liuqi8827/leetcode-2
Python & JAVA Solutions for Leetcode
liuqi8827/LeetCode-Py
⛽️「算法通关手册」,超详细的「算法与数据结构」基础讲解教程,700+ 道「LeetCode 题目」详细解析。通过「算法理论学习」和「编程实战练习」相结合的方式,从零基础到彻底掌握算法知识。
liuqi8827/levenberg-marquardt-method
Python implementation of Levenberg-Marquardt algorithm built from scratch using NumPy.
liuqi8827/levenberg-marquardt-method-1
liuqi8827/memory-maze
Evaluating long-term memory of reinforcement learning algorithms
liuqi8827/MinAtar-Faster
Optimized version of the MinAtar (testbed for AI agents) codebase along with benchmarks for standard Reinforcement Learning agents on various environments.
liuqi8827/Mini-batch-SGD-large-dynamic-networks
large dynamic network latent space inference via mini-batch stochastic gradient descent. Variational approach for lower bound marginal maximization.
liuqi8827/optuna
A hyperparameter optimization framework
liuqi8827/PaS_CrowdNav
Occlusion-Aware Crowd Navigation Using People as Sensors: ICRA2023
liuqi8827/PhySO
Physical Symbolic Optimization
liuqi8827/QDax
Accelerated Quality-Diversity
liuqi8827/relod
An efficient remote-onboard architecture for real-time Reinforcement Learning
liuqi8827/Top-AI-Conferences-Paper-with-Code
MLNLP: This repository is a collection of AI top conferences papers (e.g. ACL, EMNLP, NAACL, COLING, AAAI, IJCAI, ICLR, NeurIPS, and ICML) with open resource code
liuqi8827/torchimize
numerical optimization algorithms in pytorch
liuqi8827/unscalable-heuristic-approximator
Deep learning/Reinforcement Learning methods for A*