Mingtzge

be humble, stay hungry!

Mingtzge's Stars

satwikkansal/wtfpython
What the f*ck Python? 😱
Language:Python35.8k 726 2062.7k
openai/gym
A toolkit for developing and comparing reinforcement learning algorithms.
Language:Python34.8k 1.1k 1.8k8.6k
nndl/nndl.github.io
《神经网络与深度学习》邱锡鹏著 Neural Network and Deep Learning
Language:HTML17.5k 754 6283.6k
Unity-Technologies/ml-agents
The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
Language:C#17.2k 555 2.9k4.2k
openai/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
Language:Python15.8k 646 8504.9k
leisurelicht/wtfpython-cn
wtfpython的中文翻译/施工结束/ 能力有限，欢迎帮我改进翻译
Language:Jupyter Notebook12.6k 519 332.1k
openai/spinningup
An educational resource to help anyone learn deep reinforcement learning.
Language:Python10.2k 227 2832.2k
shenweichen/DeepCTR
Easy-to-use,Modular and Extendible package of deep-learning based CTR models .
Language:Python7.6k 178 3702.2k
STVIR/pysot
SenseTime Research platform for single object tracking, implementing algorithms like SiamRPN and SiamMask.
Language:Python4.4k 162 3841.1k
ikostrikov/pytorch-a2c-ppo-acktr-gail
PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
Language:Python3.6k 66 229829
wzhe06/SparrowRecSys
A Deep Learning Recommender System
Language:Python2.4k 57 33841
zju3dv/LoFTR
Code for "LoFTR: Detector-Free Local Feature Matching with Transformers", CVPR 2021, T-PAMI 2022
Language:Jupyter Notebook2.3k 45 221362
LyWangPX/Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutions
Solutions of Reinforcement Learning, An Introduction
Language:Jupyter Notebook2k 35 85466
hila-chefer/Transformer-Explainability
[CVPR 2021] Official PyTorch implementation for Transformer Interpretability Beyond Attention Visualization, a novel method to visualize classifications by Transformer based networks.
Language:Jupyter Notebook1.8k 21 63241
sfujim/TD3
Author's PyTorch implementation of TD3 for OpenAI gym tasks
Language:Python1.7k 19 41437
openai/maddpg
Code for the MADDPG algorithm from the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"
Language:Python1.6k 150 67492
Tencent/FeatherCNN
FeatherCNN is a high performance inference engine for convolutional neural networks.
Language:C++1.2k 100 44284
huawei-noah/SMARTS
Scalable Multi-Agent RL Training School for Autonomous Driving
Language:Python952 13 1k190
zhangchuheng123/Reinforcement-Implementation
Implementation of benchmark RL algorithms
Language:Python459 14 381
StrangerZhang/pysot-toolkit
Python Single Object Tracking Evaluation
Language:Python416 11 3768
RunzheYang/MORL
Multi-Objective Reinforcement Learning
Language:Python253 8 1851
kniost/BUPT-Resources
北邮研究生毕业论文模板以及各种校内信息
138 1 116
KernelErr/realtime-object-detector
Flutter real-time object detection App with Paddle-Lite and YOLO v3.
Language:Java99 3 220
aim-uofa/RGM
69 14 24
uber-research/MARVIN
Uber's Multi-Agent Routing Value Iteration Network
Language:Python58 5 015
cardwing/Codes-for-RL-PER
A novel DDPG method with prioritized experience replay (IEEE SMC 2017)
Language:Python46 0 516
huipengly/MFAC
Model Free Adaptive Control
Language:Matlab43 4 219
Mingtzge/PVE-MCC_for_unsignalized_intersection
Aiming at the problem of the traffic efficiency of intelligent networked vehicles passing through unsignalized-intersection in the future smart cities, this project proposed a Progressive Value-expectation Estimation Multi-agent Cooperative Control (PVE-MCC) algorithm based on reinforcement learning. The algorithm takes the intelligent networked vehicles as the research object and designed the reward function for the optimization objective from the three aspects of traffic efficiency, safety, and comfort.
Language:Python43 2 19
Mingtzge/MiVeCC_with_DRL
This is a Multi-intersection Vehicular Cooperative Control (MiVeCC) scheme to enable cooperation among vehicles in a 3*3 unsignalized intersections. we proposed a algorithm combined heuristic-rule and two-stage deep reinforcement learning. The heuristic-rule achieves vehicles across the intersections without collisions. Based on the heuristic-rule, DDPG is used to optimize the collaborative control of vehicles and improve the traffic efficiency. Simulation results show that the proposed algorithm can improve travel efficiency at multiple intersections by up to 4.59 times without collision compared with existing methods.
Language:Python28 1 18
tasx0823/tasx0823.github.io
Sun Xiao
Language:HTML2 1 00

Mingtzge

Mingtzge's Stars

satwikkansal/wtfpython

openai/gym

nndl/nndl.github.io

Unity-Technologies/ml-agents

openai/baselines

leisurelicht/wtfpython-cn

openai/spinningup

shenweichen/DeepCTR

STVIR/pysot

ikostrikov/pytorch-a2c-ppo-acktr-gail

wzhe06/SparrowRecSys

zju3dv/LoFTR

LyWangPX/Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutions

hila-chefer/Transformer-Explainability

sfujim/TD3

openai/maddpg

Tencent/FeatherCNN

huawei-noah/SMARTS

zhangchuheng123/Reinforcement-Implementation

StrangerZhang/pysot-toolkit

RunzheYang/MORL

kniost/BUPT-Resources

KernelErr/realtime-object-detector

aim-uofa/RGM

uber-research/MARVIN

cardwing/Codes-for-RL-PER

huipengly/MFAC

Mingtzge/PVE-MCC_for_unsignalized_intersection

Mingtzge/MiVeCC_with_DRL

tasx0823/tasx0823.github.io