Pinned Repositories
bert_tfv1
BERT, tensorflow, v1
bert_torch
BERT, pytorch
DQN_GridWorld
Dueling-Double-Deep-Q-Network. Reinforcement Learning.
lazyxml
Simple xml parse and build lib. https://www.cnblogs.com/lazyboy/p/3830104.html
OCR-CRNN-CTC
ocr, cnn+lstm+ctc, crnn, recognition model, tensorflow
OCR-DETECTION-CTPN
OCR, CNN+LSTM (CTPN) for image text detection, tensorflow
ResNet20-CIFAR-10
ResNet20, CIFAR-10,
text_classification
Text classification models: cnn, self-attention, cnn-rnf, rnn-att, capsule-net. TensorFlow. Single GPU or multi GPU
text_classification_hierarchical
Text classification (hierarchical) models: HCSM, HRNN, HAN, ...
zhiqiang
之强, Become Strong. A platform for reinforcement learning. PyTorch.
Li-Ming-Fan's Repositories
Li-Ming-Fan/zhiqiang
之强, Become Strong. A platform for reinforcement learning. PyTorch.
Li-Ming-Fan/Deep-Reinforcement-Learning-Algorithms-with-PyTorch
PyTorch implementations of deep reinforcement learning algorithms and environments
Li-Ming-Fan/Deep-reinforcement-learning-with-pytorch
PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....
Li-Ming-Fan/DeepRL_PyTorch
Deep Reinforcement Learning codes for study. Currently, there are only codes for algorithms: DQN, C51, QR-DQN, IQN, QUOTA.
Li-Ming-Fan/fqf-iqn-qrdqn.pytorch
PyTorch implementation of FQF, IQN and QR-DQN.
Li-Ming-Fan/go-explore
Code for Go-Explore: a New Approach for Hard-Exploration Problems
Li-Ming-Fan/GoBigger-Challenge-2021
Basic code and description for GoBigger challenge 2021.
Li-Ming-Fan/Gobigger-Explore
Li-Ming-Fan/Hands-on-RL
https://hrl.boyuai.com/
Li-Ming-Fan/InterFuser
InterFuser: Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer
Li-Ming-Fan/jsbsim
An open source flight dynamics & control software library
Li-Ming-Fan/KataGo
GTP engine and self-play learning in Go
Li-Ming-Fan/lizzieyzy
LizzieYzy - GUI for Game of Go
Li-Ming-Fan/on-policy
This is the official implementation of Multi-Agent PPO (MAPPO).
Li-Ming-Fan/online-go.com
Source code for the Online-Go.com web interface
Li-Ming-Fan/pyDecision
A python MCDA Library - AHP; Fuzzy AHP; ARAS; Borda; BWM; CODAS; COPRAS; CRITIC; DEMATEL; Fuzzy DEMATEL; EDAS; Fuzzy EDAS; ELECTRE (I, I_s, I_v, II, III, IV, Tri-B); GRA; IDOCRIW; MABAC; MOORA; MOOSRA; MULTIMOORA; PROMETHEE (I, II, III, IV, V, VI, Gaia); SAW; SMART; TOPSIS; Fuzzy TOPSIS; VIKOR; Fuzzy VIKOR; WINGS; WSM; WPM; WASPAS
Li-Ming-Fan/pymarl
Python Multi-Agent Reinforcement Learning framework
Li-Ming-Fan/retro_branching
Learning to branch with reinforcement learning using retrospective trajectories for exact combinatorial optimisation.
Li-Ming-Fan/RL-Algorithm
Hello😜
Li-Ming-Fan/rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
Li-Ming-Fan/Sabaki
An elegant Go board and SGF editor for a more civilized age.
Li-Ming-Fan/sac-discrete.pytorch
PyTorch implementation of SAC-Discrete.
Li-Ming-Fan/SAGE
No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models (ICLR 2022)
Li-Ming-Fan/smac
SMAC: The StarCraft Multi-Agent Challenge
Li-Ming-Fan/stable-baselines3
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
Li-Ming-Fan/TD3-BipedalWalkerHardcore-v2
Solve BipedalWalkerHardcore-v2 with TD3
Li-Ming-Fan/TD3-Pytorch
A clean and robust Pytorch implementation of TD3 on continuous action space
Li-Ming-Fan/TRPO-in-MARL
Li-Ming-Fan/weiqi
使用Python编写的围棋游戏
Li-Ming-Fan/wx-charts
微信小程序图表charts组件,Charts for WeChat Mini Program