dmksjfl

Hi, I am Jiafei Lyu. I am currently a Ph.D. candidate in Tsinghua University. I have a broad interest in different fields of reinforcement learning (RL).

Pinned Repositories

CABI
code for Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based Imagination (NeurIPS 2022)
Language:Python60
DARC
Code for Efficient Continuous Control with Double Actors and Regularized Critics, AAAI 2022.
Language:Python19 1 21
GD3
This is the repository of Generalized-activated Deep Deouble Deterministic Policy Gradients (GD3).
Language:Python4 2 00
Job_Shop_Scheduling_Problem_with_Reinforcement_Learning
This is the implemention of JSSP with RL. The framework used for RL is actor critic and the dataset comes from Tianchi competition.
Language:Python8724
MCQ
Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)
Language:Python51 4 37
PAR
Official code for Cross-Domain Policy Adaptation by Capturing Representation Mismatch (ICML 2024)
Language:Python90
SACMPC
SAC+TDMPC
Language:Python3 1 00
SAW
Code for State Advantage Weighting for Offline RL
Language:Python2 2 00
SEABO
Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning
Language:Python9 1 00
SMR
sample multiple reuse
Language:Python11

dmksjfl's Repositories

dmksjfl/Job_Shop_Scheduling_Problem_with_Reinforcement_Learning
This is the implemention of JSSP with RL. The framework used for RL is actor critic and the dataset comes from Tianchi competition.
Language:Python8724
dmksjfl/MCQ
Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)
Language:Python51 4 37
dmksjfl/DARC
Code for Efficient Continuous Control with Double Actors and Regularized Critics, AAAI 2022.
Language:Python19 1 21
dmksjfl/PAR
Official code for Cross-Domain Policy Adaptation by Capturing Representation Mismatch (ICML 2024)
Language:Python90
dmksjfl/SEABO
Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning
Language:Python9 1 00
dmksjfl/CABI
code for Double Check Your State Before Trusting It: Confidence-Aware Bidirectional Offline Model-Based Imagination (NeurIPS 2022)
Language:Python60
dmksjfl/GD3
This is the repository of Generalized-activated Deep Deouble Deterministic Policy Gradients (GD3).
Language:Python4 2 00
dmksjfl/SACMPC
SAC+TDMPC
Language:Python3 1 00
dmksjfl/SAW
Code for State Advantage Weighting for Offline RL
Language:Python2 2 00
dmksjfl/SMR
sample multiple reuse
Language:Python11
dmksjfl/THU-Homework-LaTex-Template
This is a LaTex template for THU homework which is suitable for math/physics/statistics/computer science and other related majors. This template is simple and may not meet all of your needs where you should improve it yourself.
Language:TeX1 1 00
dmksjfl/BCQ
Author's PyTorch implementation of BCQ for continuous and discrete actions
Language:Python00
dmksjfl/ChainsawManGunDemon
This is a complement of Gun Demon in manga Chainsaw Man by Fujimoto Hiroshi. One could find the source of manga in JUMP.
0 1 01
dmksjfl/digit_num_recog
None
Language:PureBasic00
dmksjfl/dmksjfl.github.io
Personal academic website
Language:JavaScript
dmksjfl/O2O-coupon-use-prediction
This is an implement of o2o coupon use prediction competition in Tianchi. You could find more infomation there. We get a score of 0.6377 with pure offline dataset.
Language:Python0 0
dmksjfl/RL_for_video_hash
Language:Python

dmksjfl

Pinned Repositories

CABI

DARC

GD3

Job_Shop_Scheduling_Problem_with_Reinforcement_Learning

MCQ

PAR

SACMPC

SAW

SEABO

SMR

dmksjfl's Repositories

dmksjfl/Job_Shop_Scheduling_Problem_with_Reinforcement_Learning

dmksjfl/MCQ

dmksjfl/DARC

dmksjfl/PAR

dmksjfl/SEABO

dmksjfl/CABI

dmksjfl/GD3

dmksjfl/SACMPC

dmksjfl/SAW

dmksjfl/SMR

dmksjfl/THU-Homework-LaTex-Template

dmksjfl/BCQ

dmksjfl/ChainsawManGunDemon

dmksjfl/digit_num_recog

dmksjfl/dmksjfl.github.io

dmksjfl/O2O-coupon-use-prediction

dmksjfl/RL_for_video_hash