off-policy-evaluation
There are 22 repositories under off-policy-evaluation topic.
hanjuku-kaso/awesome-offline-rl
An index of algorithms for offline reinforcement learning (offline-rl)
st-tech/zr-obp
Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation
banditml/offline-policy-evaluation
Implementations and examples of common offline policy evaluation methods in Python.
hakuhodo-technologies/scope-rl
SCOPE-RL: A python library for offline reinforcement learning, off-policy evaluation, and selection
callmespring/RL-short-course
Reinforcement Learning Short Course
aiueola/wsdm2022-cascade-dr
(WSDM2022 Best Paper Award Runner-Up) "Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model"
aiueola/kdd2023-aips
(KDD2023) "Off-Policy Evaluation of Ranking Policies under Diverse User Behavior"
CausalML/bcrl
Representation Learning for OPE
Mamba413/cope
Off-Policy Interval Estimation withConfounded Markov Decision Process
aiueola/neurips2023-future-dependent-ope
(NeurIPS2023) "Future-Dependent Value-Based Off-Policy Evaluation in POMDPs"
Mamba413/ROOM
Robust Offline Reinforcement Learning with Heavy-Tailed Rewards
callmespring/D2OPE
Implementation of "Deeply-Debiased Off-Policy Interval Estimation" (ICML, 2021) in Python
joshuaspear/offline_rl_ope
Stateful implementations of OPE algorithms, designed for use in the development of offline RL models
yingchengyang/BIRIS
On the Reuse Bias in Off-Policy Reinforcement Learning (IJCAI 2023)
callmespring/DJL
Implementation of Deep Jump Learning for Off-Policy Evaluation in Continuous Treatment Settings (NeurIPS, 2021) in Python
dtak/osiris
Omitting-States-Irrelevant-to-Return Importance Sampling estimator for off-policy evaluation
MLD3/CounterfactualAnnot-SemiOPE
[NeurIPS 2023] Counterfactual-Augmented Importance Sampling for Semi-Offline Policy Evaluation. https://arxiv.org/abs/2310.17146
airboxlab/hopes
HOPES: HVAC optimization with Off-Policy Evaluation and Selection
callmespring/Confounded-POMDP-OPE
Implementation of "A Minimax Learning Approach to Off-Policy Evaluation in Confounded Partially Observable Markov Decision Processes" (ICML)
callmespring/cope
Implementation of "Off-Policy Interval Estimation with Confounded Markov Decision Process" (JASA, 2022+)
callmespring/COPP
Conformal Off-policy Prediction
callmespring/MediationRL
Implementation of "A Reinforcement Learning Framework for Dynamic Mediation Analysis" (ICML 2023) in Python.