off-policy-evaluation

There are 22 repositories under off-policy-evaluation topic.

hanjuku-kaso/awesome-offline-rl
An index of algorithms for offline reinforcement learning (offline-rl)
904 45 187
st-tech/zr-obp
Open Bandit Pipeline: a python library for bandit algorithms and off-policy evaluation
Language:Python635 87 4287
banditml/offline-policy-evaluation
Implementations and examples of common offline policy evaluation methods in Python.
Language:Python219 7 624
hakuhodo-technologies/scope-rl
SCOPE-RL: A python library for offline reinforcement learning, off-policy evaluation, and selection
Language:Python110 5 911
callmespring/RL-short-course
Reinforcement Learning Short Course
Language:Jupyter Notebook47 3 015
aiueola/wsdm2022-cascade-dr
(WSDM2022 Best Paper Award Runner-Up) "Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model"
Language:Python13 1 03
aiueola/kdd2023-aips
(KDD2023) "Off-Policy Evaluation of Ranking Policies under Diverse User Behavior"
Language:Python8 1 00
CausalML/bcrl
Representation Learning for OPE
Language:Python8 1 00
Mamba413/cope
Off-Policy Interval Estimation withConfounded Markov Decision Process
Language:Python5 2 03
aiueola/neurips2023-future-dependent-ope
(NeurIPS2023) "Future-Dependent Value-Based Off-Policy Evaluation in POMDPs"
Language:Python4 1 00
Mamba413/ROOM
Robust Offline Reinforcement Learning with Heavy-Tailed Rewards
Language:Python4 2 01
callmespring/D2OPE
Implementation of "Deeply-Debiased Off-Policy Interval Estimation" (ICML, 2021) in Python
Language:Python2 0 00
joshuaspear/offline_rl_ope
Stateful implementations of OPE algorithms, designed for use in the development of offline RL models
Language:Python2 2 40
yingchengyang/BIRIS
On the Reuse Bias in Off-Policy Reinforcement Learning (IJCAI 2023)
Language:Python2 2 00
callmespring/DJL
Implementation of Deep Jump Learning for Off-Policy Evaluation in Continuous Treatment Settings (NeurIPS, 2021) in Python
Language:Python1 0 00
dtak/osiris
Omitting-States-Irrelevant-to-Return Importance Sampling estimator for off-policy evaluation
Language:Python1 2 00
MLD3/CounterfactualAnnot-SemiOPE
[NeurIPS 2023] Counterfactual-Augmented Importance Sampling for Semi-Offline Policy Evaluation. https://arxiv.org/abs/2310.17146
Language:Jupyter Notebook1 2 01
airboxlab/hopes
HOPES: HVAC optimization with Off-Policy Evaluation and Selection
Language:Python00
callmespring/Confounded-POMDP-OPE
Implementation of "A Minimax Learning Approach to Off-Policy Evaluation in Confounded Partially Observable Markov Decision Processes" (ICML)
Language:Python0 0 00
callmespring/cope
Implementation of "Off-Policy Interval Estimation with Confounded Markov Decision Process" (JASA, 2022+)
Language:Python0 0
callmespring/COPP
Conformal Off-policy Prediction
Language:R0 0
callmespring/MediationRL
Implementation of "A Reinforcement Learning Framework for Dynamic Mediation Analysis" (ICML 2023) in Python.
Language:Jupyter Notebook0 0

off-policy-evaluation

hanjuku-kaso/awesome-offline-rl

st-tech/zr-obp

banditml/offline-policy-evaluation

hakuhodo-technologies/scope-rl

callmespring/RL-short-course

aiueola/wsdm2022-cascade-dr

aiueola/kdd2023-aips

CausalML/bcrl

Mamba413/cope

aiueola/neurips2023-future-dependent-ope

Mamba413/ROOM

callmespring/D2OPE

joshuaspear/offline_rl_ope

yingchengyang/BIRIS

callmespring/DJL

dtak/osiris

MLD3/CounterfactualAnnot-SemiOPE

airboxlab/hopes

callmespring/Confounded-POMDP-OPE

callmespring/cope

callmespring/COPP

callmespring/MediationRL