off-policy-evaluation

Star

Here are 22 public repositories matching this topic...

callmespring / COPP

Star

Conformal Off-policy Prediction

reinforcement-learning conformal-prediction off-policy-evaluation

Updated Feb 9, 2023
R

callmespring / Confounded-POMDP-OPE

Star

Implementation of "A Minimax Learning Approach to Off-Policy Evaluation in Confounded Partially Observable Markov Decision Processes" (ICML)

reinforcement-learning partially-observable-environment off-policy-evaluation unmeasured-confounding

Updated Jun 18, 2022
Python

callmespring / cope

Star

Implementation of "Off-Policy Interval Estimation with Confounded Markov Decision Process" (JASA, 2022+)

reinforcement-learning mediation-analysis off-policy-evaluation unmeasured-confounding

Updated Mar 13, 2024
Python

callmespring / MediationRL

Star

Implementation of "A Reinforcement Learning Framework for Dynamic Mediation Analysis" (ICML 2023) in Python.

reinforcement-learning mediation-analysis off-policy-evaluation

Updated May 18, 2023
Jupyter Notebook

airboxlab / hopes

Star

HOPES: HVAC optimization with Off-Policy Evaluation and Selection

reinforcement-learning off-policy-evaluation

Updated Jul 15, 2024
Python

callmespring / DJL

Star

Implementation of Deep Jump Learning for Off-Policy Evaluation in Continuous Treatment Settings (NeurIPS, 2021) in Python

change-point-detection continuous-action-space off-policy-evaluation

Updated Jun 1, 2022
Python

dtak / osiris

Star

Omitting-States-Irrelevant-to-Return Importance Sampling estimator for off-policy evaluation

reinforcement-learning importance-sampling off-policy-evaluation

Updated Jun 11, 2021
Python

MLD3 / CounterfactualAnnot-SemiOPE

Star

[NeurIPS 2023] Counterfactual-Augmented Importance Sampling for Semi-Offline Policy Evaluation. https://arxiv.org/abs/2310.17146

off-policy-evaluation offline-reinforcement-learning counterfactual-reasoning neurips-2023

Updated Nov 2, 2023
Jupyter Notebook

yingchengyang / BIRIS

Star

On the Reuse Bias in Off-Policy Reinforcement Learning (IJCAI 2023)

reinforcement-learning off-policy-evaluation off-policy-reinforcement-learning

Updated May 25, 2023
Python

callmespring / D2OPE

Star

Implementation of "Deeply-Debiased Off-Policy Interval Estimation" (ICML, 2021) in Python

reinforcement-learning confidence-intervals off-policy-evaluation

Updated Aug 4, 2021
Python

joshuaspear / offline_rl_ope

Star

Stateful implementations of OPE algorithms, designed for use in the development of offline RL models

rl off-policy-evaluation offlinerl

Updated Sep 18, 2024
Python

aiueola / neurips2023-future-dependent-ope

Star

(NeurIPS2023) "Future-Dependent Value-Based Off-Policy Evaluation in POMDPs"

research reinforcement-learning off-policy-evaluation

Updated Oct 24, 2023
Python

Mamba413 / ROOM

Star

Robust Offline Reinforcement Learning with Heavy-Tailed Rewards

robust-statistics heavy-tailed-distributions off-policy-evaluation offline-reinforcement-learning

Updated Aug 2, 2024
Python

Mamba413 / cope

Star

Off-Policy Interval Estimation withConfounded Markov Decision Process

reinforcement-learning statistical-inference confidence-intervals causal-inference off-policy-evaluation

Updated Sep 13, 2023
Python

CausalML / bcrl

Star

Representation Learning for OPE

machine-learning deep-learning off-policy-evaluation

Updated Jul 12, 2022
Python

aiueola / kdd2023-aips

Star

(KDD2023) "Off-Policy Evaluation of Ranking Policies under Diverse User Behavior"

research ranking recommender-system off-policy-evaluation

Updated Sep 28, 2023
Python

aiueola / wsdm2022-cascade-dr

Star

(WSDM2022 Best Paper Award Runner-Up) "Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model"

research ranking recommender-system off-policy-evaluation

Updated Jul 16, 2023
Python

callmespring / RL-short-course

Star

Reinforcement Learning Short Course

reinforcement-learning q-learning ridesharing policy-gradient dynamic-programming deep-q-network markov-decision-processes policy-iteration value-iteration monte-carlo-methods temporal-differencing-learning model-based-rl policy-based-method fitted-q-iteration off-policy-evaluation offline-rl order-dispatch-recommendation

Updated May 23, 2024
Jupyter Notebook

hakuhodo-technologies / scope-rl

Star

SCOPE-RL: A python library for offline reinforcement learning, off-policy evaluation, and selection

research reinforcement-learning risk-assessment off-policy-evaluation offline-rl

Updated Mar 18, 2024
Python

banditml / offline-policy-evaluation

Star

Implementations and examples of common offline policy evaluation methods in Python.

importance-sampling counterfactual-learning off-policy-evaluation doubly-robust offline-policy-evaluation counterfactual-policy-evaluation

Updated Feb 11, 2023
Python

Improve this page

Add a description, image, and links to the off-policy-evaluation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the off-policy-evaluation topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

off-policy-evaluation

Here are 22 public repositories matching this topic...

callmespring / COPP

callmespring / Confounded-POMDP-OPE

callmespring / cope

callmespring / MediationRL

airboxlab / hopes

callmespring / DJL

dtak / osiris

MLD3 / CounterfactualAnnot-SemiOPE

yingchengyang / BIRIS

callmespring / D2OPE

joshuaspear / offline_rl_ope

aiueola / neurips2023-future-dependent-ope

Mamba413 / ROOM

Mamba413 / cope

CausalML / bcrl

aiueola / kdd2023-aips

aiueola / wsdm2022-cascade-dr

callmespring / RL-short-course

hakuhodo-technologies / scope-rl

banditml / offline-policy-evaluation

Improve this page

Add this topic to your repo