Pinned Repositories
CSRL
ed-expert-simulator
ICLR2019_evaluating_discrete_temporal_structure
Reproduce results from the paper "Learning Procedural Abstractions and Evaluating Discrete Latent Temporal Structure", Karan Goel and Emma Brunskill. ICLR 2019.
ICLR2019_prism
Code for Prism, the hierarchical Bayesian model from the paper "Learning Procedural Abstractions and Evaluating Discrete Latent Temporal Structure", Karan Goel and Emma Brunskill. ICLR 2019.
learning-compatible-performance-support
Code for "Fake It Till You Make It: Learning-Compatible Performance Support." Jonathan Bragg and Emma Brunskill. In 2019 Conference on Uncertainty in Artificial Intelligence (UAI '19).
off_policy_confounding
Off-policy Policy Evaluation For Sequential Decisions Under Unobserved Confounding
PLOTS
poela
POELA: Policy Optimization with ELigible Actions
RepBM
Representation Balancing MDPs for Off-Policy Policy Evaluation
waypoint-transformer
StanfordAI4HI's Repositories
StanfordAI4HI/RepBM
Representation Balancing MDPs for Off-Policy Policy Evaluation
StanfordAI4HI/ed-expert-simulator
StanfordAI4HI/CSRL
StanfordAI4HI/waypoint-transformer
StanfordAI4HI/ICLR2019_evaluating_discrete_temporal_structure
Reproduce results from the paper "Learning Procedural Abstractions and Evaluating Discrete Latent Temporal Structure", Karan Goel and Emma Brunskill. ICLR 2019.
StanfordAI4HI/off_policy_confounding
Off-policy Policy Evaluation For Sequential Decisions Under Unobserved Confounding
StanfordAI4HI/adaptive-interventions-with-goals
StanfordAI4HI/ICLR2019_prism
Code for Prism, the hierarchical Bayesian model from the paper "Learning Procedural Abstractions and Evaluating Discrete Latent Temporal Structure", Karan Goel and Emma Brunskill. ICLR 2019.
StanfordAI4HI/learning-compatible-performance-support
Code for "Fake It Till You Make It: Learning-Compatible Performance Support." Jonathan Bragg and Emma Brunskill. In 2019 Conference on Uncertainty in Artificial Intelligence (UAI '19).
StanfordAI4HI/poela
POELA: Policy Optimization with ELigible Actions
StanfordAI4HI/PLOTS
StanfordAI4HI/smart-primer-website-public
StanfordAI4HI/tclust-eval
Evaluation criteria for the external evaluation of temporal clusterings.
StanfordAI4HI/Automatic_Curriculum_ZPDES_Memory
StanfordAI4HI/cmab_convex_opt
Replication code for the paper "Beyond Unconstrained Reward Maximization: Contextual Multi-Armed Bandits for General Optimizations" by [Zhu et al. 2022].
StanfordAI4HI/FactoredDRO
Code for the paper Factored DRO: Factored Distributionally Robust Policies for Contextual Bandits (Neurips 2022)
StanfordAI4HI/grf
Generalized Random Forests
StanfordAI4HI/HBBS
Hierarchical Batch Bandit Search
StanfordAI4HI/prolific-eval
StanfordAI4HI/smart_primer_bot_public
StanfordAI4HI/SmartPrimer_Gym
Smart Primer children simulator and offline analysis
StanfordAI4HI/Split-select-retrain