Browsing Computer Science by Subject "exploration"
Now showing items 1-1 of 1
-
Linearizing Contextual Multi-Armed Bandit Problems with Latent Dynamics
(University of Waterloo, 2022-02-10)In many real-world applications of multi-armed bandit problems, both rewards and observed contexts are often influenced by confounding latent variables which evolve stochastically over time. While the observed contexts and ...