January 2022 – Frans A. Oliehoek

AAMAS’22 paper: Bayesian RL to cooperate with humans

Posted on January 26, 2022 | by frans | Leave a Comment

In our new paper Best-Response Bayesian Reinforcement Learning with BA-POMDPs for Centaurs, we investigate a machine whose actions can be overridden by the human. We show how Bayesian RL might lead to quick adaptation to unknown human preferences, as well as aiding the human to pursue its true goals in case of temporally inconsistent behaviors. All credits to Mert for all the hard work!