frans – Page 3 – Frans A. Oliehoek

Are Multiple Agents the Solution, and not the Problem, to Non-Stationarity?

Posted on February 19, 2021 | by frans | Leave a Comment

That is what we explore in our AAMAS’21 blue sky paper.

The idea is to explicitly model non-stationarity as part of an environmental shift game (ESG). This enables us to predict and even steer the shifts that would occur, while dealing with epistemic uncertainty in a robust manner.

AAMAS’21 camready: AIP loss bounds

Posted on February 19, 2021 | by frans | Leave a Comment

Our AAMAS’21 paper on loss bounds for influence-based abstraction is online.

In this paper, we derive conditions for ‘approximate influence predictors’ to give small value-loss when used in small (abstracted) MDPs. From these conditions we conclude that that learning such AIPs with cross-entropy loss seems sensible.

3 Year Postdoc

Posted on November 12, 2020 | by frans | Leave a Comment

Do you have experience in multiagent reinforcement learning, game theory and/or other forms of interactive learning? Then have a look at this vacancy and contact me!

NeurIPS camready: MDP Homomorphic Networks

Posted on November 5, 2020 | by frans | Leave a Comment

In this work we show how symmetries that can occur in MDPs can be exploited for more efficient deep reinforcement learning.

NeurIPS Camready: Multi-agent active perception with prediction rewards

Posted on November 5, 2020 | by frans | Leave a Comment

This paper shows that also in decentralized multiagent settings we can employ “prediction rewards” for active perception. (Intuitively leading to a type of voting that we try to optimize).

NeurIPS Camready: Influence-Augmented Online Planning

Posted on November 5, 2020 | by frans | Leave a Comment

The camready version of Influence-Augmented Online Planning for Complex Environments is now available.

In this work, we show that by learning approximate representations of influence, we can speed up online planning (POMCP) sufficiently to get better performance when the time for online decision making is constrained.

Three papers at NeurIPS 2020

Posted on October 1, 2020 | by frans | Leave a Comment

Three of our papers were accepted at NeurIPS. For short descriptions, see my tweet.

(Updated) arxiv links will follow…

Vacancy: looking for 3-year postdoc

Posted on July 13, 2020 | by frans | Leave a Comment

I’m looking for a postdoc to work on learning in interactive settings. Please see https://www.fransoliehoek.net/wp/vacancies/.

AAMAS: Maximizing Information Gain via Prediction Rewards

Posted on April 24, 2020 | by frans | Leave a Comment

This paper tackles the problem of active perception: taking actions to minimize one’s uncertainty. It further formalizes the link between information gain and prediction rewards, and uses this to propose a deep-learning approach to optimize active perception from a data set, thus obviating the need for a complex POMDP model.

IJCAI paper: Decentralized MCTS via Learned Teammate Models

Posted on April 24, 2020 | by frans | Leave a Comment

Aleksander Czechowski got his paper on Decentralized MCTS via Learned Teammate Models accepted at IJCAI 2020.

In this paper we learn the models of other agents that each agent then uses to predict the future with. Stay tuned for the camready.