This is the question that De Volkskrant asked me to comment on. Find the piece here (in Dutch).
Category: news
Influence-Based Abstraction in Deep Reinforcement Learning
Scaling Bayesian RL for Factored POMDPs
Reinforcement learning is tough. POMDPs are hard. And doing RL in partially observable problems is a huge challenge. With Sammie and Chris Amato, I have been making some progress to get a principled method (based on Monte Carlo tree search) too scale for structured problems. We can learn both how to act, as well as the structure of the problem at the same time. See the paper and bib.
At AAMAS: Deep learning of Coordination…?
Can deep Q-networks etc. brute force their way through tough coordination problems…? Perhaps not. Jacopo’s work, accepted as an extended abstract at AAMAS’19, takes a first step in exploring this in the one-shot setting.
Not so surprising: “joint Q-learner” can be too large/slow and “individual Q-learners” can fail to find good representations.
But good to know: “factored Q-value functions” which represent the Q-function as a random mixture of components involving 2 or 3 agents, can do quite well, even for hard coordination tasks!
About learning machines and AI
science squared on my research project
A popular piece on my ERC research:
Looking for colleagues!
We are looking for faculty in AI!
“Learning from Demonstration in the Wild” on arXiv
“Learning from Demonstration in the Wild” is work that I did with the folks at LatentLogic. It’s pretty cool, check the video on youtube and the paper on arXiv: https://arxiv.org/abs/1811.03516
Wanted: 3-year Postdoc in Influence-based Abstraction, Learning and Coordination
Join the INFLUENCE research team!
I am looking for a 3-year Postdoc in Influence-based Abstraction, Learning and Coordination. Find me at #IJCAI2018 to informally discuss.
More info: vacancies
IJCAI paper: overview of some of my research
My invited IJCAI paper giving an overview of (some of, apologies to some coauthors, could not fit everything there….) my research is now available from my publications page.