1. 9.3 Learning to Team-Based Navigation: A Review of Deep Reinforcement Learning Techniques for Multi-Agent Pathfinding
  2. 9.0 Reinforcement Logic Rule Learning for Temporal Point Processes
  3. 8.8 Neural Conversation Models and How to Rein Them in: A Survey of Failures and Fixes
  4. 8.5 Learning Control Policies for Variable Objectives from Offline Data
  5. 8.1 Towards a Causal Probabilistic Framework for Prediction, Action-Selection & Explanations for Robot Block-Stacking Tasks