1. 9.8 Explaining Reinforcement Learning with Shapley Values
  2. 9.7 On the Importance of Exploration for Generalization in Reinforcement Learning
  3. 9.6 Robust Reinforcement Learning via Adversarial Kernel Approximation
  4. 9.4 Detecting Adversarial Directions in Deep Reinforcement Learning to Make Robust Decisions
  5. 9.3 On the Importance of Feature Decorrelation for Unsupervised Representation Learning in Reinforcement Learning
  6. 9.2 The Role of Diverse Replay for Generalisation in Reinforcement Learning
  7. 9.2 TreeDQN: Learning to minimize Branch-and-Bound tree
  8. 9.1 Approximate information state based convergence analysis of recurrent Q-learning
  9. 9.0 In-Sample Policy Iteration for Offline Reinforcement Learning
  10. 8.9 Finite-Time Analysis of Minimax Q-Learning for Two-Player Zero-Sum Markov Games: Switching System Approach