1. 8.7 Constant Stepsize Q-learning: Distributional Convergence, Bias and Extrapolation
  2. 8.5 Reinforcement Learning with Hidden Markov Models for Discovering Decision-Making Dynamics
  3. 8.2 Sample Efficient Reinforcement Learning by Automatically Learning to Compose Subtasks
  4. 8.0 Networked Multiagent Reinforcement Learning for Peer-to-Peer Energy Trading
  5. 7.9 DittoGym: Learning to Control Soft Shape-Shifting Robots