1. 8.6 AlphaRank: An Artificial Intelligence Approach for Ranking and Selection Problems
  2. 8.3 Closure Discovery for Coarse-Grained Partial Differential Equations using Multi-Agent Reinforcement Learning
  3. 8.1 Near-Optimal Reinforcement Learning with Self-Play under Adaptivity Constraints
  4. 7.9 Efficient Reinforcement Learning for Routing Jobs in Heterogeneous Queueing Systems
  5. 7.7 To the Max: Reinventing Reward in Reinforcement Learning