1. 9.5 Non-Stationary Policy Learning for Multi-Timescale Multi-Agent Reinforcement Learning
  2. 9.5 Scaling Laws for Imitation Learning in NetHack
  3. 9.3 REX: Rapid Exploration and eXploitation for AI Agents
  4. 9.2 Towards Accelerating Benders Decomposition via Reinforcement Learning Surrogate Models
  5. 9.2 Learning Dynamic Attribute-factored World Models for Efficient Multi-object Reinforcement Learning
  6. 9.1 Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation
  7. 8.9 An Alternative to Variance: Gini Deviation for Risk-averse Policy Gradient
  8. 8.9 IxDRL: A Novel Explainable Deep Reinforcement Learning Toolkit based on Analyses of Interestingness
  9. 8.8 Continuous-Time Reinforcement Learning: New Design Algorithms with Theoretical Insights and Performance Guarantees
  10. 8.7 Basal-Bolus Advisor for Type 1 Diabetes (T1D) Patients Using Multi-Agent Reinforcement Learning (RL) Methodology