1. 9.2 Provably Efficient Information-Directed Sampling Algorithms for Multi-Agent Reinforcement Learning
  2. 8.9 Bias Mitigation via Compensation: A Reinforcement Learning Perspective
  3. 8.6 Towards Generalizable Agents in Text-Based Educational Environments: A Study of Integrating RL with LLMs
  4. 7.7 Pessimistic Value Iteration for Multi-Task Data Sharing in Offline Reinforcement Learning
  5. 7.2 Deep Reinforcement Learning for Advanced Longitudinal Control and Collision Avoidance in High-Risk Driving Scenarios