1. 8.6 Deep Reinforcement Learning for Traveling Purchaser Problems
  2. 8.4 Is Exploration All You Need? Effective Exploration Characteristics for Transfer in Reinforcement Learning
  3. 8.2 Decision Transformer as a Foundation Model for Partially Observable Continuous Control
  4. 8.0 Grid-Mapping Pseudo-Count Constraint for Offline Reinforcement Learning
  5. 7.8 AD4RL: Autonomous Driving Benchmarks for Offline Reinforcement Learning with Value-based Dataset