1. 9.6 Goodhart’s Law in Reinforcement Learning
  2. 9.5 Safe Deep Policy Adaptation
  3. 9.4 Deep Reinforcement Learning for Autonomous Vehicle Intersection Navigation
  4. 9.3 METRA: Scalable Unsupervised RL with Metric-Aware Abstraction
  5. 9.2 ELDEN: Exploration via Local Dependencies
  6. 8.9 A Framework for Few-Shot Policy Transfer through Observation Mapping and Behavior Cloning
  7. 8.9 Community Membership Hiding as Counterfactual Graph Search via Deep Reinforcement Learning
  8. 8.7 Learning RL-Policies for Joint Beamforming Without Exploration: A Batch Constrained Off-Policy Approach
  9. 8.6 Automatic Music Playlist Generation via Simulation-based Reinforcement Learning
  10. 8.1 Optimal Scheduling of Electric Vehicle Charging with Deep Reinforcement Learning considering End Users Flexibility