1. 9.5 Safe RLHF: Safe Reinforcement Learning from Human Feedback
  2. 9.3 Eureka: Human-Level Reward Design via Coding Large Language Models
  3. 9.2 MARVEL: Multi-Agent Reinforcement-Learning for Large-Scale Variable Speed Limits
  4. 9.1 Vision-Language Models are Zero-Shot Reward Models for Reinforcement Learning
  5. 8.8 How a student becomes a teacher: learning and forgetting through Spectral methods
  6. 8.8 Hybrid Search for Efficient Planning with Completeness Guarantees
  7. 8.5 CAT: Closed-loop Adversarial Training for Safe End-to-End Driving
  8. 8.5 Generative Flow Networks as Entropy-Regularized RL
  9. 8.4 Safety-Gymnasium: A Unified Safe Reinforcement Learning Benchmark
  10. 8.1 PGA: Personalizing Grasping Agents with Single Human-Robot Interaction