1. 9.4 Harnessing Network Effect for Fake News Mitigation: Selecting Debunkers via Self-Imitation Learning
  2. 9.4 RL-VLM-F: Reinforcement Learning from Vision Language Foundation Model Feedback
  3. 9.3 MADRL-based UAVs Trajectory Design with Anti-Collision Mechanism in Vehicular Networks
  4. 9.2 Reinforcement Learning with Ensemble Model Predictive Safety Certification
  5. 9.1 Curriculum reinforcement learning for quantum architecture search under hardware errors
  6. 9.0 SEABO: A Simple Search-Based Method for Offline Imitation Learning
  7. 8.9 ICED: Zero-Shot Transfer in Reinforcement Learning via In-Context Environment Design
  8. 8.8 Compound Returns Reduce Variance in Reinforcement Learning
  9. 8.6 Deep Reinforcement Learning for Picker Routing Problem in Warehousing
  10. 8.6 Informed Reinforcement Learning for Situation-Aware Traffic Rule Exceptions