1. 9.0 Efficient Reinforcement Learning for Global Decision Making in the Presence of Local Agents at Scale
  2. 8.9 EfficientZero V2: Mastering Discrete and Continuous Control with Limited Data
  3. 8.7 Causal Bandits with General Causal Models and Interventions
  4. 8.7 Robust Deep Reinforcement Learning Through Adversarial Attacks and Training : A Survey
  5. 8.5 Robust Policy Learning via Offline Skill Diffusion
  6. 8.5 Safe Hybrid-Action Reinforcement Learning-Based Decision and Control for Discretionary Lane Change
  7. 8.3 Robustifying a Policy in Multi-Agent RL with Diverse Cooperative Behavior and Adversarial Style Sampling for Assistive Tasks
  8. 8.2 Go Beyond Black-box Policies: Rethinking the Design of Learning Agent for Interpretable and Verifiable HVAC Control
  9. 8.1 Snapshot Reinforcement Learning: Leveraging Prior Trajectories for Efficiency
  10. 7.9 Influencing Bandits: Arm Selection for Preference Shaping