1. 9.2 SPQR: Controlling Q-ensemble Independence with Spiked Random Model for Reinforcement Learning
  2. 9.0 GLIDE-RL: Grounded Language Instruction through DEmonstration in RL
  3. 9.0 Decentralized Federated Policy Gradient with Byzantine Fault-Tolerance and Provably Fast Convergence
  4. 8.9 Human as AI Mentor: Enhanced Human-in-the-loop Reinforcement Learning for Safe and Efficient Autonomous Driving
  5. 8.8 Long-term Safe Reinforcement Learning with Binary Feedback
  6. 8.7 Decision Making in Non-Stationary Environments with Policy-Augmented Search
  7. 8.6 LLMs for Robotic Object Disambiguation
  8. 8.5 MOTO: Offline Pre-training to Online Fine-tuning for Model-based Robot Learning
  9. 8.4 A Tensor Network Implementation of Multi Agent Reinforcement Learning
  10. 8.2 Using reinforcement learning to improve drone-based inference of greenhouse gas fluxes