1. 9.4 RL en Markov Games with Independent Function Approximation: Improved Sample Complexity Bound under the Local Access Model
  2. 9.2 The Value of Reward Lookahead in Reinforcement Learning
  3. 9.1 Learning to Watermark LLM-generated Text via Reinforcement Learning
  4. 9.1 Dreaming of Many Worlds: Learning Contextual World Models Aids Zero-Shot Generalization
  5. 9.0 State-Separated SARSA: A Practical Sequential Decision-Making Algorithm with Recovering Rewards
  6. 8.9 ViSaRL: Visual Reinforcement Learning Guided by Human Saliency
  7. 8.8 A Scalable and Parallelizable Digital Twin Framework for Sustainable Sim2Real Transition of Multi-Agent Reinforcement Learning Systems
  8. 8.8 Offline Multitask Representation Learning for Reinforcement Learning
  9. 8.7 PERL: Parameter Efficient Reinforcement Learning from Human Feedback
  10. 8.7 A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization
  11. 8.6 Phasic Diversity Optimization for Population-Based Reinforcement Learning
  12. 8.6 Pessimistic Causal Reinforcement Learning with Mediators for Confounded Offline Data
  13. 8.5 Riemannian Flow Matching Policy for Robot Motion Learning
  14. 8.3 Latent Object Characteristics Recognition with Visual to Haptic-Audio Cross-modal Transfer Learning
  15. 7.9 Diffusion-Reinforcement Learning Hierarchical Motion Planning in Adversarial Multi-agent Games