1. 8.7 MAexp: A Generic Platform for RL-based Multi-Agent Exploration
  2. 8.6 Sample-efficient Learning of Infinite-horizon Average-reward MDPs with General Function Approximation
  3. 8.5 Zero-Shot Stitching in Reinforcement Learning using Relative Representations
  4. 8.4 Continuous-time Risk-sensitive Reinforcement Learning via Quadratic Variation Penalty
  5. 8.3 Goal Exploration via Adaptive Skill Distribution for Goal-Conditioned Reinforcement Learning
  6. 8.2 Single-Task Continual Offline Reinforcement Learning
  7. 8.1 Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models
  8. 8.0 TrajDeleter: Enabling Trajectory Forgetting in Offline Reinforcement Learning Agents
  9. 7.9 Analysis of Classifier-Free Guidance Weight Schedulers
  10. 7.8 Adaptive Regularization of Representation Rank as an Implicit Constraint of Bellman Equation