1. 9.0 Cache-Aware Reinforcement Learning in Large-Scale Recommender Systems
  2. 8.9 Unified ODE Analysis of Smooth Q-Learning Algorithms
  3. 8.7 Reinforcement Learning with Adaptive Control Regularization for Safe Control of Critical Systems
  4. 8.6 Generalizing Multi-Step Inverse Models for Representation Learning to Finite-Memory POMDPs
  5. 8.4 Towards Multi-Morphology Controllers with Diversity and Knowledge Distillation
  6. 8.3 Impedance Matching: Enabling an RL-Based Running Jump in a Quadruped Robot
  7. 8.2 Brain-Inspired Continual Learning-Robust Feature Distillation and Re-Consolidation for Class Incremental Learning
  8. 8.0 Dynamically Anchored Prompting for Task-Imbalanced Continual Learning
  9. 7.9 Compete and Compose: Learning Independent Mechanisms for Modular World Models
  10. 7.6 Hyperparameter Optimization Can Even be Harmful in Off-Policy Learning and How to Deal with It