1. 9.9 2310.17330-CQM: Curriculum Reinforcement Learning with a Quantized World Model
  2. 9.7 2310.17303-Demonstration-Regularized RL
  3. 9.5 2310.17458-Coalitional Bargaining via Reinforcement Learning: An Application to Collaborative Vehicle Routing
  4. 9.3 Privately Aligning Language Models with Reinforcement Learning
  5. 9.3 2310.17634-Grow Your Limits: Continuous Improvement with Real-World RL for Robotic Locomotion
  6. 9.2 Understanding and Addressing the Pitfalls of Bisimulation-based Representations in Offline Reinforcement Learning
  7. 9.1 2310.17596-MimicGen: A Data Generation System for Scalable Robot Learning using Human Demonstrations
  8. 9.0 Causal Q-Aggregation for CATE Model Selection
  9. 8.7 Controlled Decoding from Language Models
  10. 8.6 Good regularity creates large learning rate implicit biases: edge of stability, balancing, and catapult