1. 9.5 Deep Reinforcement Learning-based Intelligent Traffic Signal Controls with Optimized CO2 emissions
  2. 9.2 RL-X: A Deep Reinforcement Learning Library (not only) for RoboCup
  3. 9.1 Contrastive Prefence Learning: Learning from Human Feedback without RL
  4. 9.0 Progressively Efficient Learning
  5. 8.9 Provable Benefits of Multi-task RL under Non-Markovian Decision Making Processes
  6. 8.8 Optimal Best Arm Identification with Fixed Confidence in Restless Bandits
  7. 8.7 A Deep Learning Analysis of Climate Change, Innovation, and Uncertainty
  8. 8.7 Tree Search in DAG Space with Model-based Reinforcement Learning for Causal Discovery
  9. 8.5 Absolute Policy Optimization
  10. 8.2 ManiCast: Collaborative Manipulation with Cost-Aware Human Forecasting