1. 9.9 Combining Behaviors with the Successor Features Keyboard
  2. 9.8 Human-in-the-Loop Task and Motion Planning for Imitation Learning
  3. 9.7 Solving large flexible job shop scheduling instances by generating a diverse set of scheduling policies with deep reinforcement learning
  4. 9.5 Recurrent Linear Transformers
  5. 9.1 A Doubly Robust Approach to Sparse Reinforcement Learning
  6. 9.1 COPF: Continual Learning Human Preference through Optimal Policy Fitting
  7. 8.9 Active teacher selection for reinforcement learning from human feedback
  8. 8.7 Neural Multi-Objective Combinatorial Optimization with Diversity Enhancement
  9. 8.5 Efficient Meta Neural Heuristic for Multi-Objective Combinatorial Optimization
  10. 8.2 Application of deep and reinforcement learning to boundary control problems