1. 9.4 MENTOR: Guiding Hierarchical Reinforcement Learning with Human Feedback and Dynamic Distance Constraint
  2. 9.2 BeTAIL: Behavior Transformer Adversarial Imitation Learning from Human Racing Gameplay
  3. 9.1 ACE: Off-Policy Actor-Critic with Causality-Aware Entropy Regularization
  4. 9.0 Bayesian Off-Policy Evaluation and Learning for Large Action Spaces
  5. 8.9 Model-Based Reinforcement Learning Control of Reaction-Diffusion Problems
  6. 8.8 Simple and Effective Transfer Learning for Neuro-Symbolic Integration
  7. 8.7 Edge Caching Based on Deep Reinforcement Learning and Transfer Learning
  8. 8.6 PolyNet: Learning Diverse Solution Strategies for Neural Combinatorial Optimization
  9. 8.6 Enhancement of High-definition Map Update Service Through Coverage-aware and Reinforcement Learning
  10. 8.5 Partial Search in a Frozen Network is Enough to Find a Strong Lottery Ticket