1. 9.9 Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control
  2. 9.8 An AI Chatbot for Explaining Deep Reinforcement Learning Decisions of Service-oriented Systems
  3. 9.8 CWCL: Cross-Modal Transfer with Continuously Weighted Contrastive Loss
  4. 9.7 Recurrent Hypernetworks are Surprisingly Strong in Meta-RL
  5. 9.6 Age Minimization in Massive IoT via UAV Swarm: A Multi-agent Reinforcement Learning Approach
  6. 9.5 Implicit Sensing in Traffic Optimization: Advanced Deep Reinforcement Learning Techniques
  7. 9.5 Effective Multi-Agent Deep Reinforcement Learning Control with Relative Entropy Regularization
  8. 9.2 Adapting Double Q-Learning for Continuous Reinforcement Learning
  9. 9.1 Self-Recovery Prompting: Promptable General Purpose Service Robot System with Foundation Models and Self-Recovery
  10. 8.7 DefGoalNet: Contextual Goal Learning from Demonstrations For Deformable Object Manipulation