1. 9.1 Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions
  2. 9.0 Differentiable Quantum Architecture Search for Quantum Reinforcement Learning
  3. 8.9 Task Graph offloading via Deep Reinforcement Learning in Mobile Edge Computing
  4. 8.8 Prominent Roles of Conditionally Invariant Components in Domain Adaptation: Theory and Algorithms
  5. 8.7 Guide Your Agent with Adaptive Multimodal Rewards