1. 8.9 Policy Optimization finds Nash Equilibrium in Regularized General-Sum LQ Games
  2. 8.9 RL-MUL: Multiplier Design Optimization with Deep Reinforcement Learning
  3. 8.7 Efficient Automatic Tuning for Data-driven Model Predictive Control via Meta-Learning
  4. 8.7 Utilizing Maximum Mean Discrepancy Barycenter for Propagating the Uncertainty of Value Functions in Reinforcement Learning
  5. 8.6 Solving the QAP by Two-Stage Graph Pointer Networks and Reinforcement Learning
  6. 8.5 Multiple-policy Evaluation via Density Estimation
  7. 8.4 Variational Autoencoders for exteroceptive perception in reinforcement learning-based collision avoidance
  8. 8.3 Survey on Large Language Model-Enhanced Reinforcement Learning: Concept, Taxonomy, and Methods
  9. 8.2 Learning Off-policy with Model-based Intrinsic Motivation For Active Online Exploration
  10. 8.1 Exploring Adaptive MCTS with TD Learning in miniXCOM