1. 9.3 Actions Speak What You Want: Provably Sample-Efficient Reinforcement Learning of the Quantal Stackelberg Equilibrium from Strategic Feedbacks
  2. 8.9 Offline Reinforcement Learning with On-Policy Q-Function Regularization
  3. 8.5 Controlling the Latent Space of GANs through Reinforcement Learning: A Case Study on Task-based Image-to-Image Translation
  4. 8.2 FedDRL: A Trustworthy Federated Learning Model Fusion Method Based on Staged Reinforcement Learning
  5. 7.9 A Constraint Enforcement Deep Reinforcement Learning Framework for Optimal Energy Storage Systems Dispatch
  6. 7.7 Reinforcement Learning by Guided Safe Exploration