1. 9.4 Feedback is All You Need: Real-World Reinforcement Learning with Approximate Physics-Based Models
  2. 9.3 Efficient Action Robust Reinforcement Learning with Probabilistic Policy Execution Uncertainty
  3. 9.2 Improved Self-Normalized Concentration in Hilbert Spaces: Sublinear Regret for GP-UCB
  4. 9.2 Discovering User Types: Mapping User Traits by Task-Specific Behaviors in Reinforcement Learning
  5. 9.0 Can Euclidean Symmetry be Leveraged in Reinforcement Learning and Planning?
  6. 8.8 Learning Multiple Coordinated Agents under Directed Acyclic Graph Constraints
  7. 8.7 RAYEN: Imposition of Hard Convex Constraints on Neural Networks
  8. 8.5 Efficient Adversarial Attacks on Online Multi-agent Reinforcement Learning
  9. 8.5 A Multiobjective Reinforcement Learning Framework for Microgrid Energy Management
  10. 8.1 On the Robustness of Epoch-Greedy in Multi-Agent Contextual Bandit Mechanisms