1. 9.2 Zero-Sum Positional Differential Games as a Framework for Robust Reinforcement Learning: Deep Q-Learning Approach
  2. 9.1 Imitation Learning in Discounted Linear MDPs without exploration assumptions
  3. 8.9 Intelligent Switching for Reset-Free RL
  4. 8.9 Learning Optimal Deterministic Policies with Stochastic Policy Gradients
  5. 8.7 Learning Robust Autonomous Navigation and Locomotion for Wheeled-Legged Robots
  6. 8.7 Dyna-Style Learning with A Macroscopic Model for Vehicle Platooning in Mixed-Autonomy Traffic
  7. 8.6 Simulating the economic impact of rationality through reinforcement learning and agent-based modelling
  8. 8.5 Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation
  9. 8.3 Robust Risk-Sensitive Reinforcement Learning with Conditional Value-at-Risk
  10. 8.1 Reinforcement Learning-Guided Semi-Supervised Learning