1. 9.1 Meta-Learning Linear Quadratic Regulators: A Policy Gradient MAML Approach for the Model-free LQR
  2. 9.0 CaRiNG: Learning Temporal Causal Representation under Non-Invertible Generation Process
  3. 8.9 Reinforcement Learning Interventions on Boundedly Rational Human Agents in Frictionful Tasks
  4. 8.7 Off-Policy Primal-Dual Safe Reinforcement Learning
  5. 8.6 Fully Independent Communication in Multi-Agent Reinforcement Learning