1. 9.5 Landmark Guided Active Exploration with Stable Low-level Policy Learning
  2. 9.1 $λ$-AC: Learning latent decision-aware models for reinforcement learning in continuous state-spaces
  3. 8.7 Probabilistic Constraint for Safety-Critical Reinforcement Learning
  4. 8.4 Learning Environment Models with Continuous Stochastic Dynamics
  5. 8.0 Optimal Execution Using Reinforcement Learning