1. 9.3 Hierarchical Transformers are Efficient Meta-Reinforcement Learners
  2. 9.1 Frugal Actor-Critic: Sample Efficient Off-Policy Deep Reinforcement Learning Using Unique Experiences
  3. 8.9 Decision Theory-Guided Deep Reinforcement Learning for Fast Learning
  4. 8.8 High-Precision Geosteering via Reinforcement Learning and Particle Filters
  5. 8.7 POTEC: Off-Policy Learning for Large Action Spaces via Two-Stage Policy Decomposition