1. 9.5 TACO: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning
  2. 9.4 Active Coverage for PAC Reinforcement Learning
  3. 9.2 Offline Skill Graph (OSG): A Framework for Learning and Planning using Offline Reinforcement Learning Skills
  4. 8.9 Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation
  5. 8.7 Correcting discount-factor mismatch in on-policy policy gradient methods