1. 9.5 MACCA: Offline Multi-agent Reinforcement Learning with Causal Credit Assignment
  2. 9.3 SDSRA: A Skill-Driven Skill-Recombination Algorithm for Efficient Policy Learning
  3. 8.9 Generalized Contrastive Divergence: Joint Training of Energy-Based Model and Diffusion Model through Inverse Reinforcement Learning
  4. 8.7 Diffused Task-Agnostic Milestone Planner
  5. 8.6 I-PHYRE: Interactive Physical Reasoning