1. 8.9 Large-scale Reinforcement Learning for Diffusion Models
  2. 8.9 Knowledge Distillation from Language-Oriented to Emergent Communication for Multi-Agent Remote Control
  3. 8.7 Stochastic Dynamic Power Dispatch with High Generalization and Few-Shot Adaption via Contextual Meta Graph Reinforcement Learning
  4. 8.7 Learning Mean Field Games on Sparse Graphs: A Hybrid Graphex Approach
  5. 8.5 Constraint-Generation Policy Optimization (CGPO): Nonlinear Programming for Policy Optimization in Mixed Discrete-Continuous MDPs
  6. 8.5 Dynamic Layer Tying for Parameter-Efficient Transformers
  7. 8.3 Emergent Dominance Hierarchies in Reinforcement Learning Agents
  8. 8.2 Learning safety critics via a non-contractive binary bellman operator
  9. 8.1 Multi-Agent Dynamic Relational Reasoning for Social Robot Navigation
  10. 8.0 Reward-Relevance-Filtered Linear Offline Reinforcement Learning