1. 9.5 Mixtures of Experts Unlock Parameter Scaling for Deep RL
  2. 9.3 World Model on Million-Length Video And Language With RingAttention
  3. 9.2 Conservative and Risk-Aware Offline Multi-Agent Reinforcement Learning for Digital Twins
  4. 9.0 Provable Traffic Rule Compliance in Safe Reinforcement Learning on the Open Sea
  5. 8.9 SMX: Sequential Monte Carlo Planning for Expert Iteration
  6. 8.7 A Competition Winning Deep Reinforcement Learning Agent in microRTS
  7. 8.7 A Distributional Analogue to the Successor Representation
  8. 8.3 Leveraging Digital Cousins for Ensemble Q-Learning in Large-Scale Wireless Networks
  9. 8.1 Avoiding Catastrophe in Continuous Spaces by Asking for Help
  10. 7.9 Enabling Multi-Agent Transfer Reinforcement Learning via Scenario Independent Representation