1. 9.2 Scaling Instructable Agents Across Many Simulated Worlds
  2. 9.0 Model-based Offline Quantum Reinforcement Learning
  3. 9.0 Continuous Control Reinforcement Learning: Distributed Distributional DrQ Algorithms
  4. 8.8 Continual Offline Reinforcement Learning via Diffusion-based Dual Generative Replay
  5. 8.7 Offline Trajectory Generalization for Offline Reinforcement Learning
  6. 8.6 Randomized Exploration in Cooperative Multi-Agent Reinforcement Learning
  7. 8.5 Warm-Start Variational Quantum Policy Iteration
  8. 8.4 Settling Constant Regrets in Linear Markov Decision Processes
  9. 8.3 EyeFormer: Predicting Personalized Scanpaths with Transformer-Guided Reinforcement Learning
  10. 8.2 TENG: Time-Evolving Natural Gradient for Solving PDEs with Deep Neural Net