1. 9.7 Latent Plan Transformer: Planning as Latent Variable Inference
  2. 9.5 Code as Reward: Empowering Reinforcement Learning with VLMs
  3. 9.3 Learning by Doing: An Online Causal Reinforcement Learning Framework with Causal-Aware Policy
  4. 9.2 Grandmaster-Level Chess Without Search
  5. 9.1 Deep Reinforcement Learning with Dynamic Graphs for Adaptive Informative Path Planning
  6. 8.9 A Primal-Dual Algorithm for Offline Constrained Reinforcement Learning with Low-Rank MDPs
  7. 8.9 NITO: Neural Implicit Fields for Resolution-free Topology Optimization
  8. 8.7 AdaFlow: Imitation Learning with Variance-Adaptive Flow-Based Policies
  9. 8.5 Incentivized Truthful Communication for Federated Bandits
  10. 8.3 DySLIM: Dynamics Stable Learning by Invariant Measure for Chaotic Systems