1. 9.7 Iterative Option Discovery for Planning, by Planning
  2. 9.7 Learning quantum Hamiltonians at any temperature in polynomial time
  3. 9.5 Solving the Quadratic Assignment Problem using Deep Reinforcement Learning
  4. 9.5 Generalizable Long-Horizon Manipulations with Large Language Models
  5. 9.3 Sample-Efficiency in Multi-Batch Reinforcement Learning: The Need for Dimension-Dependent Adaptivity
  6. 9.2 Imitation Learning from Observation through Optimal Transport
  7. 9.2 Learning to Relax: Setting Solver Parameters Across a Sequence of Linear System Instances
  8. 9.1 Think before you speak: Training Language Models With Pause Tokens
  9. 9.0 On Representation Complexity of Model-based and Model-free Reinforcement Learning
  10. 9.0 A Neural Scaling Law from Lottery Ticket Ensembling