1. 9.3 Towards Instance-Optimality in Online PAC Reinforcement Learning
  2. 9.1 Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs
  3. 8.9 ADaPT: As-Needed Decomposition and Planning with Language Models
  4. 8.7 Real-time Control of Electric Autonomous Mobility-on-Demand Systems via Graph Reinforcement Learning
  5. 8.5 Clipped-Objective Policy Gradients for Pessimistic Policy Optimization