1. 9.5 Horizon-Free and Instance-Dependent Regret Bounds for Reinforcement Learning with General Function Approximation
  2. 9.1 Model-Based Epistemic Variance of Values for Risk-Aware Policy Optimization
  3. 8.9 Pearl: A Production-ready Reinforcement Learning Agent
  4. 8.7 Generalization to New Sequential Decision Making Tasks with In-Context Learning
  5. 8.7 A Scalable Network-Aware Multi-Agent Reinforcement Learning Framework for Decentralized Inverter-based Voltage Control
  6. 8.5 Similarity-based Knowledge Transfer for Cross-Domain Reinforcement Learning
  7. 8.3 MICRO: Model-Based Offline Reinforcement Learning with a Conservative Bellman Operator
  8. 8.3 Using Large Language Models for Hyperparameter Optimization
  9. 8.1 FoMo Rewards: Can we cast foundation models as reward functions?
  10. 7.9 Deep Dynamics: Vehicle Dynamics Modeling with a Physics-Informed Neural Network for Autonomous Racing