1. 9.7 SmartPathfinder: Pushing the Limits of Heuristic Solutions for Vehicle Routing Problem with Drones Using Reinforcement Learning
  2. 9.4 Decentralized Coordination of Distributed Energy Resources through Local Energy Markets and Deep Reinforcement Learning
  3. 9.2 Reducing Redundant Computation in Multi-Agent Coordination through Locally Centralized Execution
  4. 9.2 Multi-view Disentanglement for Reinforcement Learning with Multiple Cameras
  5. 9.0 PIPER: Primitive-Informed Preference-based Hierarchical Reinforcement Learning via Hindsight Relabeling
  6. 9.0 Multi-Agent Hybrid SAC for Joint SS-DSA in CRNs
  7. 8.8 Minimizing Weighted Counterfactual Regret with Optimistic Online Mirror Descent
  8. 8.8 Optimal Design for Human Feedback
  9. 8.5 Towards Robust Trajectory Representations: Isolating Environmental Confounders with Causal Learning
  10. 8.3 Preference Fine-Tuning of LLMs Should Leverage Suboptimal, On-Policy Data