1. 9.6 Design from Policies: Conservative Test-Time Adaptation for Offline Policy Optimization
  2. 9.5 Taming the Exponential Action Set: Sublinear Regret and Fast Convergence to Nash Equilibrium in Online Congestion Games
  3. 9.3 Curvature-enhanced Graph Convolutional Network for Biomolecular Interaction Prediction
  4. 9.3 A General Framework for Sequential Decision-Making under Adaptivity Constraints
  5. 9.1 A First Order Meta Stackelberg Method for Robust Federated Learning
  6. 9.1 PolicyClusterGCN: Identifying Efficient Clusters for Training Graph Convolutional Networks
  7. 9.1 Estimating player completion rate in mobile puzzle games using reinforcement learning
  8. 9.0 Minigrid & Miniworld: Modular & Customizable Reinforcement Learning Environments for Goal-Oriented Tasks
  9. 8.9 Large Sequence Models for Sequential Decision-Making: A Survey
  10. 8.9 STEF-DHNet: Spatiotemporal External Factors Based Deep Hybrid Network for Enhanced Long-Term Taxi Demand Prediction
  11. 8.8 Estimating the Value of Evidence-Based Decision Making
  12. 8.7 Action Q-Transformer: Visual Explanation in Deep Reinforcement Learning with Encoder-Decoder Model using Action Query
  13. 8.6 Near Optimal Heteroscedastic Regression with Symbiotic Learning
  14. 8.6 On Imitation in Mean-field Games
  15. 8.5 Safe Reinforcement Learning with Dead-Ends Avoidance and Recovery
  16. 8.3 Decision-Dependent Distributionally Robust Markov Decision Process Method in Dynamic Epidemic Control
  17. 8.3 Multi-Agent Deep Reinforcement Learning for Dynamic Avatar Migration in AIoT-enabled Vehicular Metaverses with Trajectory Prediction
  18. 8.1 Fighting Uncertainty with Gradients: Offline Reinforcement Learning via Diffusion Score Matching
  19. 7.8 Learning to Modulate pre-trained Models in RL
  20. 7.3 Maximum State Entropy Exploration using Predecessor and Successor Representations