1. 9.5 Asynchronous Federated Reinforcement Learning with Policy Gradient Updates: Algorithm Design and Convergence Analysis
  2. 9.2 Learning Efficient and Fair Policies for Uncertainty-Aware Collaborative Human-Robot Order Picking
  3. 9.1 SIR-RL: Reinforcement Learning for Optimized Policy Control during Epidemiological Outbreaks in Emerging Market and Developing Economies
  4. 9.0 Efficient Duple Perturbation Robustness in Low-rank MDPs
  5. 8.9 Generalized Population-Based Training for Hyperparameter Optimization in Reinforcement Learning
  6. 8.7 Anti-Byzantine Attacks Enabled Vehicle Selection for Asynchronous Federated Learning in Vehicular Edge Computing
  7. 8.6 Agile and versatile bipedal robot tracking control through reinforcement learning
  8. 8.3 Dataset Reset Policy Optimization for RLHF
  9. 7.9 Federated Optimization with Doubly Regularized Drift Correction
  10. 7.6 A backward differential deep learning-based algorithm for solving high-dimensional nonlinear backward stochastic differential equations