1. 9.2 Multi-Objective Optimization Using Adaptive Distributed Reinforcement Learning
  2. 9.0 Towards Efficient Risk-Sensitive Policy Gradient: An Iteration Complexity Analysis
  3. 8.7 SINDy-RL: Interpretable and Efficient Model-Based Reinforcement Learning
  4. 8.5 One-Shot Averaging for Distributed TD($\lambda$) Under Markov Sampling
  5. 8.3 A Reinforcement Learning Approach to Dairy Farm Battery Management using Q Learning