1. 8.7 Two-Timescale Q-Learning with Function Approximation in Zero-Sum Stochastic Games
  2. 8.3 Optimizing Distributed Reinforcement Learning with Reactor Model and Lingua Franca
  3. 8.1 Pruning Convolutional Filters via Reinforcement Learning with Entropy Minimization
  4. 7.9 Canaries and Whistles: Resilient Drone Communication Networks with (or without) Deep Reinforcement Learning
  5. 7.6 Reinforcement Learning-Based Bionic Reflex Control for Anthropomorphic Robotic Grasping exploiting Domain Randomization