1. 9.5 Towards A Unified Agent with Foundation Models
  2. 9.4 STRAPPER: Preference-based Reinforcement Learning via Self-training Augmentation and Peer Regularization
  3. 9.3 XSkill: Cross Embodiment Skill Discovery
  4. 9.2 Reinforcement Learning for Credit Index Option Hedging
  5. 9.1 Deep Reinforcement Learning for ESG financial portfolio management