1. 9.5 Dynamic Knowledge Injection for AIXI Agents
  2. 9.2 Foundations of Reinforcement Learning and Interactive Decision Making
  3. 9.0 Inverse Reinforcement Learning with Unknown Reward Model based on Structural Risk Minimization
  4. 8.9 OpenRL: A Unified Reinforcement Learning Framework
  5. 8.7 Ensemble-based Interactive Imitation Learning
  6. 8.6 XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library
  7. 8.5 Adaptive trajectory-constrained exploration strategy for deep reinforcement learning
  8. 8.3 Maximizing the Success Probability of Policy Allocations in Online Systems
  9. 8.3 Active Third-Person Imitation Learning
  10. 8.1 Harnessing the Power of Federated Learning in Federated Contextual Bandits