1. 9.2 Deep Reinforcement Learning from Hierarchical Weak Preference Feedback
  2. 9.1 Active flow control for three-dimensional cylinders through deep reinforcement learning
  3. 9.1 Natural and Robust Walking using Reinforcement Learning without Demonstrations in High-Dimensional Musculoskeletal Models
  4. 9.0 Marketing Budget Allocation with Offline Constrained Deep Reinforcement Learning
  5. 8.9 A Survey of Imitation Learning: Algorithms, Recent Developments, and Challenges
  6. 8.9 ORL-AUDITOR: Dataset Auditing in Offline Deep Reinforcement Learning
  7. 8.8 Subgraph Attention Networks for Molecular Graph Property Prediction and Feature Interpretation
  8. 8.5 Pure Monte Carlo Counterfactual Regret Minimization
  9. 8.2 On Reducing Undesirable Behavior in Deep Reinforcement Learning Models
  10. 8.0 Rethinking Momentum Knowledge Distillation in Online Continual Learning