1. 9.9 AlphaZero Gomoku
  2. 9.5 Hundreds Guide Millions: Adaptive Offline Reinforcement Learning with Expert Guidance
  3. 9.2 An Ensemble Method of Deep Reinforcement Learning for Automated Cryptocurrency Trading
  4. 9.2 Leveraging Reward Consistency for Interpretable Feature Discovery in Reinforcement Learning
  5. 9.1 Physics Informed Reinforcement Learning: Review and Open Problems
  6. 9.0 Learning-Aware Safety for Interactive Autonomy
  7. 9.0 RoboAgent: Generalization and Efficiency in Robot Manipulation via Semantic Augmentations and Action Chunking
  8. 8.9 Generative AI for End-to-End Limit Order Book Modelling: A Token-Level Autoregressive Generative Model of Message Flow Using a Deep State Space Network
  9. 8.9 Explaining grokking through circuit efficiency
  10. 8.8 Hawkeye: Change-targeted Testing for Android Apps based on Deep Reinforcement Learning
  11. 8.7 Neurosymbolic Reinforcement Learning and Planning: A Survey
  12. 8.7 Building a Winning Team: Selecting Source Model Ensembles using a Submodular Transferability Estimation Approach
  13. 8.6 Efficient RL via Disentangled Environment and Agent Representations
  14. 8.5 Stabilize to Act: Learning to Coordinate for Bimanual Manipulation
  15. 8.1 Multimodal Contrastive Learning with Hard Negative Sampling for Human Activity Recognition