1. 9.0 Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble
  2. 8.8 Norm Enforcement with a Soft Touch: Faster Emergence, Happier Agents
  3. 8.5 AI in Energy Digital Twining: A Reinforcement Learning-based Adaptive Digital Twin Model for Green Cities
  4. 8.3 CORE: Towards Scalable and Efficient Causal Discovery with Reinforcement Learning
  5. 8.2 Hybrid Transformer and Spatial-Temporal Self-Supervised Learning for Long-term Traffic Prediction
  6. 7.9 Autoencoder-Based Domain Learning for Semantic Communication with Conceptual Spaces
  7. 7.9 M2CURL: Sample-Efficient Multimodal Reinforcement Learning via Self-Supervised Representation Learning for Robotic Manipulation
  8. 7.5 Checkmating One, by Using Many: Combining Mixture of Experts with MCTS to Improve in Chess
  9. 7.1 Zero-Shot Reinforcement Learning via Function Encoders
  10. 6.7 Heterogeneous treatment effect estimation with subpopulation identification for personalized medicine in opioid use disorder