1. 8.7 Policy Mirror Descent with Lookahead
  2. 8.6 DecompOpt: Controllable and Decomposed Diffusion Models for Structure-based Molecular Optimization
  3. 8.5 Co-Optimization of Environment and Policies for Decentralized Multi-Agent Navigation
  4. 8.4 Self-Supervised Path Planning in UAV-aided Wireless Networks based on Active Inference
  5. 8.2 Control of Medical Digital Twins with Artificial Neural Networks
  6. 8.2 Distilling Reinforcement Learning Policies for Interpretable Robot Locomotion: Gradient Boosting Machines and Symbolic Regression
  7. 8.0 Learning-based Multi-continuum Model for Multiscale Flow Problems
  8. 8.0 Rethinking Adversarial Inverse Reinforcement Learning: From the Angles of Policy Imitation and Transferable Reward Recovery
  9. 7.8 Uncertainty Driven Active Learning for Image Segmentation in Underwater Inspection
  10. 7.8 Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion