1. 9.6 Deep Backtracking Counterfactuals for Causally Compliant Explanations
  2. 9.3 MatFormer: Nested Transformer for Elastic Inference
  3. 9.1 Revisiting Plasticity in Visual Reinforcement Learning: Data, Modules and Training Stages
  4. 8.9 Imitation Learning from Purified Demonstration
  5. 8.9 Exploiting Causal Graph Priors with Posterior Sampling for Reinforcement Learning
  6. 8.7 Robust Safe Reinforcement Learning under Adversarial Disturbances
  7. 8.7 Score Regularized Policy Optimization through Diffusion Behavior
  8. 8.4 Bridging the Gap between Newton-Raphson Method and Regularized Policy Iteration
  9. 8.2 Off-Policy Evaluation for Human Feedback
  10. 8.0 COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL