1. 9.3 Sample Efficient Reinforcement Learning with Partial Dynamics Knowledge
  2. 9.2 Leading the Pack: N-player Opponent Shaping
  3. 9.0 Model-Based Control with Sparse Neural Dynamics
  4. 8.9 BadRL: Sparse Targeted Backdoor Attack Against Reinforcement Learning
  5. 8.7 Robustly Improving Bandit Algorithms with Confounded and Selection Biased Offline Data: A Causal Approach