- 9.2 The Power of Resets in Online Reinforcement Learning
- Authors: Zakaria Mhammedi, Dylan J. Foster, Alexander Rakhlin
- Reason: Breakthrough in utilizing simulators with local planning, providing new statistical guarantees and a more efficient algorithm, RVFS, which improves upon real-world RL applications.
- 9.0 GRSN: Gated Recurrent Spiking Neurons for POMDPs and MARL
- Authors: Lang Qin, Ziming Wang, Runhao Jiang, Rui Yan, Huajin Tang
- Reason: Novel approach in aligning SNNs with RL temporal demands, showing promise in POMDPs and MARL domains while maintaining computational efficiency.
- 8.9 DPO: Differential reinforcement learning with application to optimal configuration search
- Authors: Chandrajit Bajaj, Minh Nguyen
- Reason: Novel framework of Differential Policy Optimization promising efficient handling of continuous spaces in RL, with applications in complex problem-solving and configuration search.
- 8.9 Recursive Backwards Q-Learning in Deterministic Environments
- Authors: Jan Diekhoff, Jörn Fischer
- Reason: Introduces a novel reinforcement learning algorithm aimed at solving deterministic problems more effectively, which is a direct contribution to reinforcement learning advancements.
- 8.7 ViViDex: Learning Vision-based Dexterous Manipulation from Human Videos
- Authors: Zerui Chen, Shizhe Chen, Cordelia Schmid, Ivan Laptev
- Reason: Offers a practical approach to learning from human videos for robot manipulation, handling limitations of previous methods, and improving upon the ability of robots to perform dexterous tasks.
- 8.7 An Element-Wise Weights Aggregation Method for Federated Learning
- Authors: Yi Hu, Hanchi Ren, Chen Hu, Jingjing Deng, Xianghua Xie
- Reason: The paper focuses on a central challenge in federated learning, which is closely related to reinforcement learning in terms of decentralized learning processes and possible significant improvement on learning performance.
- 8.6 FedSI: Federated Subnetwork Inference for Efficient Uncertainty Quantification
- Authors: Hui Chen, Hengyu Liu, Zhangkai Wu, Xuhui Fan, Longbing Cao
- Reason: Introduces an innovative approach to personalize federated learning, addressing uncertainty and systematizing efficiency across distributed networks.
- 8.3 ST-MambaSync: The Confluence of Mamba Structure and Spatio-Temporal Transformers for Precipitous Traffic Prediction
- Authors: Zhiqi Shao, Xusheng Yao, Ze Wang, Junbin Gao
- Reason: It introduces an innovative framework potentially beneficial for real-time scenarios in reinforcement learning universe, specifically for applications considering spatial-temporal prediction tasks.
- 8.1 Explainable AI models for predicting liquefaction-induced lateral spreading
- Authors: Cheng-Hsi Hsiao, Krishna Kumar, Ellen Rathje
- Reason: While not directly linked to reinforcement learning, the use of explainable AI is critical to understanding model decisions, which could have implications on reinforcement learning approaches that need interpretability.
- 7.9 Fast Ensembling with Diffusion Schrödinger Bridge
- Authors: Hyunsu Kim, Jongmin Yoon, Juho Lee
- Reason: Proposes a novel ensembling technique that can be used to enhance performance of deep neural networks, relevant for reinforcement learning contexts where ensemble learning may offer improvements.