9.5 Asynchronous Federated Reinforcement Learning with Policy Gradient Updates: Algorithm Design and Convergence Analysis
- Authors: Guangchen Lan, Dong-Jun Han, Abolfazl Hashemi, Vaneet Aggarwal, Christopher G. Brinton
- Reason: Proposes a novel and theoretically grounded RL framework with significant improvements in sample and time complexity, authored by researchers with influential publications in the area.
9.2 Learning Efficient and Fair Policies for Uncertainty-Aware Collaborative Human-Robot Order Picking
- Authors: Igor G. Smit, Zaharah Bukhsh, Mykola Pechenizkiy, Kostas Alogariastos, Kasper Hendriks, Yingqian Zhang
- Reason: Introduces a multi-objective DRL approach integrating efficiency and fairness in human-robot collaboration, promising impactful applications in warehousing and logistics.
9.1 SIR-RL: Reinforcement Learning for Optimized Policy Control during Epidemiological Outbreaks in Emerging Market and Developing Economies
- Authors: Maeghal Jain, Ziya Uddin, Wubshet Ibrahim
- Reason: The paper introduces a novel reinforcement learning (RL) framework for optimizing policy decisions during pandemics, addressing a timely and impactful global challenge. Supported by a substantial number of pages and figures, it reflects thorough research that can significantly influence public health policy applications of RL. Additionally, given the unprecedented impact of COVID-19, research in this area is highly relevant and has the potential for considerable real-world influence.
9.0 Efficient Duple Perturbation Robustness in Low-rank MDPs
- Authors: Yang Hu, Haitong Ma, Bo Dai, Na Li
- Reason: Addresses robustness in RL with a focus on efficiency, offering theoretical guarantees that are significant for practical RL implementations; submitted to a top-tier conference (ICML).
8.9 Generalized Population-Based Training for Hyperparameter Optimization in Reinforcement Learning
- Authors: Hui Bai, Ran Cheng
- Reason: Develops a refinement to the influential Population-Based Training method, with empirical evaluation showing outperformance over traditional approaches.
8.7 Anti-Byzantine Attacks Enabled Vehicle Selection for Asynchronous Federated Learning in Vehicular Edge Computing
- Authors: Cui Zhang, Xiao Xu, Qiong Wu, Pingyi Fan, Qiang Fan, Huiling Zhu, Jiangzhou Wang
- Reason: The paper addresses a current and practical challenge within the vehicular edge computing (VEC) and presents a deep reinforcement learning (DRL) based vehicle selection scheme to improve safety and accuracy. The fact that it includes a released source code and orientation towards real-world applications increases its potential impact on the field of distributed machine learning, specifically for asynchronous federated learning settings.
8.6 Agile and versatile bipedal robot tracking control through reinforcement learning
- Authors: Jiayi Li, Linqi Ye, Yi Cheng, Houde Liu, Bin Liang
- Reason: Contributes to robotic control via RL, focusing on versatile and robust bipedal locomotion that is highly relevant for robotics research and industry applications.
8.3 Dataset Reset Policy Optimization for RLHF
- Authors: Jonathan D. Chang, Wenhao Shan, Owen Oertell, Kianté Brantley, Dipendra Misra, Jason D. Lee, Wen Sun
- Reason: This paper presents new algorithmic contributions to the field of reinforcement learning from human feedback (RLHF), offering provable guarantees and showing empirical improvement over state-of-the-art methods. Given the importance of RLHF in the development of generative models, and its direct comparison and demonstrated improvements to existing approaches, this paper holds substantial potential impact, especially with RL methods that involve human-in-the-loop systems.
7.9 Federated Optimization with Doubly Regularized Drift Correction
- Authors: Xiaowen Jiang, Anton Rodomanov, Sebastian U. Stich
- Reason: The paper’s exploration of distributed optimization strategies, with applications to federated learning, addresses drift problems in this area. It approaches the communication-computation trade-offs dilemma, which is crucial for efficient distributed training of machine learning models. Their proposed method FedRed could influence future studies and applications in federated learning environments by improving computational efficiency.
7.6 A backward differential deep learning-based algorithm for solving high-dimensional nonlinear backward stochastic differential equations
- Authors: Lorenc Kapllani, Long Teng
- Reason: While primarily a numerical analysis paper, it applies machine learning techniques to solve high-dimensional nonlinear backward stochastic differential equations (BSDEs), which can have important implications in financial mathematics and other fields where BSDEs are relevant. The paper provides both theoretical and numerical demonstrations of the proposed methodology’s efficiency, which could drive future research and applications within mathematical finance using machine learning.