- 9.3 Feedback Efficient Online Fine-Tuning of Diffusion Models
- Authors: Masatoshi Uehara, Yulai Zhao, Kevin Black, Ehsan Hajiramezanali, Gabriele Scalia, Nathaniel Lee Diamant, Alex M Tseng, Sergey Levine, Tommaso Biancalani
- Reason: Sergey Levine is a known authority in the field of reinforcement learning, contributing to the influence score. The paper integrates reinforcement learning with diffusion models, offering a novel approach and applicable theory with potentially broad impact on complex data domains.
- 9.2 How Can LLM Guide RL? A Value-Based Approach
- Authors: Shenao Zhang, Sirui Zheng, Shuqi Ke, Zhihan Liu, Wanxin Jin, Jianbo Yuan, Yingxiang Yang, Hongxia Yang, Zhaoran Wang
- Reason: Integrates large language models (LLMs) with reinforcement learning (RL), a novel combination that could potentially enhance RL algorithms’ sample efficiency. The practical approach outlined and the successful demonstration across diverse environments make this a potentially influential paper. The author list includes researchers from established institutes, likely to have high authority in the field.
- 9.1 Multi-Constraint Safe RL with Objective Suppression for Safety-Critical Applications
- Authors: Zihan Zhou, Jonathan Booher, Wei Liu, Aleksandr Petiushko, Animesh Garg
- Reason: This paper presents a novel method for safely managing multiple constraints in reinforcement learning which is critical for domains like autonomous driving. Given the practical implications and author Animesh Garg’s authority in the field, this paper shows potential high influence.
- 9.1 DynaMITE-RL: A Dynamic Model for Improved Temporal Meta-Reinforcement Learning
- Authors: Anthony Liang, Guy Tennenholtz, Chih-wei Hsu, Yinlam Chow, Erdem Bıyık, Craig Boutilier
- Reason: Introduces a significant advancement in meta-reinforcement learning with innovative modifications to benefit various environments. The authorship includes Craig Boutilier, a well-respected figure in the field, which increases the credibility and potential impact of the paper.
- 9.0 Model-based deep reinforcement learning for accelerated learning from flow simulations
- Authors: Andre Weiner, Janis Geise
- Reason: Model-based reinforcement learning is critical for many real-world applications, including flow simulations which have significant practical implications. This paper demonstrates substantial computational savings, indicating notable potential influence.
- 8.9 HiMAP: Learning Heuristics-Informed Policies for Large-Scale Multi-Agent Pathfinding
- Authors: Huijie Tang, Federico Berto, Zihan Ma, Chuanbo Hua, Kyuree Ahn, Jinkyoo Park
- Reason: Accepted at AAMAS 2024, this paper addresses scalability challenges in multi-agent systems using Heuristics-Informed Multi-Agent Pathfinding (HiMAP), which could have a substantial impact on areas requiring large-scale coordination.
- 8.9 Graph Diffusion Policy Optimization
- Authors: Yijing Liu, Chao Du, Tianyu Pang, Chongxuan Li, Wei Chen, Min Lin
- Reason: Presents a new method for optimizing graph diffusion models using reinforcement learning, which is a novel approach with potential for significant impact in generating graphs for complex objectives. Empirical results supporting state-of-the-art performance add to the potential influence of this paper.
- 8.8 Combining Transformer based Deep Reinforcement Learning with Black-Litterman Model for Portfolio Optimization
- Authors: Ruoyu Sun, Angelos Stefanidis, Zhengyong Jiang, Jionglong Su
- Reason: Portfolio optimization is a high-impact area in finance, and this paper’s novel approach of integrating DRL with the Black-Litterman model shows promise in performance improvement, indicating a high potential for influence in financial applications.
- 8.7 Foundation Policies with Hilbert Representations
- Authors: Seohong Park, Tobias Kreiman, Sergey Levine
- Reason: Authored by Sergey Levine, a well-known figure in reinforcement learning, the paper introduces a novel approach for pre-training generalist policies, presenting a significant step in unsupervised reinforcement learning.
- 8.7 Concurrent Learning of Policy and Unknown Safety Constraints in Reinforcement Learning
- Authors: Lunet Yifru, Ali Baheri
- Reason: Addresses the critical issue of safety in reinforcement learning, proposing an approach for concurrent learning of policy and safety constraints. The relevance of safety and adaptability in real-world applications heightens the importance of this work.
- 8.6 Behavioral Refinement via Interpolant-based Policy Diffusion
- Authors: Kaiqi Chen, Eugene Lim, Kelvin Lin, Yiyang Chen, Harold Soh
- Reason: Offers advancement in imitation learning through the concept of interpolant-based policy diffusion. Considering the authors’ credibility and the integration of theoretical and empirical contributions, this paper stands out for its innovative approach to a relevant problem in imitation learning.
- 8.5 Truly No-Regret Learning in Constrained MDPs
- Authors: Adrian Müller, Pragnya Alatur, Volkan Cevher, Giorgia Ramponi, Niao He
- Reason: The paper confronts a vital challenge in reinforcement learning with CMDPs, providing safer learning algorithms that guarantee safety during the learning process. The authors’ backgrounds suggest this work will be influential in promoting secure learning environments.
- 8.5 Q-FOX Learning: Breaking Tradition in Reinforcement Learning
- Authors: Mahmood Alqaseer, Yossra H. Ali, Tarik A. Rashid
- Reason: Proposes an innovative technique for hyperparameter tuning in RL, which is a critical aspect of model performance. The paper presents strong empirical results, suggesting it could be influential for researchers looking to optimize RL algorithms.
- 8.3 Teacher-Student Learning on Complexity in Intelligent Routing
- Authors: Shu-Ting Pi, Michael Yang, Yuying Zhu, Qun Liu
- Reason: Addressing e-commerce customer service, an important commercial domain, this paper presents a machine learning framework to improve customer routing efficiency. The inclusion in the KDD 2023 Workshop signifies its relevance and potential influence in applied machine learning.
- 8.2 Language-guided Skill Learning with Temporal Variational Inference
- Authors: Haotian Fu, Pratyusha Sharma, Elias Stengel-Eskin, George Konidaris, Nicolas Le Roux, Marc-Alexandre Côté, Xingdi Yuan
- Reason: Bridging natural language processing with reinforcement learning, this research could substantially contribute to the development of more intuitive human-robot interactions, though the specificity to skill learning may slightly narrow its scope of influence.