Arxiv RL Today CV Research Interest

I made GPT-4 service to generates a daily compilation of recent, notable papers from arXiv 😉

Number before title corresponds to importance score

Tue, 7 May 2024

  1. 9.2 Finite-Time Convergence and Sample Complexity of Actor-Critic Multi-Objective Reinforcement Learning
    • Authors: Tianchen Zhou, FNU Hairi, Haibo Yang, Jia Liu, Tian Tong, Fan Yang, Michinari Momma, Yan Gao
    • Reason: The paper is accepted at a top-tier conference (ICML 2024) and tackles the under-explored multi-objective reinforcement learning (MORL), providing theoretical contributions including finite-time convergence analysis and sample complexity, which can significantly influence the understanding and application of MORL in various domains.
  2. 9.0 Federated Reinforcement Learning with Constraint Heterogeneity
    • Authors: Hao Jin, Liangyu Zhang, Zhihua Zhang
    • Reason: In the trending field of federated learning, this paper adds the important dimension of reinforcement learning with multiple constraints, suited for practical applications like healthcare, which is highly relevant and can drive future research in federated RL.
  3. 8.9 Deep Reinforcement Learning for Modelling Protein Complexes
    • Authors: Tao Feng, Ziqi Gao, Jiaxuan You, Chenyi Zi, Yan Zhou, Chen Zhang, Jia Li
    • Reason: Addresses a challenging problem in bioinformatics with potential high impact, presented at a reputable conference (ICLR 2024), significant accuracy and efficiency improvements demonstrated, and relevance to current high-interest topics in AI such as AlphaFold.
  4. 8.7 Natural Policy Gradient and Actor Critic Methods for Constrained Multi-Task Reinforcement Learning
    • Authors: Sihan Zeng, Thinh T. Doan, Justin Romberg
    • Reason: Addresses multi-task learning, an area of growing interest, with solutions for both centralized and decentralized settings. The paper presents theoretical convergence results and practical algorithmic contributions.
  5. 8.7 Policy Learning for Balancing Short-Term and Long-Term Rewards
    • Authors: Peng Wu, Ziyu Shen, Feng Xie, Zhongyao Wang, Chunchen Liu, Yan Zeng
    • Reason: The paper presents a new policy learning framework that balances short-term and long-term rewards, with sound theoretical foundations including identifiability, efficiency bounds, and algorithms with proven regret convergence rates. This addresses a core challenge in reinforcement learning and could be influential for both theory and practice.
  6. 8.5 Proximal Curriculum with Task Correlations for Deep Reinforcement Learning
    • Authors: Georgios Tzannetos, Parameswaran Kamalaruban, Adish Singla
    • Reason: Introduces a novel curriculum strategy that could potentially speed up the training of deep RL agents, backed by theoretical and empirical results, contributing to the advancement of learning curricula in RL.
  7. 8.5 Reverse Forward Curriculum Learning for Extreme Sample and Demonstration Efficiency in Reinforcement Learning
    • Authors: Stone Tao, Arth Shukla, Tse-kai Chan, Hao Su
    • Reason: Accepted at ICLR 2024, this paper proposes an innovative curriculum learning approach for reinforcement learning, demonstrating significant improvements in sample and demonstration efficiency. It offers a solution to one of RL’s biggest hurdles: data efficiency.
  8. 8.4 Enhancing Q-Learning with Large Language Model Heuristics
    • Authors: Xiefeng Wu
    • Reason: The paper proposes an interesting blend of reinforcement learning and natural language processing by integrating large language model heuristics into Q-learning. Despite the lower impact due to potential text overlap with previous work, the approach could be influential for future work at the intersection of RL and NLP.
  9. 8.3 Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline
    • Authors: Wenjia Meng, Qian Zheng, Long Yang, Yilong Yin, Gang Pan
    • Reason: Proposes a method to decrease the high variance issue in off-policy policy gradient methods, which could be influential in making these methods more practical and effective.
  10. 8.1 CTD4 - A Deep Continuous Distributional Actor-Critic Agent with a Kalman Fusion of Multiple Critics
    • Authors: David Valencia, Henry Williams, Trevor Gee, Bruce A MacDonaland, Minas Liarokapis
    • Reason: Offers a new distributional RL algorithm tailored for continuous action spaces, addressing practical challenges of CDRL and can be more sample-efficient, which could influence continuous control problems in RL.

Mon, 6 May 2024

Fri, 3 May 2024

Thu, 2 May 2024

Wed, 1 May 2024

Tue, 30 Apr 2024

Mon, 29 Apr 2024

Fri, 26 Apr 2024

Thu, 25 Apr 2024

Wed, 24 Apr 2024

Tue, 23 Apr 2024

Mon, 22 Apr 2024

Fri, 19 Apr 2024

Thu, 18 Apr 2024

Wed, 17 Apr 2024

Tue, 16 Apr 2024

Mon, 15 Apr 2024

Fri, 12 Apr 2024

Thu, 11 Apr 2024

Wed, 10 Apr 2024

Tue, 9 Apr 2024

Mon, 8 Apr 2024

Fri, 5 Apr 2024

Thu, 4 Apr 2024

Wed, 3 Apr 2024

Tue, 2 Apr 2024

Mon, 1 Apr 2024

Fri, 29 Mar 2024

Thu, 28 Mar 2024

Wed, 27 Mar 2024

Tue, 26 Mar 2024

Mon, 25 Mar 2024

Fri, 22 Mar 2024

Thu, 21 Mar 2024

Wed, 20 Mar 2024

Tue, 19 Mar 2024

Mon, 18 Mar 2024

Fri, 15 Mar 2024

Thu, 14 Mar 2024

Wed, 13 Mar 2024

Tue, 12 Mar 2024

Mon, 11 Mar 2024

Fri, 8 Mar 2024

Thu, 7 Mar 2024

Wed, 6 Mar 2024

Tue, 5 Mar 2024

Mon, 4 Mar 2024

Fri, 1 Mar 2024

Thu, 29 Feb 2024

Wed, 28 Feb 2024

Tue, 27 Feb 2024

Mon, 26 Feb 2024

Fri, 23 Feb 2024

Thu, 22 Feb 2024

Wed, 21 Feb 2024

Tue, 20 Feb 2024

Mon, 19 Feb 2024

Fri, 16 Feb 2024

Thu, 15 Feb 2024

Wed, 14 Feb 2024

Tue, 13 Feb 2024

Mon, 12 Feb 2024

Fri, 9 Feb 2024

Thu, 8 Feb 2024

Wed, 7 Feb 2024

Tue, 6 Feb 2024

Mon, 5 Feb 2024

Fri, 2 Feb 2024

Thu, 1 Feb 2024

Wed, 31 Jan 2024

Tue, 30 Jan 2024

Mon, 29 Jan 2024

Fri, 26 Jan 2024

Thu, 25 Jan 2024

Wed, 24 Jan 2024

Tue, 23 Jan 2024

Mon, 22 Jan 2024

Fri, 19 Jan 2024

Thu, 18 Jan 2024

Wed, 17 Jan 2024

Mon, 15 Jan 2024

Fri, 12 Jan 2024

Thu, 11 Jan 2024

Wed, 10 Jan 2024

Tue, 9 Jan 2024

Mon, 8 Jan 2024

Fri, 5 Jan 2024

Thu, 4 Jan 2024

Wed, 3 Jan 2024

Tue, 2 Jan 2024

Mon, 1 Jan 2024

Fri, 29 Dec 2023

Wed, 27 Dec 2023

Mon, 25 Dec 2023

Fri, 22 Dec 2023

Thu, 21 Dec 2023

Wed, 20 Dec 2023

Tue, 19 Dec 2023

Mon, 18 Dec 2023

Fri, 15 Dec 2023

Thu, 14 Dec 2023

Wed, 13 Dec 2023

Tue, 12 Dec 2023

Mon, 11 Dec 2023

Fri, 8 Dec 2023

Thu, 7 Dec 2023

Wed, 6 Dec 2023

Tue, 5 Dec 2023

Mon, 4 Dec 2023

Fri, 1 Dec 2023

Thu, 30 Nov 2023

Wed, 29 Nov 2023

Tue, 28 Nov 2023

Mon, 27 Nov 2023

Thu, 23 Nov 2023

Wed, 22 Nov 2023

Tue, 21 Nov 2023

Mon, 20 Nov 2023

Fri, 17 Nov 2023

Thu, 16 Nov 2023

Wed, 15 Nov 2023

Tue, 14 Nov 2023

Mon, 13 Nov 2023

Fri, 10 Nov 2023

Thu, 9 Nov 2023

Wed, 8 Nov 2023

Tue, 7 Nov 2023

Mon, 6 Nov 2023

Fri, 3 Nov 2023

Thu, 2 Nov 2023

Wed, 1 Nov 2023

Tue, 31 Oct 2023

Mon, 30 Oct 2023

Fri, 27 Oct 2023

Thu, 26 Oct 2023

Wed, 25 Oct 2023

Tue, 24 Oct 2023

Mon, 23 Oct 2023

Fri, 20 Oct 2023

Thu, 19 Oct 2023

Wed, 18 Oct 2023

Tue, 17 Oct 2023

Mon, 16 Oct 2023

Fri, 13 Oct 2023

Thu, 12 Oct 2023

Wed, 11 Oct 2023

Tue, 10 Oct 2023

Mon, 9 Oct 2023

Fri, 6 Oct 2023

Thu, 5 Oct 2023

Wed, 4 Oct 2023

Tue, 3 Oct 2023

Mon, 2 Oct 2023

Fri, 29 Sep 2023

Thu, 28 Sep 2023

Wed, 27 Sep 2023

Tue, 26 Sep 2023

Mon, 25 Sep 2023

Fri, 22 Sep 2023

Thu, 21 Sep 2023

Wed, 20 Sep 2023

Tue, 19 Sep 2023

Mon, 18 Sep 2023

Fri, 15 Sep 2023

Thu, 14 Sep 2023

Wed, 13 Sep 2023

Tue, 12 Sep 2023

Mon, 11 Sep 2023

Fri, 8 Sep 2023

Thu, 7 Sep 2023

Wed, 6 Sep 2023

Mon, 4 Sep 2023

Fri, 1 Sep 2023

Thu, 31 Aug 2023

Wed, 30 Aug 2023

Tue, 29 Aug 2023

Mon, 28 Aug 2023

Fri, 25 Aug 2023

Thu, 24 Aug 2023

Wed, 23 Aug 2023

Tue, 22 Aug 2023

Mon, 21 Aug 2023

Thu, 17 Aug 2023

Wed, 16 Aug 2023

Tue, 15 Aug 2023

Mon, 14 Aug 2023

Fri, 11 Aug 2023

Thu, 10 Aug 2023

Wed, 9 Aug 2023

Tue, 8 Aug 2023

Mon, 7 Aug 2023

Fri, 4 Aug 2023

Thu, 3 Aug 2023

Wed, 2 Aug 2023

Tue, 1 Aug 2023

Mon, 31 Jul 2023

Fri, 28 Jul 2023

Thu, 27 Jul 2023

Wed, 26 Jul 2023

Tue, 25 Jul 2023

Mon, 24 Jul 2023

Fri, 21 Jul 2023

Thu, 20 Jul 2023

Wed, 19 Jul 2023

Tue, 18 Jul 2023

Mon, 17 Jul 2023

Fri, 14 Jul 2023

Thu, 13 Jul 2023

Wed, 12 Jul 2023

Tue, 11 Jul 2023

Mon, 10 Jul 2023

Fri, 7 Jul 2023

Thu, 6 Jul 2023

Tue, 4 Jul 2023

Mon, 3 Jul 2023

Fri, 30 Jun 2023

Thu, 29 Jun 2023

Wed, 28 Jun 2023

Tue, 27 Jun 2023

Mon, 26 Jun 2023

Fri, 23 Jun 2023

Thu, 22 Jun 2023

Wed, 21 Jun 2023

Mon, 19 Jun 2023

Fri, 16 Jun 2023

Wed, 14 Jun 2023

Tue, 13 Jun 2023

Mon, 12 Jun 2023

Fri, 9 Jun 2023

Thu, 8 Jun 2023