I made GPT-4 service to generates a daily compilation of recent, notable papers from arXiv 😉

Number before title corresponds to importance score

Tue, 7 May 2024

9.2 Finite-Time Convergence and Sample Complexity of Actor-Critic Multi-Objective Reinforcement Learning
- Authors: Tianchen Zhou, FNU Hairi, Haibo Yang, Jia Liu, Tian Tong, Fan Yang, Michinari Momma, Yan Gao
- Reason: The paper is accepted at a top-tier conference (ICML 2024) and tackles the under-explored multi-objective reinforcement learning (MORL), providing theoretical contributions including finite-time convergence analysis and sample complexity, which can significantly influence the understanding and application of MORL in various domains.
9.0 Federated Reinforcement Learning with Constraint Heterogeneity
- Authors: Hao Jin, Liangyu Zhang, Zhihua Zhang
- Reason: In the trending field of federated learning, this paper adds the important dimension of reinforcement learning with multiple constraints, suited for practical applications like healthcare, which is highly relevant and can drive future research in federated RL.
8.9 Deep Reinforcement Learning for Modelling Protein Complexes
- Authors: Tao Feng, Ziqi Gao, Jiaxuan You, Chenyi Zi, Yan Zhou, Chen Zhang, Jia Li
- Reason: Addresses a challenging problem in bioinformatics with potential high impact, presented at a reputable conference (ICLR 2024), significant accuracy and efficiency improvements demonstrated, and relevance to current high-interest topics in AI such as AlphaFold.
8.7 Natural Policy Gradient and Actor Critic Methods for Constrained Multi-Task Reinforcement Learning
- Authors: Sihan Zeng, Thinh T. Doan, Justin Romberg
- Reason: Addresses multi-task learning, an area of growing interest, with solutions for both centralized and decentralized settings. The paper presents theoretical convergence results and practical algorithmic contributions.
8.7 Policy Learning for Balancing Short-Term and Long-Term Rewards
- Authors: Peng Wu, Ziyu Shen, Feng Xie, Zhongyao Wang, Chunchen Liu, Yan Zeng
- Reason: The paper presents a new policy learning framework that balances short-term and long-term rewards, with sound theoretical foundations including identifiability, efficiency bounds, and algorithms with proven regret convergence rates. This addresses a core challenge in reinforcement learning and could be influential for both theory and practice.
8.5 Proximal Curriculum with Task Correlations for Deep Reinforcement Learning
- Authors: Georgios Tzannetos, Parameswaran Kamalaruban, Adish Singla
- Reason: Introduces a novel curriculum strategy that could potentially speed up the training of deep RL agents, backed by theoretical and empirical results, contributing to the advancement of learning curricula in RL.
8.5 Reverse Forward Curriculum Learning for Extreme Sample and Demonstration Efficiency in Reinforcement Learning
- Authors: Stone Tao, Arth Shukla, Tse-kai Chan, Hao Su
- Reason: Accepted at ICLR 2024, this paper proposes an innovative curriculum learning approach for reinforcement learning, demonstrating significant improvements in sample and demonstration efficiency. It offers a solution to one of RL’s biggest hurdles: data efficiency.
8.4 Enhancing Q-Learning with Large Language Model Heuristics
- Authors: Xiefeng Wu
- Reason: The paper proposes an interesting blend of reinforcement learning and natural language processing by integrating large language model heuristics into Q-learning. Despite the lower impact due to potential text overlap with previous work, the approach could be influential for future work at the intersection of RL and NLP.
8.3 Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline
- Authors: Wenjia Meng, Qian Zheng, Long Yang, Yilong Yin, Gang Pan
- Reason: Proposes a method to decrease the high variance issue in off-policy policy gradient methods, which could be influential in making these methods more practical and effective.
8.1 CTD4 - A Deep Continuous Distributional Actor-Critic Agent with a Kalman Fusion of Multiple Critics
- Authors: David Valencia, Henry Williams, Trevor Gee, Bruce A MacDonaland, Minas Liarokapis
- Reason: Offers a new distributional RL algorithm tailored for continuous action spaces, addressing practical challenges of CDRL and can be more sample-efficient, which could influence continuous control problems in RL.

Mon, 6 May 2024

Fri, 3 May 2024

Thu, 2 May 2024

Wed, 1 May 2024

Tue, 30 Apr 2024

Mon, 29 Apr 2024

Fri, 26 Apr 2024

Thu, 25 Apr 2024

Wed, 24 Apr 2024

Tue, 23 Apr 2024

Mon, 22 Apr 2024

Fri, 19 Apr 2024

Thu, 18 Apr 2024

Wed, 17 Apr 2024

Tue, 16 Apr 2024

Mon, 15 Apr 2024

Fri, 12 Apr 2024

Thu, 11 Apr 2024

Wed, 10 Apr 2024

Tue, 9 Apr 2024

Mon, 8 Apr 2024

Fri, 5 Apr 2024

Thu, 4 Apr 2024

Wed, 3 Apr 2024

Tue, 2 Apr 2024

Mon, 1 Apr 2024

Fri, 29 Mar 2024

Thu, 28 Mar 2024

Wed, 27 Mar 2024

Tue, 26 Mar 2024

Mon, 25 Mar 2024

Fri, 22 Mar 2024

Thu, 21 Mar 2024

Wed, 20 Mar 2024

Tue, 19 Mar 2024

Mon, 18 Mar 2024

Fri, 15 Mar 2024

Thu, 14 Mar 2024

Wed, 13 Mar 2024

Tue, 12 Mar 2024

Mon, 11 Mar 2024

Fri, 8 Mar 2024

Thu, 7 Mar 2024

Wed, 6 Mar 2024

Tue, 5 Mar 2024

Mon, 4 Mar 2024

Fri, 1 Mar 2024

Thu, 29 Feb 2024

Wed, 28 Feb 2024

Tue, 27 Feb 2024

Mon, 26 Feb 2024

Fri, 23 Feb 2024

Thu, 22 Feb 2024

Wed, 21 Feb 2024

Tue, 20 Feb 2024

Mon, 19 Feb 2024

Fri, 16 Feb 2024

Thu, 15 Feb 2024

Wed, 14 Feb 2024

Tue, 13 Feb 2024

Mon, 12 Feb 2024

Fri, 9 Feb 2024

Thu, 8 Feb 2024

Wed, 7 Feb 2024

Tue, 6 Feb 2024

Mon, 5 Feb 2024

Fri, 2 Feb 2024

Thu, 1 Feb 2024

Wed, 31 Jan 2024

Tue, 30 Jan 2024

Mon, 29 Jan 2024

Fri, 26 Jan 2024

Thu, 25 Jan 2024

Wed, 24 Jan 2024

Tue, 23 Jan 2024

Mon, 22 Jan 2024

Fri, 19 Jan 2024

Thu, 18 Jan 2024

Wed, 17 Jan 2024

Mon, 15 Jan 2024

Fri, 12 Jan 2024

Thu, 11 Jan 2024

Wed, 10 Jan 2024

Tue, 9 Jan 2024

Mon, 8 Jan 2024

Fri, 5 Jan 2024

Thu, 4 Jan 2024

Wed, 3 Jan 2024

Tue, 2 Jan 2024

Mon, 1 Jan 2024

Fri, 29 Dec 2023

Wed, 27 Dec 2023

Mon, 25 Dec 2023

Fri, 22 Dec 2023

Thu, 21 Dec 2023

Wed, 20 Dec 2023

Tue, 19 Dec 2023

Mon, 18 Dec 2023

Fri, 15 Dec 2023

Thu, 14 Dec 2023

Wed, 13 Dec 2023

Tue, 12 Dec 2023

Mon, 11 Dec 2023

Fri, 8 Dec 2023

Thu, 7 Dec 2023

Wed, 6 Dec 2023

Tue, 5 Dec 2023

Mon, 4 Dec 2023

Fri, 1 Dec 2023

Thu, 30 Nov 2023

Wed, 29 Nov 2023

Tue, 28 Nov 2023

Mon, 27 Nov 2023

Thu, 23 Nov 2023

Wed, 22 Nov 2023

Tue, 21 Nov 2023

Mon, 20 Nov 2023

Fri, 17 Nov 2023

Thu, 16 Nov 2023

Wed, 15 Nov 2023

Tue, 14 Nov 2023

Mon, 13 Nov 2023

Fri, 10 Nov 2023

Thu, 9 Nov 2023

Wed, 8 Nov 2023

Tue, 7 Nov 2023

Mon, 6 Nov 2023

Fri, 3 Nov 2023

Thu, 2 Nov 2023

Wed, 1 Nov 2023

Tue, 31 Oct 2023

Mon, 30 Oct 2023

Fri, 27 Oct 2023

Thu, 26 Oct 2023

Wed, 25 Oct 2023

Tue, 24 Oct 2023

Mon, 23 Oct 2023

Fri, 20 Oct 2023

Thu, 19 Oct 2023

Wed, 18 Oct 2023

Tue, 17 Oct 2023

Mon, 16 Oct 2023

Fri, 13 Oct 2023

Thu, 12 Oct 2023

Wed, 11 Oct 2023

Tue, 10 Oct 2023

Mon, 9 Oct 2023

Fri, 6 Oct 2023

Thu, 5 Oct 2023

Wed, 4 Oct 2023

Tue, 3 Oct 2023

Mon, 2 Oct 2023

Fri, 29 Sep 2023

Thu, 28 Sep 2023

Wed, 27 Sep 2023

Tue, 26 Sep 2023

Mon, 25 Sep 2023

Fri, 22 Sep 2023

Thu, 21 Sep 2023

Wed, 20 Sep 2023

Tue, 19 Sep 2023

Mon, 18 Sep 2023

Fri, 15 Sep 2023

Thu, 14 Sep 2023

Wed, 13 Sep 2023

Tue, 12 Sep 2023

Mon, 11 Sep 2023

Fri, 8 Sep 2023

Thu, 7 Sep 2023

Wed, 6 Sep 2023

Mon, 4 Sep 2023

Fri, 1 Sep 2023

Thu, 31 Aug 2023

Wed, 30 Aug 2023

Tue, 29 Aug 2023

Mon, 28 Aug 2023

Fri, 25 Aug 2023

Thu, 24 Aug 2023

Wed, 23 Aug 2023

Tue, 22 Aug 2023

Mon, 21 Aug 2023

Thu, 17 Aug 2023

Wed, 16 Aug 2023

Tue, 15 Aug 2023

Mon, 14 Aug 2023

Fri, 11 Aug 2023

Thu, 10 Aug 2023

Wed, 9 Aug 2023

Tue, 8 Aug 2023

Mon, 7 Aug 2023

Fri, 4 Aug 2023

Thu, 3 Aug 2023

Wed, 2 Aug 2023

Tue, 1 Aug 2023

Mon, 31 Jul 2023

Fri, 28 Jul 2023

Thu, 27 Jul 2023

Wed, 26 Jul 2023

Tue, 25 Jul 2023

Mon, 24 Jul 2023

Fri, 21 Jul 2023

Thu, 20 Jul 2023

Wed, 19 Jul 2023

Tue, 18 Jul 2023

Mon, 17 Jul 2023

Fri, 14 Jul 2023

Thu, 13 Jul 2023

Wed, 12 Jul 2023

Tue, 11 Jul 2023

Mon, 10 Jul 2023

Fri, 7 Jul 2023

Thu, 6 Jul 2023

Tue, 4 Jul 2023

Mon, 3 Jul 2023

Fri, 30 Jun 2023

Thu, 29 Jun 2023

Wed, 28 Jun 2023

Tue, 27 Jun 2023

Mon, 26 Jun 2023

Fri, 23 Jun 2023

Thu, 22 Jun 2023

Wed, 21 Jun 2023

Mon, 19 Jun 2023

Fri, 16 Jun 2023

Wed, 14 Jun 2023

Tue, 13 Jun 2023

Mon, 12 Jun 2023

Fri, 9 Jun 2023

Thu, 8 Jun 2023

Work

acrossB

2022. 9 - Present
- Developing machine learning system (model, infra, etc) for business products on global logistics platform
- Research on forecasting stochastic future retail sales considering promotion, new release, and cannibalization
- Develop a design pattern widely usable on ML/DL
- Develop a cloud-based system for model serving and monitoring
Korea Trade Network

2020. 5 - 2022. 9
- Decentralized identity (DID) prototyping using the Kubernetes cluster
- Developed connection between cloud public certificate service and vanilla JS-based browser certificate service

Education

M.S. in Physics

Seoul National University, Seoul, Korea (2015. 3 - 2020. 8)
- Researched on computational approaches to calculate ultra cold quantum states advised by Uwe R. Fischer
- Drop out of doctoral program and graduate with a master's degree
Thesis: Benchmarking the multiconfigurational Hartree method by the exact wavefunction of two harmonically trapped bosons with contact interaction
B.S. in Physics

Korean Advanced Institute of Science and Technology, Daejeon, Korea (2011. 2 - 2015. 2)
- Study on the dynamics and properties of academic citation network advised by Hawoong Jeong

Projects

2019 - 2020. Forecasting stock price using A2C algorithm
2021 - 2022. Developing backbone model on computer vision
2022 - 2023. Developing deep learning model for time series forecasting on sales problems
2023 - Present. Investigating on recent development on reinforcement learning

Competition

2017. 2nd place, Korea Super-Computing Challenge

Competition to refactor existing code on the various engineering tasks into parallel format using OpenMPI to improve performance

Publications

Gwak, Yeongjin, Oleksandr V. Marchukov, and Uwe R. Fischer. "Benchmarking the multiconfigurational Hartree method by the exact wavefunction of two harmonically trapped bosons with contact interaction." Annals of Physics 434 (2021): 168592