9.7 Robotic Table Tennis: A Case Study into a High Speed Learning System
- Authors: David B. D’Ambrosio, Jonathan Abelian, Saminda Abeyruwan, Michael Ahn, Alex Bewley, Justin Boyd, Krzysztof Choromanski, Omar Cortes, Erwin Coumans, Tianli Ding, Wenbo Gao, Laura Graesser, Atil Iscen, Navdeep Jaitly, Deepali Jain, Juhana Kangaspunta, Satoshi Kataoka, Gus Kouretas, Yuheng Kuang, Nevena Lazic, Corey Lynch, Reza Mahjourian, Sherry Q. Moore, Thinh Nguyen, Ken Oslund, Barney J Reed, Krista Reymann, Pannag R. Sanketi, Anish Shankar, Pierre Sermanet, Vikas Sindhwani, Avi Singh, Vincent Vanhoucke, Grace Vesom, Peng Xu
- Reason: The paper provides a detailed system description and a collection of studies explaining the importance of various components in a high-speed learning system. This provides valuable insights into designing effective reinforcement learning systems and their real-world applications.
9.6 DiffusionEngine: Diffusion Model is Scalable Data Engine for Object Detection
- Authors: Manlin Zhang, Jie Wu, Yuxi Ren, Ming Li, Jie Qin, Xuefeng Xiao, Wei Liu, Rui Wang, Min Zheng, Andy J. Ma
- Reason: This paper presents a promising approach for scaling up detection-oriented data. The approach combines a pre-trained diffusion model with a Detection-Adapter that aligns the implicit semantic and location knowledge in off-the-shelf diffusion models with detection-aware signals.
9.5 REBOOT: Reuse Data for Bootstrapping Efficient Real-World Dexterous Manipulation
- Authors: Zheyuan Hu, Aaron Rovinsky, Jianlan Luo, Vikash Kumar, Abhishek Gupta, Sergey Levine
- Reason: The paper introduces an efficient system for learning dexterous manipulation skills with RL. It provides practical solutions to the challenges faced in real-world reinforcement learning applications.
9.5 ArtiGrasp: Physically Plausible Synthesis of Bi-Manual Dexterous Grasping and Articulation
- Authors: Hui Zhang, Sammy Christen, Zicong Fan, Luocheng Zheng, Jemin Hwangbo, Jie Song, Otmar Hilliges
- Reason: This paper presents an innovative method for synthesizing bi-manual hand-object interactions, including grasping and articulation. The approach uses reinforcement learning and physics simulations to control hand pose.
9.2 Scalable Learning of Intrusion Responses through Recursive Decomposition
- Authors: Kim Hammar, Rolf Stadler
- Reason: The paper addresses a significant problem in the field of cybersecurity, introducing a novel method for automated intrusion responses. The practical applicability and the use of reinforcement learning techniques enhance its potential influence.
9.2 Bootstrapping Adaptive Human-Machine Interfaces with Offline Reinforcement Learning
- Authors: Jensen Gao, Siddharth Reddy, Glen Berseth, Anca D. Dragan, Sergey Levine
- Reason: This research provides meaningful insights on an algorithm for training an interface to map raw command signals to actions using a combination of offline pre-training and online fine-tuning, crucial for adaptive interfaces.
9.0 Evaluation of Reinforcement Learning Techniques for Trading on a Diverse Portfolio
- Authors: Ishan S. Khare, Tarun K. Martheswaran, Akshana Dassanaike-Perera, Jonah B. Ezekiel
- Reason: The paper presents key findings on the use of reinforcement learning techniques in trading on the S&P 500 index. The detailed analysis and the results of their models could potentially influence the application of RL in financial market predictions and trading strategies.
9.0 A Function Interpretation Benchmark for Evaluating Interpretability Methods
- Authors: Sarah Schwettmann, Tamar Rott Shaham, Joanna Materzynska, Neil Chowdhury, Shuang Li, Jacob Andreas, David Bau, Antonio Torralba
- Reason: An important contribution that introduces a benchmark suite for evaluating the building blocks of automated interpretability methods. It’s useful for assessing the performance of more sophisticated interpretability tools.
8.9 DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models
- Authors: Yung-Sung Chuang, Yujia Xie, Hongyin Luo, Yoon Kim, James Glass, Pengcheng He
- Reason: The paper introduces a novel decoding strategy that reduces hallucinations in large language models. This method of contrasting layers could fundamentally change how we interpret and utilise large language models.
8.6 Large Language Models as Optimizers
- Authors: Chengrun Yang, Xuezhi Wang, Yifeng Lu, Hanxiao Liu, Quoc V. Le, Denny Zhou, Xinyun Chen
- Reason: This research proposes a novel approach to use large language models as optimizers where the task is described in natural language. This can significantly affect the way we interact with machine learning models and expand their scope of application.