Optimizing Robotic Manipulation With Decision-RWKV: A Recurrent Sequence Modeling Approach for Lifelong LearningSource: Journal of Computing and Information Science in Engineering:;2025:;volume( 025 ):;issue: 003::page 31004-1DOI: 10.1115/1.4067524Publisher: The American Society of Mechanical Engineers (ASME)
Abstract: Models based on the transformer architecture have seen widespread application across fields such as natural language processing (NLP), computer vision, and robotics, with large language models (LLMs) like ChatGPT revolutionizing machine understanding of human language and demonstrating impressive memory capacity and reproduction capabilities. Traditional machine learning algorithms struggle with catastrophic forgetting, detrimental to the diverse and generalized abilities required for robotic deployment. This article investigates the receptance weighted key value (RWKV) framework, known for its advanced capabilities in efficient and effective sequence modeling, integration with the decision transformer (DT), and experience replay architectures. It focuses on potential performance enhancements in sequence decision-making and lifelong robotic learning tasks. We introduce the decision-RWKV (DRWKV) model and conduct extensive experiments using the D4RL database within the OpenAI Gym environment and on the D’Claw platform to assess the DRWKV model’s performance in single-task tests and lifelong learning scenarios, showing its ability to handle multiple subtasks efficiently. The code for all algorithms, training, and image rendering in this study is available online (open source).
|
Show full item record
contributor author | Dong, Yujian | |
contributor author | Wu, Tianyu | |
contributor author | Song, Chaoyang | |
date accessioned | 2025-04-21T10:18:59Z | |
date available | 2025-04-21T10:18:59Z | |
date copyright | 1/27/2025 12:00:00 AM | |
date issued | 2025 | |
identifier issn | 1530-9827 | |
identifier other | jcise_25_3_031004.pdf | |
identifier uri | http://yetl.yabesh.ir/yetl1/handle/yetl/4305925 | |
description abstract | Models based on the transformer architecture have seen widespread application across fields such as natural language processing (NLP), computer vision, and robotics, with large language models (LLMs) like ChatGPT revolutionizing machine understanding of human language and demonstrating impressive memory capacity and reproduction capabilities. Traditional machine learning algorithms struggle with catastrophic forgetting, detrimental to the diverse and generalized abilities required for robotic deployment. This article investigates the receptance weighted key value (RWKV) framework, known for its advanced capabilities in efficient and effective sequence modeling, integration with the decision transformer (DT), and experience replay architectures. It focuses on potential performance enhancements in sequence decision-making and lifelong robotic learning tasks. We introduce the decision-RWKV (DRWKV) model and conduct extensive experiments using the D4RL database within the OpenAI Gym environment and on the D’Claw platform to assess the DRWKV model’s performance in single-task tests and lifelong learning scenarios, showing its ability to handle multiple subtasks efficiently. The code for all algorithms, training, and image rendering in this study is available online (open source). | |
publisher | The American Society of Mechanical Engineers (ASME) | |
title | Optimizing Robotic Manipulation With Decision-RWKV: A Recurrent Sequence Modeling Approach for Lifelong Learning | |
type | Journal Paper | |
journal volume | 25 | |
journal issue | 3 | |
journal title | Journal of Computing and Information Science in Engineering | |
identifier doi | 10.1115/1.4067524 | |
journal fristpage | 31004-1 | |
journal lastpage | 31004-9 | |
page | 9 | |
tree | Journal of Computing and Information Science in Engineering:;2025:;volume( 025 ):;issue: 003 | |
contenttype | Fulltext |