Optimizing Robotic Manipulation With Decision-RWKV: A Recurrent Sequence Modeling Approach for Lifelong Learning

Dong, Yujian; Wu, Tianyu; Song, Chaoyang

Source: Journal of Computing and Information Science in Engineering:;2025:;volume( 025 ):;issue: 003::page 31004-1

Author:

Dong, Yujian

Wu, Tianyu

Song, Chaoyang

DOI: 10.1115/1.4067524

Publisher: The American Society of Mechanical Engineers (ASME)

Abstract: Models based on the transformer architecture have seen widespread application across fields such as natural language processing (NLP), computer vision, and robotics, with large language models (LLMs) like ChatGPT revolutionizing machine understanding of human language and demonstrating impressive memory capacity and reproduction capabilities. Traditional machine learning algorithms struggle with catastrophic forgetting, detrimental to the diverse and generalized abilities required for robotic deployment. This article investigates the receptance weighted key value (RWKV) framework, known for its advanced capabilities in efficient and effective sequence modeling, integration with the decision transformer (DT), and experience replay architectures. It focuses on potential performance enhancements in sequence decision-making and lifelong robotic learning tasks. We introduce the decision-RWKV (DRWKV) model and conduct extensive experiments using the D4RL database within the OpenAI Gym environment and on the D’Claw platform to assess the DRWKV model’s performance in single-task tests and lifelong learning scenarios, showing its ability to handle multiple subtasks efficiently. The code for all algorithms, training, and image rendering in this study is available online (open source).

Download: (768.2Kb)
Show Full MetaData Hide Full MetaData
Get RIS
Item Order
Go To Publisher
Price: 5000 Rial
Statistics

Optimizing Robotic Manipulation With Decision-RWKV: A Recurrent Sequence Modeling Approach for Lifelong Learning

URI

http://yetl.yabesh.ir/yetl1/handle/yetl/4305925

Collections

Journal of Computing and Information Science in Engineering

Show full item record

contributor author	Dong, Yujian
contributor author	Wu, Tianyu
contributor author	Song, Chaoyang
date accessioned	2025-04-21T10:18:59Z
date available	2025-04-21T10:18:59Z
date copyright	1/27/2025 12:00:00 AM
date issued	2025
identifier issn	1530-9827
identifier other	jcise_25_3_031004.pdf
identifier uri	http://yetl.yabesh.ir/yetl1/handle/yetl/4305925
description abstract	Models based on the transformer architecture have seen widespread application across fields such as natural language processing (NLP), computer vision, and robotics, with large language models (LLMs) like ChatGPT revolutionizing machine understanding of human language and demonstrating impressive memory capacity and reproduction capabilities. Traditional machine learning algorithms struggle with catastrophic forgetting, detrimental to the diverse and generalized abilities required for robotic deployment. This article investigates the receptance weighted key value (RWKV) framework, known for its advanced capabilities in efficient and effective sequence modeling, integration with the decision transformer (DT), and experience replay architectures. It focuses on potential performance enhancements in sequence decision-making and lifelong robotic learning tasks. We introduce the decision-RWKV (DRWKV) model and conduct extensive experiments using the D4RL database within the OpenAI Gym environment and on the D’Claw platform to assess the DRWKV model’s performance in single-task tests and lifelong learning scenarios, showing its ability to handle multiple subtasks efficiently. The code for all algorithms, training, and image rendering in this study is available online (open source).
publisher	The American Society of Mechanical Engineers (ASME)
title	Optimizing Robotic Manipulation With Decision-RWKV: A Recurrent Sequence Modeling Approach for Lifelong Learning
type	Journal Paper
journal volume	25
journal issue	3
journal title	Journal of Computing and Information Science in Engineering
identifier doi	10.1115/1.4067524
journal fristpage	31004-1
journal lastpage	31004-9
page	9
tree	Journal of Computing and Information Science in Engineering:;2025:;volume( 025 ):;issue: 003
contenttype	Fulltext

YaBeSH Engineering and Technology Library

Archive