Show simple item record

contributor authorDong, Yujian
contributor authorWu, Tianyu
contributor authorSong, Chaoyang
date accessioned2025-04-21T10:18:59Z
date available2025-04-21T10:18:59Z
date copyright1/27/2025 12:00:00 AM
date issued2025
identifier issn1530-9827
identifier otherjcise_25_3_031004.pdf
identifier urihttp://yetl.yabesh.ir/yetl1/handle/yetl/4305925
description abstractModels based on the transformer architecture have seen widespread application across fields such as natural language processing (NLP), computer vision, and robotics, with large language models (LLMs) like ChatGPT revolutionizing machine understanding of human language and demonstrating impressive memory capacity and reproduction capabilities. Traditional machine learning algorithms struggle with catastrophic forgetting, detrimental to the diverse and generalized abilities required for robotic deployment. This article investigates the receptance weighted key value (RWKV) framework, known for its advanced capabilities in efficient and effective sequence modeling, integration with the decision transformer (DT), and experience replay architectures. It focuses on potential performance enhancements in sequence decision-making and lifelong robotic learning tasks. We introduce the decision-RWKV (DRWKV) model and conduct extensive experiments using the D4RL database within the OpenAI Gym environment and on the D’Claw platform to assess the DRWKV model’s performance in single-task tests and lifelong learning scenarios, showing its ability to handle multiple subtasks efficiently. The code for all algorithms, training, and image rendering in this study is available online (open source).
publisherThe American Society of Mechanical Engineers (ASME)
titleOptimizing Robotic Manipulation With Decision-RWKV: A Recurrent Sequence Modeling Approach for Lifelong Learning
typeJournal Paper
journal volume25
journal issue3
journal titleJournal of Computing and Information Science in Engineering
identifier doi10.1115/1.4067524
journal fristpage31004-1
journal lastpage31004-9
page9
treeJournal of Computing and Information Science in Engineering:;2025:;volume( 025 ):;issue: 003
contenttypeFulltext


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record