Convergence Properties of a Computational Learning Model for Unknown Markov Chains

Andreas A. Malikopoulos

contributor author	Andreas A. Malikopoulos
date accessioned	2017-05-09T00:32:10Z
date available	2017-05-09T00:32:10Z
date copyright	July, 2009
date issued	2009
identifier issn	0022-0434
identifier other	JDSMAA-26497#041011_1.pdf
identifier uri	http://yetl.yabesh.ir/yetl/handle/yetl/140201
description abstract	The increasing complexity of engineering systems has motivated continuing research on computational learning methods toward making autonomous intelligent systems that can learn how to improve their performance over time while interacting with their environment. These systems need not only to sense their environment, but also to integrate information from the environment into all decision-makings. The evolution of such systems is modeled as an unknown controlled Markov chain. In a previous research, the predictive optimal decision-making (POD) model was developed, aiming to learn in real time the unknown transition probabilities and associated costs over a varying finite time horizon. In this paper, the convergence of the POD to the stationary distribution of a Markov chain is proven, thus establishing the POD as a robust model for making autonomous intelligent systems. This paper provides the conditions that the POD can be valid, and be an interpretation of its underlying structure.
publisher	The American Society of Mechanical Engineers (ASME)
title	Convergence Properties of a Computational Learning Model for Unknown Markov Chains
type	Journal Paper
journal volume	131
journal issue	4
journal title	Journal of Dynamic Systems, Measurement, and Control
identifier doi	10.1115/1.3117202
journal fristpage	41011
identifier eissn	1528-9028
tree	Journal of Dynamic Systems, Measurement, and Control:;2009:;volume( 131 ):;issue: 004
contenttype	Fulltext

Files in this item

Name:: JDSMAA-26497#041011_1.pdf
Size:: 143.5Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Journal of Dynamic Systems, Measurement, and Control

Show simple item record

YaBeSH Engineering and Technology Library

Archive

Convergence Properties of a Computational Learning Model for Unknown Markov Chains

Files in this item

This item appears in the following Collection(s)