YaBeSH Engineering and Technology Library

    • Journals
    • PaperQuest
    • YSE Standards
    • YaBeSH
    • Login
    View Item 
    •   YE&T Library
    • ASME
    • Journal of Dynamic Systems, Measurement, and Control
    • View Item
    •   YE&T Library
    • ASME
    • Journal of Dynamic Systems, Measurement, and Control
    • View Item
    • All Fields
    • Source Title
    • Year
    • Publisher
    • Title
    • Subject
    • Author
    • DOI
    • ISBN
    Advanced Search
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Archive

    Convergence Properties of a Computational Learning Model for Unknown Markov Chains

    Source: Journal of Dynamic Systems, Measurement, and Control:;2009:;volume( 131 ):;issue: 004::page 41011
    Author:
    Andreas A. Malikopoulos
    DOI: 10.1115/1.3117202
    Publisher: The American Society of Mechanical Engineers (ASME)
    Abstract: The increasing complexity of engineering systems has motivated continuing research on computational learning methods toward making autonomous intelligent systems that can learn how to improve their performance over time while interacting with their environment. These systems need not only to sense their environment, but also to integrate information from the environment into all decision-makings. The evolution of such systems is modeled as an unknown controlled Markov chain. In a previous research, the predictive optimal decision-making (POD) model was developed, aiming to learn in real time the unknown transition probabilities and associated costs over a varying finite time horizon. In this paper, the convergence of the POD to the stationary distribution of a Markov chain is proven, thus establishing the POD as a robust model for making autonomous intelligent systems. This paper provides the conditions that the POD can be valid, and be an interpretation of its underlying structure.
    • Download: (143.5Kb)
    • Show Full MetaData Hide Full MetaData
    • Get RIS
    • Item Order
    • Go To Publisher
    • Price: 5000 Rial
    • Statistics

      Convergence Properties of a Computational Learning Model for Unknown Markov Chains

    URI
    http://yetl.yabesh.ir/yetl1/handle/yetl/140201
    Collections
    • Journal of Dynamic Systems, Measurement, and Control

    Show full item record

    contributor authorAndreas A. Malikopoulos
    date accessioned2017-05-09T00:32:10Z
    date available2017-05-09T00:32:10Z
    date copyrightJuly, 2009
    date issued2009
    identifier issn0022-0434
    identifier otherJDSMAA-26497#041011_1.pdf
    identifier urihttp://yetl.yabesh.ir/yetl/handle/yetl/140201
    description abstractThe increasing complexity of engineering systems has motivated continuing research on computational learning methods toward making autonomous intelligent systems that can learn how to improve their performance over time while interacting with their environment. These systems need not only to sense their environment, but also to integrate information from the environment into all decision-makings. The evolution of such systems is modeled as an unknown controlled Markov chain. In a previous research, the predictive optimal decision-making (POD) model was developed, aiming to learn in real time the unknown transition probabilities and associated costs over a varying finite time horizon. In this paper, the convergence of the POD to the stationary distribution of a Markov chain is proven, thus establishing the POD as a robust model for making autonomous intelligent systems. This paper provides the conditions that the POD can be valid, and be an interpretation of its underlying structure.
    publisherThe American Society of Mechanical Engineers (ASME)
    titleConvergence Properties of a Computational Learning Model for Unknown Markov Chains
    typeJournal Paper
    journal volume131
    journal issue4
    journal titleJournal of Dynamic Systems, Measurement, and Control
    identifier doi10.1115/1.3117202
    journal fristpage41011
    identifier eissn1528-9028
    treeJournal of Dynamic Systems, Measurement, and Control:;2009:;volume( 131 ):;issue: 004
    contenttypeFulltext
    DSpace software copyright © 2002-2015  DuraSpace
    نرم افزار کتابخانه دیجیتال "دی اسپیس" فارسی شده توسط یابش برای کتابخانه های ایرانی | تماس با یابش
    yabeshDSpacePersian
     
    DSpace software copyright © 2002-2015  DuraSpace
    نرم افزار کتابخانه دیجیتال "دی اسپیس" فارسی شده توسط یابش برای کتابخانه های ایرانی | تماس با یابش
    yabeshDSpacePersian