On sets of countable non-negative matrices and Markov decision processes

Douglas P. Kennedy

doi:10.2307/1426638

On sets of countable non-negative matrices and Markov decision processes

Published online by Cambridge University Press: 01 July 2016

Douglas P. Kennedy

Show author details

Douglas P. Kennedy*: Affiliation:
University of Cambridge

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

Consider a set S of countable non-negative matrices satisfying the property that for any two indices i, j, for some n ≧ 1 there are matrices M1, M2, · · ·, Mn in S with (M1M2 · · · Mn)ij >0. For non-negative vectors x set Tx = supM∈SMx, where the supremum is taken separately in each coordinate. Assume that for each x with Tx finite in each coordinate there is a matrix in S which achieves the supremum simultaneously for all coordinates. With these two assumptions on S, the R-theory for a countable irreducible matrix is extended to the operator T. The results are used to consider the existence of stationary optimal policies for Markov decision processes with multiplicative rewards.

Keywords

NON-NEGATIVE MATRICES R-THEORY PERRON–FROBENIUS THEORY DYNAMIC PROGRAMMING MARKOV DECISION PROCESSES

Type: Research Article
Information: Advances in Applied Probability , Volume 10 , Issue 3 , September 1978 , pp. 633 - 646

DOI: https://doi.org/10.2307/1426638 [Opens in a new window]
Copyright: Copyright © Applied Probability Trust 1978

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

[1] Bather, J. (1973) Optimal decision procedures for finite Markov chains, II. Adv. Appl. Prob. 5, 521–540.CrossRef Google Scholar

[2] Bellman, R. (1956) On a class of quasi-linear equations. Canadian J. Maths 8, 198–202.CrossRef Google Scholar

[3] Bellman, R. (1957) Dynamic Programming. Princeton University Press, Princeton, N.J. Google Scholar PubMed

[4] Hordijk, A. (1974) Dynamic Programming and Markov Potential Theory. Mathematisch Centrum, Amsterdam.Google Scholar

[5] Howard, P. A. and Matheson, J. E. (1972) Risk-sensitive Markov decision processes. Management Sci. 18, 356–369.CrossRef Google Scholar

[6] Mandl, P. and Seneta, E. (1969) The theory of non-negative matrices in a dynamic programming problem. Austral. J. Statist. 11, 85–96.CrossRef Google Scholar

[7] Seneta, E. (1973) Non-Negative Matrices. Allen and Unwin, London.Google Scholar

[8] Vere-Jones, D. (1967) Ergodic properties of non-negative matrices I. Pacific J. Maths 22, 361–386.CrossRef Google Scholar

Article contents

On sets of countable non-negative matrices and Markov decision processes

Abstract

Keywords

Access options

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests