Markov decision programming–the moment optimal problem for the first-passage model

Liu Jianyong; Liu Ke

doi:10.1017/S0334270000000850

Markov decision programming–the moment optimal problem for the first-passage model

Published online by Cambridge University Press: 17 February 2009

Liu Jianyong and

Liu Ke

Article contents

Abstract
References

Rights & Permissions

Abstract

Core share and HTML view are not available for this content. However, as you have access to this content, a full PDF is available via the ‘Save PDF’ action button.

In this paper, we discuss MDP-the moment optimal problem for the first-passage model. A policy improvement iteration algorithm is given for finding the k-moment optimal stationary policy.

Type: Research Article
Information: The ANZIAM Journal , Volume 38 , Issue 4 , April 1997 , pp. 542 - 562

DOI: https://doi.org/10.1017/S0334270000000850 [Opens in a new window]
Copyright: Copyright © Australian Mathematical Society 1997

References

[1]Baykal-Gürsoy, M. and Ross, K. W., “Variability sensitive Markov decision processes”, Math. Oper. Res. 17 (1992) 558–571.CrossRef Google Scholar

[2]qing, Dong ze, “An accelerated successive approximation method of discounted Markovian decision programming and the least variance problem in optimal policies (Chinese)”, Acta Math. Sinica 21 (1978) 135–150.Google Scholar

[3]Filar, J. A., Kallenberg, L. C. M. and Huey-Miin, Lee, “Variance-penalized Markov decision processes”, Math. Oper. Res. 14 (1989) 147–161.CrossRef Google Scholar

[4]Filar, J. A. and Huey-Miin, Lee, “Gain/variability tradeoffs in undiscounted Markov decision processes”, in Proc. 1985 IEEE Conf. (24th Conf), Decision and Control, 1106–1112.Google Scholar

[5]Jaquette, S. C., “Markov decision processes with a new optimality criterion: Small interest rates”, Ann. Statist. 43 (1972) 1894–1901.CrossRef Google Scholar

[6]Jaquette, S. C., “Markov decision processes with a new optimality criterion: Discrete time”, Ann. Statist. 1 (1973) 496–505.CrossRef Google Scholar

[7]Kawai, H. A., “A variance minimization problem for a Markov decision process”, EurJour. Oper. Res. 31 (1987) 140–145.CrossRef Google Scholar

[8]Chung, Kun-Jen, “A note on maximal mean/standard deviation ratio in an undiscounted Markov decision process”, Oper. Res. Ltters 8 (1989) 201–203.CrossRef Google Scholar

[9]Chung, Kun-Jen, “Mean-variance tradeoffs in an undiscounted Markov decision process: The unichain case”, Oper. Res. 42 (1994) 184–188.CrossRef Google Scholar

[10]Jian-xing, Lin, “The moment optimal model in which the discount factor is dependent on history(chinese)”, M. Sc. Thesis, Department of Appl. Math., Qinghua University.Google Scholar

[11]Jian-yong, Liu and Ke, Liu, “Markov decision programming- the first-passage model with denumerable state space”, Syst. Sci. and Math. Sci. 5 (1992) 340–351.Google Scholar

[12]Quelle, G., “Dynamic programming of expectation and variance”, J. Math. Anal. Appl. 55 (1976) 239–252.CrossRef Google Scholar

[13]Sobel, M. L., “The variance of discounted Markov decision process”, J. Appl. Prob. 19 (1982) 794–802.CrossRef Google Scholar

[14]Sobel, M. L., “Maximal mean/standard deviation ratio in an undiscounted Markov decision process”, Oper. Res. Letters 4 (1985) 157–158.CrossRef Google Scholar

[15]Sobel, M. L., “Mean-variance tradeoffs in an undiscounted Markov decision process”, Oper. Res. 42 (1994) 175–183.CrossRef Google Scholar

[16]White, D. J., “Variance and probabilistic criteria in finite markov decision processes: A review”, J. Opti. Theory Appl. 56 (1988) 1–29.CrossRef Google Scholar

Article contents

Markov decision programming–the moment optimal problem for the first-passage model

Abstract

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests