Optimal stopping in a partially observable binary-valued markov chain with costly perfect information

George E. Monahan

doi:10.2307/3213917

Optimal stopping in a partially observable binary-valued markov chain with costly perfect information

Published online by Cambridge University Press: 14 July 2016

George E. Monahan

Show author details

George E. Monahan*: Affiliation:
Georgia Institute of Technology
*: ∗Postal address: College of Management, Georgia Institute of Technology, Atlanta, GA 30332, U.S.A.

Article contents

Abstract
References

Get access

Rights & Permissions

Abstract

The problem of optimal stopping in a Markov chain when there is imperfect state information is formulated as a partially observable Markov decision process. Properties of the optimal value function are developed. It is shown that under mild conditions the optimal policy is well structured. An efficient algorithm, which uses the structural information in the computation of the optimal policy, is presented.

Keywords

PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES DYNAMIC PROGRAMMING

Type: Research Paper
Information: Journal of Applied Probability , Volume 19 , Issue 1 , March 1982 , pp. 72 - 81

DOI: https://doi.org/10.2307/3213917 [Opens in a new window]
Copyright: Copyright © Applied Probability Trust 1982

Access options

Get access to the full version of this content by using one of the access options below. (Log in options will check for institutional or personal access. Content may require purchase if you do not have access.)

References

Albright, S. (1979) Structural results for partially observable Markov decision processes. Operat. Res. 27, 1041–1053.CrossRef Google Scholar

Bertsekas, D. (1976) Dynamic Programming and Stochastic Control. Academic Press, New York.Google Scholar

Derman, C. (1963) On optimal replacement rules when changes of state are Markovian. In Mathematical Optimization Techniques, ed. Bellman, R.. University of California Press, Berkeley, Ca. Google Scholar

Derman, C. (1970) Finite State Markovian Decision Processes. Academic Press, New York.Google Scholar

De Groot, M. (1970) Optimal Statistical Decisions. McGraw-Hill, New York.Google Scholar

Dynkin, E. (1963) The optimum choice of the instant for stopping a Markov process. Dokl. Akad. Nauk SSR 150, 238–240.Google Scholar

Monahan, G. (1977) On Optimal Stopping in a Partially Observable Markov Process with Costly Information. Unpublished Ph.D. Thesis, Northwestern University, Evanston, II.Google Scholar

Monahan, G. (1980) Optimal stopping in a partially observable Markov process with costly information. Operat. Res. 28, 1319–1334.CrossRef Google Scholar

Rosenfield, D. (1976) Markovian deterioration with uncertain information. Operat. Res. 24, 141–155.CrossRef Google Scholar

Ross, S. (1970) Applied Probability with Optimization Applications. Holden-Day, San Francisco.Google Scholar

Ross, S. (1971) Quality control under Markovian deterioration. Management Sci. 17, 587–596.CrossRef Google Scholar

Sondik, E. (1971) The Optimal Control of Partially Observable Markov Processes. Unpublished Ph.D. Thesis, Stanford University, Stanford, Ca. Google Scholar

Sondik, E. (1978) The optimal control of partially observable Markov processes over the infinite horizon: Discounted costs. Operat. Res. 26, 282–304.Google Scholar

White, C. (1977) A Markov quality control process subject to partial observation. Management Sci. 23, 843–852.CrossRef Google Scholar

Article contents

Optimal stopping in a partially observable binary-valued markov chain with costly perfect information

Abstract

Keywords

Access options

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests