COMPUTING AVERAGE OPTIMAL CONSTRAINED POLICIES IN STOCHASTIC DYNAMIC PROGRAMMING

Linn I. Sennott

doi:10.1017/S0269964801151089

Abstract

A stochastic dynamic program incurs two types of cost: a service cost and a quality of service (delay) cost. The objective is to minimize the expected average service cost, subject to a constraint on the average quality of service cost. When the state space S is finite, we show how to compute an optimal policy for the general constrained problem under weak conditions. The development uses a Lagrange multiplier approach and value iteration. When S is denumerably infinite, we give a method for computation of an optimal policy, using a sequence of approximating finite state problems. The method is illustrated with two computational examples.

Crossref Citations

This article has been cited by the following publications. This list is generated based on data provided by Crossref.

Chiasserini, C.F. Nuggehalli, P. and Srinivasan, V. 2002. Energy-efficient communication protocols. p. 824.

Nuggehalli, P. Srinivasan, V. and Rao, R.R. 2002. Delay constrained energy efficient transmission strategies for wireless devices. Vol. 3, Issue. , p. 1765.

Gans, Noah and Zhou, Yong-Pin 2003. A Call-Routing Problem with Service-Level Constraints. Operations Research, Vol. 51, Issue. 2, p. 255.

Ata, Barιş 2005. Dynamic Power Control in a Wireless Static Channel Subject to a Quality-of-Service Constraint. Operations Research, Vol. 53, Issue. 5, p. 842.

Fan Zhang and Chanson, S.T. 2005. Power-Aware Processor Scheduling under Average Delay Constraints. p. 202.

Veatch, M.H. 2006. Enhanced Dynamic Programming Algorithms for Series Line Optimization. IEEE Transactions on Automatic Control, Vol. 51, Issue. 1, p. 159.

Nuggehalli, P. Srinivasan, V. and Rao, R.R. 2006. Energy efficient transmission scheduling for delay constrained wireless networks. IEEE Transactions on Wireless Communications, Vol. 5, Issue. 3, p. 531.

Mao, Xiaomao and Qiu, Peiliang 2010. Energy efficient scheduling in multi-access wireless sensor networks: A stochastic dynamic programming method. p. 78.

Pietrabissa, Antonio 2010. Reinforcement learning call control in variable capacity links. p. 933.

Prieto-Rumeau, Tomás and Hernández-Lerma, Onésimo 2010. The vanishing discount approach to constrained continuous-time controlled Markov chains. Systems & Control Letters, Vol. 59, Issue. 8, p. 504.

Pietrabissa, Antonio 2011. A Reinforcement Learning Approach to Call Admission and Call Dropping Control in Links with Variable Capacity. European Journal of Control, Vol. 17, Issue. 1, p. 89.

Chakravorty, Jhelum and Mahajan, Aditya 2015. Distortion-transmission trade-off in real-time transmission of Markov sources. p. 1.

Chakravorty, Jhelum and Mahajan, Aditya 2016. Fundamental limits of remote estimation of autoregressive Markov processes under communication constraints. p. 1.

Chakravorty, Jhelum and Mahajan, Aditya 2017. Fundamental Limits of Remote Estimation of Autoregressive Markov Processes Under Communication Constraints. IEEE Transactions on Automatic Control, Vol. 62, Issue. 3, p. 1109.

Yu, Qiuping Allon, Gad Bassamboo, Achal and Iravani, Seyed 2018. Managing Customer Expectations and Priorities in Service Systems. Management Science, Vol. 64, Issue. 8, p. 3942.

Article contents

COMPUTING AVERAGE OPTIMAL CONSTRAINED POLICIES IN STOCHASTIC DYNAMIC PROGRAMMING

Abstract

Access options

This article has been cited by the following publications. This list is generated based on data provided by Crossref.

Article contents

COMPUTING AVERAGE OPTIMAL CONSTRAINED POLICIES IN STOCHASTIC DYNAMIC PROGRAMMING

Abstract

Access options

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests