POMDPs in controlled sensing and sensor scheduling

Vikram Krishnamurthy

doi:10.1017/CBO9781316471104.011

Introduction

Statistical signal processing deals with extracting signals from noisy measurements. Motivated by physical, communication and social constraints, this chapter addresses the deeper issue of how to dynamically control and optimize signal processing resources to extract signals from noisy measurements. In such controlled sensing problems, several types of sensors (or sensing modes) are available for measuring a given process. Associated with each sensor is a per unit-of-time measurement cost, reflecting the fact that measurements that are more costly to make typically contain more reliable information. Which sensor (or sensing mode) should the decision-maker choose at each time instant to provide the next measurement? Such problems are motivated by technological advances in the design of flexible sensors such as sophisticated multi-function radars which can be configured to operate in one of many modes for each measurement.

The controlled sensing problem considered in this chapter is also called the sensor scheduling problem, measurement control problem or active sensing problem. In the context of signal processing, the phrase sensor-adaptive signal processing is apt since the sensors adapt (reconfigure) their sensing modes in real time. Controlled sensing arises in numerous applications including adaptive radar (how much resources to devote to each target ([246]), cognitive radio (how to sense the radio spectrum for available channels [353]), and social networks (how do social sensors affect marketing and advertising strategies [185]).

This chapter discusses the formulation of controlled sensing problems with three examples. The first example considers state and measurement control of a linear Gaussian state space model. As an application, radar scheduling is discussed. The second example deals with POMDPs in controlled sensing. Unlike the formulation in Chapter 7, POMDPs arising in controlled sensing have instantaneous costs which are explicit functions of the belief. As a result, the cost in terms of the belief state is nonlinear. The third example discusses POMDPs in social learning. In social learning the observation probabilities are an explicit function of the belief. POMDPs with social learning involve the interaction of local and global decision-makers. The chapter concludes with a discussion of risk-averse MDPs and POMDPs which are of recent interest in mathematical finance.

State and sensor control for state space models

Consider state and measurement control for the state space model discussed in Chapter 2.

Book contents

8 - POMDPs in controlled sensing and sensor scheduling

Summary

Access options

Book contents

8 - POMDPs in controlled sensing and sensor scheduling

Summary

Access options

Save book to Kindle

Save book to Dropbox

Save book to Google Drive