CHAPTER 3 : Optimal Power Management for Wireless Sensor-Actuator Systems
3.1 Problem Description
We consider the wireless control architecture shown in Fig. 5 deployed to control a discrete-time linear time-invariant plant described by the difference equation
xk+1=Axk+Buk+wk, k≥0, (3.1)
where xk ∈ Rn is the plant’s state with x0 given,uk ∈ Rm the driving input, and{wk,k ≥ 0}is
the process noise composed of independent identically distributed (i.i.d)n-dimensional Gaussian random variableswk∼ N0,W with zero mean and covarianceW. We assume the plant is unstable
(λmax(A)>1) but that(A,B)is stabilizable.
The wireless sensor collects state measurementsxkthat it communicates with powerpk∈[0,pmax]
over a wireless fading channel with coefficienthk. At the other side of the channel the receiver/
controller uses the received information to determine a control inputuk that it feedbacks into the
plant.
The channel fading state hk,k ≥ 0 is modeled as an i.i.d. process (block fading cf. 2.2) with
some known probability distributionφh on the positive realsR+, independent of the plant pro-
cess noise {wk,k ≥ 0}. We make the technical assumption that the distribution of the channel
state is absolutely continuous, i.e., has a probability density function onR+. To allow for trans-
missions adapted to the current channel conditions the transmitter has access to the channel state informationhk before transmitting at timek.
Following the wireless channel modeled introduced in 2.2, the probability of successfully decoding the message at the receiver/controller at timek, denoted hereby byqk, is a function of the current
channel statehk as well as the current transmit powerpk,
qk =q(hk,pk). (3.2)
The typical form of this function is depicted in Fig. 2. We assume that q(h,p) is a known increasing function of the producth p. We denote the success of transmission at each time stepk by an indicator variableγk ∈ {0, 1}. Variablesγk ∼Bern(qk)are Bernoulli distributed with time-
varying success probabilities qk given by (3.2). With this communication model the controller
Plant: xk+1=Axk+Buk+wk Sensor/ Transmitter z−1 Channel: qk=q(hk,pk) γk∼Bern(qk) yk=γkxk Receiver/ Controller xk xk,pk yk,γk uk hk wk γk γk−1
Figure 5: Wireless control system architecture. A sensor measures the plant and wireless fading channel statesxk,hkrespectively and transmits with powerpk. Messages are successfully decoded
at the controller with probability qk that depends on the channel statehk and the powerpk. The
sensor receives acknowledgments with a one-step delay.
assume that the controller also getsγk so that it can distinguish between the cases xk = 0 and γk = 0. Packet receipt acknowledgmentγk is also sent to the sensor as provided by 802.11 and
TCP protocols. We assume lossless acknowledgments, so that the sensor knows what information is received at the controller.
The problem addressed in this chapter is the joint design of the control inputsukand the transmit
powerspk. The control inputukis determined by the received informationy0:k,γ0:k. The powerpk
is determined as a function of the plant state measurementsx0:k, the observed channel realizations
h0:k, and the controller acknowledgments γ0:k. Informally, to conserve power at the sensor side
we want to transmit information only when the statexkdeviates from its desired value or when
the channel realizationhkis favorable. In the first case transmission is necessary to keep the plant
under control. In the latter case the transmission cost is minimal. A formal problem specification is presented in the next section after the following remarks.
To formulate the joint design of plant controller and power management we introduce an equiva- lent architecture. In view of (3.2), choosing pkis equivalent to choosing the desired probability of
successful decodingqk at time kand transmitting with the minimum required power to achieve
thisqk, namely
pk =p(hk,qk):=inf{0≤p≤ pmax:q(hk,p)≥qk}. (3.3)
We can therefore interpretqk as our decision variable withp(hk,qk)denoting the cost of selecting
transmission success probabilityqk. This leads to the equivalent control system architecture shown
in Fig. 6 where a scheduler block responsible for decidingqkreplaces the sensor/transmitter block
of Fig 5. Our formulation generalizes the simple transmit-or-not decision as considered in, e.g., Xu and Hespanha (2004).
We note for future reference that the assumed monotonicity of the functionq(h,p)on the product h pimplies that the power functionp(h,q)is increasing inqand decreasing inh. Using maximum
Plant: xk+1=Axk+Buk+wk Scheduler: decidesqk usesp(hk,qk) z−1 Channel: γk∼Bern(qk) yk=γkxk Receiver/ Controller xk xk,qk yk,γk uk hk wk γk γk−1
Figure 6: Equivalent wireless control system architecture. A scheduler decides the successful decoding probability qk and transmits the state measurement xk with the required power pk =
p(hk,qk). The controller receives the message with probabilityqk.
power pmax, the transmitter can achieve a maximum successful decoding probabilityqmax(h) :=
q(h,pmax)for a given channel state h. Therefore, the decision variablesqk belong in the interval [0,qmax(hk)]. We also make the following assumptions.
Assumption 3.1. The maximum achievable successful decoding probability qmax(h)satisfies
Ehqmax(h)>qcrit:=1−1/λmax(A)2, (3.4)
where expectation is taken over the channel distributionφh.
Assumption 3.2. For any channel realization h, the function p(h,q)in (3.3) is continuous in the successful decoding probability variable q.
Assumption 3.1 is essentially a stability condition, which as we will see in the following section states that transmitter has enough power to keep the plant state bounded in second moment, and it will be used to establish our main Theorems 3.1 and 3.2. Assumption 3.2 is of a technical nature and will be used in Theorem 3.2.
In the architecture of Fig. 6 the communication decisionqk is chosen as a function of the infor-
mation available at the sensor, while the plant control signaluk is a function of the information
available at the controller. These choices are in general allowed to be randomized. The sequence
π :={q0,q1, . . .}, or equivalently the power allocation{p0,p1, . . .}, is termed the communication
policy, whereas the sequenceθ:={u0,u1, . . .}denotes the control policy. With fixed policiesπ,θ,
all random variables are defined on an appropriate product space and have a measure that we denote asPπ,θ. We use Eπ,θ to denote integration with respect toPπ,θ, which we simplify toE
when not leading to confusion. We remark that sensor and controller know each other’s policy. The policy pair(π,θ)incurs a control cost and a communication cost. As a control cost we adopt
the standard linear quadratic regulator cost JLQRN (π,θ):=Eπ,θ N−1
∑
k=0 xTkQxk+uTkRuk, (3.5)for some pair of matricesR>0 andQ≥0, with(A,Q1/2)detectable. The communication cost is given by the expected power consumption
JPWRN (π,θ):=Eπ,θ N−1
∑
k=0p(hk,qk). (3.6)
To quantify the tradeoff between plant performance and power consumption we combine the LQR cost in (3.5) and the power cost in (3.6) into the limit aggregate cost
J(π,θ):=lim sup N→∞ 1 N h JLQRN (π,θ) +λJPWRN (π,θ) i , (3.7)
for some positive constantλ > 0. Our goal is to design plant and power control policies θ and π respectively that minimize the joint cost (3.7). These policies depend on what information is
available to the sensor and controller. The specific information structure considered is introduced next.