CHAPTER 4. Design and Analysis of Bandwidth Reservation Game in
4.8 Bayesian Nash Equilibrium
4.8.2 Analysis of Bayesian Nash Equilibrium
It is well known that an equilibrium point may not exist. In the subsection, we are interested in investigating the existence and uniqueness of a BNE in our resource allocation game.
To show the existence of BNE, we need to show that the strategy set of each player is convex, compact and nonempty (79). Moreover, the utility function is concave on the strategy set. The strategy set of each player in our game is nonempty since every admitted connection allows to request bandwidth. Additionally, bi ⊆ R. Thus, the strategy set is convex and compact. Now we want to show that the utility function is continuous and concave on both bi and b−i. Our utility function comprises SI and P I. It is easy to show that both SI and P I are continuous on bi and b−i. Thus, the utility function is continuous on both bi and b−i. As shown in Section 4.7, P I for all scheduling classes is modeled based on an exponential function. It is clear to conclude that P I is concave for the payoff function.
We show the concavity of SI by the following Lemma.
Lemma 1: In the proposed game, the SI for all scheduling classes is concave.
proof: As shown in equation (4.14), the SI for nrtPS and BE connections is modeled by a
logarithm function. Thus, it is clear to conclude that the SI for nrtPS and BE connections is concave. We now focus on the SI for rtPS connections. As shown in equation (4.10), the SI for rtPS connections is modeled by a sigmoid function. Additionally, the infection point of the sigmoid function is based on the M SR of the connection. It separates SI for SS i with a rtPS connection as below:
SIi(bi) :
convex, 0 ≤ bi≤ MSRirt
concave, M SRrti ≤ bi <∞
According to our admission control policy, each SS should receive the guaranteed bandwidth until its M SR is reached. Because of this policy, in our design, the P I of all scheduling classes is zero before the M SR is reached. Moreover, the objective of each SS tries to maximize its payoff which is SI − P I. Therefore, the SS must request bandwidth which is larger or equal to its M SR. Consequently, we can limit our consideration only to the concave part of the sigmoid function and conclude that the SI for rtPS connections is also concave in the proposed game. In summary, the SI for all scheduling classes is concave in the proposed game.
Based on Lemma 1 and description above, the proof for the existence of BNE is com-pleted. Now, we investigate the uniqueness of BNE. We rely on a sufficient condition: a non-cooperative game has a unique equilibrium if the nonnegative weighted sum of the payoff function is diagonally strictly concave (79).
Definition 2. (Diagonally Strictly Concave) A weighted sum function h(x, r) :=Pn
i=1riζ(x) is called diagonally strictly concave for all vector x ∈ Rn×1 and fixed vector r ∈ Rn×1, if for any two different vectors x0, x1, we have
Λ(x0, x1, r), (x1− x0)Tσ(x0, r) + (x0− x1)Tσ(x1, r) > 0 where σ(x, r) is called pseudo-gradient of f (x, r), defined as:
σ(x,r),
Definition 1 states that the BNE is obtained when all players obtain their best strategies by giving the strategies of other SSs such that their expected payoffs are maximized. Ad-ditionally, as stated in Definition 2, the payoff is calculated as the expected value of utility function with the corresponding event probability. In this paper, the event probability is considered as the scheduling class of the connection running on the SS. According to IEEE 802.16 standard, the scheduling class has to be determine during admission control proce-dure and it cannot be changed after creation of the connection. Consequently, this makes each SS only have one event with probability of 1. Therefore, the description in Definition 1 and 2 match our utility function shown in equation (4.9).
Lemma 2: The weighted nonnegative sum of average payoff ui in the proposed game is diagonally strictly concave.
proof: We present the weighted nonnegative sum of the average payoff as:
hn(b, r),
n]T be the pseudo-gradient of hn((b, r)). Each SS i ∈ N serves a connection belonging to one of scheduling classes. Sup-pose brti , bnrti and bbei are the amount of bandwidth for the connection in rtPS, nrtPS and BE, respectively. Note that each connection only has one scheduling class. Thus, only one of bti, t∈ {rt, nrt, be} can be a positive number and the rest of them must be zero. Suppose SSi serves a rtPS connection, then the average payoff ui can be actually transformed into
a weighted sum function as follows
where γ represents the index for different jointly probability events with corresponding probability wγ. Similarly, the average payoff function can be presented as following if the connection belongs to nrtPS or BE.
ui(bs, bs−) =X
where s ∈ {nrt, be}. Now, we can write the pseudo-gradient σn as:
σn(b, r) =
To check the diagonally strictly concave, we let b0, b1 be two different vectors and define
Λ(b0, b1, r), (b1− b0)Tσn(b0, r) + (b0− b1)Tσn(b1, r) (4.26)
In Lemma 1, we have shown that the SI is concave for both nrtPS and BE connection.
Moreover, for rtPS connection, we also proved that the SS never request bandwidth less than the M SR of the connection. Therefore, only concave part of SI needs to be considered.
Without loss of generality, we assume b1i > b0i. It can lead to (∂SI(b∂b00i)
i −∂SI(b∂b01i)
i ) ≥ 0. P I for all scheduling classes is modeled by the exponential function which is strictly convex to the amount of reserved bandwidth. Thus, we can have (∂P I(b∂b11i)
i − ∂P I(b∂b10i)
i
) > 0. It results λ(b0i, b1i, ri) > 0. Consequently, we can conclud:
Λ(b0, b1, r) =X
∀i
λ(b0i, b1i, ri) > 0 (4.28) It shows that the weighted nonnegative sum of average payoff is diagonally strictly concave.
The proof for uniqueness of BNE is completed.