1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions
Optimal Gateway Selection in Multi-domain Wireless
Networks: A Potential Game Perspective
Yang Song, Starsky H.Y. Wong, and Kang-Won Lee Wireless Networking Research Group
IBM T. J. Watson Research Center
Mobicom 2011
1
Motivation
2
Gateway Selection Game
3
Equilibrium Selective Learning
4
Performance Evaluation
5
Conclusions
1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions
Coalition Networks with Multiple Domains
Scenario:
- Coalition networks withheterogenousgroups.
- Inter-connected via wireless links, e.g., IEEE 802.11, WiMAX, UAV, satellite, 3G/4G etc.
Example:
1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions
Interoperability Issue
Problems:
Inter-domain communication is non-trivial for heterogenous domains
Different network protocol, security schemes, policies Security and policy
enforcement, traffic analysis
Problems:
Inter-domain communication is non-trivial for heterogenous domains
Different network protocol, security schemes, policies Security and policy
enforcement, traffic analysis
Solution:
1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions
Interoperability Issue
Problems:
Inter-domain communication is non-trivial for heterogenous domains
Different network protocol, security schemes, policies Security and policy
enforcement, traffic analysis
Solution:
Designate
gatewaynodes
Gateways
Domain B Domain A
D1 S1
S2
D2
Gateways
Domain B
Domain C Domain A
Destination
Source
Each pair of nodes has a cost,
e.g., routing metric cost, such as
hop count, RIP, AODV etc.
Euclidean distance ETX, ETT, RTT Energy consumption etc.
1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions
Cost Efficient Gateway Selection
Gateways
Domain B
Domain C Domain A
Destination
Source
Each pair of nodes has a cost,
e.g., routing metric cost, such as
hop count, RIP, AODV etc.
Euclidean distance ETX, ETT, RTT Energy consumption etc.
For a single domain
Intra-domain cost
Gateways
Domain B
Domain C Domain A
Destination
Source
Each pair of nodes has a cost,
e.g., routing metric cost, such as
hop count, RIP, AODV etc.
Euclidean distance ETX, ETT, RTT Energy consumption etc.
For a single domain For the network
Intra-domain cost Inter-domain backbone cost
1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions
Cost Efficient Gateway Selection
Gateways
Domain B
Domain C Domain A
Destination
Source
Each pair of nodes has a cost,
e.g., routing metric cost, such as
hop count, RIP, AODV etc.
Euclidean distance ETX, ETT, RTT Energy consumption etc.
For a single domain For the network
Intra-domain cost + Inter-domain backbone cost
Question:Gateways
Domain B
Domain C Domain A
Destination
Source
1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions
Challenges
Gateways
Domain B
Domain C Domain A
Destination
Source
Combinatorial nature of solution space
Gateways
Domain B
Domain C Domain A
Destination
Source
Combinatorial nature of solution space
Distributed solution
1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions
Challenges
Gateways
Domain B
Domain C Domain A
Destination
Source
Combinatorial nature of solution space
Each domain may designate gateway for its own benefit (self-interested / lack of coordination)
Distributed solution
Gateways
Domain B
Domain C Domain A
Destination
Source
Combinatorial nature of solution space
Each domain may designate gateway for its own benefit (self-interested / lack of coordination)
Distributed solution Equilibrium efficiency
1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions
Challenges
Gateways
Domain B
Domain C Domain A
Destination
Source
Combinatorial nature of solution space
Each domain may designate gateway for its own benefit (self-interested / lack of coordination)
Reluctance in revealing its own intra-domain topology
Distributed solution Equilibrium efficiency
Gateways
Domain B
Domain C Domain A
Destination
Source
Combinatorial nature of solution space
Each domain may designate gateway for its own benefit (self-interested / lack of coordination)
Reluctance in revealing its own intra-domain topology
Distributed solution Equilibrium efficiency Local information only
1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions
Challenges
Gateways
Domain B
Domain C Domain A
Destination
Source
Combinatorial nature of solution space
Each domain may designate gateway for its own benefit (self-interested / lack of coordination)
Reluctance in revealing its own intra-domain topology
Distributed solution Equilibrium efficiency Local information only
M : the set of domains in the coalition network
Nm: the set of nodes in the domain
gmi = 1: node i is selected as the gateway node and gmi = 0 o.w. and bim= argmaxi∈Nmgmi be the selected gateway node gm= {gm1, gm2,· · · , gm|Nm|}: the gateway selection strategyof
s = {g1,g2,· · · , g|M|}: the jointgateway selection profileof the network Satellite/UAV/3G/4G link:
cost η (expensive), to enforce always-on connectivity A pair of node i and j:
c(i, j) ≥ 0 is the associated symmetric link cost, c(i, j) = η if out of range
′ , min (c (i, j) , η)
1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions
Gateway Selection Game
For each single domain
Minimize (Localinformation and observation only)
Um(gm,g−m) = X
i6= bim,i ∈Nm
c i,ibm
+ X
n6=m,n∈M
c′ ibm, bin
(1)
Minimize (Localinformation and observation only)
Um(gm,g−m) = X
i6= bim,i ∈Nm
c i,ibm
+ X
n6=m,n∈M
c′ ibm, bin
(1)
Gateways
Domain B
Domain C Domain A
Destination
Source
Player: each domain m ∈ M Strategy space: Nm
1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions
Gateway Selection Game
For each single domain
Minimize (Localinformation and observation only)
Um(gm,g−m) = X
i6= bim,i ∈Nm
c i,ibm
+ X
n6=m,n∈M
c′ ibm, bin
(1)
Gateways
Domain B
Domain C Domain A
Destination
Source
Player: each domain m ∈ M Strategy space: Nm
Questions
Minimize (Localinformation and observation only)
Um(gm,g−m) = X
i6= bim,i ∈Nm
c i,ibm
+ X
n6=m,n∈M
c′ ibm, bin
(1)
Gateways
Domain B
Domain C Domain A
Destination
Source
Player: each domain m ∈ M Strategy space: Nm
Questions
– Agreement? ⇐⇒Existenceof NE
1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions
Gateway Selection Game
For each single domain
Minimize (Localinformation and observation only)
Um(gm,g−m) = X
i6= bim,i ∈Nm
c i,ibm
+ X
n6=m,n∈M
c′ ibm, bin
(1)
Gateways
Domain B
Domain C Domain A
Destination
Source
Player: each domain m ∈ M Strategy space: Nm
Questions
– Agreement? ⇐⇒Existenceof NE – Performance? ⇐⇒Efficiencyof NE
Minimize (Localinformation and observation only)
Um(gm,g−m) = X
i6= bim,i ∈Nm
c i,ibm
+ X
n6=m,n∈M
c′ ibm, bin
(1)
Gateways
Domain B
Domain C Domain A
Destination
Source
Player: each domain m ∈ M Strategy space: Nm
Questions
– Agreement? ⇐⇒Existenceof NE – Performance? ⇐⇒Efficiencyof NE
For overall network
1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions
Existence of Nash Equilibrium
Theorem
The gateway selection game has a Nash equilibrium, which
minimizes, either locally or globally, the following function
F(s) = X
m
X
i6= bim,i∈Nm
c
i , b i
m+ X
(
ibm, bin)
∈CCG(s)c
′b
i
m, b i
n. (3)
Theorem
The gateway selection game has a Nash equilibrium, which
minimizes, either locally or globally, the following function
F(s) = X
m
X
i6= bim,i∈Nm
c
i , b i
m+ X
(
ibm, bin)
∈CCG(s)c
′b
i
m, b i
n. (3)
Nash equilibrium may not be unique
Multiple Nash equilibria have different performance
1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions
Existence of Nash Equilibrium
Theorem
The gateway selection game has a Nash equilibrium, which
minimizes, either locally or globally, the following function
F(s) = X
m
X
i6= bim,i∈Nm
c
i , b i
m+ X
(
ibm, bin)
∈CCG(s)c
′b
i
m, b i
n. (3)
Nash equilibrium may not be unique
Multiple Nash equilibria have different performance
To capture the (in)efficiency of Nash equilibrium,Price of Anarchy andPrice of Stabilityare introduced
For two player gateway selection games, the best Nash Equilibrium is the global network optimum solution, i.e., the price of stability is 1.
1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions
Efficiency of Nash Equilibria
For |M| = 2
For two player gateway selection games, the best Nash Equilibrium is the global network optimum solution, i.e., the price of stability is 1.
For |M| ≥ 3
For |M| ≥ 3, if the link cost metric c(a, b) satisfies the triangle inequality, the price of stability is always 1.
For two player gateway selection games, the best Nash Equilibrium is the global network optimum solution, i.e., the price of stability is 1.
For |M| ≥ 3
For |M| ≥ 3, if the link cost metric c(a, b) satisfies the triangle inequality, the price of stability is always 1.
All else
If the triangle inequality does not hold, the price of stability of an
|M|-player gateway selection game is at most (1 + δ), where
1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions
B-logit: Binary Logit Algorithm
B-logit: For every time slot t:
Randomly select one of the players, say m, to update its gateway selection while other domains remain unchanged.
1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions
B-logit: Binary Logit Algorithm
B-logit: For every time slot t:
Randomly select one of the players, say m, to update its gateway selection while other domains remain unchanged.
Denote the current gateway selection of domain m as gm(t).
Domain m randomly selects a node in its domain as the gateway candidate. Denote the candidate gateway selection strategy bygfm. Domain m updates as
Pr (gm(t + 1) =gfm) (5)
= exp−Um(gfm,g−m(t))/τ
exp−Um(gfm,g−m(t))/τ+ exp−Um(gm(t),g−m(t))/τ and
Pr (gm(t + 1) = gm(t)) = 1− Pr (gm(t + 1) =gfm) (6) where τ is a small positive constant, a.k.a., thesmoothing factor
Randomly select one of the players, say m, to update its gateway selection while other domains remain unchanged.
Denote the current gateway selection of domain m as gm(t).
Domain m randomly selects a node in its domain as the gateway candidate. Denote the candidate gateway selection strategy bygfm. Domain m updates as
Pr (gm(t + 1) =gfm) (5)
= exp−Um(gfm,g−m(t))/τ
exp−Um(gfm,g−m(t))/τ+ exp−Um(gm(t),g−m(t))/τ and
Pr (gm(t + 1) = gm(t)) = 1− Pr (gm(t + 1) =gfm) (6) where τ is a small positive constant, a.k.a., thesmoothing factor
1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions
Proof (sketch)
1,1
x y x y1,2 x y1,3 x y1,c l×
⋯ ⋯
c l,c l
x×y×
,2
xc l×y xc l×,y3
⋯ ⋯
2,1
x y
3,1
x y
,1
xc l×y
2,2
x y x y2,3 ⋯ ⋯ x y2,c l×
Note Pr (s′→ s′′)
1
|M| 1
|Nm|
exp−U(s′′ )/τ
exp−Um(gm ,g−m(t)f )/τ+ exp−Um(gm (t),g−m(t))/τ Verify
π(s′) = exp−F (s′)/τ P
s∈Sexp−F (s)/τ satisfies thedetailed balance equation, i.e., π(s′) Pr (s′→ s′′) = π(s′′) Pr (s′′→ s′) B-logitalgorithm induces a reversible,
irreducible, and aperiodic Markov chain and it is theuniquesteady state distribution.
By taking τ→ 0, we have
1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions
Generalization of B-logit
γ-logit algorithm family (Γ):
γ-logit shares the same structure as B-logit except in (5), where the probability is calculated as
Pr (gm(t + 1) =gfm) = exp−Um(gfm,g−m(t))/τ
γ(s′,s′′) (7) where s′={gm(t), g−m(t)} and s′′={fgm,g−m(t)} are two gateway selection profiles inS, and γ satisfies
1 Symmetry
γ(s′,s′′) = γ(s′′,s′),∀s′∈ S, s′′∈ S,
2 Feasibility γ(s′,s′′)≥ max
exp−Um(s′)/τ,exp−Um(s′′)/τ . B-logitis aspecial caseof γ-logit algorithm with
converging to the global minimizer of the potential function
asymptotically.
1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions
Theorem
Every γ-logit algorithm in Γ is equilibrium selective, i.e.,
converging to the global minimizer of the potential function
asymptotically.
Which isbetter?
converging to the global minimizer of the potential function
asymptotically.
Which isbetter?
Each γ-logit algorithm induces a Markov chain with different transition probability matrix, where
Pi,j(γ), Pr si → sj
= 1
|M|
1
|Nm|
exp−U (sj)/τ γ(si,sj)
1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions
Theorem
Every γ-logit algorithm in Γ is equilibrium selective, i.e.,
converging to the global minimizer of the potential function
asymptotically.
Which isbetter?
Each γ-logit algorithm induces a Markov chain with different transition probability matrix, where
Pi,j(γ), Pr si → sj
= 1
|M|
1
|Nm|
exp−U (sj)/τ γ(si,sj)
The mixing rate of a Markov chain is determined by the second largest eigenvalue modulus (SLEM), i.e.,
For every time slot t:
Randomly select one of the players, say m, to update its gateway selection while other domains remain unchanged.
Denote the current gateway selection of domain m as gm(t). Domain m randomly selects a node in its domain as the gateway candidate. Denote the candidate gateway selection strategy bygfm. Domain m updates as
Pr (gm(t + 1) =gfm) = exp−Um(gfm,g−m(t))/τ
max (exp−Um(s′)/τ,exp−Um(s′′)/τ).
1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions
Solution: MAX-logit Algorithm
MAX-logit:
For every time slot t:
Randomly select one of the players, say m, to update its gateway selection while other domains remain unchanged.
Denote the current gateway selection of domain m as gm(t). Domain m randomly selects a node in its domain as the gateway candidate. Denote the candidate gateway selection strategy bygfm. Domain m updates as
Pr (gm(t + 1) =gfm) = exp−Um(gfm,g−m(t))/τ
max (exp−Um(s′)/τ,exp−Um(s′′)/τ). Denote µMAX as the second largest eigenvalue modulus associated with MAX-logit algorithm.
Theorem
|M| domains where each domain has |N | nodes
For each domain, nodes are randomly deployed in a round area with radius 125m, centered at a random point within the square field of 1000× 1000m2
Link cost:
1 Euclidean distance: Network optimum solution is the best Nash (γ-logit algorithms converge to the network optimum solution)
2 Random cost: γ-logit algorithm converges to the approximate 1 + δ solution (Nash equilibrium)
3 Randomly select p% of the links in the network and add random cost offset which is uniformly distributed between 0 and 5% of the original cost
1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions
Euclidean Distance Scenarios
p% = 0%
2, 3, 4 domains where each domain has 20 nodes
0 20 40 60 80 100
3000 3100 3200 3300 3400
Iteration steps
Global network cost
MAX−logit B−logit
OPT
0 50 100 150 200
5000 5500 6000 6500
Iteration steps
Global network cost
MAX−logit B−logit
OPT
0 50 100 150 200
8000 8500 9000 9500 10000 10500
Iteration steps
Global network cost
MAX−logit B−logit
OPT
2, 3, 4 domains where each domain has 20 nodes
0 20 40 60 80 100
3000 3100 3200 3300 3400
Iteration steps
Global network cost
MAX−logit B−logit
OPT
0 50 100 150 200
5000 5500 6000 6500
Iteration steps
Global network cost
MAX−logit B−logit
OPT
0 50 100 150 200
8000 8500 9000 9500 10000 10500
Iteration steps
Global network cost
MAX−logit B−logit
OPT
Nodes per domain 2 domains 3 domains 4 domains
5 nodes 16.06% 24.52% 33.85%
10 nodes 25.00% 29.81% 28.55%
20 nodes 11.96% 20.19% 20.36%
30 nodes 5.87% 16.46% 17.60%
1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions
Random Cost Scenarios
p= 50, i.e., 50% of the links in the network are associated with random link cost
2, 3, 4 domains where each domain has 20 nodes
0 50 100 150 200
2500 3000 3500 4000 4500 5000
Iteration steps
Global network cost
OPT B−logit MAX−logit
0 50 100 150 200
4000 4500 5000 5500 6000
Iteration steps
Global network cost
BOUND
OPT
MAX−logit B−logit
0 50 100 150 200
6000 7000 8000 9000 10000 11000 12000
Iteration steps
Global network cost
BOUND
OPT
MAX−logit
B−logit
cost
2, 3, 4 domains where each domain has 20 nodes
0 50 100 150 200
2500 3000 3500 4000 4500 5000
Iteration steps
Global network cost
OPT B−logit MAX−logit
0 50 100 150 200
4000 4500 5000 5500 6000
Iteration steps
Global network cost
BOUND
OPT
MAX−logit B−logit
0 50 100 150 200
6000 7000 8000 9000 10000 11000 12000
Iteration steps
Global network cost
BOUND
OPT
MAX−logit
B−logit
Nodes per domain 2 domains 3 domains 4 domains
5 nodes 21.84% 24.46% 27.38%
10 nodes 21.00% 21.44% 21.56%
20 nodes 9.54% 9.13% 5.47%
1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions
Conclusions
Interactive gateway selection by multiple domains in coalition networks
In a potential game framework, the existence and inefficiency of Nash equilibria are characterized (two domains, multi-domains) Equilibrium selective learning: generalized B-logit into γ-logit, or Γ
Propose MAX-logit which converges to the best Nash equilibrium at the fastest speed in Γ
Other applications of potential games in power control, channel allocation, spectrum sharing content distribution etc.