• No results found

Optimal Gateway Selection in Multi-domain Wireless Networks: A Potential Game Perspective

N/A
N/A
Protected

Academic year: 2021

Share "Optimal Gateway Selection in Multi-domain Wireless Networks: A Potential Game Perspective"

Copied!
51
0
0

Loading.... (view fulltext now)

Full text

(1)

1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions

Optimal Gateway Selection in Multi-domain Wireless

Networks: A Potential Game Perspective

Yang Song, Starsky H.Y. Wong, and Kang-Won Lee Wireless Networking Research Group

IBM T. J. Watson Research Center

Mobicom 2011

(2)

1

Motivation

2

Gateway Selection Game

3

Equilibrium Selective Learning

4

Performance Evaluation

5

Conclusions

(3)

1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions

Coalition Networks with Multiple Domains

Scenario:

- Coalition networks withheterogenousgroups.

- Inter-connected via wireless links, e.g., IEEE 802.11, WiMAX, UAV, satellite, 3G/4G etc.

Example:

(4)
(5)

1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions

Interoperability Issue

Problems:

Inter-domain communication is non-trivial for heterogenous domains

Different network protocol, security schemes, policies Security and policy

enforcement, traffic analysis

(6)

Problems:

Inter-domain communication is non-trivial for heterogenous domains

Different network protocol, security schemes, policies Security and policy

enforcement, traffic analysis

Solution:

(7)

1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions

Interoperability Issue

Problems:

Inter-domain communication is non-trivial for heterogenous domains

Different network protocol, security schemes, policies Security and policy

enforcement, traffic analysis

Solution:

Designate

gateway

nodes

Gateways

Domain B Domain A

D1 S1

S2

D2

(8)

Gateways

Domain B

Domain C Domain A

Destination

Source

Each pair of nodes has a cost,

e.g., routing metric cost, such as

hop count, RIP, AODV etc.

Euclidean distance ETX, ETT, RTT Energy consumption etc.

(9)

1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions

Cost Efficient Gateway Selection

Gateways

Domain B

Domain C Domain A

Destination

Source

Each pair of nodes has a cost,

e.g., routing metric cost, such as

hop count, RIP, AODV etc.

Euclidean distance ETX, ETT, RTT Energy consumption etc.

For a single domain

Intra-domain cost

(10)

Gateways

Domain B

Domain C Domain A

Destination

Source

Each pair of nodes has a cost,

e.g., routing metric cost, such as

hop count, RIP, AODV etc.

Euclidean distance ETX, ETT, RTT Energy consumption etc.

For a single domain For the network

Intra-domain cost Inter-domain backbone cost

(11)

1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions

Cost Efficient Gateway Selection

Gateways

Domain B

Domain C Domain A

Destination

Source

Each pair of nodes has a cost,

e.g., routing metric cost, such as

hop count, RIP, AODV etc.

Euclidean distance ETX, ETT, RTT Energy consumption etc.

For a single domain For the network

Intra-domain cost + Inter-domain backbone cost

Question:

(12)

Gateways

Domain B

Domain C Domain A

Destination

Source

(13)

1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions

Challenges

Gateways

Domain B

Domain C Domain A

Destination

Source

Combinatorial nature of solution space

(14)

Gateways

Domain B

Domain C Domain A

Destination

Source

Combinatorial nature of solution space

Distributed solution

(15)

1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions

Challenges

Gateways

Domain B

Domain C Domain A

Destination

Source

Combinatorial nature of solution space

Each domain may designate gateway for its own benefit (self-interested / lack of coordination)

Distributed solution

(16)

Gateways

Domain B

Domain C Domain A

Destination

Source

Combinatorial nature of solution space

Each domain may designate gateway for its own benefit (self-interested / lack of coordination)

Distributed solution Equilibrium efficiency

(17)

1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions

Challenges

Gateways

Domain B

Domain C Domain A

Destination

Source

Combinatorial nature of solution space

Each domain may designate gateway for its own benefit (self-interested / lack of coordination)

Reluctance in revealing its own intra-domain topology

Distributed solution Equilibrium efficiency

(18)

Gateways

Domain B

Domain C Domain A

Destination

Source

Combinatorial nature of solution space

Each domain may designate gateway for its own benefit (self-interested / lack of coordination)

Reluctance in revealing its own intra-domain topology

Distributed solution Equilibrium efficiency Local information only

(19)

1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions

Challenges

Gateways

Domain B

Domain C Domain A

Destination

Source

Combinatorial nature of solution space

Each domain may designate gateway for its own benefit (self-interested / lack of coordination)

Reluctance in revealing its own intra-domain topology

Distributed solution Equilibrium efficiency Local information only

(20)

M : the set of domains in the coalition network

Nm: the set of nodes in the domain

gmi = 1: node i is selected as the gateway node and gmi = 0 o.w. and bim= argmaxi∈Nmgmi be the selected gateway node gm= {gm1, gm2,· · · , gm|Nm|}: the gateway selection strategyof

s = {g1,g2,· · · , g|M|}: the jointgateway selection profileof the network Satellite/UAV/3G/4G link:

cost η (expensive), to enforce always-on connectivity A pair of node i and j:

c(i, j) ≥ 0 is the associated symmetric link cost, c(i, j) = η if out of range

, min (c (i, j) , η)

(21)

1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions

Gateway Selection Game

For each single domain

Minimize (Localinformation and observation only)

Um(gm,g−m) = X

i6= bim,i ∈Nm

c i,ibm

+ X

n6=m,n∈M

c ibm, bin

 (1)

(22)

Minimize (Localinformation and observation only)

Um(gm,g−m) = X

i6= bim,i ∈Nm

c i,ibm

+ X

n6=m,n∈M

c ibm, bin

 (1)

Gateways

Domain B

Domain C Domain A

Destination

Source

Player: each domain m ∈ M Strategy space: Nm

(23)

1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions

Gateway Selection Game

For each single domain

Minimize (Localinformation and observation only)

Um(gm,g−m) = X

i6= bim,i ∈Nm

c i,ibm

+ X

n6=m,n∈M

c ibm, bin

 (1)

Gateways

Domain B

Domain C Domain A

Destination

Source

Player: each domain m ∈ M Strategy space: Nm

Questions

(24)

Minimize (Localinformation and observation only)

Um(gm,g−m) = X

i6= bim,i ∈Nm

c i,ibm

+ X

n6=m,n∈M

c ibm, bin

 (1)

Gateways

Domain B

Domain C Domain A

Destination

Source

Player: each domain m ∈ M Strategy space: Nm

Questions

– Agreement? ⇐⇒Existenceof NE

(25)

1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions

Gateway Selection Game

For each single domain

Minimize (Localinformation and observation only)

Um(gm,g−m) = X

i6= bim,i ∈Nm

c i,ibm

+ X

n6=m,n∈M

c ibm, bin

 (1)

Gateways

Domain B

Domain C Domain A

Destination

Source

Player: each domain m ∈ M Strategy space: Nm

Questions

– Agreement? ⇐⇒Existenceof NE – Performance? ⇐⇒Efficiencyof NE

(26)

Minimize (Localinformation and observation only)

Um(gm,g−m) = X

i6= bim,i ∈Nm

c i,ibm

+ X

n6=m,n∈M

c ibm, bin

 (1)

Gateways

Domain B

Domain C Domain A

Destination

Source

Player: each domain m ∈ M Strategy space: Nm

Questions

– Agreement? ⇐⇒Existenceof NE – Performance? ⇐⇒Efficiencyof NE

For overall network

(27)

1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions

Existence of Nash Equilibrium

Theorem

The gateway selection game has a Nash equilibrium, which

minimizes, either locally or globally, the following function

F(s) = X

m

X

i6= bim,i∈Nm

c 

i , b i

m



+ X

(

ibm, bin

)

∈CCG(s)

c



b

i

m

, b i

n



. (3)

(28)

Theorem

The gateway selection game has a Nash equilibrium, which

minimizes, either locally or globally, the following function

F(s) = X

m

X

i6= bim,i∈Nm

c 

i , b i

m



+ X

(

ibm, bin

)

∈CCG(s)

c



b

i

m

, b i

n



. (3)

Nash equilibrium may not be unique

Multiple Nash equilibria have different performance

(29)

1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions

Existence of Nash Equilibrium

Theorem

The gateway selection game has a Nash equilibrium, which

minimizes, either locally or globally, the following function

F(s) = X

m

X

i6= bim,i∈Nm

c 

i , b i

m



+ X

(

ibm, bin

)

∈CCG(s)

c



b

i

m

, b i

n



. (3)

Nash equilibrium may not be unique

Multiple Nash equilibria have different performance

To capture the (in)efficiency of Nash equilibrium,Price of Anarchy andPrice of Stabilityare introduced

(30)

For two player gateway selection games, the best Nash Equilibrium is the global network optimum solution, i.e., the price of stability is 1.

(31)

1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions

Efficiency of Nash Equilibria

For |M| = 2

For two player gateway selection games, the best Nash Equilibrium is the global network optimum solution, i.e., the price of stability is 1.

For |M| ≥ 3

For |M| ≥ 3, if the link cost metric c(a, b) satisfies the triangle inequality, the price of stability is always 1.

(32)

For two player gateway selection games, the best Nash Equilibrium is the global network optimum solution, i.e., the price of stability is 1.

For |M| ≥ 3

For |M| ≥ 3, if the link cost metric c(a, b) satisfies the triangle inequality, the price of stability is always 1.

All else

If the triangle inequality does not hold, the price of stability of an

|M|-player gateway selection game is at most (1 + δ), where

 

(33)

1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions

B-logit: Binary Logit Algorithm

B-logit: For every time slot t:

(34)

Randomly select one of the players, say m, to update its gateway selection while other domains remain unchanged.

(35)

1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions

B-logit: Binary Logit Algorithm

B-logit: For every time slot t:

Randomly select one of the players, say m, to update its gateway selection while other domains remain unchanged.

Denote the current gateway selection of domain m as gm(t).

Domain m randomly selects a node in its domain as the gateway candidate. Denote the candidate gateway selection strategy bygfm. Domain m updates as

Pr (gm(t + 1) =gfm) (5)

= exp−Um(gfm,gm(t))

exp−Um(gfm,gm(t))+ exp−Um(gm(t),gm(t)) and

Pr (gm(t + 1) = gm(t)) = 1− Pr (gm(t + 1) =gfm) (6) where τ is a small positive constant, a.k.a., thesmoothing factor

(36)

Randomly select one of the players, say m, to update its gateway selection while other domains remain unchanged.

Denote the current gateway selection of domain m as gm(t).

Domain m randomly selects a node in its domain as the gateway candidate. Denote the candidate gateway selection strategy bygfm. Domain m updates as

Pr (gm(t + 1) =gfm) (5)

= exp−Um(gfm,gm(t))

exp−Um(gfm,gm(t))+ exp−Um(gm(t),gm(t)) and

Pr (gm(t + 1) = gm(t)) = 1− Pr (gm(t + 1) =gfm) (6) where τ is a small positive constant, a.k.a., thesmoothing factor

(37)

1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions

Proof (sketch)

1,1

x y x y1,2 x y1,3 x y1,c l×

⋯ ⋯

c l,c l

x×y×

,2

xc l×y xc l×,y3

⋯ ⋯

2,1

x y

3,1

x y

,1

xc l×y

2,2

x y x y2,3 ⋯ ⋯ x y2,c l×

Note Pr (s→ s′′)

1

|M| 1

|Nm|

exp−U(s′′ )/τ

exp−Um(gm ,g−m(t)f )+ exp−Um(gm (t),g−m(t)) Verify

π(s) = exp−F (s)/τ P

s∈Sexp−F (s)/τ satisfies thedetailed balance equation, i.e., π(s) Pr (s→ s′′) = π(s′′) Pr (s′′→ s) B-logitalgorithm induces a reversible,

irreducible, and aperiodic Markov chain and it is theuniquesteady state distribution.

By taking τ→ 0, we have

(38)
(39)

1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions

Generalization of B-logit

γ-logit algorithm family (Γ):

γ-logit shares the same structure as B-logit except in (5), where the probability is calculated as

Pr (gm(t + 1) =gfm) = exp−Um(gfm,g−m(t))

γ(s,s′′) (7) where s={gm(t), g−m(t)} and s′′={fgm,g−m(t)} are two gateway selection profiles inS, and γ satisfies

1 Symmetry

γ(s,s′′) = γ(s′′,s),∀s∈ S, s′′∈ S,

2 Feasibility γ(s,s′′)≥ max

exp−Um(s)/τ,exp−Um(s′′)/τ . B-logitis aspecial caseof γ-logit algorithm with

(40)

converging to the global minimizer of the potential function

asymptotically.

(41)

1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions

Theorem

Every γ-logit algorithm in Γ is equilibrium selective, i.e.,

converging to the global minimizer of the potential function

asymptotically.

Which isbetter?

(42)

converging to the global minimizer of the potential function

asymptotically.

Which isbetter?

Each γ-logit algorithm induces a Markov chain with different transition probability matrix, where

Pi,j(γ), Pr si → sj

= 1

|M|

1

|Nm|

exp−U (sj)/τ γ(si,sj)

(43)

1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions

Theorem

Every γ-logit algorithm in Γ is equilibrium selective, i.e.,

converging to the global minimizer of the potential function

asymptotically.

Which isbetter?

Each γ-logit algorithm induces a Markov chain with different transition probability matrix, where

Pi,j(γ), Pr si → sj

= 1

|M|

1

|Nm|

exp−U (sj)/τ γ(si,sj)

The mixing rate of a Markov chain is determined by the second largest eigenvalue modulus (SLEM), i.e.,



(44)

For every time slot t:

Randomly select one of the players, say m, to update its gateway selection while other domains remain unchanged.

Denote the current gateway selection of domain m as gm(t). Domain m randomly selects a node in its domain as the gateway candidate. Denote the candidate gateway selection strategy bygfm. Domain m updates as

Pr (gm(t + 1) =gfm) = exp−Um(gfm,gm(t))

max (exp−Um(s)/τ,exp−Um(s′′)/τ).

(45)

1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions

Solution: MAX-logit Algorithm

MAX-logit:

For every time slot t:

Randomly select one of the players, say m, to update its gateway selection while other domains remain unchanged.

Denote the current gateway selection of domain m as gm(t). Domain m randomly selects a node in its domain as the gateway candidate. Denote the candidate gateway selection strategy bygfm. Domain m updates as

Pr (gm(t + 1) =gfm) = exp−Um(gfm,gm(t))

max (exp−Um(s)/τ,exp−Um(s′′)/τ). Denote µMAX as the second largest eigenvalue modulus associated with MAX-logit algorithm.

Theorem

(46)

|M| domains where each domain has |N | nodes

For each domain, nodes are randomly deployed in a round area with radius 125m, centered at a random point within the square field of 1000× 1000m2

Link cost:

1 Euclidean distance: Network optimum solution is the best Nash (γ-logit algorithms converge to the network optimum solution)

2 Random cost: γ-logit algorithm converges to the approximate 1 + δ solution (Nash equilibrium)

3 Randomly select p% of the links in the network and add random cost offset which is uniformly distributed between 0 and 5% of the original cost

(47)

1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions

Euclidean Distance Scenarios

p% = 0%

2, 3, 4 domains where each domain has 20 nodes

0 20 40 60 80 100

3000 3100 3200 3300 3400

Iteration steps

Global network cost

MAX−logit B−logit

OPT

0 50 100 150 200

5000 5500 6000 6500

Iteration steps

Global network cost

MAX−logit B−logit

OPT

0 50 100 150 200

8000 8500 9000 9500 10000 10500

Iteration steps

Global network cost

MAX−logit B−logit

OPT

(48)

2, 3, 4 domains where each domain has 20 nodes

0 20 40 60 80 100

3000 3100 3200 3300 3400

Iteration steps

Global network cost

MAX−logit B−logit

OPT

0 50 100 150 200

5000 5500 6000 6500

Iteration steps

Global network cost

MAX−logit B−logit

OPT

0 50 100 150 200

8000 8500 9000 9500 10000 10500

Iteration steps

Global network cost

MAX−logit B−logit

OPT

Nodes per domain 2 domains 3 domains 4 domains

5 nodes 16.06% 24.52% 33.85%

10 nodes 25.00% 29.81% 28.55%

20 nodes 11.96% 20.19% 20.36%

30 nodes 5.87% 16.46% 17.60%

(49)

1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions

Random Cost Scenarios

p= 50, i.e., 50% of the links in the network are associated with random link cost

2, 3, 4 domains where each domain has 20 nodes

0 50 100 150 200

2500 3000 3500 4000 4500 5000

Iteration steps

Global network cost

OPT B−logit MAX−logit

0 50 100 150 200

4000 4500 5000 5500 6000

Iteration steps

Global network cost

BOUND

OPT

MAX−logit B−logit

0 50 100 150 200

6000 7000 8000 9000 10000 11000 12000

Iteration steps

Global network cost

BOUND

OPT

MAX−logit

B−logit

(50)

cost

2, 3, 4 domains where each domain has 20 nodes

0 50 100 150 200

2500 3000 3500 4000 4500 5000

Iteration steps

Global network cost

OPT B−logit MAX−logit

0 50 100 150 200

4000 4500 5000 5500 6000

Iteration steps

Global network cost

BOUND

OPT

MAX−logit B−logit

0 50 100 150 200

6000 7000 8000 9000 10000 11000 12000

Iteration steps

Global network cost

BOUND

OPT

MAX−logit

B−logit

Nodes per domain 2 domains 3 domains 4 domains

5 nodes 21.84% 24.46% 27.38%

10 nodes 21.00% 21.44% 21.56%

20 nodes 9.54% 9.13% 5.47%

(51)

1. Motivation 2. Gateway Selection Game 3. Equilibrium Selection Learning 4. Evaluation 5. Conclusions

Conclusions

Interactive gateway selection by multiple domains in coalition networks

In a potential game framework, the existence and inefficiency of Nash equilibria are characterized (two domains, multi-domains) Equilibrium selective learning: generalized B-logit into γ-logit, or Γ

Propose MAX-logit which converges to the best Nash equilibrium at the fastest speed in Γ

Other applications of potential games in power control, channel allocation, spectrum sharing content distribution etc.

References

Related documents

(2005) avaliaram o efeito de três ciclos de SRR sobre as médias, as variâncias  genéticas  e  as  correlações  de  várias  características.  Para 

O ideótipo de máxima adaptabilidade geral representa a planta ideal, utilizada para comparação com os acessos do BAG na projeção dos componentes principais, e que

Again, while physical violence shows negative and significant effect on increasing nutritional level among women in UP, Assam, Gujarat, AP, Karnataka, Kerala and Tamil Nadu;

Based on her acute study of social development in Rajasthan, Mather summarizes, “It is fairly obvious that the government only responds to broad social consensus.” 3

In order to address these two issues, we have generated human CTLA-4 gene knock-in mice and used them to compare a panel of anti-human CTLA-4 antibodies for their ability to

“Human Resource Outsourcing: Measuring the Hidden Costs” at the Western Decision Sciences Annual Meeting, Kauai, April, 2009. “Human Resource Outsourcing: Prediction of

Neque porro quisquam est, qui dolorem ipsum quia dolor sit amet, consectetur, adipisci velit, sed quia non numquam eius modi tempora incidunt ut labore et dolore magnam

“Significant restrictions on access to bank information for tax purposes remain in three OECD countries (Austria, Luxembourg and Switzerland) and in a number of offshore