• No results found

Game Theory

N/A
N/A
Protected

Academic year: 2020

Share "Game Theory"

Copied!
137
0
0

Loading.... (view fulltext now)

Full text

(1)

Elise Bonzon

[email protected]

LIPADE - Universit´e Paris Descartes http://www.math-info.univ-paris5.fr/

v

bonzon/

(2)

Game Theory

M. Osborne, A. Rubinstein, A Course in Game Theory

Game theory is a bag of analytical tools designed to help us understand the phenomena that we observe when decision-makers interact. The basic assumptions that underlie the theory are that decision-makers pursue well defined exogenous objectives (that are rational) and take into account their knowledge or expectations of other decision-makers’

behavior (they reason strategically).

(3)

Terminology: partial taxonomy

2-players games / n-players games Static games / dynamic games

Zero-sum games / Non zero-sum games

Games with perfect information / Games with imperfect information Coalitional games / Non-coalitional games

⇒ Belote?

(4)

Terminology: partial taxonomy

2-players games / n-players games Static games / dynamic games

Zero-sum games / Non zero-sum games

Games with perfect information / Games with imperfect information Coalitional games / Non-coalitional games

⇒ Chess?

⇒ Belote?

(5)

Terminology: partial taxonomy

2-players games / n-players games Static games / dynamic games

Zero-sum games / Non zero-sum games

Games with perfect information / Games with imperfect information Coalitional games / Non-coalitional games

⇒ Chess?

(6)

Terminology: partial taxonomy

2-players games / n-players games Static games / dynamic games

Zero-sum games / Non zero-sum games

Games with perfect information / Games with imperfect information Coalitional games / Non-coalitional games

⇒ Chess?

⇒ Belote?

(7)

Terminology: partial taxonomy

2-players games / n-players games Static games / dynamic games

Zero-sum games / Non zero-sum games

Games with perfect information / Games with imperfect information Coalitional games / Non-coalitional games

⇒ Chess?

(8)

Terminology: partial taxonomy

2-players games / n-players games Static games / dynamic games

Zero-sum games / Non zero-sum games

Games with perfect information / Games with imperfect information Coalitional games / Non-coalitional games

⇒ Chess?

⇒ Belote?

(9)

Terminology: partial taxonomy

2-players games / n-players games Static games / dynamic games

Zero-sum games / Non zero-sum games

Games with perfect information / Games with imperfect information Coalitional games / Non-coalitional games

⇒ Chess?

(10)

Terminology: partial taxonomy

2-players games / n-players games Static games / dynamic games

Zero-sum games / Non zero-sum games

Games with perfect information / Games with imperfect information Coalitional games / Non-coalitional games

⇒ Chess?

⇒ Belote?

(11)

2-players games / n-players games Static games / dynamic games

Zero-sum games / Non zero-sum games

Games with perfect information / Games with imperfect information Coalitional games / Non-coalitional games

⇒ Chess?

⇒ Belote?

(12)

Bibliography

J. Von Neumann, O. Morgenstein, Theory of Games and Economic Behavior, Princeton University Press, 1944.

D.Luce, H. Raiffa, Games and Decisions: Introduction and Critical Survey, Wiley, 1957.

D. Fudemberg, J. Tirole, Game Theory, MIT Press, 1991.

M. Osborne, A. Rubinstein, A Course in Game Theory, MIT Press, 1994.

M. Yildizoglu, Introduction ` a la th´ eorie des jeux, Dunod, 2003.

(13)

2

Solution concepts

3

Repeated games

4

Two players zero-sum games

5

Bounds of games theory

(14)

1

Formalisation of a game Definitions

Preferences

Strategic and extensive forms

2

Solution concepts

3

Repeated games

4

Two players zero-sum games

5

Bounds of games theory

(15)

Definitions Preferences

Strategic and extensive forms

2

Solution concepts

3

Repeated games

4

Two players zero-sum games

5

Bounds of games theory

(16)

Formalisation of a game

Formalisation of a game

A game is defined by:

Who? → Players

What? → Actions, strategies How? → Sequence of events

How much? → What is the payoff of each outcome for each player?

(17)

Definitions Preferences

Strategic and extensive forms

2

Solution concepts

3

Repeated games

4

Two players zero-sum games

5

Bounds of games theory

(18)

Formalisation of a game

Utility

The players are rational: they want to obtain the best payoff as possible To do so, they need to know the preferences of each player over the different outcomes of the game

Utility: measurement of the “happiness” of a player for each outcome of the game. Allow us to define the preferences of players.

Example : Battle of the Sexes.

Anne and Charles wish to go together to a football game or the opera.

Their main concern is to go out together, but Anne prefers the opera and

Charles the football game.

(19)

Utility

The players are rational: they want to obtain the best payoff as possible To do so, they need to know the preferences of each player over the different outcomes of the game

Utility: measurement of the “happiness” of a player for each outcome of the game. Allow us to define the preferences of players.

Example : Battle of the Sexes.

Anne and Charles wish to go together to a football game or the opera.

Their main concern is to go out together, but Anne prefers the opera and

Charles the football game.

(20)

Formalisation of a game

Utility

Ordinal utility: ranking of the outcomes of the game Anne’s preferences: O

A

O

C



A

F

A

F

C



A

O

A

F

C



A

F

A

O

C

Cardinal utilities: associate to each outcome a numerical value Charles’ preferences: u

C

(F

A

, F

C

) = 2, u

C

(O

A

, O

C

) = 1, u

C

(O

A

, F

C

) = 0, u

C

(F

A

, O

C

) = 0

Problem: Size of the representation. Compact representation of the

preferences.

(21)

Utility

Ordinal utility: ranking of the outcomes of the game Anne’s preferences: O

A

O

C



A

F

A

F

C



A

O

A

F

C



A

F

A

O

C

Cardinal utilities: associate to each outcome a numerical value Charles’ preferences: u

C

(F

A

, F

C

) = 2, u

C

(O

A

, O

C

) = 1, u

C

(O

A

, F

C

) = 0, u

C

(F

A

, O

C

) = 0

Problem: Size of the representation. Compact representation of the

preferences.

(22)

1

Formalisation of a game Definitions

Preferences

Strategic and extensive forms

2

Solution concepts

3

Repeated games

4

Two players zero-sum games

5

Bounds of games theory

(23)

Formalisation of a game

Strategic games

A strategic game is defined by:

A finite set of players N = {1, . . . n},

For each player i ∈ N a nonempty set of strategies S

i

= {s

i1

, . . . , s

ip

} (the set of actions available to player i )

For each player i ∈ N a utility function u

i

: S

1

× . . . × S

n

→ IR which associates a numerical value to each set of strategies.

H H H

H H 1

2 O

C

F

C

O

A

(2, 1) (0, 0)

F

A

(0, 0) (1, 2)

(24)

Formalisation of a game

Strategic games

A strategic game is defined by:

A finite set of players N = {1, . . . n},

For each player i ∈ N a nonempty set of strategies S

i

= {s

i1

, . . . , s

ip

} (the set of actions available to player i )

For each player i ∈ N a utility function u

i

: S

1

× . . . × S

n

→ IR which associates a numerical value to each set of strategies.

Example : Battle of the Sexes.

H H H

H H 1

2 O

C

F

C

O

A

(2, 1) (0, 0)

F

A

(0, 0) (1, 2)

(25)

Strategic games

s

i

is a pure strategy of the player i , that is an action plan which stipulates an action each times i has to play.

Ex: s

1

= O

A

, s

1

= F

A

s

i

is a winning strategy for the player i if it allows i to win whatever the other players do

s = {s

1

, . . . , s

n

}, where ∀i , s

i

∈ S

i

is a strategy profile. A strategy profile must include one and only one strategy for every player, and is a set of strategies for each player which fully specifies all actions in a game.

Ex: s = {O

A

F

C

}

s

−i

represents the strategy profile s except the strategy of the player i : s

−i

= {s

1

, . . . , s

i −1

, s

i +1

, . . . s

n

},

Ex: s = {F }

(26)

Formalisation of a game

Extensive games

An extensive game is defined by:

A finite set of players N = {1, . . . n}, A finite game-tree made up of

A set of nodes which represent the actions

A set of branches which represent the alternatives at each time Each terminal (leaf) node of the game tree has an n-tuple of payoffs, meaning there is one payoff for each player at the end of every possible play A partition of the non-terminal nodes of the game tree in n + 1 subsets, one for each player. Each player’s subset of nodes is referred to as the

“nodes of the player”.

Each set of nodes of a player is partitioned in information sets, which

make certain choices indistinguishable for the player when making a move

(27)

Extensive games - Battle of the sexes

A

C

(2, 1) O

(0, 0) F O

C

(0, 0) O

(1, 2) F F

Player 1

Player 2 Player 2

(28)

Formalisation of a game

Extensive games - Battle of the sexes

A

C

(2, 1) O

(0, 0) F O

C

(0, 0) O

(1, 2) F F

Player 1

Player 2 Player 2

Information sets: Anne: {A}; Charles: {{C (O)}, {C (F )}}

Sequential actions

(29)

Relation between extensive and strate- gic forms

Each game in extensive form corresponds to a game in strategic form in which players simultaneously choose their strategies

A game in strategic form can correspond to several games in extensive

form

(30)

Formalisation of a game

Relation between extensive and strate- gic forms

Strategic form:

H H H H H 1

2 O

C

F

C

O

A

(2, 1) (0, 0) F

A

(0, 0) (1, 2) Extensive form:

A

C

(2, 1) O

(0, 0) F O

C

(0, 0) O

(1, 2) F F Player 1

Player 2 Player 2

Is there another extensive form of this strategic game?

(31)

Relation between extensive and strate- gic forms

Strategic form:

H H H H H 1

2 O

C

F

C

O

A

(2, 1) (0, 0) F

A

(0, 0) (1, 2) Extensive form:

A

C O

C F Player 1

Player 2 Player 2

(32)

Formalisation of a game

Relation between extensive and strate- gic forms

Extensive form:

A

C

(2, 1) O

(0, 0) F O

C

(0, 0) O

(1, 2) F F Player 1

Player 2 Player 2

s

1

: O

C

(O

A

), O

C

(F

A

) ; s

2

: O

C

(O

A

), F

C

(F

A

) ; s

3

: F

C

(O

A

), O

C

(F

A

) ; s

4

: F

C

(O

A

), F

C

(F

A

).

Strategic form:

H H H

H H 1

2 s

1

s

2

s

3

s

4

O

A

(2, 1) (2, 1) (0, 0) (0, 0)

(33)

2

Solution concepts Dominated strategies Nash Equilibrium Pareto criterion Security levels Examples of games

Subgame perfect equilibrium

3

Repeated games

4

Two players zero-sum games

(34)

1

Formalisation of a game

2

Solution concepts Dominated strategies Nash Equilibrium Pareto criterion Security levels Examples of games

Subgame perfect equilibrium

3

Repeated games

4

Two players zero-sum games

5

Bounds of games theory

(35)

Dominated strategies

H H H H

H 1

2 A B

C (2, 1) (0, 1)

D (3, 1) (4, 2)

(36)

Solution concepts

Dominated strategies

H H H H

H 1

2 A B

C (2, 1) (0, 1)

D (3, 1) (4, 2)

(37)

Dominated strategies

H H H H

H 1

2 A B

C (2, 1) (0, 1) D (3, 1) (4, 2)

A strategy s

i

strictly dominates another strategy s

i0

if s

i

is strictly better than s

i0

for the player i whatever the strategies of the other players are:

∀s

−i

∈ S

−i

, u

i

(s

i0

, s

−i

) < u

i

(s

i

, s

−i

)

(38)

Solution concepts

Dominated strategies

H H H H

H 1

2 A B

C (2, 1) (0, 1) D (3, 1) (4, 2)

A strategy s

i

weakly dominates another strategy s

i0

if s

i

is at least as good than s

i0

for the player i whatever the strategies of the other players are; and strictly better than s

i0

for at least one combination of strategies:

∀s

−i

∈ S

−i

, u

i

(s

i0

, s

−i

) ≤ u

i

(s

i

, s

−i

) and

∃s

−i

∈ S

−i

t.q. u

i

(s

i0

, s

−i

) < u

i

(s

i

, s

−i

)

(39)

Iterated elimination of dominated strategies

H H H

H H 1

2 G M D

H (4, 3) (5, 1) (6, 2)

M (2, 1) (8, 4) (3, 6)

B (3, 0) (9, 6) (2, 8)

(40)

Solution concepts

Iterated elimination of dominated strategies

H H H

H H 1

2 G M D

H (4, 3) (5, 1) (6, 2)

M (2, 1) (8, 4) (3, 6)

B (3, 0) (9, 6) (2, 8)

(41)

Iterated elimination of dominated strategies

H H H

H H 1

2 G M D

H (4, 3) (5, 1) (6, 2)

M (2, 1) (8, 4) (3, 6)

B (3, 0) (9, 6) (2, 8)

(42)

Solution concepts

Iterated elimination of dominated strategies

H H H

H H 1

2 G M D

H (4, 3) (5, 1) (6, 2)

M (2, 1) (8, 4) (3, 6)

B (3, 0) (9, 6) (2, 8)

(43)

Iterated elimination of dominated strategies

H H H

H H 1

2 G M D

H (4, 3) (5, 1) (6, 2)

M (2, 1) (8, 4) (3, 6)

B (3, 0) (9, 6) (2, 8)

(44)

Solution concepts

Iterated elimination of dominated strategies

H H H

H H 1

2 G M D

H (4, 3) (5, 1) (6, 2)

M (2, 1) (8, 4) (3, 6)

B (3, 0) (9, 6) (2, 8)

(45)

Iterated elimination of dominated strategies

The order of elimination of strictly dominated strategies does not affect the final result

The order of elimination of weakly dominated strategies can affect the final result

The process of iterated elimination of dominated strategies does not

necessarily lead to a unique solution

(46)

1

Formalisation of a game

2

Solution concepts Dominated strategies Nash Equilibrium Pareto criterion Security levels Examples of games

Subgame perfect equilibrium

3

Repeated games

4

Two players zero-sum games

5

Bounds of games theory

(47)

Pure strategy Nash equilibrium

H H H H

H 1

2 G M D

H (4, 3) (5, 1) (6, 2) M (2, 1) (8, 4) (3, 6) B (3, 0) (9, 6) (2, 8)

A pure strategy Nash equilibrium (PNE) is a strategy profile such that

each player’s strategy is an optimal response to the strategies of the other

players

(48)

Solution concepts

Pure strategy Nash equilibrium

H H H H

H 1

2 G M D

H (4, 3) (5, 1) (6, 2) M (2, 1) (8, 4) (3,6) B (3, 0) (9, 6) (2, 8)

A pure strategy Nash equilibrium (PNE) is a strategy profile such that

each player’s strategy is an optimal response to the strategies of the other

players

(49)

Pure strategy Nash equilibrium

H H H H

H 1

2 G M D

H (4, 3) (5, 1) (6, 2) M (2, 1) (8, 4) (3, 6) B (3, 0) (9, 6) (2, 8)

A pure strategy Nash equilibrium (PNE) is a strategy profile such that

each player’s strategy is an optimal response to the strategies of the other

players

(50)

Solution concepts

Pure strategy Nash equilibrium

H H H H

H 1

2 G M D

H (4, 3) (5, 1) (6, 2) M (2, 1) (8, 4) (3, 6) B (3, 0) (9, 6) (2, 8)

A pure strategy Nash equilibrium (PNE) is a strategy profile such that

each player’s strategy is an optimal response to the strategies of the other

players

(51)

Pure strategy Nash equilibrium

H H H H

H 1

2 G M D

H (4, 3) (5, 1) (6, 2) M (2, 1) (8, 4) (3, 6) B (3, 0) (9, 6) (2, 8)

A pure strategy Nash equilibrium (PNE) is a strategy profile such that

each player’s strategy is an optimal response to the strategies of the other

players

(52)

Solution concepts

Pure strategy Nash equilibrium

H H H H

H 1

2 G M D

H (4, 3) (5, 1) (6, 2) M (2, 1) (8, 4) (3, 6) B (3, 0) (9, 6) (2, 8)

A pure strategy Nash equilibrium (PNE) is a strategy profile such that each player’s strategy is an optimal response to the strategies of the other players

s = {s

1

, . . . , s

n

} is a PNE iff

∀i ∈ {1, . . . , n}, ∀s

i0

∈ S

i

, u

i

(s) ≥ u

i

(s

−i

, s

i0

)

(53)

Pure strategy Nash equilibrium

A game can have several PNEs, or cannot have any A strictly dominated strategy cannot be in a PNE A weakly dominated strategy can be in a PNE

If the iterated elimination of strictly dominated strategies give a unique

solution, this solution is the unique PNE of the game

(54)

Solution concepts

Mixed strategies

Pure strategy: actions available for a player

Mixed strategy : strategy that defines the probabilities with which players choose each of their pure strategies

A mixed strategy for the player i is a probability distribution over S

i

. Σ

i

is the set of mixed strategy for i .

σ

i

: S

i

→ IR assigns to each pure strategy s

i

its probability to be played

A pure strategy s

i

corresponds to the mixed strategy σ

i

associated with

the probability of 1

(55)

Mixed strategy Nash equilibrium

A mixed strategy Nash equilibrium is a situation where all players choose their mixed strategies in order to maximise they expected utilities.

σ ∈ Σ is a mixed strategy Nash equilibrium iff

∀i ∈ N, ∀σ

0i

∈ Σ

i

, u

i

(σ) ≥ u

i

0i

, σ

−i

)

(56)

Solution concepts

Mixed strategy Nash equilibirum

H H H

H H 1

2 P F

P (15, -15) (-15, 15) F (-15, 15) (15, -15)

No pure strategy Nash equilibrium

Mixed strategy Nash equilibrium(a)?

(57)

Mixed strategy Nash equilibrium

H H H

H H 1

2 H T

H (15, -15) (-15, 15) T (-15, 15) (15, -15)

Probability distribution: σ

1

(H) = x , σ

1

(T ) = 1 − x , σ

2

(H) = y and

σ

2

(T ) = 1 − y

(58)

Solution concepts

Mixed strategy Nash equilibrium

H H H

H H 1

2 H T

H (15, -15) (-15, 15) T (-15, 15) (15, -15)

Probability distribution: σ

1

(H) = x , σ

1

(T ) = 1 − x , σ

2

(H) = y and σ

2

(T ) = 1 − y

Expected utility of 1 :

y (15x − 15(1 − x )) + (1 − y )(−15x + 15(1 − x ))

(59)

Mixed strategy Nash equilibrium

H H H

H H 1

2 H T

H (15, -15) (-15, 15) T (-15, 15) (15, -15)

Probability distribution: σ

1

(H) = x , σ

1

(T ) = 1 − x , σ

2

(H) = y and σ

2

(T ) = 1 − y

Expected utility of 1 : 15(x (4y − 2) − 2y + 1)

Expected utility of 2 :

(60)

Solution concepts

Mixed strategy Nash equilibrium

H H H

H H 1

2 H T

H (15, -15) (-15, 15) T (-15, 15) (15, -15)

Probability distribution: σ

1

(H) = x , σ

1

(T ) = 1 − x , σ

2

(H) = y and σ

2

(T ) = 1 − y

Expected utility of 1 : 15(x (4y − 2) − 2y + 1)

Expected utility of 2 : 15(y (2 − 4x ) + 2x − 1)

(61)

Mixed strategy Nash equilibrium

H H H

H H 1

2 H T

H (15, -15) (-15, 15) T (-15, 15) (15, -15)

Probability distribution: σ

1

(H) = x , σ

1

(T ) = 1 − x , σ

2

(H) = y and σ

2

(T ) = 1 − y

Expected utility of 1 : 15(x (4y − 2) − 2y + 1)

Expected utility of 2 : 15(y (2 − 4x ) + 2x − 1)

(62)

Solution concepts

Mixed strategy Nash equilibrium

Probability distribution: σ

1

(H) = x , σ

1

(T ) = 1 − x , σ

2

(H) = y and σ

2

(T ) = 1 − y

Expected utility of 1 : 15(x (4y − 2) − 2y + 1)

→ If 4y − 2 > 0 (y >

12

), then the best interest of 1 is to choose x = 1

→ If 4y − 2 < 0 (y <

12

), then the best interest of 1 is to choose x = 0

→ If 4y − 2 = 0 (y =

12

), then the best interest of 1 is to choose x =?

Expected utility of 2 : 15(y (2 − 4x ) + 2x − 1)

→ If 2 − 4x > 0 (x <

12

), then the best interest of 2 is to choose y = 1

→ If 2 − 4x < 0 (x >

12

), then the best interest of 2 is to choose y = 0

→ If 2 − 4x = 0 (x =

12

), then the best interest of 2 is to choose y =?

(63)

Mixed strategy Nash equilibrium

Probability distribution: σ

1

(H) = x , σ

1

(T ) = 1 − x , σ

2

(H) = y and σ

2

(T ) = 1 − y

Expected utility of 1 : 15(x (4y − 2) − 2y + 1)

→ If 4y − 2 > 0 (y >

12

), then the best interest of 1 is to choose x = 1

→ If 4y − 2 < 0 (y <

12

), then the best interest of 1 is to choose x = 0

→ If 4y − 2 = 0 (y =

12

), then the best interest of 1 is to choose x =?

Expected utility of 2 : 15(y (2 − 4x ) + 2x − 1)

→ If 2 − 4x > 0 (x <

12

), then the best interest of 2 is to choose y = 1

→ If 2 − 4x < 0 (x >

12

), then the best interest of 2 is to choose y = 0

→ If 2 − 4x = 0 (x =

12

), then the best interest of 2 is to choose y =?

(64)

Solution concepts

Mixed strategy Nash equilibrium

Probability distribution: σ

1

(H) = x , σ

1

(T ) = 1 − x , σ

2

(H) = y and σ

2

(T ) = 1 − y

Expected utility of 1 : 15(x (4y − 2) − 2y + 1)

→ If 4y − 2 > 0 (y >

12

), then the best interest of 1 is to choose x = 1

→ If 4y − 2 < 0 (y <

12

), then the best interest of 1 is to choose x = 0

→ If 4y − 2 = 0 (y =

12

), then the best interest of 1 is to choose x =?

Expected utility of 2 : 15(y (2 − 4x ) + 2x − 1)

→ If 2 − 4x > 0 (x <

12

), then the best interest of 2 is to choose y = 1

→ If 2 − 4x < 0 (x >

12

), then the best interest of 2 is to choose y = 0

→ If 2 − 4x = 0 (x =

12

), then the best interest of 2 is to choose y =?

(65)

Mixed strategy Nash equilibrium

Probability distribution: σ

1

(H) = x , σ

1

(T ) = 1 − x , σ

2

(H) = y and σ

2

(T ) = 1 − y

Expected utility of 1 : 15(x (4y − 2) − 2y + 1)

→ If 4y − 2 > 0 (y >

12

), then the best interest of 1 is to choose x = 1

→ If 4y − 2 < 0 (y <

12

), then the best interest of 1 is to choose x = 0

→ If 4y − 2 = 0 (y =

12

), then the best interest of 1 is to choose x =?

Expected utility of 2 : 15(y (2 − 4x ) + 2x − 1)

→ If 2 − 4x > 0 (x <

12

), then the best interest of 2 is to choose y = 1

→ If 2 − 4x < 0 (x >

12

), then the best interest of 2 is to choose y = 0

→ If 2 − 4x = 0 (x =

12

), then the best interest of 2 is to choose y =?

(66)

Solution concepts

Mixed strategy Nash equilibrium

Probability distribution: σ

1

(H) = x , σ

1

(T ) = 1 − x , σ

2

(H) = y and σ

2

(T ) = 1 − y

Expected utility of 1 : 15(x (4y − 2) − 2y + 1)

→ If 4y − 2 > 0 (y >

12

), then the best interest of 1 is to choose x = 1

→ If 4y − 2 < 0 (y <

12

), then the best interest of 1 is to choose x = 0

→ If 4y − 2 = 0 (y =

12

), then the best interest of 1 is to choose x =?

Expected utility of 2 : 15(y (2 − 4x ) + 2x − 1)

→ If 2 − 4x > 0 (x <

12

), then the best interest of 2 is to choose y = 1

→ If 2 − 4x < 0 (x >

12

), then the best interest of 2 is to choose y = 0

→ If 2 − 4x = 0 (x =

12

), then the best interest of 2 is to choose y =?

(67)

Mixed strategy Nash equilibrium

Probability distribution: σ

1

(H) = x , σ

1

(T ) = 1 − x , σ

2

(H) = y and σ

2

(T ) = 1 − y

Expected utility of 1 : 15(x (4y − 2) − 2y + 1)

→ If 4y − 2 > 0 (y >

12

), then the best interest of 1 is to choose x = 1

→ If 4y − 2 < 0 (y <

12

), then the best interest of 1 is to choose x = 0

→ If 4y − 2 = 0 (y =

12

), then the best interest of 1 is to choose x =?

Expected utility of 2 : 15(y (2 − 4x ) + 2x − 1)

→ If 2 − 4x > 0 (x <

12

), then the best interest of 2 is to choose y = 1

→ If 2 − 4x < 0 (x >

12

), then the best interest of 2 is to choose y = 0

→ If 2 − 4x = 0 (x =

12

), then the best interest of 2 is to choose y =?

(68)

Solution concepts

Mixed strategy Nash equilibrium

Probability distribution: σ

1

(H) = x , σ

1

(T ) = 1 − x , σ

2

(H) = y and σ

2

(T ) = 1 − y

Expected utility of 1 : 15(x (4y − 2) − 2y + 1)

→ If 4y − 2 > 0 (y >

12

), then the best interest of 1 is to choose x = 1

→ If 4y − 2 < 0 (y <

12

), then the best interest of 1 is to choose x = 0

→ If 4y − 2 = 0 (y =

12

), then the best interest of 1 is to choose x =?

Expected utility of 2 : 15(y (2 − 4x ) + 2x − 1)

→ If 2 − 4x > 0 (x <

12

), then the best interest of 2 is to choose y = 1

→ If 2 − 4x < 0 (x >

12

), then the best interest of 2 is to choose y = 0

→ If 2 − 4x = 0 (x =

12

), then the best interest of 2 is to choose y =?

(69)

Mixed strategy Nash equilibrium

Probability distribution: σ

1

(H) = x , σ

1

(T ) = 1 − x , σ

2

(H) = y and σ

2

(T ) = 1 − y

Expected utility of 1 : 15(x (4y − 2) − 2y + 1)

→ If 4y − 2 > 0 (y >

12

), then the best interest of 1 is to choose x = 1

→ If 4y − 2 < 0 (y <

12

), then the best interest of 1 is to choose x = 0

→ If 4y − 2 = 0 (y =

12

), then the best interest of 1 is to choose x =

12

Expected utility of 2 : 15(y (2 − 4x ) + 2x − 1)

→ If 2 − 4x > 0 (x <

12

), then the best interest of 2 is to choose y = 1

→ If 2 − 4x < 0 (x >

12

), then the best interest of 2 is to choose y = 0

→ If 2 − 4x = 0 (x =

12

), then the best interest of 2 is to choose y =

12

(70)

Solution concepts

Nash equilibrium

Every pure strategy Nash equilibrium is a mixed strategy Nash equilibrium

Every finite game has at least one mixed strategy Nash equilibrium

(71)

2

Solution concepts Dominated strategies Nash Equilibrium Pareto criterion Security levels Examples of games

Subgame perfect equilibrium

3

Repeated games

4

Two players zero-sum games

(72)

Solution concepts

Pareto criterion

H H H H

H 1

2 A B

C (4, 4) (0, 3)

D (3, 1) (4, 5)

(73)

Pareto criterion

H H H H

H 1

2 A B

C (4, 4) (0, 3)

D (3, 1) (4, 5)

(74)

Solution concepts

Pareto criterion

H H H H

H 1

2 A B

C (4, 4) (0, 3)

D (3, 1) (4, 5)

(75)

Pareto criterion

H H H H

H 1

2 A B

C (4, 4) (0, 3) D (3, 1) (4, 5)

A strategy profile s strongly Pareto dominates s

0

if s is strictly better than s

0

for all the players

∀s

i

∈ s, ∀s

i0

∈ s

0

, u

i

(s) > u

i

(s

0

)

BD strongly Pareto dominates BC and AD

(76)

Solution concepts

Pareto criterion

H H H H

H 1

2 A B

C (4, 4) (0, 3) D (3, 1) (4, 5)

A strategy profile s weakly Pareto dominates s

0

if s is as good as s

0

for all the players, and strictly better than s

0

for at least one player

∀s

i

∈ s, ∀s

i0

∈ s

0

, u

i

(s) ≥ u

i

(s

0

) and

∃s

i

∈ s, ∃s

i0

∈ s

0

such that u

i

(s) > u

i

(s

0

)

BD weakly Pareto dominates AC

(77)

Pareto optimum

Pareto optimum:

State in which it is not possible to increase the well-being of an individual without decreasing the one of another individual

uniformly improvable: it is possible to increase the weel being of some individuals without decreasing the one of the others

non uniformly improvable: it is not possible to increase the weel being of some individuals without decreasing the one of the others

Pareto optimum = strategy profile non (weakly) Pareto dominated

(78)

Solution concepts

Pareto optimum

H H H H

H 1

2 A B

C (4, 4) (0, 3) D (3, 1) (4, 5)

Pure strategy Nash equilibria: CA, DB

Pareto optimum: DB

(79)

2

Solution concepts Dominated strategies Nash Equilibrium Pareto criterion Security levels Examples of games

Subgame perfect equilibrium

3

Repeated games

4

Two players zero-sum games

(80)

Solution concepts

Security levels

H H H

H H 1

2 A B

C (9, 9) (0, 8) D (8, 0) (7, 7)

The security level of a strategy s

i

for the player i is the minimum utility this player can obtain with this strategy whatever are the choices of the other players:

min

s−i

u

i

(s

i

, s

−i

)

The security level of a player i is the maximal security level of all i ’s

strategies

(81)

Security levels

H H H

H H 1

2 A B

C (9, 9) (0, 8) D (8, 0) (7, 7)

The security level of a strategy s

i

for the player i is the minimum utility this player can obtain with this strategy whatever are the choices of the other players:

min

s−i

u

i

(s

i

, s

−i

)

(82)

1

Formalisation of a game

2

Solution concepts Dominated strategies Nash Equilibrium Pareto criterion Security levels Examples of games

Subgame perfect equilibrium

3

Repeated games

4

Two players zero-sum games

5

Bounds of games theory

(83)

Prisoner dilemma

H H H

H H 1

2 C D

C (3, 3) (0, 5) D (5, 0) (1, 1)

Pure strategy Nash equilibrium(a)?

Dominated strategies?

Pareto optimum?

(84)

Solution concepts

Deadlock

H H H

H H 1

2 C D

C (1, 1) (0, 3) D (3, 0) (2, 2)

Pure strategy Nash equilibrium(a)?

Dominated strategies?

Pareto optimum?

Security levels of the players?

(85)

Friend or Foe

H H H

H H 1

2 Friend Foe

Friend (50, 50) (0, 100) Foe (100, 0) (0, 0)

Pure strategy Nash equilibrium(a)?

Dominated strategies?

Pareto optimum?

(86)

Solution concepts

Pure coordination game

H H H

H H 1

2 standard 1 standard 2

standard 1 (5, 5) (0, 0) standard 2 (0, 0) (3, 3)

Pure strategy Nash equilibrium(a)?

Dominated strategies?

Pareto optimum?

Security levels of the players?

(87)

Stag hunt

H H H

H H 1

2 stag hare

stag (10, 10) (0, 8) hare (8, 0) (7, 7)

Pure strategy Nash equilibrium(a)?

Dominated strategies?

Pareto optimum?

(88)

Solution concepts

Battle of the Sexes

H H H

H H 1

2 O F

O (1, 2) (0, 0) F (0, 0) (2, 1)

Pure strategy Nash equilibrium(a)?

Dominated strategies?

Pareto optimum?

Security levels of the players?

(89)

Chicken’s game

H H H

H H 1

2 stay swerve

stay (-100, -100) (1, -1) swerve (-1, 1) (0, 0)

Pure strategy Nash equilibrium(a)?

Dominated strategies?

Pareto optimum?

(90)

1

Formalisation of a game

2

Solution concepts Dominated strategies Nash Equilibrium Pareto criterion Security levels Examples of games

Subgame perfect equilibrium

3

Repeated games

4

Two players zero-sum games

5

Bounds of games theory

(91)

Dynamic games

1

2

(3, 0) C

(5, 1) D A

2

(4, 1) E

(3, 2)

F

B

(92)

Solution concepts

Dynamic game and backward induction

1

2

(3, 0) C

(5, 1) D A

2

(4, 1) E

(3, 2) F B

Backward induction Reasoning backward in time

Optimal choices on the terminal nodes

Using this information, one can then determine what to do at the second-to-last time of decision.

This process continues backwards until one has determined the best action

(93)

Dynamic game and backward induction

1

2

(3, 0) C

(5, 1) D A

2

(4, 1) E

(3, 2) F B

Backward induction Reasoning backward in time

Optimal choices on the terminal nodes

(94)

Solution concepts

Dynamic game and backward induction

1

2

(3, 0) C

(5, 1) D A

2

(4, 1) E

(3, 2) F B

Backward induction Reasoning backward in time

Optimal choices on the terminal nodes

Using this information, one can then determine what to do at the second-to-last time of decision.

This process continues backwards until one has determined the best action

(95)

Dynamic game and backward induction

Every (finite) extensive game with perfect information has (at least) one pusre strategy Nash equilibrium

This equilibrium can be find using backward induction

(96)

Solution concepts

Dynamic game and subgames

1

2

(3, 0) C

(5, 1) D A

2

(4, 1) E

(3, 2) F B

A subgame of an extensive game is made up of A node

All the nodes that are successors of the initial node All the edges between these nodes

The utilities associated to the terminal nodes

Three subgames here

(97)

Dynamic game and subgames

1

2

(3, 0) C

(5, 1) D A

2

(4, 1) E

(3, 2) F B

A subgame of an extensive game is made up of A node

All the nodes that are successors of the initial node

(98)

Solution concepts

Dynamic game and subgames

1

2

(3, 0) C

(5, 1) D A

2

(4, 1) E

(3, 2) F B

A subgame of an extensive game is made up of A node

All the nodes that are successors of the initial node All the edges between these nodes

The utilities associated to the terminal nodes

Three subgames here

(99)

Dynamic game and subgames

1

2

(3, 0) C

(5, 1) D A

2

(4, 1) E

(3, 2) F B

A subgame of an extensive game is made up of A node

All the nodes that are successors of the initial node

(100)

Solution concepts

Subgame perfect equilibrium

1

2

(3, 0) C

(5, 1) D A

2

(4, 1) E

(3, 2) F B

A strategy profile is a Subgame perfect Nash equilibrium if it represents

a Nash equilibrium of every subgame of the original game

(101)

Subgame perfect equilibrium

1

2

(3, 0) C

(5, 1) D A

2

(4, 1) E

(3, 2) F B

Strategic form of this subgame

C D

(102)

Solution concepts

Subgame perfect equilibrium

1

2

(3, 0) C

(5, 1) D A

2

(4, 1) E

(3, 2) F B

Strategic form of this subgame:

E F

(4, 1) (3, 2)

This subgame has one pure strategy Nash equilibrium: F (B).

(103)

Subgame perfect equilibrium

Strategic form of the original game:

H H H H H 1

2 C (A), E (B) C (A), F (B) D(A), E (B) D(A), F (B)

A (3, 0) (3, 0) (5, 1) (5, 1)

B (4, 1) (3, 2) (4, 1) (3, 2)

This subgame has three pure strategy Nash equilibria (B, C (A)F (B))

(A, D(A)E (B))

(A, D(A)F (B))

(104)

Solution concepts

Subgame perfect equilibrium

Strategic form of the original game:

H H H H H 1

2 C (A), E (B) C (A), F (B) D(A), E (B) D(A), F (B)

A (3, 0) (3, 0) (5, 1) (5, 1)

B (4, 1) (3, 2) (4, 1) (3, 2)

This subgame has three pure strategy Nash equilibria

(B, C (A)F (B)) is not a subgame perfect Nash equilibrium: C (A) won’t be played in the subgame starting after A,

(A, D(A)E (B))

(A, D(A)F (B))

(105)

Subgame perfect equilibrium

Strategic form of the original game:

H H H H H 1

2 C (A), E (B) C (A), F (B) D(A), E (B) D(A), F (B)

A (3, 0) (3, 0) (5, 1) (5, 1)

B (4, 1) (3, 2) (4, 1) (3, 2)

This subgame has three pure strategy Nash equilibria

(B, C (A)F (B)) is not a subgame perfect Nash equilibrium: C (A) won’t be played in the subgame starting after A,

(A, D(A)E (B)) is not a subgame perfect Nash equilibrium: E (B) won’t be played in the subgame starting after B,

(A, D(A)F (B))

(106)

Solution concepts

Subgame perfect equilibrium

Strategic form of the original game:

H H H H H 1

2 C (A), E (B) C (A), F (B) D(A), E (B) D(A), F (B)

A (3, 0) (3, 0) (5, 1) (5, 1)

B (4, 1) (3, 2) (4, 1) (3, 2)

This subgame has three pure strategy Nash equilibria

(B, C (A)F (B)) is not a subgame perfect Nash equilibrium: C (A) won’t be played in the subgame starting after A,

(A, D(A)E (B)) is not a subgame perfect Nash equilibrium: E (B) won’t be played in the subgame starting after B,

(A, D(A)F (B)) is subgame perfect Nash equilibrium: D(A) and F (B) are

pure strategy Nash equilibria of the subgames associated

(107)

Subgame perfect equilibrium

1

2

(3, 0) C

(5, 1) D A

2

(4, 1) E

(3, 2) F B

This game has one subgame perfect Nash equilibrium: (A, D(A)F (B)).

For the games with perfect information, the subgame perfect Nash

equilibria correspond to the equilibria found with backward induction

(108)

1

Formalisation of a game

2

Solution concepts

3

Repeated games

4

Two players zero-sum games

5

Bounds of games theory

(109)

Prisoner’s dilemma

Two men are arrested, but the police do not possess enough information for a conviction. Following the separation of the two men, the police offer both a similar deal:

If one testifies against his partner (defects / betrays), and the other remains silent (cooperates / assists), the betrayer goes free and the cooperator receives the full one-year sentence

If each ’rats out’ the other, each receives a three-month sentence If both remain silent, both are sentenced to only one month in jail for a minor charge

Each prisoner must choose to either betray or remain silent; the decision

(110)

Repeated games

Prisoner’s dilemma

Two men are arrested, but the police do not possess enough information for a conviction. Following the separation of the two men, the police offer both a similar deal:

If one testifies against his partner (defects / betrays), and the other remains silent (cooperates / assists), the betrayer goes free (+5) and the cooperator receives the full one-year sentence (0)

If each ’rats out’ the other, each receives a three-month sentence (+3) If both remain silent, both are sentenced to only one month in jail for a minor charge (+1)

Each prisoner must choose to either betray or remain silent; the decision

of each is kept quiet. What should they do?

(111)

Prisoner’s dilemma

H H H

H H 1

2 C D

C (3, 3) (0, 5) D (5, 0) (1, 1)

Nash equilibrium 6= Pareto optimum

Equilibrium: both prisoners choose to defect even though their individual utility would be greater if they cooperated

Each prisoner is incitated to betray after having promised to cooperate. It

(112)

Repeated games

Prisoner’s dilemma: real life examples

Some examples of the prisonner’s dilemma in real life:

Politics: two states engaged in an arms race

Environmental studies: global climate change. All countries will benefit from a stable climate, but any single country is often hesitant to curb CO2 emissions

Economics: two firms advertising the same product at the same time

(113)

Iterated game

Iterated game (or repeated game): some number of repetitions of some (two-players) base game

A player will have to take into account the impact of his current action on the future actions of other players

The threat of such retaliation is often enough: benefits related to the prospects for future earnings may outweigh the short-term gains achieved by breaking an agreement

Logic of r´ eciprocity

(114)

Repeated games

Iterated prisoner’s dilemma

You do not have the same musical tastes as your neighbor. He often listen to loud music. You may also (in retaliation) listen to your music at a more than reasonable sound level. This has for consequences that he will start again the next day. But you both enjoy the periods when none of you bother the other.

Suppose that we weighted your satisfaction:

You have a satisfaction of 5 to listen to your music very loud Your satisfaction is of 0 when your neighbor puts his music very loud A calm evening, without music, brings you a satisfaction of 3

The fact of hearing both your music and the one of your neighbor gives you a satisfaction of a 1

You know what was the behavior of your neighbor in previous days, what are

you doing today?

(115)

Iterated dilemma

Flood and Dresher, 1952 Non-zero sum game 2 simultaneaous players 2 strategies :

Cooperate (beeing “nice”), denoted C Deviate (beeing “mean”), denoted D

Utility of the players are denoted S , P, R and T . They are up to the game, but have to fulfil the following condition:

S < P < R < T

(116)

Repeated games

Iterated dilemma

Players meet each other several times

At each iteration, players know their previous actions They do not know when the game will stop

The final gain of a player is the sum of his gains obtained at each encounter

In order to promote the cooperation, we add the following constraint:

S + T < 2 × R

(117)

Iterated prisoner’s dilemma

H H H

H H 1

2 D C

D (P =1, P =1) (T = 5, S = 0) C (S = 0, T = 5) (R = 3, R = 3)

P: Punishment, punishment for mutual trahison S : Sucker’s payoff

R: Reward, reward for mututal cooperation

T : Temptation

(118)

Repeated games

Iterated prisoner’s dilemma: some strategies

all-C: always cooperate ([c]

) all-D: always deviate ([d ]

)

tit-for-tat: cooperate, and then do the same that the opponent did in the previous encounter

tester: deviate, and then do the same that the opponent did in the previous encounter

spiteful: cooperate until the opponent has defected, after that always defect.

random : cooperate with a probability of

12

.

per_ccd : plays periodically [c, c, d ]

per_ddc : plays periodically [d , d , c]

(119)

Encounters between strategies

1 2 3 4 5 6 7 8 9 10

Results of all-C 0 0 0 0 0 0 0 0 0 0 = 0

Actions of all-C C C C C C C C C C C

Actions of all-D D D D D D D D D D D

Results of all-D 5 5 5 5 5 5 5 5 5 5 = 50

Results of per_ddc 5 1 0 1 1 0 1 1 0 1 = 11

Actions of per_ddc D D C D D C D D C D

Actions of spiteful C D D D D D D D D D

(120)

Repeated games

What strategy to choose?

Confrontation of 100 encounters:

all-C all-D all-C 300, 300 0, 500 all-D 500,0 100,100

Each strategy is good against some strategies, and bad against others.

The key point is to note that a player does not know what strategy has

his opponent.

(121)

tit-for-tat: a good strategy

tit-for-tat: cooperate, and then do the same that the opponent did in the previous encounter

tit-for-tat does not ever win

In the best case, this strategy obtains the same results than its opponent

But in the worst case, this strategy has only 5 points less than its

opponent, whatever the length of the game is

(122)

Repeated games

The tournaments

Several strategies meet

The final gain of a strategy is the sum of all of its gains against each of its opponent

Every games have the same length, that no strategy can know

(123)

Repeated games

Example of one tournament

5 strategies, 10 iterations for each game

all-C all-D per_ddc tit-for-tat

all-C 30 50 44 30

all-D 0 10 7 9

per_ddc 9 22 16 18

tit-for-tat 30 14 23 30

Total 99 110 101 117

Ranking: 1. tit-for-tat - 2. all-D - 3. per_ddc - 4. all-C

http://www2.lifl.fr/IPD/

http://www.apprendre-en-ligne.net/jeux/dilemme/home.html

(124)

Repeated games

Example of one tournament

5 strategies, 10 iterations for each game

all-C all-D per_ddc tit-for-tat

all-C 30 50 44 30

all-D 0 10 7 9

per_ddc 9 22 16 18

tit-for-tat 30 14 23 30

Total 99 110 101 117

Ranking: 1. tit-for-tat - 2. all-D - 3. per_ddc - 4. all-C

More details about the iterated prisoner’s dilemma (websites in French):

http://www2.lifl.fr/IPD/

http://www.apprendre-en-ligne.net/jeux/dilemme/home.html

(125)

2

Solution concepts

3

Repeated games

4

Two players zero-sum games

5

Bounds of games theory

(126)

Two players zero-sum games

Two players zero-sum games

Simplest games No coalitions Strictly competitives:

Players have preferences strictly opposites

For each strategy profile s, we have u

1

(s) + u

2

(s) = 0 Examples:

Board games (chess, draughts, ...) War

...

(127)

Two players zero-sum games

H H H H

H 1

2 E F G H

A (3, -3) (3, -3) (-5, 5) (5, -5)

B (-4, 4) (7, -7) (8, -8) (7, -7)

C (2, -2) (5, -5) (-18, 18) (2, -2)

D (5, -5) (6, -6) (7, -7) (5, -5)

(128)

Two players zero-sum games

Two players zero-sum games

H H H H

H 1

2 E F G H

A 3 3 -5 5

B -4 7 8 7

C 2 5 -18 2

D 5 6 7 5

(129)

Two players zero-sum games

H H H H

H 1

2 E F G H

A 3 3 -5 5

B -4 7 8 7

C 2 5 -18 2

D 5 6 7 5

Player 1 tries to maximize his security level: v

1

= max

s1

(min

s2

(u

1

(s

1

, s

2

)))

(130)

Two players zero-sum games

Two players zero-sum games

H H H H

H 1

2 E F G H

A 3 3 -5 5

B -4 7 8 7

C 2 5 -18 2

D 5 6 7 5

Player 1 tries to maximize his security level: v

1

= max

s1

(min

s2

(u

1

(s

1

, s

2

)))

(131)

Two players zero-sum games

H H H H

H 1

2 E F G H

A 3 3 -5 5

B -4 7 8 7

C 2 5 -18 2

D 5 6 7 5

Player 1 tries to maximize his security level: v

1

= max

s1

(min

s2

(u

1

(s

1

, s

2

)))

(132)

Two players zero-sum games

Two players zero-sum games

H H H H

H 1

2 E F G H

A 3 3 -5 5

B -4 7 8 7

C 2 5 -18 2

D 5 6 7 5

Player 1 tries to maximize his security level: v

1

= max

s1

(min

s2

(u

1

(s

1

, s

2

))) Player 2 tries to minimize player 1’s security level:

v

2

= min

s2

(max

s1

(u

1

(s

1

, s

2

)))

(133)

Two players zero-sum games

H H H H

H 1

2 E F G H

A 3 3 -5 5

B -4 7 8 7

C 2 5 -18 2

D 5 6 7 5

Player 1 tries to maximize his security level: v

1

= max

s1

(min

s2

(u

1

(s

1

, s

2

))) Player 2 tries to minimize player 1’s security level:

v

2

= min

s2

(max

s1

(u

1

(s

1

, s

2

)))

(134)

Two players zero-sum games

Two players zero-sum games

H H H H

H 1

2 E F G H

A 3 3 -5 5

B -4 7 8 7

C 2 5 -18 2

D 5 6 7 5

Player 1 tries to maximize his security level: v

1

= max

s1

(min

s2

(u

1

(s

1

, s

2

))) Player 2 tries to minimize player 1’s security level:

v

2

= min

s2

(max

s1

(u

1

(s

1

, s

2

)))

(135)

Two players zero-sum games

H H H H

H 1

2 E F G H

A 3 3 -5 5

B -4 7 8 7

C 2 5 -18 2

D 5 6 7 5

Player 1 tries to maximize his security level: v

1

= max

s1

(min

s2

(u

1

(s

1

, s

2

))) Player 2 tries to minimize player 1’s security level:

v

2

= min

s2

(max

s1

(u

1

(s

1

, s

2

)))

If v

1

= v

2

= v , then every couple of strategies (s

1

, s

2

), s

1

garanteeing v to

(136)

1

Formalisation of a game

2

Solution concepts

3

Repeated games

4

Two players zero-sum games

5

Bounds of games theory

(137)

Bounds of games theory

Complete preferences

Invariable preferences

Perfect knowledge

Bounded rationality

References

Related documents

and conformable CsPbBr 3 perovskite-based PD array with ultra- light weight (3.12 g m −2 ) and excellent flexibility. Characterization of mechanical stability of the ultrathin

• Poster presentation at the Alabama Dietetic Association State Meeting titled: “The Impact of the Coordinated Approach to Child Health (CATCH) Intervention On

Overall, the results in Tables 2-4 thus conform that neither selectivity, nor the omission of control variables, including general ability, nor the endogenous nature of education

Doing so not only creates an awareness of the mechatronics courses for these students, but since the Instrumentation course is a part of the core curriculum, having the students

The unilateral fin line on a magnetized ferrite substrate has been analyzed using the network analysis technique for electromagnetic fields. applied in

1. Offer a landscape architecture-specific approach to teaching introductory computer programming that combines a) landscape parametrics with b) concepts of computer programming

For each family, the number of gates, the normalized circuit area (to a unit transistor), the logic depth, the normalized delay (to the technology-dependent intrinsic delay.. τ

Data send processes receive their instructions from the data traverse processes; they inform the data send processes to which render node which top-level octree cells data should