Nash Equilibrium - PowerPoint PPT Presentation

1 / 37
About This Presentation
Title:

Nash Equilibrium

Description:

Common payoff games: a game where all action profiles a A1 x ... sa22 is player 2's probability of playing action a2 under his mixed strategy ... – PowerPoint PPT presentation

Number of Views:85
Avg rating:3.0/5.0
Slides: 38
Provided by: adam59
Category:

less

Transcript and Presenter's Notes

Title: Nash Equilibrium


1
Nash Equilibrium
  • A strategy profile s (s1, , sn) is a Nash
    Equilibrium if, for all agents i, si is a best
    response to s-i.
  • two strategies si and s-i are in Nash equilibrium
    if
  • under the assumption that agent i plays si, agent
    j can do no better than play s-i and
  • under the assumption that agent j plays s-i,
    agent i can do no better than play si.
  • Each agent chooses a strategy that is a best
    response to the other agents strategies.

2
Nash Equilibrium (NE)
  • How hard is the Nash Equilibrium to compute?

3
Normal Form Games
  • Finite, n-person normal form game (N, A, O, m,
    u)
  • N is a finite set of n players, indexed by i
  • A A1, . . . , An is a set of actions for each
    player i
  • a A is an action profile
  • O is a set of outcomes
  • m A O
  • u u1, ... , un, a utility function for each
    player, where ui A

4
Game Definitions
  • Common payoff games a game where all action
    profiles a A1 x x An for any pair of agents i,
    j, result in ui(a) uj(a).
  • Constant sum games A game in which a constant c
    exists st for each strategy profile a A1 x A2 it
    is the case that u1(a) u2(a) c.

5
Linear Programming
  • A linear program is defined by
  • A set of real-valued variables
  • A linear objective function
  • A weighted sum of the variables
  • A set of linear constraints
  • The requirement that a weighted sum of the
    variables must be greater than or equal to some
    constant

6
Linear programming
  • maximize Si wixi
  • subject to Si wcixi bc c C
  • xi 0 xi X
  • These problems can be solved in polynomial time
    using interior point methods.
  • Interestingly, the (worst-case exponential)
    simplex method is often faster in practice.

7
NE of two-player zero-sum games
  • Easiest NE to find via linear programming in
    polynomial time.
  • Let G (1,2, A1, A2, u1, u2) be a two-player
    zero-sum game.
  • Ui is the unique expected utility for player i
    in equilibrium.
  • We know that the players interests are opposing,
    what does this tell us about U1 and U2?
  • U1 - U2

8
NE of two-player zero-sum games
  • minimize U1
  • subject to u1(aj1, ak2) sk2 U1
    j A1
  • sk2 1
  • sk2 0 k A2
  • Variables
  • U1 is the expected utility for player 1
  • sa22 is player 2s probability of playing action
    a2 under his mixed strategy
  • each u1(a1, a2) is a constant.

9
NE of two-player zero-sum games
  • minimize U1
  • subject to u1(aj1, ak2) sk2 rj1 U1
    j A1
  • sa22 1
  • sk2 0 k A2
  • rj1 0 j A1
  • Introduce slack variables, rj1.

10
NE of two-player general-sum games
  • The NE of a two-player general-sum game cannot be
    represented by a linear program. Why?

11
NE of two-player general-sum games
  • Computation of the NE in this case is not
    NP-Complete. Why?
  • However, it does appear to be in the PPAD
    complexity class.

12
NE of two-player general-sum games
  • PPAD Class of problems
  • Define a family of directed graphs, G(n).
  • Let each graph in G(n) contain a number of nodes
    that is exponential in n.
  • Let Parent N N and Child N N
  • Let there be one graph g G(n) for every such
    pair of Parent and Child functions as long as
  • An edge exists from a node j to a node k iff
    Parent(k) j and Child(j) k.
  • There must exist one distinguished node 0 N
    with exactly zero parents.

13
NE of two-player general-sum games
  • Reformulate previous solution to explicitly
    consider both players.
  • u1(aj1, ak2) sk2 rj1 U1 j A1
  • u2(aj1, ak2) sk1 rj2 U2 k A1
  • sj1 1, sk2 1
  • sj1 0, sk2 0 j A1, k A2
  • rj1 0, rk2 0 j A1 , k A2
  • rj1 sj1 0, rk2 sk2 0 j A1 , k A2

14
NE of two-player general-sum games
  • The complementarity condition requires that when
    an action is played by a particular player with
    positive probability, the corresponding slack
    variable must be zero.
  • Each slack variable represents the players
    incentive to deviate from the corresponding
    action.
  • Linear Complementarity Problem (LCP)

15
NE of two-player general-sum games
  • Solve the LCP with the Lemke-Howson algorithm

a12
a22
How many pure strategies does agent a1 have?
a11
What about agent a2?
a21
a31
16
NE of two-player general-sum games
  • Lemke-Howson algorithm

a12
a22
a11
a21
a31
How do we represent a2s Strategy space?
How do we represent a1s Strategy space?
17
NE of two-player general-sum games
  • Lemke-Howson algorithm
  • Label the strategies.
  • A pair of strategies (s1, s2) is a NE iff it is
    completely labeled (L(s1) L(s2) A1 A2).

s31
s22
Where are the multiply labeled points?
(0,0,1)
(0,1)
(0,1/3,2/3)
(1/3, 2/3)
(1,0,0)
s11
(2/3, 1/3)
s12
(2/3,1/3,0)
(1,0)
(0,1,0)
s21
18
NE of two-player general-sum games
  • Lemke-Howson algorithm
  • Identify the NE

(0,0,1)
a11 , a21 , a12
(0,1)
a11 , a12
(0,1/3,2/3)
(1/3, 2/3)
a11 , a12 , a22
a11 , a21
a21 , a31 , a12
(1,0,0)
(2/3, 1/3)
(0,0,0)
a21 , a31
a11 , a21 , a31
(2/3,1/3,0)
a31 , a12 , a22
(0,0)
(1,0)
a31 , a22
a11 , a22
(0,1,0)
a11 , a31 , a22
((0,0,1), (0,1)), ((0,1/3,2/3), (2/3,
1/3)), ((2/3,1/3,0), (1/3, 2/3))
19
NE of two-player general-sum games
  • Lemke-Howson algorithm
  • Initialize
  • (s1, s2) (0, 0)
  • Find an s1 G1 s.t. s1 is adjacent to 0
  • x 1
  • Repeat
  • sx sx
  • let aji be the label that occurs in both s1 and
    s2
  • Find an sx Gx s.t. sx is adjacent to sx and
    aji L(sx)
  • x 3 x
  • Until (s1, s2) is a completely labeled pair.

20
NE of two-player general-sum games
(0,0,1)
(0,0,1)
a11 , a21 , a12
a11 , a21 , a12
(0,1)
(0,1)
a11 , a12
a11 , a12
(0,1/3,2/3)
(0,1/3,2/3)
(1/3, 2/3)
(1/3, 2/3)
(1/3, 2/3)
a11 , a12 , a22
a11 , a12 , a22
a11 , a21
a11 , a21
a11 , a21
a21 , a31 , a12
(1,0,0)
(2/3, 1/3)
(2/3, 1/3)
(2/3, 1/3)
(0,0,0)
(0,0,0)
a21 , a31
a21 , a31
a21 , a31
a11 , a21 , a31
a11 , a21 , a31
(2/3,1/3,0)
(2/3,1/3,0)
a31 , a12 , a22
a31 , a12 , a22
(0,0)
(1,0)
(1,0)
(0,0)
(1,0)
a31 , a22
a31 , a22
a31 , a22
a11 , a22
a11 , a22
(0,1,0)
(0,1,0)
a11 , a31 , a22
a11 , a31 , a22
((2/3,1/3,0), (1/3,2/3)) ((0,0,0), (0,0))
((0,1,0), (0,1)) ((2/3,1/3,0), (0,1))
((2/3,1/3,0), (1/3,2/3))
((2/3,1/3,0), (1/3,2/3)) ((2/3,1/3,0),
(1/3,2/3)) ((0,1/3,2/3)), (1/3,2/3)
((0,1/3,2/3), (2/3, 1/3))
((0,0,1), (1,0)) ((0,0,0), (0,0)) ((0,0,1)),
(0,0) ((0,0,1), (1,0))
21
NE of two-player general-sum games
  • Lemke-Howson algorithm
  • Advantages
  • Guaranteed to find a sample NE.
  • Non-determinism is concentrated in the first
    move.
  • Disadvantages
  • Not guaranteed to find all NE.
  • Does not provide guidance on choosing a good
    first move.

22
NE of two-player general-sum games
  • Heuristics and the Support-Enumeration Method can
    be combined to provide an algorithm to find NE.
  • Search for NE can be reduced to searching the
    space of supports.
  • Use a feasibility program that tests the
    specified supports.
  • Complete
  • Worst case performance exponential

23
Specific NE of General-sum Games
  • Idea is to find an equilibrium with a specific
    property.
  • Properties
  • Uniqueness
  • Pareto optimal
  • Guaranteed payoff
  • Guaranteed social welfare
  • Subset inclusion
  • Subset containment
  • NP-Hard when applied to NE.

24
NE for n-player general-sum games
  • Cannot formulate as a linear complementarity
    problem.
  • Sequence of linear complementarity problems
    (SLCP).
  • Each LCP is an approximation of the problem and
    is used to developed the next approximation in
    the sequence.

25
NE for n-player general-sum games
  • Formulate the problem as a minimum of a function.
  • Constrained optimization problem
  • Unconstrained optimization problem
  • Disadvantages
  • Both have local minima that do not correspond to
    the NE.

26
NE for n-player general-sum games
  • Simplicial subdivision algorithms
  • Consider
  • the space of mixed strategies is a simlpex
  • The players best response is a function from
    points on the simplex to other points on the
    simplex.
  • Scarfs algorithm locates the fixed points.
  • Add a variable that expresses the accuracy of the
    current iterations approximation.
  • Worst case complexity exponential in the number
    of players and the number of digits of accuracy.

27
NE for n-player general-sum games
  • Generalize SEM to the n-player case
  • The feasibility program becomes non-linear.
  • Algorithm must accommodate multiple variables in
    the feasibility problem.
  • Use standard numerical techniques for non-linear
    optimization.
  • Reverse the lexicographic ordering between size
    and balance of supports.

28
All NE of General-sum Games
  • Idea is to determine all equilibria of a game.
  • Important when designing a game and need to know
    all possible stable outcomes.
  • Worst case exponential in the number of actions
    for each player.

29
Dominant Strategies
  • A strategy dominates another when the first
    strategy is always at least as good as the
    second, independent of the other players
    actions.
  • Iterative removal
  • Strictly dominant strategies order does not
    matter.
  • Very weakly and weakly dominant strategies
    removal order can have an affect.
  • Potentially remove some equilibrium of the
    original game.
  • Potentially remove a larger set of strategies and
    result in a smaller game.

30
Domination by a Pure Strategy
  • for all pure strategies ai Ai for player i where
    ai ? si do
  • dom true
  • for all pure strategy profiles a-i A-i for the
    players other than i do
  • if ui(si, a-i) ui(ai, a-i) then
  • dom false
  • break
  • end if
  • end for
  • if dom true then return true
  • end for
  • return false

31
Domination by a Mixed Strategy
  • Recall that mixed strategies cannot be
    enumerated.
  • Strict Domination
  • Requires a linear program.
  • Minimize
  • Subject to

32
Domination by a Mixed Strategy
  • Very weakly dominate

33
Domination by a Mixed Strategy
  • Weak domination

Maximize
Subject to
34
Iterated Dominance
  • Strategy Elimination Does there exist some
    elimination path under which the strategy si is
    eliminated?
  • Reduction Identity Given action subsets A-i Ai
    for each player i, does there exist a maximally
    reduced game where each player i has the actions
    A-i?
  • Uniqueness Does every elimination path lead to
    the same reduced game?
  • Reduction Size Given constants ki for each
    player i, does there exist a maximally reduced
    game where each player i has exactly ki actions?
  • Iterated strict dominance problems are all in P.
  • Iterated weak or very weak dominance problems
    are NP-complete.

35
Correlated Equilibrium n-player general-sum games
Variables p(a) constants ui(a)
Could find the social-welfare maximizing the
correlated equilibrium by adding an objective
function
Maximize
36
Correlated Equilibrium
  • Complexity of P when applied to CE
  • Uniqueness
  • Pareto Optimal
  • Guaranteed payoff
  • Subset inclusion
  • Subset containment

37
CE to NE Calculation
Intuitively, correlated equilibrium has only a
single randomization over outcomes, whereas in NE
this is constructed as a product of independent
probabilities.
Changing this program so that it finds NE
requires the first constraint to be
Write a Comment
User Comments (0)
About PowerShow.com