Gametheory: A brief survey of some classic and recent topics for the Intro to AI course at CMU By Tu - PowerPoint PPT Presentation

1 / 29

About This Presentation

Title:

Gametheory: A brief survey of some classic and recent topics for the Intro to AI course at CMU By Tu

Description:

Game-theory: A brief survey of some classic and recent topics for the ... that agent can make at a point in the game ... THEN the game has a (pure ... – PowerPoint PPT presentation

Number of Views:26

Avg rating:3.0/5.0

Slides: 30

Provided by: TuomasS6

Category:

more less

Transcript and Presenter's Notes

Title: Gametheory: A brief survey of some classic and recent topics for the Intro to AI course at CMU By Tu

1
Game-theory A brief survey of some classic and
recent topics for the Intro to AI course at
CMUBy Tuomas SandholmAssociate
ProfessorComputer Science DepartmentCarnegie
Mellon University
2
The heart of the problem

In a 1-agent setting, agents expected utility
maximizing strategy is well-defined
But in a multiagent system, the outcome may
depend on others strategies also

3
Terminology

Agent player
Action move choice that agent can make at a
point in the game
Strategy si mapping from history (to the
extent that the agent i can distinguish) to
actions
Strategy set Si strategies available to the
agent
Strategy profile (s1, s2, ..., sA) one
strategy for each agent
Agents utility is determined after each agent
(including nature that is used to model
uncertainty) has chosen its strategy, and game
has been played ui ui(s1, s2, ..., sA)

4
Game representations
5
Dominant strategy equilibrium

Best response si for all si, ui(si,s-i)
ui(si,s-i)
Dominant strategy si si is a best response
for all s-i
Does not always exist
Inferior strategies are called dominated
Dominant strategy equilibrium is a strategy
profile where each agent has picked its dominant
strategy
Does not always exist
Requires no counterspeculation

6
Nash equilibrium Nash50

Sometimes an agents best response depends on
others strategies a dominant strategy does not
exist
A strategy profile is a Nash equilibrium if no
player has incentive to deviate from his strategy
given that others do not deviate for every agent
i, ui(si,s-i) ui(si,s-i) for all si
Dominant strategy equilibria are Nash equilibria
but not vice versa
Defect-defect is the only Nash eq. in Prisoners
Dilemma
Battle of the Sexes game
Has no dominant strategy equilibria

7
Criticisms of Nash equilibrium

Not unique in all games, e.g. Battle of the Sexes
Approaches for addressing this problem
Refinements of the equilibrium concept
Choose the Nash equilibrium with highest welfare
Subgame perfection
Focal points
Mediation
Communication
Convention
Learning
Does not exist in all games
May be hard to compute
Finding a good one is NP-hard GilboaZemel
GEB-89, ConitzerSandholm IJCAI-03

8
Existence of (pure strategy) Nash equilibria

IF a game is finite
and at every point in the game, the agent whose
turn it is to move knows what moves have been
played so far
THEN the game has a (pure strategy) Nash
equilibrium
(solvable by minimax search at least as long as
ties are ruled out)

9
Rock-scissors-paper game

Sequential moves

10
Rock-scissors-paper game

Simultaneous moves

11
Mixed strategy Nash equilibrium
Mixed strategy agents chosen probability
distribution over pure strategies from its
strategy set
Each agent has a best response strategy and
beliefs (consistent with each other)
Symmetric mixed

strategy Nash eq

Each player

plays each pure

strategy with

probability 1/3
In mixed strategy equilibrium, each strategy that
occurs in the mix of agent i has equal expected
utility to i
Information set (the mover does not know which
node of the set she is in)
12
Existence of mixed strategy Nash equilibria

Every finite player, finite strategy game has at
least one Nash equilibrium if we admit mixed
strategy equilibria as well as pure Nash 50
(Proof is based on Kakutanis fix point theorem)

13
Complexity of finding a mixed-strategy Nash
equilibrium?

Still an open question!
Together with factoring the most important
concrete open question on the boundary of P
today Papadimitriou 2001
We solved several related questions
Conitzer Sandholm, International Joint
Conference on Artificial Intelligence 2003

14
A useful reduction (SAT -gt game) CS, IJCAI-03

Theorem. SAT-solutions correspond to
mixed-strategy equilibria of the following game
(each agent randomizes uniformly on support)

SAT Formula
(x1 or -x2) and (-x1 or x2 )
Solutions
x1true, x2true
x1false,x2false
Game
x1
x2
x1
-x1
x2
-x2
(x1 or -x2)
(-x1 or x2)
default
x1
-2,-2
-2,-2
0,-2
0,-2
2,-2
2,-2
-2,-2
-2,-2
-2,1
x2
-2,-2
-2,-2
2,-2
2,-2
0,-2
0,-2
-2,-2
-2,-2
-2,1
x1
-2,0
-2,2
1,1
-2,-2
1,1
1,1
-2,0
-2,2
-2,1
-x1
-2,0
-2,2
-2,-2
1,1
1,1
1,1
-2,2
-2,0
-2,1
x2
-2,2
-2,0
1,1
1,1
1,1
-2,-2
-2,2
-2,0
-2,1
-x2
-2,2
-2,0
1,1
1,1
-2,-2
1,1
-2,0
-2,2
-2,1
(x1 or -x2)
-2,-2
-2,-2
0,-2
2,-2
2,-2
0,-2
-2,-2
-2,-2
-2,1
(-x1 or x2)
-2,-2
-2,-2
2,-2
0,-2
0,-2
2,-2
-2,-2
-2,-2
-2,1
default
1,-2
1,-2
1,-2
1,-2
1,-2
1,-2
1,-2
1,-2
0,0

Proof sketch
Playing opposite literals (with any probability)
is unstable
If you play literals (with probabilities), you
should make sure that
for any clause, the probability of the literal
being in that clause is high enough, and
for any variable, the probability that the
literal corresponds to that variable is high
enough
(otherwise the other player will play this
clause/variable and hurt you)
So equilibria where both randomize over literals
can only occur when both randomize over same SAT
solution
These are the only good equilibria

15
Complexity of mixed-strategy Nash equilibria with
certain properties

This reduction implies that there is an
equilibrium where players get expected utility 1
each iff the SAT formula is satisfiable
Any reasonable objective would prefer such
equilibria to 0-payoff equilibrium
Corollary. Deciding whether a good equilibrium
exists is NP-hard
1. equilibrium with high social welfare
2. Pareto-optimal equilibrium
3. equilibrium with high utility for a given
player i
4. equilibrium with high minimal utility
Also NP-hard (from the same reduction)
5. Does more than one equilibrium exists?
6. Is a given strategy ever played in any
equilibrium?
7. Is there an equilibrium where a given strategy
is never played?
(5) weaker versions of (4), (6), (7) were known
Gilboa, Zemel GEB-89
All these hold even for symmetric, 2-player games

16
Counting the number of mixed-strategy Nash
equilibria

Why count equilibria? If we cannot even count
the equilibria, there is little hope of getting a
good overview of the overall strategic structure
of the game
Unfortunately, our reduction implies
Corollary. Counting Nash equilibria is P-hard!
Proof. SAT is P-hard, and the number of
equilibria is 1 SAT
Corollary. Counting connected sets of equilibria
is just as hard
Proof. In our game, each equilibrium is alone in
its connected set
These results hold even for symmetric, 2-player
games

17
Subgame perfect equilibrium credible
threatsSelten 72

Proper subgame subtree (of the game tree) whose
root is alone in its information set
Subgame perfect equilibrium strategy profile
that is in Nash equilibrium in every proper
subgame (including the root), whether or not that
subgame is reached along the equilibrium path of
play
E.g. Cuban missile crisis
Pure strategy Nash equilibria (Arm,Fold),
(Retract,Nuke)
Pure strategy subgame perfect equilibria
(Arm,Fold)
Conclusion Kennedys Nuke threat was not credible

18
Conclusions on game-theoretic analysis tools

Tools for building robust, nonmanipulable systems
with self-interested agents and different agent
designers
Different solution concepts
For existence, use strongest equilibrium concept
For uniqueness, use weakest equilibrium concept

19
Mechanism design
Necessary for building nonmanipulable multiagent
systems (e.g. for automated negotiation)
20
Goal of mechanism design

Implementing a social choice function f(u1, ,
uA) using a game
Center auctioneer does not know the agents
preferences
Agents may lie
Goal is to design the rules of the game (aka
mechanism) so that in equilibrium (s1, , sA),
the outcome of the game is f(u1, , uA)
Mechanism designer specifies the strategy sets Si
and how outcome is determined as a function of
(s1, , sA) ? (S1, , SA)
Variants
Strongest There exists exactly one equilibrium.
Its outcome is f(u1, , uA)
Medium In every equilibrium the outcome is f(u1,
, uA)
Weakest In at least one equilibrium the outcome
is f(u1, , uA)

21
Revelation principle

Any outcome that can be supported in Nash
(dominant strategy) equilibrium via a complex
indirect mechanism can be supported in Nash
(dominant strategy) equilibrium via a direct
mechanism where agents reveal their types
truthfully in a single step

22
Uses of the revelation principle

Literal Only direct mechanisms needed
Problems
Strategy formulator might be complex
Complex to determine and/or execute best-response
strategy
Computational burden is pushed on the center
(assumed away)
Thus the revelation principle might not hold in
practice if these computational problems are hard
This problem traditionally ignored in game theory
Agent might not know its own preferences, and
figuring them out can be costly (e.g.
computationally). In an indirect mechanism, the
right outcome can sometimes be chosen without
eliciting everything about each agents
preferences
See e.g., Preference Elicitation in
Combinatorial Auctions by Conen Sandholm
Even if the indirect mechanism has a unique
equilibrium, the direct mechanism can have
additional bad equilibria
As an analysis tool
Best direct mechanism gives tight upper bound on
how well any indirect mechanism can do
Space of direct mechanisms is smaller than that
of indirect ones
One can analyze all direct mechanisms pick best
one
Thus one can know when one has designed an
optimal indirect mechanism (when it is as good as
the best direct one)

23
Implementation in dominant strategies
Strongest form of mechanism design

Goal is to design the rules of the game (aka
mechanism) so that in dominant strategy
equilibrium (s1, , sA), the outcome of the
game is f(u1, , uA)
Nice in that agents cannot benefit from
counterspeculating each other
Others preferences
Others rationality
Others endowments
Others capabilities

24
Gibbard-Satterthwaite impossibility 1972

Thrm. If O 3 (and each outcome would be
the social choice under f for some input profile
(u1, , uA) ) and f is implementable in
dominant strategies, then f is dictatorial

25
(No Transcript)
26
Special case where dominant strategy
implementation is possible Quasilinear
preferences -gt Clarke tax mechanism

Outcome (x1, x2, ..., xk, m1, m2, ..., mA )
Quasilinear preferences ui(x, m) mi vi(x1,
x2, ..., xk)
Utilitarian setting Social welfare maximizing
choice
Outcome s(v1, v2, ..., vA) maxx ?i vi(x1, x2,
..., xk)
Agents payment mi ?j?i vj(s(v)) - ?j?i
vj(s(v-i)) ? 0 is a tax
Thrm Every agents dominant strategy is to
reveal preferences truthfully
Intuition Agent internalizes the negative
externality he imposes on others by affecting the
outcome
Agent pays nothing if he does not change the
outcome
Example k1, x1joint pool built or not,
mi
E.g. equal sharing of construction cost -c / A

27
Clarke tax mechanism

Pros
Social welfare maximizing outcome
Truth-telling is a dominant strategy
Feasible in that it does not need a benefactor
(?i mi ? 0)
Cons
Budget balance not maintained (in pool example,
generally ?i mi lt 0)
Have to burn the excess money that is collected
Thrm. Green Laffont 1979. Let the agents
have arbitrary quasilinear preferences. No
social choice function that is (ex post) welfare
maximizing (taking into account money burning as
a loss) is implementable in dominant strategies
If there is some party that has no private
information to reveal and no preferences over x,
welfare maximization and budget balance can be
obtained by having that partys payment be m0 -
?i1.. mi
Auctioneer could be called agent 0
Clarke tax mechanism gt Second-price sealed-bid
(Vickrey) auction
Vulnerable to collusion
Even by coalitions of just 2 agents

28
Another approach for circumventing the
Gibbard-Satterthwaite impossibility (and
potentially other impossibility results)

Design the game so that (although manipulations
exist), finding a beneficial manipulation is
computationally so complex for an agent that the
agent cannot do that
E.g. Complexity of Manipulating Elections with
Few Candidates
Conitzer Sandholm, National Conference on
Artificial Intelligence 2002
E.g. Universal Voting Protocol Tweaks for Making
Manipulation Hard
Conitzer Sandholm, International Joint
Conference on Artificial Intelligence 2003

29
Yet another approach for circumventing the
Gibbard-Satterthwaite impossibility (and
potentially other impossibility results)

Designing the mechanism automatically to the
situation at hand
Input is the probabilistic information that the
center has about the agents
Output is an optimal mechanism where the agents
are motivated to reveal their preferences
truthfully, and a social objective is satisfied
to the optimal extent
Advantages
Can be used even without side payments
quasilinear preferences
Could achieve better outcomes than Clarke tax
mechanism
Circumvents impossibility in many cases
Complexity of Mechanism Design Conitzer
Sandholm, Conference on Uncertainty in AI 2002
Designing a deterministic mechanism is
NP-complete
Designing a randomized mechanism is fast
No loss in social objective, sometime a gain
Both results hold for dominant strategy
implementation as well as Bayes-Nash
implementation