Optimality in strategic games, CP nets and soft constraints - PowerPoint PPT Presentation

About This Presentation
Title:

Optimality in strategic games, CP nets and soft constraints

Description:

... each player may depend on the strategies of a subset of agents (its neighbours) ... Neighbours: variables in the same constraint ... – PowerPoint PPT presentation

Number of Views:48
Avg rating:3.0/5.0
Slides: 36
Provided by: tob60
Category:

less

Transcript and Presenter's Notes

Title: Optimality in strategic games, CP nets and soft constraints


1
Optimality in strategic games, CP nets
and soft constraints
2
Main aim
  • To compare the notion of optimality used in many
    formalisms
  • To throw the basis for exploiting results in one
    field and reuse them in the other field
  • Strategic games
  • Agent interaction while pursuing their own
    interest (payoff function)
  • CP nets
  • Agents qualitative and conditional preferences
  • Soft constraints
  • Agents quantitative preferences

3
Outline
  • Strategic games
  • Relation between CP nets and games
  • Relation between soft constraints and games

4
Parametrized Strategic games
  • A set of players 1,.., n
  • For each player i
  • A set of strategies Si
  • A strict total order gti over Si depending on s-i
    (a joint strategy of all players but player i)
    payoff function
  • Example (prisoners dilemma) 2 players, 2
    strategies (ci, ni) for each player i

5
Pure Nash equilibria
  • A strategy si is a best response for i to s-i if
    si i si for all si in Si
  • A joint strategy s is a pure Nash equilibrium if
    each si is a best response to s-i
  • Also for all i, for all si in Si, si i si
  • No player has regrets on the strategy he chose
  • But there could be better joint strategies if
    more than one player changed its strategy
  • In the example, one Nash equilibria (NE) (N1,N2)

6
Pareto efficient joint strategies
  • No other joint strategy is better or equal for
    all agents, and better for at least one
  • Example
  • (N1,N2) unique Nash equilibrium
  • All other joint strategies are Pareto efficient
    (PE)

7
Dominance between strategies
  • A strategy si is never a best response for i if
    it is not a best response to any joint strategy
    s-i
  • In the example for each player i, Ci is never a
    best response

8
Elimination of dominated strategies
  • G ?NBR G
  • G subgame of G
  • For all i, each si in Si-Si is never a best
    response for i in G
  • Eliminate from the strategies of each players
    those that are never a best response

9
Nash equilibria and strategy elimination
  • If G ?NBR G, then s Nash equilibrium of G iff
    Nash equilibrium of G
  • In the example Ci is nbr, thus G has one row
    and one column, which is the unique Nash
    equilibrium 1,1

10
From CP-nets to games
  • Given a CP-net N, we build the game g(N)
  • Players features
  • Strategies of player i domain of feature xi
  • Payoff function of player i CP table for xi
  • Given s-i, si gti si iff s-ipar(xi) si gti si
    in the cp table for variable i
  • Thm opt(N) NE(g(N))

11
Example CP net
fishgtmeat
peaches gt strawberries
12
CP net ? Param. Strategic Game
Three players 1 main course, 2 wine, 3
fruit Two strategies for each player S1 meat,
fish S2red, white S3peaches, strawberries
fishgtmeat
Payoff functions For 1 main course fish gt meat,
always For 2 wine fish, -- ? white gt red meat,
-- ? red gt white For 3 fruit peaches gt
strawberries, always
peaches gt strawberries
13
Example optimals and Nash equilibria
  • Unique optimal for CP-net (fish, white, peaches)
  • Hard constraints fish, peaches, fish ? white,
    meat ? red
  • For the game
  • Meat is nbr for main course
  • Strawberries is nbr for fruit
  • Once meat is eliminated, red is nbr for wine
  • Nash equilibrium fish, white, peaches

14
From games to CP-nets
  • Given a game G, we build a CP-net n(G)
  • Feature xi player i
  • Domain of xi strategies for player i
  • Parents of xi all the other features
  • CP table of xi s-i si gt si if si gti si given
    s-i
  • Thm. NE(G) opt(n(G))

15
Example
  • Two features x1, x2
  • D(x1)c1, n1
  • D(x2)c2,n2
  • x1 depends on x2
  • x2c2 n1 gt c1
  • x2n2 n1 gt c1
  • x2 depends on x1
  • X1c1 n2 gt c2
  • X1n1 n2 gt c2
  • Hard constraints
  • x2c2 ? x1n1
  • x2n2 ? x1n1
  • x1c1 ? x2n2
  • x1n1 ? x2n2
  • Unique solution x1n1, x2n2

16
Reduced CP-nets
  • If y is a parent of x, but the preference over
    the domain of x does not depend on y, then we can
    remove y from the parents of x ? eliminate rows
  • From a CP net N to its reduced version r(N)

17
Example reduced CP-net
  • Two features x1, x2
  • D(x1)c1, n1
  • D(x2)c2, n2
  • x1 depends on x2
  • x2c2 n1 gt c1
  • x2n2 n1 gt c1
  • x2 depends on x1
  • X1c1 n2 gt c2
  • X1n1 n2 gt c2
  • Two features x1, x2
  • D(x1)c1, n1
  • D(x2)c2, n2
  • x1 and x2 independent
  • For x1 n1 gt c1
  • For x2 n2 gt c2

18
CP-net techniques in games
  • From game G to n(G)
  • From n(G) to r(n(G))
  • Hard constraints for r(n(G))
  • Optimals of r(n(G)) Nash equilibria of G

19
Example
  • Two features x1, x2
  • D(x1)c1, n1
  • D(x2)c2,n2
  • x1 depends on x2
  • x2c2 n1 gt c1
  • x2n2 n1 gt c1
  • x2 depends on x1
  • X1c1 n2 gt c2
  • X1n1 n2 gt c2

20
Example reduced CP-net
  • Two features x1, x2
  • D(x1)c1, n1
  • D(x2)c2, n2
  • x1 depends on x2
  • x2c2 n1 gt c1
  • x2n2 n1 gt c1
  • x2 depends on x1
  • X1c1 n2 gt c2
  • X1n1 n2 gt c2
  • Two features x1, x2
  • D(x1)c1, n1
  • D(x2)c2, n2
  • x1 and x2 independent
  • For x1 n1 gt c1
  • For x2 n2 gt c2

21
Example reduced CP-net
  • Two features x1, x2
  • D(x1)c1, n1
  • D(x2)c2, n2
  • x1 and x2 independent
  • For x1 n1 gt c1
  • For x2 n2 gt c2
  • Hard constraints
  • x1n1
  • x2n2
  • Thus x1n1, x2n2 unique optimal solution of the
    CP-net and Nash equilibrium of the game

22
Games and acyclic CP-nets
  • From game G to r(n(G))
  • If r(n(G)) is acyclic, then G has one Nash
    equilibrium, and linear time to find it

23
Combination operator
  • Extensive (always) for all a,b in A, a x b ? a,b
  • Idempotent for all a in A, a x a a
  • Ex. max, min, and
  • Ex. of instances fuzzy, classical
  • It is possible to apply soft constraint
    propagation
  • Strictly monotonic for all a,b,c in A, a lt b ? a
    x c lt b x c
  • Ex. sum, product
  • Ex. of instances weighted
  • It cannot be idempotent and strictly monotonic at
    the same time

24
Pareto efficient joint strategies
  • No other joint strategy is better or equal for
    all agents, and better for at least one
  • Example
  • (N1,N2) unique Nash equilibrium
  • All other joint strategies are Pareto efficient
    (PE)

25
From soft CSPs to games a local approach
  • From a soft CSP P to a game L(P)
  • Graphical games the payoff of each player may
    depend on the strategies of a subset of agents
    (its neighbours)
  • Players one for each variable
  • Strategies for a player i all values in domain
    of xi
  • Neighbours variables in the same constraint
  • Payoff function of player i for strategy s
    preference for assignment s in constraints
    involving xi

26
Optimality in SCSPs, NE in Games
  • From Games to CSPs
  • Full power of SCSPs no needed to model NE

Game G
CP-net n(G)
Greco et al.2005
CSP C(G)
Optimality Constraints of N(G)
Equivalent
27
Optimality in SCSPs, NE in Games
  • From a SCSP P to a game L(P)
  • Local Approach
  • Players one for each variable
  • Strategies for a player i all values in domain
    of xi
  • Payoff of player i for joint strategy s
    preference for assignment s in constraints
    involving xi

28
Example 1 Fuzzy SCSP ? game
local
X
Y
Z
  • Three players x,y,z
  • Two strategies a,b
  • Payoff functions
  • For x px(aa-)0.4, px(ba-)0.3
  • For y
  • p(aaa) min(0.4,0.4) 0.4
  • p(aba) min(0.1,0.1)0.1
  • ...
  • Two Nash equilibria aaa and bbb
  • Optimal solutions only bbb

(a,a) ? 0.4 (a,b) ? 0.1 (b,a) ? 0.3 (b,b) ? 0.5
(a,a) ? 0.4 (a,b) ? 0.3 (b,a) ? 0.1 (b,b) ? 0.5
29
Example 2 Fuzzy SCSP ? game
local
X
Y
Z
  • Three players x,y,z
  • Two strategies a,b
  • Payoff functions
  • For x px(aa-)0.9, px(ba-)0.6
  • For y
  • p(aaa) min(0.9,0.1) 0.1
  • p(aab) min(0.6,0.2)0.2
  • ...
  • Two Nash equilibria aab and bbb
  • Optimal solutions only aab, abb, bab, bbb

(a,a) ? 0.9 (a,b) ? 0.6 (b,a) ? 0.6 (b,b) ? 0.9
(a,a) ? 0.1 (a,b) ? 0.2 (b,a) ? 0.1 (b,b) ? 0.2
30
Strictly monotonic combination
  • In general, no relationship between optimal
    solutions of P and Nash equilibria of L(P)
  • However, some relationship exist if combination
    is strictly monotonic
  • Thm. Soft CSP P with strictly monotonic
    combination ? Opt(P) ? NE(L(P))

31
Classical CSPs ? games
  • Classical constraints are combined via logical
    and (which is not strictly monotonic)
  • However, if we consider consistent CSPs, the
    result holds
  • Thm. consistent CSP ? Sol(P) ? NE(L(P))

32
Optimality in SCSPs, NE in Games
  • Given an SCSP P, build a game GL(P)
  • Global mapping
  • Players variables
  • Strategies domain values
  • Payoff for player x for strategy s preference
    value for that assignment (by looking at all
    constraints)
  • Note same payoff for all players
  • Theorem Opt(P) ? NE(GL(P))
  • Subset relation for all classes of SCSPs

33
Optimality in SCSPs, PE in Games
  • From a game G to an SCSP L(G)
  • Variables players (n)
  • Domains strategies
  • Semiring Cartesian product of n semirings
  • For each variable xi, one constraint involving xi
    and its neighborhood
  • pref(t) (d1,...,dn), where dj 1j for j ? i,
    and di F(pi(t))
  • F is bijection from the payoffs to preferences in
    a c-semiring
  • Thm. Game G ? opt(L(G)) PE(G)

34
Example
Semiring weighted x weighted
(c1,c2) ? (7,0) (c1,n2) ? (10,0) (n1,c2) ?
(6,0) (n1,n2) ? (9,0)
X1
x2
(c1,c2) ? (0,7) (c1,n2) ? (0,6) (n1,c2) ?
(0,10) (n1,n2) ? (0,9)
  • Optimal solutions
  • (c,c) with pref. (7,7)
  • (n,c) with pref. (10,6)
  • (c,n) with pref. (6, 10)
  • Pareto efficient joint strategies all but (1,1)

35
Optimality in SCSPs, PE in Games
  • From SCSPs to Games
  • If we use the local mapping
  • Opt(P) ? PE(L(P))
  • If we use global mapping
  • Opt(P) PE(L(P))

36
Summary CP-nets and NE games 1-1
N not reduced
g
r
g
N reduced
g(N)
r
n
n(g(N))
37
Summary SCSPs and Games
  • Nash Equilibria
  • Pareto Efficient



Game
SCSP
Game
CSP
? x st.m.
Game
?
Game
local
local
SCSP
SCSP
global
global
Game
?
Game

38
References
  • CP-nets and Soft Constraints
  • Carmel Domshlak, Steven David Prestwich,
    Francesca Rossi, Kristen Brent Venable, Toby
    Walsh Hard and soft constraints for reasoning
    about qualitative conditional preferences. J.
    Heuristics 12(4-5) 263-285 (2006)
  • C. Boutilier, R. I. Brafman, Carmel Domshlak, H.
    H. Hoos, and D. Poole. Preference-based
    constraint optimization with CP-nets.
    Computational Intelligence, 20(2)137157, 2004
  • Games, CP-nets and Soft Constraints
  • Georg Gottlob, Gianluigi Greco, Francesco
    Scarcello Pure Nash Equilibria Hard and Easy
    Games. J. Artif. Intell. Res. (JAIR) 24 357-406
    (2005)
  • Krzysztof R. Apt, Francesca Rossi, K. Brent
    Venable,Comparing the notions of optimality in
    CP-nets, strategic games, and soft constraints,
    to appear in Annals of Mathematics and
    Artificial Intelligence.
Write a Comment
User Comments (0)
About PowerShow.com