Evolutionary Games - PowerPoint PPT Presentation

1 / 15

About This Presentation

Title:

Evolutionary Games

Description:

Number of Views:49

Avg rating:3.0/5.0

Slides: 16

Provided by: vic36

Category:

Tags: arranged | evolutionary | games | undesirable

Transcript and Presenter's Notes

Title: Evolutionary Games

1
Evolutionary Games

We now turn attention to another kind of
equilibrium-based solution.
This is a solution that is produced by some form
of learning or adaptation process.
we will focus on the kinds of things that can be
learned in a population of learning agents.
We'll have to be careful because evolution has a
huge number of things that can affect it mating,
mutation, environment, catastrophes, other agents
in the population, etc. We'll restrict attention
to just a couple of these factors.

3
Reaching an equilibrium

the main requirement for reaching an equilibrium
in learning is that the learning algorithms stop
changing.
This type of equilibrium can be very weak as
when, for example, a learning agent happens to
select parameter values that cause another
learning agent to stop adapting, and vice versa.
Or, both agents get tired of adapting and just
"freeze" their solutions even though they may not
be good solutions. This type of equilibrium
may also be weak because even the smallest
perturbation to this type of equilibrium can
cause the system to adapt to another solution.
A stronger notion of equilibrium is a learned
solution that is not easily changed by perturbing
the system.
We call such an equilibrium a stable solution.

Finally, not every learning process has an
equilibrium.
Since only certain types of learning processes
and games produce these equilibria, the notion of
a learning-based equilibrium is not as universal
as the notion of a Nash equilibrium

In evolutionary games, the two main factors that
contribute to what is learned are
The types of interactions that occur between the
agents in a population.
The rules that are applied to determine which
strategies within the population are fit and
therefore likely to be learned by the population.

Let's begin by using an example.
Suppose that we have two large and separate
groups of agents (males and females) who will be
playing the battle of the sexes game.
Suppose that each of these two groups has a mix
of agents that either always play cooperate (vote
for what other wants) or always play defect (vote
for what it wants)
One agent from each group, one male and one
female, is selected at random, they each make
their choice, and they get the reward that
results.

9
(No Transcript)
10
(No Transcript)
11
Relative Fitness.

When we look at the strategies, if 1/3 of the
agents are playing strategy A and getting 1/3 of
the total utility, they are getting what they
expected so they shouldnt change.
HOWEVER, if 1/3 are getting ½ of the total
utility for all players, they are playing better
than others. We will do better if we have MORE
agents like these super achieving agents. But
how many more?
The simple thing to do is reset the agents so the
number of each type of agent exactly matches the
percent of utility that group achieved in the
last round.
When we are happy with the division (no under or
over achieving group), we are done learning.

12
Imitator Dynamics

Replicator dynamics and random pairings of
solutions are not the only models for evolution.
Thus, they are not the only learning models that
have some claim to justification.
We will explore a different technique for
selecting the proportion of strategies that
evolve from one generation to another, but first
we will need to explore other models for
selecting which agents interact with each other.

13
Playing with Neighbors

Standard evolutionary game (random interactions)
? all Defect
Modifications- spatial games Interactions no
longer random, but with spatial neighbours
Sum scores. Player with highest score of 9 shaded
takes square (territory, food, mates) in next
generation
Some degree of cooperation evolves!

15
Imitator Dynamics

When agents can only play with their neighbors,
we can introduce a different way (different from
replicator dynamics) of selecting which
strategies propagate to the next generation. One
way to do this is for an agent to imitate its
most successful neighbor. The algorithm for
doing this goes something like this
Interact with all of my neighbors (wraping around
the board as needed), and let all my neighbors
interact with their neighbors.
After the interactions with my neighbors are
complete, identify the interaction strategy from
my neighbors that was most successful unless my
current strategy beat all of my neighbors (in
which case I'll stick to my strategy).
Change my strategy to the most successful
strategy of my neighbors -- imitate them -- on
the next round.
Imitator dynamics can produce vastly different
results than replicator dynamics.