Mixing - PowerPoint PPT Presentation

1 / 60
About This Presentation
Title:

Mixing

Description:

Fundamentals for designing a Markov chain. Bounding running times ... burn in (Dyer-Frieze'01, Molloy'02) - non-Markovian couplings (Hayes-Vigoda'03) Outline ... – PowerPoint PPT presentation

Number of Views:35
Avg rating:3.0/5.0
Slides: 61
Provided by: compu62
Category:
Tags: frieze | mixing

less

Transcript and Presenter's Notes

Title: Mixing


1
Mixing
A tutorial on Markov chains
  • Dana Randall
  • Georgia Tech

( Slides at www.math.gatech.edu/randall )
2
Outline
  • Fundamentals for designing a Markov chain
  • Bounding running times (convergence rates)
  • Connections to statistical physics

3
Markov chains for sampling
Given A large set (matchings, colorings,

independent sets,)
Main Q What do typical elements
look like?
4
Markov chains
Andrei Andreyevich Markov 1856-1922
5
Sampling using Markov chains
State space ?
( ? cn )
6
Sampling using Markov chains
State space ?
( ? cn )
  • Step 1. Connect the state space.

7
Basics of Markov chains
x
y
H
8
The stationary distribution p
9
Sampling from non-uniform distributions
Q What if we want to sample from some other
distribution?
  • Step 2. Carefully define the
  • transition probabilities.

10
The Metropolis Algorithm
(MRRTT 53)
Propose a move from x to y as before, but accept
with probability
min (1, p(y)/p(x))
(with remaining probability stay at x).
11
Basics continued
  • Step 1. Connect the state space.
  • Step 2. Carefully define the
  • transition probabilities.

Starting at any state x0, take a random walk for
some number of steps . . . and output the final
state (from p?).
12
The mixing rate
Defn The total variation distance is
Pt,p max __ ? Pt(x,y) - p(x).
1
2
xÎ ?
yÎ ?
A Markov chain is rapidly mixing if t(e)
is poly (n, log(e-1)).
13
Spectral gap
Let 1 l1 gt l2 l? be
the eigenvalues of P.
Defn Gap(P) 1-l2 is the
spectral gap.
14
Outline
  • Fundamentals for designing a Markov chain
  • Bounding running times (convergence rates)
  • Connections to statistical physics

15
Outline for rest of talk
  • Techniques
  • Coupling
  • Flows and paths
  • Indirect methods
  • Problems
  • Walk on the hypercube
  • Colorings
  • Matchings
  • Independent sets
  • Connections with statistical physics
  • - problems
  • - algorithms
  • - physical insights

16
Coupling
17
Coupling
Simulate 2 processes
18
Coupling
Defn A coupling is a MC on ? x ?
  • Each process Xt, Yt is a faithful copy of
    the original MC,
  • If Xt Yt, then Xt1 Yt1.

19
Ex1 Walk on the hypercube
  • MCCUBE
  • Start at v0(0,0,,0).
  • Repeat
  • - Pick i Î n, b Î 0,1.
  • - Set vi b.

Symmetric, ergodic p is uniform.

Mixing time? Use coupling
20
Outline
  • Techniques
  • Coupling
  • - path coupling
  • Flows and paths
  • Indirect methods
  • Problems
  • Walk on the hypercube
  • Colorings
  • Matchings
  • Independent sets
  • Connections with statistical physics
  • - problems
  • - algorithms
  • - physical insights

21
Ex 2 Colorings
Given A graph G (max deg d), k gt 1. Goal Find
a random k-coloring of G.
  • MCCOL (Single point replacement)
  • Starting at some k-coloring C0
  • Repeat
  • - With prob 1/2 do nothing.
  • - Pick v Î V, c Î k
  • - Recolor v with c, if possible.

The lazy chain
If k d 2, then the state
space is connected.
(Therefore p is uniform.)
22
Path Coupling
Bubley,Dyer,Greenhill97-8
Coupling Show for all x,y Î W, E D
(dist(x,y)) lt 0.
-
Path coupling Show for all u,v s.t.
dist(u,v)1, that E D (dist(u,v)) lt
0.
-
23
Path coupling for MCCOL
Thm MCCOL is rapidly mixing if k 3d.
(Jerrum 95)
Pf Use path coupling dist(x,y) 1.
  • o.w. ?dist 0.

24
Summary Coupling
Pros Can yield very easy proofs
Cons Demands a lot from the chain
  • Extensions
  • Careful coupling (k 2d) (Jerrum95)
  • Change the MC (Luby-R-Sinclair95)
  • Macromoves
  • - burn in (Dyer-Frieze01, Molloy02)
  • - non-Markovian couplings
  • (Hayes-Vigoda03)

25
Outline
  • Techniques
  • Coupling
  • Flows and paths
  • Indirect methods
  • Problems
  • Walk on the hypercube
  • Colorings
  • Matchings
  • Independent sets
  • Connections with statistical physics
  • - problems
  • - algorithms
  • - physical insights

26
Conductance and flows
(Jerrum-Sinclair88)
27
Min cut Max flow

(Sinclair92)
  • G Make ?2 canonical

paths gxy from xÎ?, to yÎ?, x ? y,
carrying p(x)p(y) units of flow.
?
28
Ex 3 Back to the hypercube
- Define a canonical path from s to t.
- Bound the number of paths through (u,v) Î E.
29
Outline
  • Techniques
  • Coupling
  • Flows and paths
  • Indirect methods
  • Problems
  • Walk on the hypercube
  • Colorings
  • Matchings
  • Independent sets
  • Connections with statistical physics
  • - problems
  • - algorithms
  • - physical insights

30
Ex 4 Sampling matchings
31
Ex 4 Sampling matchings
  • MCMATCH
  • Starting at M0, repeat
  • Pick e (u,v) Î E

- If e Î M, remove e - If u and v unmatched in
M, add e - If u matched (by e) and v
unmatched (or vice versa), add e and remove
e - Otherwise do nothing.
Thm Coupling wont work!
(Kumar-Ramesh99)
32
Mixing time of MCMATCH
s
t
Å
33
Outline
  • Techniques
  • Coupling
  • Flows and paths
  • Indirect methods
  • Problems
  • Walk on the hypercube
  • Colorings
  • Matchings
  • Independent sets
  • Connections with statistical physics
  • - problems
  • - algorithms
  • - physical insights

34
Ex 5 Independent Sets
Goal Given l, sample ind. set I
with prob p(I) lI/Z, Z ?J
lJ.
35
Slow mixing of MCIND (large l)
(Even)
(Odd)
36
Summary Flows
Pros Offers a combinatorial approach to
mixing especially useful for proving slow
mixing.
Cons Requires global knowledge of the chain
to spread out paths.
Extensions Balanced flows
(Morris-Sinclair99) MCMC -- Major
highlights - The permanent
(Jerrum-Sinclair-Vigoda02) -
Volume of a convex polytope
(Dyer-Frieze-Kannan89, )
37
Outline
  • Techniques
  • Coupling
  • Flows and paths
  • Indirect methods
  • - Comparison
  • - Decomposition
  • Problems
  • Walk on the hypercube
  • Colorings
  • Matchings
  • Independent sets
  • Connections with statistical physics
  • - problems
  • - algorithms
  • - physical insights

38
Comparison
(Diaconis,Saloff-Coste93)
39
Comparison
x
y
known P
_
_

(x,y) Î P gx,y (using P) G(z,w) is the
set of paths gx,y using (z,w)
w
unknown P
z
_
1
Thm Gap(P) Gap(P).
A
40
Comparison, aka . . .
Adjacency . . . The Matrix Reloaded
41
Disjoint decomposition
(Madras-R.96, Martin-R.00)
A2
A1
P
A5
A3
A4
A6
?
42
Ex 6 MCIND on small ind. sets
For G(V,E)
Let ? ind. sets of G ?k ind.
sets of size k.
43
Ind. sets w/bounded size (cont.)
Thm MCIND is rapidly mixing on
K
Ç
?k , where KV/2(?1).
k 1
MCSWAP
?0 ?1 ?2 . . . ?K-1 ?K
Projection
Restrictions
a0 a1 a2 . . .aK-1 aK
?k
44
The Restrictions of MCswap
Projection
Restrictions
  • .

45
Summary Indirect methods
Pros Offer a top down approach allow hybrid
methods to be used..
Cons Can increase the complexity.
Extensions Comparison thm for log-Sobolev
(Diaconis-Saloff-Coste96)
Comparison for Glauber dynamics
(R.-Tetali 98)
Decomposition for log-Sobolev
(Jerrum-Son-Tetali-Vigoda 02)
46
Outline
  • Techniques
  • Coupling
  • Flows and paths
  • Hybrid methods
  • Problems
  • Walk on the hypercube
  • Colorings
  • Matchings
  • Independent sets
  • Connections with statistical physics
  • - problems
  • - algorithms
  • - physical insights

47
Why Statistical Physics?
  • They have a need for sampling
  • Use many interesting heuristics
  • Great intuition
  • Experts on large data sets
  • Microscopic Macroscopic
  • details behavior
  • (i.e., phase transitions)

48

49
Models from statistical physics
Hardcore model
Potts model
(3-colorings) (Independent
sets) (Matchings)
(Min cut)
-
-
-
-

Dimer model
50
Models (cont.)
51
Models (The physics perspective)
Given A physical system ? s Define A
Gibbs measure as follows
H(s) (the Hamiltonian),
b 1/kT (inverse temperature),
p(s) e-bH(s)/ Z,
52
Physics perspective (cont.)
Q What about on the infinite lattice?
Use conditional probabilities
53
Phase transitions Ind. sets
Low temperature long range effects
High temperature ? effects die out
TC indicates a phase transition.
54
Slow mixing of MCIND revisited
R/B
8
55
Group by of fault lines
Fault lines are vacant paths of width 2 from top
to bottom (or left to
right).
56
Peierls Argument
57
Peierls Argument cont.
2n/2 3l
S1
SB
( l - n/2 more points)
(and similarly for S2, S3, ) x
58
Conclusions
Techniques
59
Conclusions
Open problems
  • Sampling 4,5,6-colorings on the grid.
  • Sampling perfect matchings on
  • non-bipartite graphs.
  • Sampling acyclic orientations in a graph.
  • Sampling configurations of the Potts
  • model (a generalization
  • of Ising, but with more colors).
  • How can we further exploit phase
  • transitions? Other physical intuition?

...
60
(No Transcript)
Write a Comment
User Comments (0)
About PowerShow.com