Algorithms Analysis lecture 11 Greedy Algorithms - PowerPoint PPT Presentation

1 / 55

About This Presentation

Title:

Algorithms Analysis lecture 11 Greedy Algorithms

Description:

Order activities in Aij in increasing order of finish time ... Show that all but one of the subproblems resulted by making the greedy choice are empty ... – PowerPoint PPT presentation

Number of Views:110

Avg rating:3.0/5.0

Slides: 56

Provided by: elearni7

Category:

more less

Transcript and Presenter's Notes

Title: Algorithms Analysis lecture 11 Greedy Algorithms

1
Algorithms Analysislecture 11 Greedy
Algorithms
2
Greedy Algorithms

Activity Selection Problem
Knapsack Problem
Huffman Problem

3
Greedy Algorithms

Similar to dynamic programming, but simpler
approach
Also used for optimization problems
Idea Make a locally optimal choice in hope of
getting a globally optimal solution

4
Greedy Algorithms

?? For optimization problems
?? Dynamic programming
?? determine the best choices is overkill
?? Greedy algorithm
?? makes the choice that looks best at the moment
in order to get optimal solution.
?? Optimistic ??????
?? Almost always but do not always yield optimal
solutions.
?? We will have the Activity selection example
and we will solve it using both algorithms

5
Activity Selection

Schedule n activities that require exclusive use
of a common resource
S a1, . . . , an set of activities
ai needs resource during period si , fi)
si start time and fi finish time of activity
ai
0 ? si lt fi lt ?
Activities ai and aj are compatible if the
intervals si , fi) and sj, fj) do not overlap

fj ? si
fi ? sj
i
j
j
i
6
Activity Selection Problem

Select the largest possible set of non-
overlapping (mutually compatible) activities.
E.g.

Activities are sorted in increasing order of
finish times
A subset of mutually compatible activities a3,
a9, a11
sets of mutually compatible activities
a1, a4, a8, a11 and a2, a4, a9, a11

7
Activity-selection UsingDynamic Programming,
step 1
8
Optimal Substructure, step 1

Define the space of subproblems
Sij ak ? S fi sk lt fk sj
activities that start after ai finishes and
finish before aj starts

9
Representing the Problem, step 1

Range for Sij is 0 ? i, j ? n 1
In a set Sij we assume that activities are sorted
in increasing order of finish times
f0 ? f1 ? f2 ? ? fn lt fn1
What happens if i j ?
For an activity ak ? Sij fi ? sk lt fk ? sj lt fj
contradiction with fi ? fj!
? Sij ? (the set Sij must be empty!)
We only need to consider sets Sij with 0 ? i lt j
? n 1

10
Optimal Substructure, step 1

Subproblem
Select a maximum size subset of mutually
compatible activities from set Sij
Assume that a solution to the above subproblem
includes activity ak (Sij is non-empty)
Solution to Sij (Solution to Sik) ? ak ?
(Solution to Skj)
?Solution to Sij ? ?Solution to Sik ? 1
?Solution to Skj ?

11
Optimal Substructure (cont.)

Suppose Aij Optimal solution to Sij
Claim Sets Aik and Akj must be optimal solutions
Assume ? Aik that includes more activities than
Aik
SizeAij SizeAik 1 SizeAkj gt
SizeAij
? Contradiction we assumed that Aij is the
maximum of activities taken from Sij

12
Recursive Solution, step 2

Any optimal solution (associated with a set Sij)
contains within it optimal solutions to
subproblems Sik and Skj
ci, j size of maximum-size subset of mutually
compatible activities in Sij
If Sij ? ? ci, j 0 (i j)

13
Recursive Solution, step 2

If Sij ? ? and if we consider that ak is used in
an optimal solution (maximum-size subset of
mutually compatible activities of Sij)
ci, j

ci,k ck, j 1
14
Recursive Solution, step 2

0 if Sij ?
ci, j max ci,k ck, j 1 if Sij ? ?
There are j i 1 possible values for k
k i1, , j 1
ak cannot be ai or aj (from the definition of
Sij)
Sij ak ? S fi sk lt fk sj
We check all the values and take the best one
We could now write a tabular, bottom-up dynamic
programming algorithm

i lt k lt j ak ? Sij
15
Converting to a greedy solution
16
Theorem

Let Sij ? ? and am be the activity in Sij with
the earliest finish time
fm min fk ak ? Sij
Then
am is used in some maximum-size subset of
mutually compatible activities of Sij
There exists some optimal solution that contains
am
Sim ?
Choosing am leaves Smj the only non-empty
subproblem

17
Proof

Assume ? ak ? Sim
fi ? sk lt fk ? sm lt fm
? fk lt fm contradiction !
am did not have the earliest finish time
? There is no ak ? Sim ? Sim ?

Sij
sm
fm
ak
am
18
Proof
Greedy Choice Property

am is used in some maximum-size subset of
mutually compatible activities of Sij
Aij optimal solution for activity selection
from Sij
Order activities in Aij in increasing order of
finish time
Let ak be the first activity in Aij ak,
If ak am Done!
Otherwise, replace ak with am (resulting in a set
Aij)
since fm ? fk the activities in Aij will
continue to be compatible
Aij will have the same size with Aij ? am is
used in some maximum-size subset

Sij
19
Why is the Theorem Useful?
2 subproblems Sik, Skj
1 subproblem Smj Sim ?
1 choice the activity with the earliest finish
time in Sij
j i 1 choices

Making the greedy choice (the activity with the
earliest finish time in Sij)
Reduce the number of subproblems and choices
Solve each subproblem in a top-down fashion

20
Greedy Approach

To select a maximum size subset of mutually
compatible activities from set Sij
Choose am ? Sij with earliest finish time (greedy
choice)
Add am to the set of activities used in the
optimal solution
Solve the same problem for the set Smj
From the theorem
By choosing am we are guaranteed to have used an
activity included in an optimal solution
? We do not need to solve the subproblem Smj
before making the choice!
The problem has the GREEDY CHOICE property

21
Characterizing the Subproblems

The original problem find the maximum subset of
mutually compatible activities for S S0, n1
Activities are sorted by increasing finish time
a0, a1, a2, a3, , an1
We always choose an activity with the earliest
finish time
Greedy choice maximizes the unscheduled time
remaining
Finish time of activities selected is strictly
increasing

22
A Recursive Greedy Algorithm

Alg. REC-ACT-SEL (s, f, i, j)
m ? i 1
while m lt j and sm lt fi ?Find first activity in
Sij
do m ? m 1
if m lt j
then return am ? REC-ACT-SEL(s, f, m, j)
else return ?
Activities are ordered in increasing order of
finish time
Running time ?(n) each activity is examined
only once
Initial call REC-ACT-SEL(s, f, 0, n1)

ai
fi
23
Example
k sk fk
a1
m1
a0
a2
a3
a1
a4
m4
a1
a5
a4
a1
a6
a1
a4
a1
a7
a1
a4
a8
m8
a4
a1
a9
a1
a4
a8
a10
a4
a1
a8
a11
m11
a1
a4
a8
a11
a1
a4
a8
0
1
2
3
4
5
6
7
8
9
10
11
12
13
14
24
An Iterative Greedy Algorithm

Alg. GREEDY-ACTIVITY-SELECTOR(s, f)
n ? lengths
A ? a1
i ? 1
for m ? 2 to n
do if sm fi ? activity am is
compatible with ai
then A ? A ? am
i ? m ? ai is most recent addition to
A
return A
Assumes that activities are ordered in increasing
order of finish time
Running time ?(n) each activity is examined
only once

am
fm
am
fm
am
fm
ai
fi
25
Activity Selection
B (1)
C (2)
A (3)
E (4)
D (5)
F (6)
G (7)
H (8)
Time
0
1
2
3
4
5
6
7
8
9
10
11
0
1
2
3
4
5
6
7
8
9
10
11
26
Activity Selection
B (1)
C (2)
A (3)
E (4)
D (5)
F (6)
G (7)
H (8)
Time
0
1
2
3
4
5
6
7
8
9
10
11
B
0
1
2
3
4
5
6
7
8
9
10
11
27
Activity Selection
B (1)
C (2)
C (2)
A (3)
E (4)
D (5)
F (6)
G (7)
H (8)
Time
0
1
2
3
4
5
6
7
8
9
10
11
B
E
H
0
1
2
3
4
5
6
7
8
9
10
11
28
Steps Toward Our Greedy Solution

Determine the optimal substructure of the problem
Develop a recursive solution
Prove that one of the optimal choices is the
greedy choice
Show that all but one of the subproblems resulted
by making the greedy choice are empty
Develop a recursive algorithm that implements the
greedy strategy
Convert the recursive algorithm to an iterative
one

29
Another Example The Knapsack Problem- two
version

The 0-1 knapsack problem
A thief robbing a store finds n items the i-th
item is worth vi dollars and weights wi pounds
(vi, wi integers)
The thief can only carry W pounds in his knapsack
Items must be taken entirely(1) or left behind
(0)
Which items should the thief take to maximize the
value of his load?
The fractional knapsack problem
Similar to above
The thief can take fractions of items

30
The 0-1 Knapsack Problem

Thief has a knapsack of capacity W
There are n items for i-th item value vi and
weight wi
Goal
find xi such that for all xi 0, 1, i 1, 2,
.., n
? wixi ? W and
? xivi is maximum

31
0-1 Knapsack - Greedy Strategy
50
50

E.g.

Item 3
30
20
Item 2
100
20
Item 1
10
10
60
60
100
120
160
6/pound
5/pound
4/pound

None of the solutions involving the greedy choice
(item 1) leads to an optimal solution
The greedy choice property does not hold

32
0-1 Knapsack - Dynamic Programming

P(i, w) the maximum profit ??? that can be
obtained from items 1 to i, if the knapsack has
size w
Case 1 thief takes item i
P(i, w)
Case 2 thief does not take item i
P(i, w)

vi P(i - 1, w-wi)
P(i - 1, w)
33
0-1 Knapsack - Dynamic Programming
34
(No Transcript)
35
Reconstructing the Optimal Solution
36
Optimal Substructure

Consider the most valuable load that weights at
most W pounds
If we remove item j from this load
The remaining load must be the most valuable
load weighing at most W wj that can be taken
from the remaining n 1 items

37
Fractional Knapsack Problem

Knapsack capacity W
There are n items the i-th item has value vi and
weight wi
Goal
find xi such that for all 0 ? xi ? 1, i 1, 2,
.., n
? wixi ? W and
? xivi is maximum

38
Example
39
Fractional Knapsack Problem

Greedy strategy
Pick the item with the maximum value per pound
vi/wi
If the supply of that element is exhausted and
the thief can carry more take as much as
possible from the item with the next greatest
value per pound
It is good to order items based on their value
per pound

40
Fractional Knapsack Problem

Alg. Fractional-Knapsack (W, vn, wn)
While w gt 0 and as long as there are items
remaining
pick item with maximum vi/wi
remove item i from list
w ? w xiwi
w the amount of space remaining in the knapsack
(w W)
Running time ?(n) if items already ordered else
?(nlgn)

41
Fractional Knapsack - Example
50
20 --- 30
50

E.g.

80
Item 3
30
20
Item 2
100
20
Item 1
10
10
60
60
100
120
240
6/pound
5/pound
4/pound
42
Huffman Code Problem

Huffmans algorithm achieves data compression by
finding the best variable length binary encoding
scheme for the symbols that occur in the file to
be compressed.

43
Huffman Code Problem

The more frequent a symbol occurs, the shorter
should be the Huffman binary word representing
it.
The Huffman code is a prefix-free code. No
prefix of a code word is equal to another
codeword.

44
Overview

Huffman codes compressing data (savings of 20
to 90)
Huffmans greedy algorithm uses a table of the
frequencies of occurrence of each character to
build up an optimal way of representing each
character as a binary string

45
Example

Assume we are given a data file that contains
only 6 symbols, namely
a, b, c, d, e, f
With the following frequency table
Find a variable length prefix-free encoding
scheme that compresses this data file as much as
possible?

46
Huffman Code Problem
Left tree represents a fixed length encoding
scheme Right tree represents a Huffman
encoding scheme
47
Example
48
Huffman Code Problem
49
Constructing A Huffman Code
Total computation time O(n lg n)
O(lg n)
O(lg n)
O(lg n)
O(lg n)
50
Cost of a Tree T

For each character c in the alphabet C
let f(c) be the frequency of c in the file
let dT(c) be the depth of c in the tree
It is also the length of the codeword. Why?
Let B(T) be the number of bits required to encode
the file (called the cost of T)

51
Huffman Code Problem

In the pseudocode that follows
?? we assume that C is a set of n characters and
that each character c C is an object with a
defined frequency f c.
?? The algorithm builds the tree T corresponding
to the optimal code
?? A min-priority queue Q, is used to identify
the two least-frequent objects to merge together.
?? The result of the merger of two objects is a
new object whose frequency is the sum of the
frequencies of the two objects that were merged.

52
Running time of Huffman's algorithm

The running time of Huffman's algorithm assumes
that Q
is implemented as a binary min-heap.
?? For a set C of n characters, the
initialization of Q in line 2 can be performed in
O (n) time using the BUILD-MINHEAP
?? The for loop in lines 3-8 is executed exactly
n - 1 times, and since each heap operation
requires time O (lg n), the loop contributes O (n
lg n) to the running time. Thus, the total
running time of HUFFMAN on a set of n characters
is O (n lg n).

53
Prefix Code

Prefix(-free) code no codeword is also a prefix
of some other codewords (Un-ambiguous)
An optimal data compression achievable by a
character code can always be achieved with a
prefix code
Simplify the encoding (compression) and decoding
Encoding abc ? 0 . 101. 100 0101100
Decoding 001011101 0. 0. 101. 1101 ? aabe
Use binary tree to represent prefix codes for
easy decoding
An optimal code is always represented by a full
binary tree, in which every non-leaf node has two
children
C leaves and C-1 internal nodes Cost

54
Huffman Code

Reduce size of data by 20-90 in general
If no characters occur more frequently than
others, then no advantage over ASCII
Encoding
Given the characters and their frequencies,
perform the algorithm and generate a code. Write
the characters using the code
Decoding
Given the Huffman tree, figure out what each
character is (possible because of prefix property)

55
Application on Huffman code