Chapter 10. More Dynamic Programming All Pairs Shortest Paths. CS 473: Fundamental Algorithms, Spring 2011 February 22, 2011

(1)

Chapter 10 More Dynamic Programming

CS 473: Fundamental Algorithms, Spring 2011 February 22, 2011

10.1 All Pairs Shortest Paths

10.1.0.1 Shortest Path Problems Shortest Path Problems

Input A (undirected or directed) graph G = (V, E) with edge lengths (or costs). For edge e = (u, v), `(e) = `(u, v) is its length.

• Given nodes s, t find shortest path from s to t.

• Given node s find shortest path from s to all other nodes.

• Find shortest paths for all pairs of nodes.

10.1.0.2 Single-Source Shortest Paths Single-Source Shortest Path Problems

Input A (undirected or directed) graph G = (V, E) with edge lengths. For edge e = (u, v),

`(e) = `(u, v) is its length.

• Given nodes s, t find shortest path from s to t.

• Given node s find shortest path from s to all other nodes.

Dijkstra’s algorithm for non-negative edge lengths. Running time: O((m+n) log n) with heaps and O(m + n log n) with advanced priority queues.

Bellman-Ford algorithm for arbitrary edge lengths. Running time: O(nm).

(2)

10.1.0.3 All-Pairs Shortest Paths All-Pairs Shortest Path Problem

Input A (undirected or directed) graph G = (V, E) with edge lengths. For edge e = (u, v),

`(e) = `(u, v) is its length.

• Find shortest paths for all pairs of nodes.

Apply single-source algorithms n times, once for each vertex.

• Non-negative lengths. O(nm log n) with heaps and O(nm + n²logn) using advanced priority queues.

• Arbitrary edge lengths: O(n²m). Can we do better?

10.1.0.4 Shortest Paths and Recursion

• Can we compute the shortest path distance from s to t recursively?

• What are the smaller sub-problems?

Lemma 10.1.1 Let G be a directed graph with arbitrary edge lengths. If s = v0 → v1 → v2 → . . . → v^k is a shortest path from s to vk then for 1≤ i < k:

• s = v⁰ → v¹ → v² → . . . → vⁱ is a shortest path from s to vi

Sub-problem idea: paths of fewer hops/edges

10.1.0.5 Hop-based Recur’: Single-Source Shortest Paths Single-source problem: fix source s.

OP T (v, k): shortest path distance from s to v using at most k edges.

Note: dist(s, v) = OP T (v, n− 1) Recursion for OP T (v, k):

OP T (v, k) = min

u∈V(OP T (u, k− 1) + c(u, v)).

Base case: OP T (v, 1) = c(s, v) if (s, v)∈ E otherwise ∞ Leads to Bellman-Ford algorithm — see text book.

OP T (v, k) values are also of independent interest: shortest paths with at most k hops

(3)

10.1.0.6 All-Pairs: recursion on index of intermediate nodes

• Number vertices arbitrarily as v1, v2, . . . , vn

• dist(i, j, k): shortest path distance between vⁱ and vj among all paths in which the largest index of an intermediate node is at most k

i j

1

2 4 3

1

100 1

1 510 2

dist(i, j, 0) = 100 dist(i, j, 1) = 9 dist(i, j, 2) = 8 dist(i, j, 3) = 5

10.1.0.7 All-Pairs: recursion on index of intermediate nodes

i j

dist(i, k, k − 1) k dist(k, j, k − 1)

dist(i, j, k − 1)

dist(i, j, k) = min(dist(i, j, k− 1), dist(i, k, k − 1) + dist(k, j, k − 1))

Base case: dist(i, j, 0) = c(i, j) if (i, j)∈ E, otherwise ∞

Correctness: If i→ j shortest path goes through k then k occurs only once on the path

— otherwise there is a negative length cycle.

(4)

10.1.1 Floyd-Warshall Algorithm

10.1.1.1 for All-Pairs Shortest Paths

Check if G has a negative cycle using Bellman-Ford in O(mn) time If there is a negative cycle return

for i = 1 to n do for j = 1 to n do

dist(i, j, 0) = c(i, j) (* c(i, j) =∞ if (i, j) not edge, 0 if i = j *)

for k = 1 to n do for i = 1 to n do

for j = 1 to n do

Correctness: Recursion works under the assumption that all shortest paths are defined (no negative length cycle).

Running Time: Θ(n³), Space: Θ(n³).

10.1.2 Floyd-Warshall Algorithm

10.1.2.1 for All-Pairs Shortest Paths

Do we need a separate algorithm to check if there is negative cycle?

for i = 1 to n do for j = 1 to n do

dist(i, j, 0) = c(i, j) (* c(i, j) =∞ if (i, j) not edge, 0 if i = j *)

for j = 1 to n do

for i = 1 to n do

if (dist(i, i, n− 1) < 0) then

Output that there is a negative length cycle in G

Correctness: exercise

10.1.2.2 Floyd-Warshall Algorithm: Finding the Paths Question: Can we find the paths in addition to the distances?

(5)

• Create a n × n array Next that stores the next vertex on shortest path for each pair of vertices

• With array Next, for any pair of given vertices i, j can compute a shortest path in O(n) time.

10.1.3 Floyd-Warshall Algorithm

10.1.3.1 Finding the Paths for i = 1 to n do

for j = 1 to n do

dist(i, j, 0) = c(i, j) (* c(i, j) =∞ if (i, j) not edge, 0 if i = j *) N ext(i, j) =−1

for j = 1 to n do

if (dist(i, j, k− 1) > dist(i, k, k − 1) + dist(k, j, k − 1)) then dist(i, j, k) = dist(i, k, k− 1) + dist(k, j, k − 1)

N ext(i, j) = k

for i = 1 to n do

if (dist(i, i, n− 1) < 0) then

Output that there is a negative length cycle in G

Exercise: Given N ext array and any two vertices i, j describe an O(n) algorithm to find a i-j shortest path.

10.1.3.2 Summary of results on shortest paths Single vertex

No negative edges Dijkstra O(n log n + m) Edges cost might be negative

But no negative cycles Bellman Ford O(nm) All Pairs Shortest Paths

No negative edges n * Dijkstra O(n²logn + nm) No negative cycles n * Bellman Ford O(n²m) = O(n⁴) No negative cycles Floyd-Warshall O(n³)

10.2 Knapsack

10.2.0.3 Knapsack Problem

Input Given a Knapsack of capacity W lbs. and n objects with ith object having weight wi and value vi; assume W, wi, vi are all positive integers

(6)

Goal Fill the Knapsack without exceeding weight limit while maximizing value.

Basic problem that arises in many applications as a sub-problem.

10.2.0.4 Knapsack Example

Example 10.2.1

Item 1 2 3 4 5

Value 1 6 18 22 28

Weight 1 2 5 6 7

If W = 11, the best is {3, 4} giving value 40.

Special Case

When vi =wi, the Knapsack problem is called the Subset Sum Problem.

10.2.0.5 Greedy Approach

• Pick objects with greatest value

– Let W = 2, w1 = w2 = 1, w3 = 2, v1 = v2 = 2 and v3 = 3; greedy strategy will pick {3}, but the optimal is {1, 2}

• Pick objects with smallest weight

– Let W = 2, w1 = 1, w2 = 2, v1 = 1 andv2 = 3; greedy strategy will pick {1}, but the optimal is {2}

• Pick objects with largest vi/wi ratio

– Let W = 4, w1 = w2 = 2, w3 = 3, v1 = v2 = 3 and v3 = 5; greedy strategy will pick {3}, but the optimal is {1, 2}

– Can show that a slight modification always gives half the optimum profit: pick the better of the output of this algorithm and the largest value item. Also, the algorithms gives better approximations when all item weights are small when compared toW .

10.2.0.6 Towards a Recursive Solution

First guess: Opt(i) is the optimum solution value for items 1, . . . , i.

Observation 10.2.2 Consider an optimal solution O for 1, . . . , i Case item i6∈ O O is an optimal solution to items 1 to i − 1

Case item i∈ O Then O − {i} is an optimum solution for items 1 to n − 1 in knapsack of capacity W − wⁱ.

Subproblems depend also on remaining capacity. Cannot write subproblem only in terms of Opt(1), . . . , Opt(i− 1).

(7)

Opt(i, w): optimum profit for items 1 to i in knapsack of size w Goal: compute Opt(n, W )

10.2.0.7 Dynamic Programming Solution

Definition 10.2.3 Let Opt(i, w) be the optimal way of picking items from 1 to i, with total weight not exceeding w

Opt(i, w) =











0 if i = 0

Opt(i− 1, w) if wi > w max

(Opt(i− 1, w)

Opt(i− 1, w − wⁱ) +vi

otherwise

10.2.0.8 An Iterative Algorithm for w = 0 to W do

M [0, w] = 0 for i = 1 to n do

for w = 1 to W do if (wi> w) then

M [i, w] = M [i− 1, w]

else

M [i, w] = max(M [i− 1, w], M[i − 1, w − wⁱ] + vi)

Running Time

• Time taken is O(nW )

• Input has size O(n + log W +Pn

i=1(logvi+ logwi)); so running time not polynomial but “pseudo-polynomial”!

10.2.0.9 Knapsack Algorithm and Polynomial time Input size for Knapsack: O(n) + log W +Pn

i=1(logwi+ logvi) Running time of dynamic programming algorithm: O(nW ) Not a polynomial time algorithm.

Example: W = 2ⁿ and wi, vi ∈ [1..2ⁿ].

Input size is O(n²), running time isO(n2ⁿ) arithmetic/comparisons.

Algorithm is called apseudo-polynomial time algorithm because running time is polynomial if numbers in input are of size polynomial in the combinatorial size of problem.

Knapsack is NP-hard if numbers are not polynomial in n.

(8)

10.3 Traveling Salesman Problem

10.3.0.10 Traveling Salesman Problem

Input A graph G = (V, E) with non-negative edge costs/lengths. c(e) for edge e Goal Find a tour of minimum cost that visits each node.

No polynomial time algorithm known. Problem is NP-Hard.

10.3.0.11 Example: optimal tour for cities of a country (which one?)

10.3.0.12 An Exponential Time Algorithm How many different tours are there? n!

Stirling’s formula: n!'√

n(n/e)ⁿ which is Θ(2^{cn log n}) for some constant c > 1 Can we do better? Can we get a 2^O(n) time algorithm?

10.3.0.13 Towards a Recursive Solution

• Order vertices as v1, v2, . . . , vn

• OP T (S): optimum TSP tour for the vertices S ⊆ V in the graph restricted to S.

Want OP T (V ).

Can we compute OP T (S) recursively?

• Say v ∈ S. What are the two neighbors of v in optimum tour in S?

• If u, w are neighbors of v in an optimum tour of S then removing v gives an optimum path from u to w visiting all nodes in S− {v}.

Path from u to w is not a recursive subproblem! Need to find a more general problem to allow recursion.

(9)

10.3.0.14 A More General Problem: TSP Path

Input A graph G = (V, E) with non-negative edge costs/lengths(c(e) for edge e) and two nodess, t

Goal Find a path from s to t of minimum cost that visits each node exactly once.

Can solve TSP using above. Do you see how?

Recursion for optimum TSP Path problem:

• OP T (u, v, S): optimum TSP Path from u to v in the graph restricted to S (here u, v ∈ S).

10.3.1 A More General Problem: TSP Path

10.3.1.1 Continued...

What is the next node in the optimum path from u to v? Suppose it is w. Then what is OP T (u, v, S)?

OP T (u, v, S) = c(u, w) + OP T (w, v, S− {u}) We do not know w! So try all possibilities for w.

10.3.1.2 A Recursive Solution OP T (u, v, S) = minw∈S,w6=u,v

c(u, w) + OP T (w, v, S− {u})

What are the subproblems for the original problem OP T (s, t, V )?

OP T (u, v, S) for u, v∈ S, S ⊆ V . How many subproblems?

• number of distinct subsets S of V is at most 2ⁿ

• number of pairs of nodes in a set S is at most n²

• hence number of subproblems is O(n²2ⁿ)

Exercise: Show that one can compute TSP using above dynamic program in O(n³2ⁿ) time and O(n²2ⁿ) space.

Disadvantage of dynamic programming solution: memory!

10.3.1.3 Dynamic Programming: Postscript

Dynamic Programming = Smart Recursion + Memoization

• How to come up with the recursion?

• How to recognize that dynamic programming may apply?