Probabilistic Analysis and Randomized Algorithms

[This chapter introduces probabilistic analysis and randomized algorithms. It assumes that the student is familiar with the basic probability material in Appendix C. The primary goals of these notes are to

• _{explain the difference between probabilistic analysis and randomized algo-}

rithms,

• _{present the technique of indicator random variables, and}

• _{give another example of the analysis of a randomized algorithm (permuting an}

array in place).

These notes omit the technique of permuting an array by sorting, and they omit the starred Section 5.4.]

The hiring problem

Scenario:

• _{You are using an employment agency to hire a new of}_Þ_{ce assistant.} • _{The agency sends you one candidate each day.}

• _{You interview the candidate and must immediately decide whether or not to}

hire that person. But if you hire, you must also Þre your current ofÞce assistant—even if it’s someone you have recently hired.

• _{Cost to interview is}_c_i _{per candidate (interview fee paid to agency).}

• _{Cost to hire is}_c_h _{per candidate (includes cost to}_Þ_{re current of}_Þ_{ce assistant +}

hiring fee paid to agency).

• _{Assume that}_c_h >_c_i_.

• _{You are committed to having hired, at all times, the best candidate seen so}

far. Meaning that whenever you interview a candidate who is better than your current ofÞce assistant, you must Þre the current ofÞce assistant and hire the candidate. Since you must have someone hired at all times, you will always hire theÞrst candidate that you interview.

5-2 Lecture Notes for Chapter 5: Probabilistic Analysis and Randomized Algorithms

Pseudocode to model this scenario: Assumes that the candidates are numbered 1 tonand that after interviewing each candidate, we can determine if it’s better than the current ofÞce assistant. Uses a dummy candidate 0 that is worse than all others, so that theÞrst candidate is always hired.

HIRE-ASSISTANT(n)

best ←0 candidate 0 is a least-qualiÞed dummy candidate

fori←1ton

dointerview candidatei

ifcandidatei is better than candidatebest

thenbest ←i

hire candidatei

Cost: Ifncandidates, and we hiremof them, the cost isO(nci+mch).

• _{Have to pay}_nc_i _{to interview, no matter how many we hire.} • _{So we focus on analyzing the hiring cost}_mc_h.

• _mc_h _{varies with each run—it depends on the order in which we interview the}

candidates.

• _{This is a model of a common paradigm: we need to} _Þ_{nd the maximum or}

minimum in a sequence by examining each element and maintaining a current “winner.” The variable m denotes how many times we change our notion of which element is currently winning.

Worst-case analysis

In the worst case, we hire allncandidates.

This happens if each one is better than all who came before. In other words, if the candidates appear in increasing order of quality.

If we hire alln, then the cost is O(nci+nch)= O(nch)(sincech>ci).

Probabilistic analysis

In general, we have no control over the order in which candidates appear. We could assume that they come in a random order:

• _{Assign a rank to each candidate:} _rank(_i)_{is a unique integer in the range 1 to}_n_. • _{The ordered list}_rank(₁),_rank(₂), . . . ,_rank(_n)_{is a permutation of the candi-}

date numbers1,2, . . . ,n.

• _{The list of ranks is equally likely to be any one of the}_n_{! permutations.}

• _{Equivalently, the ranks form a}_{uniform random permutation}_{: each of the pos-}

siblen! permutations appears with equal probability.

Essential idea of probabilistic analysis: We must use knowledge of, or make as- sumptions about, the distribution of inputs.

• _{The expectation is over this distribution.}

• _{This technique requires that we can make a reasonable characterization of the}

Lecture Notes for Chapter 5: Probabilistic Analysis and Randomized Algorithms 5-3

Randomized algorithms

We might not know the distribution of inputs, or we might not be able to model it computationally.

Instead, we use randomization within the algorithm in order to impose a distribution on the inputs.

For the hiring problem: Change the scenario:

• _{The employment agency sends us a list of all}_n_{candidates in advance.}

• _{On each day, we randomly choose a candidate from the list to interview (but}

considering only those we have not yet interviewed).

• _{Instead of relying on the candidates being presented to us in a random order,}

we take control of the process and enforce a random order.

What makes an algorithm randomized: An algorithm israndomizedif its behav-

ior is determined in part by values produced by arandom-number generator.

• _RANDOM(_a,_b)_{returns an integer}_r_{, where}_a ≤_r ≤_b_{and each of the}_b−_a+₁

possible values ofr is equally likely.

• _{In practice, RANDOM}_{is implemented by a}_{pseudorandom-number generator}_,

which is a deterministic method returning numbers that “look” random and pass statistical tests.

Indicator random variables

A simple yet powerful technique for computing the expected value of a random variable.

Helpful in situations in which there may be dependence.

Given a sample space and an event A, we deÞne theindicator random variable

I{A} = 1 if Aoccurs,

0 if Adoes not occur.

Lemma

For an event A, letXA =I{A}. Then E [XA]=Pr{A}.

Proof Letting Abe the complement of A, we have E [XA] = E [I{A}]

= 1·Pr{A} +0·Pr{A} (deÞnition of expected value)

5-4 Lecture Notes for Chapter 5: Probabilistic Analysis and Randomized Algorithms

Simple example: Determine the expected number of heads when weßip a fair coin one time.

• _{Sample space is}{_H,_T}_. • _Pr{_H} =_Pr{_T} =₁/_2.

• _De_Þ_{ne indicator random variable}_X_H =_I{_H}_. _X_H _{counts the number of heads}

in oneßip.

• _{Since Pr}{_H} =₁/_{2, lemma says that E [}_X_H_]=₁/_2.

Slightly more complicated example: Determine the expected number of heads in

ncoinßips.

• _Let _X_{be a random variable for the number of heads in}_n_ß_ips. • _{Could compute E [}_X_]=n

k=0k·Pr{X =k}. In fact, this is what the book does

in equation (C.36).

• _{Instead, we’ll use indicator random variables.}

• _For_i =₁,₂, . . . ,_n_{, de}_Þ_ne _X_i =_I{_the_i_th_ß_{ip results in event} _H}_. • _Then_X =n

i=1Xi.

• _{Lemma says that E [}_X_i_]=_Pr{_H} =₁/_{2 for}_i =₁,₂, . . . ,_n_. • _{Expected number of heads is E[}_X_]=_{E [}n

i=1Xi].

• _Problem: _{We want E [}n

i=1Xi]. We have only the individual expectations E [X1],E [X2], . . . ,E [Xn].

• _Solution: _{Linearity of expectation says that the expectation of the sum equals}

the sum of the expectations. Thus, E [X] = E _n i=1 Xi = n i=1 E [Xi] = n i=1 1/2 = n/2.

• _{Linearity of expectation applies even when there is dependence among the ran-}

dom variables. [Not an issue in this example, but it can be a great help. The hat-check problem of Exercise 5.2-4 is a problem with lots of dependence. See the solution on page 5-10 of this manual.]

Analysis of the hiring problem

Assume that the candidates arrive in a random order.

Let X be a random variable that equals the number of times we hire a new ofÞce assistant.

DeÞne indicator random variables X1,X2, . . . ,Xn, where

Lecture Notes for Chapter 5: Probabilistic Analysis and Randomized Algorithms 5-5

Useful properties:

• _X =_X₁+_X₂+ · · · +_X_n.

• _Lemma⇒_{E [}_X_i_]=_Pr{_candidate_i_{is hired}}_.

We need to compute Pr{candidatei is hired}.

• _Candidate _i_{is hired if and only if candidate} _i_{is better than each of candidates}

1,2, . . . ,i−1.

• _{Assumption that the candidates arrive in random order}⇒_{candidates 1},₂, . . . ,_i

arrive in random order⇒any one of theseÞrsticandidates is equally likely to be the best one so far.

• _{Thus, Pr}{_candidate_i_{is the best so far}} =₁/_i_. • _{Which implies E [}_X_i_]=₁/_i_. Now compute E [X]: E [X] = E _n i=1 Xi = n i=1 E [Xi] = n i=1 1/i

= lnn+O(1) (equation (A.7): the sum is a harmonic series) .

Thus, the expected hiring cost is O(chlnn), which is much better than the worst- case cost ofO(nch).

Randomized algorithms

Instead of assuming a distribution of the inputs, we impose a distribution.

The hiring problem

For the hiring problem, the algorithm is deterministic:

• _{For any given input, the number of times we hire a new of}_Þ_{ce assistant will}

always be the same.

• _{The number of times we hire a new of}_Þ_{ce assistant depends only on the input.} • _{In fact, it depends only on the ordering of the candidates’ ranks that it is given.} • _{Some rank orderings will always produce a high hiring cost. Example:}₁,₂,₃,

4,5,6, where each candidate is hired.

• _{Some will always produce a low hiring cost. Example: any ordering in which}

the best candidate is theÞrst one interviewed. Then only the best candidate is hired.

5-6 Lecture Notes for Chapter 5: Probabilistic Analysis and Randomized Algorithms

Instead of always interviewing the candidates in the order presented, what if we

Þrst randomly permuted this order?

• _{The randomization is now in the algorithm, not in the input distribution.} • _{Given a particular input, we can no longer say what its hiring cost will be. Each}

time we run the algorithm, we can get a different hiring cost.

• _{In other words, each time we run the algorithm, the execution depends on the}

random choices made.

• _{No particular input always elicits worst-case behavior.}

• _{Bad behavior occurs only if we get “unlucky” numbers from the random-}

number generator.

Pseudocode for randomized hiring problem:

RANDOMIZED-HIRE-ASSISTANT(n)

randomly permute the list of candidates HIRE-ASSISTANT(n)

Lemma

The expected hiring cost of RANDOMIZED-HIRE-ASSISTANTisO(chlnn).

Proof After permuting the input array, we have a situation identical to the probabilistic analysis of deterministic HIRE-ASSISTANT.

Randomly permuting an array

[The book considers two methods of randomly permuting ann-element array. The

Þrst method assigns a random priority in the range 1 ton3_{to each position and then}

reorders the array elements into increasing priority order. We omit this method from these notes. The second method is better: it works in place (unlike the priority-based method), it runs in linear time without requiring sorting, and it needs fewer random bits (n random numbers in the range 1 tonrather than the range 1 ton3_{). We present and analyze the second method in these notes.]}

Goal: Produce a uniform random permutation (each of the n! permutations is

equally likely to be produced).

Non-goal: Show that for each element A[i], the probability that A[i] moves to position j is 1/n. (See Exercise 5.3-4, whose solution is on page 5-13 of this manual.)

The following procedure permutes the array A[1. .n] in place (i.e., no auxiliary array is required).

RANDOMIZE-IN-PLACE(A,n)

fori←1ton

Lecture Notes for Chapter 5: Probabilistic Analysis and Randomized Algorithms 5-7

Idea:

• _{In iteration}_i_{, choose} _A_[_i_{] randomly from}_A_[_i. ._n_]. • _{Will never alter} _A_[_i_{] after iteration}_i_.

Time: O(1)per iteration⇒O(n)total.

Correctness: Given a set ofnelements, ak-permutationis a sequence containing

kof thenelements. There aren!/(n−k)! possiblek-permutations.

Lemma

RANDOMIZE-IN-PLACE computes a uniform random permutation.

Proof Use a loop invariant:

Loop invariant: Just prior to the ith iteration of the for loop, for each possible (i−1)-permutation, subarray A[1. .i −1] contains this(i−1)- permutation with probability(n−i+1)!/n!.

Initialization: Just beforeÞrst iteration,i =1. Loop invariant says that for each possible 0-permutation, subarray A[1. .0] contains this 0-permutation with probability n!/n! = 1. A[1. .0] is an empty subarray, and a 0-permutation has no elements. So, A[1. .0] contains any 0-permutation with probability 1.

Maintenance: Assume that just prior to theith iteration, each possible (i −1)- permutation appears in A[1. .i−1] with probability(n−i+1)!/n!. Will show that after theith iteration, each possiblei-permutation appears in A[1. .i] with probability(n−i)!/n!. Incrementingifor the next iteration then maintains the invariant.

Consider a particular i-permutation π = x1, x2, . . . , xi. It consists of an (i−1)-permutationπ= x1,x2, . . . ,xi−1, followed byxi.

LetE1be the event that the algorithm actually putsπinto A[1. .i−1]. By the

loop invariant, Pr{E1} =(n−i+1)!/n!.

LetE2be the event that theith iteration putsxi into A[i].

We get thei-permutationπ in A[1. .i] if and only if bothE1andE2occur⇒

the probability that the algorithm producesπ in A[1. .i] is Pr{E2∩E1}.

Equation (C.14)⇒Pr{E2∩E1} =Pr{E2|E1}Pr{E1}.

The algorithm chooses xi randomly from then−i+1 possibilities inA[i. .n]

⇒Pr{E2|E1} =1/(n−i+1). Thus, Pr{E2∩E1} = Pr{E2|E1}Pr{E1} = 1 n−i+1· (n−i+1)! n! = (n−i)! n! .

Termination: At termination,i =n+1, so we conclude thatA[1. .n] is a given

In document Introduction to Algorithms Cormen Solution pdf (Page 63-70)