QUEUING. Chapter CHAPTER OUTLINE

(1)

KEY TERMS

SELF-REVIEW EXERCISES PROBLEMS

CASE 1: HOW MANY OPERATORS? REFERENCES

C H A P T E R O U T L I N E 15.1 Introduction

15.2 The Basic Model

15.3 A Taxonomy of Queuing Models 15.4 Little’s Flow Equation and Related Results 15.5 The M/G/1 Queue

15.6 Model 1: An M/M/sQueue (Hematology Lab) 15.7 Economic Analysis of Queuing Systems 15.8 Model 2: A Finite Queue (WATS Lines) 15.9 Model 3: The Repairperson Model

15.10 Transient Versus Steady-State Results: Order Promising 15.11 The Role of the Exponential Distribution

15.12 Queue Discipline 15.13 Notes on Implementation 15.14 Summary

15

Chapter

Q

UEUING

(2)

15.1

INTRODUCTION _{Queuing models are everywhere. This fact is obvious even to the most casual observer.} Airplanes “queue up” in holding patterns, waiting for a runway so they can land, and then they line up again to take off. People line up for tickets, to buy groceries, and, if they happen to live in England, for almost everything else. Jobs line up for machines, orders line up to be filled, and so on. As you can probably tell,queueis the British term for any type of line for waiting. The Danish engineer A. K. Erlang is credited with founding queuing theory by studying telephone switchboards in Copenhagen for the Danish Telephone Company. He developed many of the queuing results used today. One of the greatest uses of queuing the-ory in the United States is for analyzing automobile traffic flow—studying how many lanes to have, how to regulate the traffic lights, and so forth—in order to maximize the flow of traffic.

In 1988, New York City (NYC)’s arrestees were in custody waiting to be arraigned for an average of 44 hours, occasion-ally for more than 72 hours. Moreover, they were held in crowded, noisy conditions that were emotionally stressful, unhealthy, and often physically dangerous. In March 1990, the New York Timesran a front-page story on a woman who spent 45 hours in pre-arraignment detention in the Bronx with the headline “Trapped in the Terror of New York’s Holding Pens.” Arrestees were being denied a speedy court appearance and the lengthy delays greatly diminished the efficiency of the jus-tice system. That same year, the NY Supreme Court ruled that the city was to attempt to arraign within 24 hours or release the prisoner.

Under these circumstances, NYC undertook the single most ambitious management science project to date in its his-tory with the goal of reducing the arrest-to-arraignment (ATA) time. There were basically four different boroughs (Manhattan, Bronx, Brooklyn, and Queens), each with its own idiosyncratic ways of doing things. The basic process included the following basic steps, which comprise a large queuing sys-tem: arrest by an officer of NYPD; taken to precinct where prisoner is searched, fingerprinted, and detained while an arrest report is completed; taken to central booking where fin-gerprints are faxed to state capital for identification and report of criminal history, the arresting officer fills out more paper-work including the sworn complaint with the assistant district attorney, and the arrestee is lodged to await arraignment.

In 1988, in these four boroughs alone, over 325,000 arrests were made for which the defendants could be detained awaiting arraignment (i.e., more serious crimes). Unlike many other jurisdictions in the United States, in NYC felonies predominate, many of them involving violence or illicit drugs. Thus, an arrestee might find himself in the same hold-ing cell with violent repeat offenders or defendants in drug episodes.

The project team mounted an extensive two-year effort. While the lengthy delays in ATA time were the key factor to study, the high costs associated with the current ATA process were an additional item of study. One of the contributors to these high costs was that arresting officers were spending an average of more than 8 hours from the time they departed central booking until they swore out the complaint. Much of this time was overtime and the task itself only required 30 minutes! They were waiting in line for hours!

The whole process was modeled as a series of stages. Some of the stages were modeled as single-server queues, oth-ers as multiple-server queues, and some were even more com-plex. The statistical distributions and their corresponding parameters for each stage had to be determined. The overall model could then look at various different what-if scenarios involving combinations of workloads and arrest processing policies. The model generated several types of output includ-ing average overall ATA time and average times for completinclud-ing individual stages of the process. The cost of each of the scenar-ios could also be generated by a companion spreadsheet model. NYC could then choose between several different alter-natives, each with its own cost and performance measure.

In May 1990, Mayor David Dinkins released the findings of this project at a press conference with his strong endorse-ment of its recommended changes. The model saved the city over $10 million per year in police overtime costs alone. The city has reduced the average ATA time delay from 44 hours to about 24 hours citywide. Arrestees gain the right to a speedier trial and are no longer “warehoused” under horrible condi-tions for longer than absolutely necessary. The city has also greatly reduced its costs for prisoner supervision and trans-portation by about $11 million per year. One final recommen-dation was the elimination of a single arraignment court-room. This resulted in additional savings of $9.5 million for the city and state. (See Larson et al.)

71 \₂

(3)

Monte Jackson might not subscribe to the notion that all of life is a queue, but as administrative director of St. Luke’s Hospital in Philadelphia, he must deal with a number of situations that can be described as queuing models. Briefly, a queuing modelis one in which you have a sequence of items (such as people) arriving at a facility for service, as shown in Figure 15.1. At this moment, Monte is concerned about three particular “queuing models.”

Model 1: St. Luke’s Hematology Lab St. Luke’s treats a large number of patients on an outpatient basis; that is, there are many patients who come to the hospital to see the staff doctors for diagnosis and treatment but who are not admitted to the hospital. Outpatients plus those admitted to the 600-bed hospital produce a large flow of new patients each day. Most new patients must visit the hematology laboratory as part of the diagnostic process. Each such patient has to be seen by a technician. The system works like this: After seeing a doctor, the patient arrives at the laboratory and checks in with a clerk. Patients are assigned on a first-come, first-served basis to test rooms as they become avail-able. The technician assigned to that room performs the tests ordered by the doctor. When the testing is complete, the patient goes on to the next step in the process (perhaps X-ray), and the technician sees a new patient.

Monte must decide how many technicians to hire. Superficially, at least, the trade-off is obvious. More technicians means more expense for the hospital, but quicker service for the patients.

Model 2: Buying WATS Lines As part of its remodeling process, St. Luke’s is design-ing a new communications system. Monte must decide how many WATS lines the hospital should buy. WATS (Wide Area Telephone Service) is an acronym for a special flat-rate, long-distance service offered by some phone companies. When all the phone lines allocated to WATS are in use, the person dialing out will get a busy signal, indicating that the call can’t be completed. Monte knows that when people pick up the phone, they want to get through without having to try several times. How many lines he needs to achieve that result at a reasonable cost is not so clear.

Model 3: Hiring Repairpeople St. Luke’s hires repairpeople to maintain 20 individ-ual pieces of electronic equipment. The equipment includes measuring devices such as the electrocardiogram machine, small dedicated computers like the one used for lung analysis, and equipment such as the CAT scanner. If a piece of equipment fails and all the repairpeo-ple are occupied, it must wait to be repaired. Monte must decide how many repairpeorepairpeo-ple to hire. He must balance their cost against the cost of having broken equipment.

As Table 15.1 indicates, all three of these models fit the general description of a queu-ing model. Monte will resolve these models by usqueu-ing a combination of analytic and simula-tion models. However, before we reach the level of sophisticasimula-tion required to deal with Monte’s specific models, it is necessary for us to spend some time with the basic queuing model. In the process we will learn some terminology, and we will see the type of analytic results that are available.

FIGURE 15.1

General Queuing Models

Arrivals

00000 Service facility

Table 15.1

Some Queuing Models

PROBLEM ARRIVALS SERVICE FACILITY

1 Patients Technicians

2 Telephone Calls Switchboard

(4)

15.2

THE BASIC MODEL _{Consider the Xerox machine located in the fourth-floor secretarial service suite. Assume that} users arrive at the machine and form a single line. Each arrival in turn uses the machine to perform a specific task. These tasks vary from obtaining a copy of a 1-page letter to produc-ing 100 copies of a 25-page report. This system is called a sproduc-ingle-server (or sproduc-ingle-channel) queue. Questions about this or any other queuing system center on four quantities:

1. The number of people in the system: the number of people currently being served, as well as those waiting for service.

2. The number of people in the queue: the number of people waiting for service.

3. The waiting time in the system: the interval between when an individual enters the system and when he or she leaves the system. Note that this interval includes the service time.

4. The waiting time in the queue: the time between entering the system and the begin-ning of service.

ASSUMPTIONS OF THE BASIC MODEL

1. Arrival Process.Each arrival will be called a “job.” Since the time between arrivals (the

interarrival time) is not known with certainty, we will need to specify a probability distribution for it. In the basic model a particular distribution, called the exponential distribution(sometimes called the negative exponential distribution), is used. This dis-tribution plays a central role in many queuing models. It provides a reasonable repre-sentation of the arrival process in a number of situations, and its so-called lack of memoryproperty makes it possible to obtain analytic results. The exponential distrib-ution is notsymmetric, a fact that bothers people who think that an “average” must have as many values above the mean as below it. For example, if customers arrive, on the average, every 5 minutes according to an exponential distribution, then approxi-mately of them will have interarrival times less than 5 minutes, and only about of them longer than 5 minutes (but some of those may be very long and thus “skew” the average). The exponential distribution describes many services (bank tellers, postal clerks). About of the service times will be below the mean time (a lot of short, quick transactions) and of the service times will be above the mean (someone with the cash receipts from his or her business, or a person mailing a package overseas).

The words Poisson inputare also used to describe the arrival process when the time between arrivals has an exponential distribution. This is because of the relation-ship between the exponential distribution and the Poisson distribution. In particular, if the interarrival time has an exponential distribution, the number of arrivals in a specified length of time (say, three hours) has a Poisson distribution.

The exponential distribution and its relationship to the Poisson is discussed in some detail in Section 15.11. At this point, it is only necessary to understand that the exponential distribution is completely specified by one parameter. This parameter, called , is the mean arrival rate; that is, how many jobs arrive (on the average) during a specific period of time. In a moment we will consider an example in which = 0.05 jobs per minute. This implies that on the average of a job arrives every minute. It is probably more natural to think in terms of a longer time interval. An equivalent statement is that on the averageone job arrives every 20 minutes. Using more techni-cal terms, we say that the mean interarrival time is20 minutes. Mean interarrival time is the average time between two arrivals. Thus, for the exponential distribution

average time between jobs = mean interarrival time = (15.1) Thus, if= 0.05,

mean interarrival time = 1 = 20 = 0.051 1 5 \₁₀₀ 1 \₃ 2 \₃ 1 \₃ 2 \₃

(5)

2. Service Process.In the basic model, the time that it takes to complete a job (the ser-vice time) is also treated with the exponential distribution. The parameter for this exponential distribution is called . It represents the mean service ratein jobs per minute. In other words,Tis the number of jobs that would be served (on the aver-age) during a period ofTminutes if the machine were busy during that time. In the upcoming example we will assume that = 0.10. This implies that on the average 0.10 of a job is completed each minute. An equivalent statement is that on the average one job is completed every 10 minutes. The mean, or average, service time(the average time to complete a job) is 1/. When , the mean service rate, is 0.10, the average ser-vice time is 10 since 1/= 1/0.10 = 10.

3. Queue Size. There is no limit on the number of jobs that can wait in the queue. The queue is said to be infinite.

4. Queue Discipline. Jobs are served on a first-come, first-served basis; that is, in the same order as they arrive at the queue.

5. Time Horizon. The system operates as described continuously over an infinite hori-zon.

6. Source Population. There is an infinite population available to arrive.

Consider these assumptions in the context of the Xerox model. Suppose that the aver-age arrival time between jobs is 20 minutes. As we have seen, the fact that the interarrival time has an exponential distribution means that 1/= 20, and thus = 0.05, or that the jobs arrive at the rate of 0.05 job per minute. Similarly, if the average time to complete a job is 10 minutes, we know that 1/= 10, and thus = 0.10, or that jobs are completed at the rate of 0.10 job per minute when the machine is operating.

CHARACTERISTICS OF THE BASIC MODEL

The values of these two parameters (together with the assumptions) are all that is needed to calculate several important operating characteristicsof the basic model. The necessary formulas are presented in Table 15.2 WARNING! The formulas in Table 15.2 hold only if

< . If this condition does not hold (i.e., if≥), the number of people in the queue will grow without limit.

Consider, for example, a specific case where = 0.25 and = 0.10. Remember that 1/is the average interarrival time. Thus, since 1/= 1/0.25 = 4, on the average a job arrives every 4 minutes. Similarly, 1/is the average time it takes to complete a job. Since 1/= 1/0.10 = 10, on the average it takes 10 minutes to complete a job. It seems clear that in this case the service operation will get further behind (the queue will grow longer) as time goes by.

Now return to the Xerox model, in which < and the formulas in Table 15.2 hold. Spreadsheets are ideal for crunching the numerical results from such formulas. We will use

Table 15.2

Operating Characteristics for the Basic Model

CHARACTERISTIC SYMBOL FORMULA

Utilization — /

Expected Number in System L

Expected Number in Queue L_q

Expected Waiting Time (Includes Service Time) W

Expected Time in Queue W_q

Probability that the System is Empty P₀ 1 – /

( – ) 1 – 2 ( – ) –

(6)

an Excel spreadsheet (Q.XLS) that was originally developed by Professor David Ashley and that already has these formulas entered. When you first open the spreadsheet you see the introductory page as shown in Figure 15.2 (note that there are four different worksheets [MMs, MG1, finiteQ, finitePopulation] to be used as indicated by the tabs at the bottom of the spreadsheet).

Plugging the numerical values from the Xerox model,= 0.05 and = 0.10, into the appropriate cells (E2 and E3) of the appropriate worksheet (“MMs” in this example) yields the results presented in Figure 15.3. We also have to tell it that we have only one server (i.e., one copy machine) and that our time unit is minutes.

Steady-State Results These numbers require some interpretation.L, for example, is the expected number of people in the system (those being served plus those waiting) after the queue has reached steady state. In this sentence,steady statemeans that the probability that you will observe a certain number of people (say, 2) in the system does not depend on the time at which you count them. If a steady state has been achieved, the probability that there are two people using and/or waiting for the Xerox machine should be the same at 2:30 P.M. and at 4:00 P.M.

The other characteristics presented in Figure 15.3 have a similar interpretation. Thus, in a steady state, (1) the system is empty with a probability of one-half (cell F7 shows that FIGURE 15.2

Introductory Page of Queuing Workbook

FIGURE 15.3

Evaluating the Operating Characteristics of the Basic Model

(7)

15.3

A TAXONOMY OF QUEUING MODELS

P₀= 0.5); (2) on the average there is 0.5 person in the queue (cell F8 shows that L_q= 0.5); (3) on the average an arrival must wait 10 minutes before starting to use the machine (cell F10 shows that W_q= 10); and (4) on the average an arrival will spend 20 minutes in the sys-tem (cell F11 shows that W= 20). Remember that these values are averages and, as such, may have the same characteristics as the exponential distribution ( of observations below mean, above). Thus, of the customers will spend less than 10 minutes in line, while will spend more than 10 minutes in line.

Using the Results These results hold for the basic model and the particular values for the parameters (= 0.05 and = 0.10). They provide information that is useful to man-agement in analyzing this service facility. Suppose, for example, that manman-agement makes the following calculations: Since = 0.05, on the average of a job arrives each minute. During each 8-hour day there are 8 ×60 = 480 minutes. Thus, during each day there is on the average a total of

(0.05)(480) = 24

arrivals. From the calculations in Figure 15.3 we know that on the average each person spends 20 minutes in the system (W= 20). Thus, a total of (24 arrivals per day) (20 minutes per arrival) = 480 minutes, or 8 hours, is spent at this facility. Management might well feel that this is too long. A variety of steps might be taken:

1. A new machine might be purchased with a smaller mean service time.

2. Another machine might be purchased and both machines used to satisfy the demand. This would change the system to a two-server queue.

3. Some personnel might be sent to a different and less busy copying facility. This would change the arrival process.

Management might select one of these alternatives, or perhaps some other option. But in any case, management must balance the cost of providing service against the cost of wait-ing. The results in Figure 15.3 and similar results for other systems would be a central part of the analysis. These ideas will be developed in more detail in the context of Monte Jackson’s models.

There are many possible queuing models. For example, if the interarrival time in the basic model had been given a different distribution (not the exponential) we would have had a different model, in the sense that the previous formulas for L,L_q, and so on, would no longer hold. To facilitate communication among those working on queuing models, D. G. Kendall proposed a taxonomy based on the following notation:

A/B/s

where A= arrival distribution

B= service distribution

s= number of servers

Different letters are used to designate certain distributions. Placed in the Aor the B posi-tion, they indicate the arrival or the service distribuposi-tion, respectively. The following con-ventions are in general use:

M= exponential distribution

D= deterministic number

G= any (a general) distribution of service times

GI= any (a general) distribution of arrival times

We can see, for example, that the Xerox model is an M/M/1 model; that is, a single-server queue with exponential interarrival and service times.

5 \₁₀₀ 1_\ 3 2_\ 3 1_\ 3 2_\ 3

(8)

15.4

LITTLE’S FLOW EQUATION

AND RELATED RESULTS It can be proven that in a steady-state queuing process

L= W (15.2)

This result states that L, the expected number of people in the system, equals , the arrival rate, times W, the expected waiting time. To perform a quick numerical check, see if the numbers derived for the Xerox model (Figure 15.3) satisfy (15.2). The calculation is shown in (15.3).

L= 1.0 = 0.05 ×20 = W (15.3)

To understand the intuitive foundation for this result, consider the diagram in Figure 15.4. In Scene 1 our hero arrives and joins the queue. In Scene 2 he has just completed ser-The Westinghouse Hanford Company in Richland,

Washing-ton, is a secured work facility: All vehicles and passengers are checked at a guard station before being allowed onto the premises. This security checkpoint created enormous traffic backups during shift changes, when the volume of entering vehicles was greatest. The result was a severe hazard for the workforce and a major loss of productivity for the company as personnel were detained in long lines. An in-house engineer-ing group was therefore asked to study the problem and make recommendations for changes.

The study group found that each workday morning an average of 7 buses and 283 cars and vans arrived at the plant. Upon approaching the entrance gate, the vehicles formed one line to pass through the checkpoint, which was normally manned by two guards during rush periods. The line extended past the available queue space (which could accommodate only 40 cars) and spilled out onto the adjacent highway, causing a major safety problem. Because of the long line, drivers of other vehicles often elected to continue down the highway to a sec-ond gate. This option meant additional time and distance for the employees, as well as an unknown wait at the other gate.

The standard analytical queuing model predicted— correctly—that because the service rate at the checkpoint was equal to the arrival rate, the queue would continue to grow without limit as long as cars kept arriving. This, however,

merely confirmed what had already been observed. A simula-tion was therefore developed. The model was run to repro-duce the current situation and then to try out alternatives. • The first alternative scenario increased the number of

guards to three while keeping the single lane of traffic. This approach reduced the maximum queue length from 45.5 to 28, but increased costs.

• The second scenario had vehicles forming two lines, with a security guard assigned to each line. When a bus arrived, it was routed around the two vehicle lines and serviced immediately by one guard while the other guard temporarily worked both lines. This solution produced a maximum queue length of 14 vehicles and a waiting time of only about 15 minutes, compared to over 30 minutes for the existing configuration.

The second scenario appeared to be a good solution involving no additional cost. When it was implemented on a trial basis, the queue length was indeed drastically reduced. The biggest surprise was that the number of vehicles using the gate rose from 285 to 345. Obviously vehicles that had been regularly bypassing the main gate had started using it again. Thanks to the shorter queues, the new system easily handled the increased traffic load. (See Landauer and Becker.)

APPLICATION CAPSULE

Merging Traffic: A Queuing Simulation Helps Eliminate a Costly Bottleneck

FIGURE 15.4

Little’s Flow Equation Scene 1:

Scene 2: He completes

service Arrived during the time our

hero waited and was served Our hero arrives

(9)

vice. Assume the system is in steady state. Since in this case the average number of people in the system is independent of time, let us measure this quantity when our hero completes being served. At this time, the number of people in the system is precisely the total number who arrived after he did (i.e., the individuals who arrived during his waiting time). Therefore, ifWis his waiting time and people arrive at a rate of, we would expect L, the average number in the system, to equal W.

Equation (15.2) is often called Little’s flow equation. Note that it applies to any steady-state queuing process and is thus applicable to a wide variety of models. The proof used to establish (15.2) also shows that

L_q= W_q (15.4)

A numerical check for the Xerox model shows that

L_q= 0.5 = 0.05 ×10 = W_q

which again agrees with the result in Figure 15.3.

One must take some care in applying this result in more complicated cases. It is essential that represents the rate at which arrivals jointhe queue. This may be differ-ent from the rate at which people actually “arrive.” Consider, for example, a queue with an upper limit on the number of items that can wait in the queue (called a finite queue). A modern phone system that will hold a certain number of calls (say, 10) in a queue until a service representative becomes available provides a good example of such a queue. In such a system a person who calls and finds the system full simply receives a busy signal—in other words, is sent away. He or she does not join the queue. This is called a balk. Thus, if= 0.25 (the arrival rate) and the mean time between calls is 4 minutes, this is notthe rate at which people join. Thus, the relationship L= 0.25Wwill not hold for this system. Similarly, a customer may tire of waiting in line (or being on hold) and leave without being served. This is called reneging. Here again,L= 0.25W will not hold for this system.

Another important general result depends on the observation that

expected waiting time = expected waiting time in queue + expected service time For the basic model we have already made use of the fact that

expected service time = Putting the general result in symbols yields

For the Xerox model we have

(15.5) Not only does this hold for the basic model, but the general result (equation [15.5]) holds for any queuing model in which a steady state occurs.

Equations (15.2), (15.4), and (15.5) make it possible to compute the four operating characteristics L,L_q,W, and W_qonce one of them is known. To illustrate this fact, let us start the Xerox model all over again. We begin as last time using the second formula in Table 15.2 to calculate L:

Now rather than using the other formulas in Table 15.2 that are specifically for the basic model, we will use the two general results that we have just presented. First, from Little’s flow equation (15.2) we know that

L= W L = – = 0.05 0.10 – 0.05 = 1 W = 20 = 10 + 1 0.10 = Wq + 1 W = W_q + 1 1

(10)

Thus, knowing L= 1 and = 0.05, we obtain W= L/= 20. Then, turning to (15.5), we see that

Finally, (15.4) shows that

L_q= W_q= 0.05 × 10 = 0.5

This alternative method of obtaining numerical results will turn out to be most useful when analyzing more complicated systems than the basic model.

While the exponential distribution accurately describes the arrival process in many situa-tions, it may not fit the service process very well. Fortunately, there is a generalization of the basic model that permits the distribution of the service time to be arbitrary. It is not even necessary to know the service time distribution, only its mean, 1/, and its variance,2_. The operating characteristics for the generalized model are given in Table 15.3.

Note that we have made use of the results of Section 15.4 in obtaining all the operating characteristics except for L_q. As a check on the validity of these formulas, suppose that the arbitrary service time distribution is exponential. The variance of an exponential distribu-tion is (1/)2_{if the mean is 1/}_{. Therefore,}

which is the same result as in the basic model. As 2_increases,_L_,_L

q,W, and Wqall increase. This means that the consistency of a

server may be as important as the speed of the server. Suppose you must hire a secretary, and you have to select one of two candidates. Secretary 1 is very consistent, typing any doc-ument in exactly 15 minutes. Secretary 2 is somewhat faster, with an average of 14 minutes per document, but with times varying according to the exponential distribution. The aver-age workload in the office is three documents per hour, with interarrival times varying according to the exponential distribution. Which secretary will give you shorter average turnaround times on documents? This can be easily solved with the “MG1” and “MMs” worksheets of SECRETRY.XLS shown in Figures 15.5 and 15.6.

Since Secretary 1 types every document in exactly 15 minutes,2_{is equal to 0. The} val-ues of the other parameters are = 3 per hour (or 0.05 per minute) and = per minute. These values are entered in the input parameters section of the “MMs” spreadsheet (cells E3:E6), which automatically incorporates the appropriate values into the “MG1” worksheet. The results are shown in Figure 15.5.

1_\ 15 L_q = 2_(1/₎2_{+ (}_/₎2 2(1 – /) = 2 ( – ) W_q = W – 1 = 20 – 1 0.10 = 10 W = W_q + 1 15.5 THE M/G/1 QUEUE Table 15.3 Operating Characteristics for the Generalized Model

CHARACTERISTIC SYMBOL FORMULA

Utilization — /

Expected Number in System L L_q+

Expected Number in Queue L_q

Expected Waiting Time W W_q+

Expected Time in Queue W_q

Probability that the System is Empty P₀ 1 – /

L_q 1 22_{+ (}_/₎2 2(1 – /)

(11)

We could also verify this manually by using the formulas shown in Table 15.3:

Again, using the spreadsheet model for Secretary 2, we enter the parameters as

= 0.05,= per minute, and = 14 minutes. The results are shown in Figure 15.6. We could also use either the basic model (Table 15.2) or the generalized model (Table 15.3) to verify the spreadsheet answer for Secretary 2. Using the generalized model we get

Even though Secretary 2 is “faster,” her average turnaround times are longer because of the high variability of her service times.

W = 98_\

3 + 14 = 46.67 minutes average turnaround time W_q = (49_\ 30)/0.05 = 98\3 = 32.67 minutes L_q = (0.05) 2₍₁₄₎2_{+ [0.05/(}1_\ 14)]2 2[1 – 0.05/(1_\ 14)] = 49_\ 30 = 1.633 minutes 1 \₁₄ W = 45_\

2 + 15 = 37.5 minutes average turnaround time W_q = (9\₈)/0.05 = 45\₂ = 22.5 minutes L_q = (0.05) 2_{(0) + [0.05/(}1_\ 15)]2 2[1 – 0.05/(1_\ 15)] = 9_\ 8 FIGURE 15.5

Turnaround Time for Secretary 1

FIGURE 15.6 Turnaround Time for Secretary 2

(12)

Recall that as we started this chapter, our stated goal was to attack three particular models at St. Luke’s Hospital with queuing models. In the preceding sections we have laid the groundwork for this process. We have introduced, defined, and illustrated the characteris-tics of the systems that we will consider (e.g., expected number in queue, expected waiting time, etc.) as well as developed a spreadsheet that can automatically calculate such charac-teristics. We have also made some general results such as Little’s flow equation available for use in future analysis. We are now in a position to turn our attention to Monte Jackson’s models.

The system described in Model 1 of Section 15.1, the blood-testing model, is illus-trated in Figure 15.7. Note that each patient joins a common queue and, on arriving at the head of the line, enters the first examining room that becomes available. This type of sys-tem must not be confused with a syssys-tem in which a queue forms in front of each server, as in the typical grocery store.

Assume that the interarrival time is given by an exponential distribution with parame-ter = 0.20 per minute. This implies that a new patient arrives every 5 minutes on the aver-age, since

mean interarrival =

Also, assume that each server is identical and that each service time is given by an exponen-tial distribution with parameter = 0.125 per minute. This implies that the mean service time is 8 minutes, since

mean service time for an individual server =

Note that if there were only one server, the queue would grow without limit, since >

(0.20 > 0.125). For a multiserver queue, however, a steady state will exist as long as < s, where sis the number of servers. For example, if we have two servers, we will achieve a steady state because 0.20 < 0.25 (= 2 * 0.125).

The Key Equations As before, we want to find values L,L_q,W, and W_q. However, since this is a multiserver queue (not a single-sever queue as in the Xerox model), we must use different formulas. To evaluate these formulas it is convenient to start with the expression for P₀, the probability that the system is empty. For this model

(15.6)

and L_q, the expected number of people in the queue, is expressed as

(15.7) L_q = P₀

(/) s+1 (s – 1)!(s – /)2

P₀ = 1

s–1 n=0 (/)n n! + (/)s s!

1 1 – (/s)

1 = 1 0.125 = 8 1 = 0.201 = 5 15.6 MODEL 1: AN M/M/s

QUEUE (HEMATOLOGY LAB)

FIGURE 15.7 Multiserver Queue Arrivals 000 Server 1 Server 2 Server n

(13)

Equations (15.6) and (15.7) and the general results in (15.2), (15.4), and (15.5) make it possible to calculate values for W_q,W, and Lfor any specified parameter values (and ) and any number of servers (value ofs). Again, these new formulas are already entered in the “MMs” worksheet of our queuing template in our workbook (HEMATLGY.XLS).

Example Calculations Assume, for example, that Monte decided to hire two techni-cians. Then, since s= 2,= 0.20, and = 0.125, we can put these values into the “MMs” worksheet of HEMATLGY.XLS and get the following results shown in Figure 15.8. We see that the utilization (/s) = 0.8 (cell F6) and the probability that the system is empty is 0.11 (cell F7). These two values can be used in equation (15.7) to find L_q= 2.84 (cell F8). That is, the expected number of people in the queue is somewhat less than 3. Using equation (15.4),L_q= W_q, we see that, on the average, a patient waits for 14.22 minutes (cell F10) before entering an examining room.

Lastly, the template uses the general observation that

expected waiting time = expected waiting time in queue + expected service time

to calculate that the expected waiting time (W) = 22.22 minutes (cell F11). On the average, then, a patient spends 22.10 minutes in the hematology area, waiting for a technician and having tests.

Monte now wants to check what happens if he adds a third or fourth technician (server). These results are shown in Figures 15.9 and 15.10 respectively.

FIGURE 15.8

Results for Hematology Lab with 2 Servers

FIGURE 15.9

(14)

15.7

ECONOMIC ANALYSIS OF QUEUING SYSTEMS

There is a dramatic shortening of waiting time (W_q) with a third server (down to 1.57 minutes), but at the cost of having an extra server. Adding a fourth server doesn’t make such a dramatic difference, as it reduces waiting time in the queue to 0.30 minutes. Another factor to take into account is how busy the servers would be in each scenario. We see in Figures 15.8 to 15.10 that the utilization drops from 80% to 53.3% to 40%. The more servers added, the higher the percentage of idle time for the technicians, which could lead to boredom and sloppy work.

These calculations provide lots of information to help Monte make his decision. With one technician, since > , the system is unstable and the queue will steadily grow. This could be considered irresponsible. With two technicians, the average waiting time in the queue is less than 15 minutes. By current hospital standards, this is a small and acceptable value. Obviously by adding more servers, Monte can reduce the average waiting time but at significant expense to St. Luke’s. If, in some cases the queue gets uncomfortably long with two servers (remember that W_qis an expected value, and the actual time in the queue will vary), the supervisors of the hematology laboratory can temporarily move one of the blood analysts to a technician’s position. Monte thus feels comfortable with the idea of hiring two full-time technicians without performing a detailed cost analysis.

We note that this example model is identical to the models that are faced by fast-food franchise managers: how many people to put on a shift to keep the average customer wait below a certain value. McDonalds reportedly figures it will lose a customer if the total wait is more than five minutes.

Monte selected the number of lab technicians to hire by looking at the operating character-istics and using his judgment. This is not an unusual approach in queuing models and is especially common in the not-for-profit sector. Monte realizes that he is balancing the cost of hiring more technicians against the costs he incurs by forcing the patients to wait. The cost of hiring additional technicians is fairly clear. The waiting cost is not.

Monte first notes that the cost to the patient is irrelevant to his decision, except as it affects the patient’s willingness to use the hospital. It really does not matter who is wait-ing—a consultant who charges $250 per hour for his services or an unemployed person with no opportunity cost—unless the waiting time persuades the patient to use another health facility. This observation explains why certain monopolies like government agencies and utilities can be so casual about your waiting time. There is no place else to go!

Besides the possible effect on demand, the hematology lab could cost the hospital money if it reduced the output of the hospital. Suppose, for example, that the outpatient FIGURE 15.10

(15)

clinics could process 50 new patients each day, but that the hematology lab could handle only 10 patients. (This is clearly an extreme example to establish a point.) In this case, the hospital would be wasting a valuable resource, the doctors and other staff in the clinics, because of a bottleneck in the hematology lab. However, having stated this, it still is not easy to assess an explicit cost of a patient waiting.

Cost Parameters If you are willing and able to estimate certain costs, you can build expected cost models of queuing systems. Consider, for example, the hematology lab model (in general terms any multiserver queue with exponential interarrival and service times), and suppose the manager is willing to specify two costs:

C_s= cost per hour of having a server available

C_w= cost per hour of having a person wait in the system (a very “fuzzy” or qualitative cost) With these it is possible to calculate the total costs associated with the decision to use any particular number of servers. Let us start by calculating the total cost of hiring 2 servers for an 8-hour day. There are two components:

server cost = (C_s)(2)(8)

where C_sis the cost per hour for one server, 2 is the number of servers, and 8 is the number of hours each server works, and

waiting cost = (C_w)(L₂)(8)

where L₂is the number of people in the queue when there are 2 servers. This second calcu-lation may not be as obvious, but the rationale is the same as for the server cost. If there are, on the average,L₂people waiting when the system has 2 servers, then L₂times 8 is the aver-age number of waiting “hours” per day. Hence, (C_w)(L₂)(8) is the average waiting cost for the 8-hour day.

If we wanted to calculate the total cost of using 4 servers for a 6-hour day, we would take

(C_s)(4)(6) + (C_w)(L₄)(6) or

[(C_s)(4) + (C_w)(L₄)]6

The term in square brackets, [(C_s)(4) + (C_w)(L₄)], then, is the total cost per hour of using 4 servers.

The Total Cost per Hour We now define

TC(s) = total cost per hour of using sservers and we see that

TC(s) = (C_s)(s) + (C_w)(L_s)

Our goal is to choose s, the number of servers, to minimize this function. We can see that as sincreases, the waiting cost will decrease and the server cost will increase. The idea is to find that value ofsthat minimizes the sum of these two costs.

Figure 15.11 shows the worksheet Monte created called “Econ. Analysis” in his HEMATLGY.XLS workbook to determine the optimal value ofs. Unfortunately it is not possible to derive a formula that gives the optimal value ofs. (This is in contrast to the EOQ model, where we can find the optimal order quantity, Q*, with the equation Q* =

, as in Chapter 7.)

In this example, let’s put a relatively large cost on waiting and see if the decision changes from Monte’s original decision of choosing two servers. We establish C_s = $50/server/hour and C_w= $100/customer/hour (see cells B1 and B2), and then we can cal-culate the server cost and waiting cost for 2, 3, and 4 servers. We’ll assume we want to

(16)

pare the cost over an 8-hour shift (cell E1), and we must enter in cells B6:B8 the values for L(expected number in system) for each value of swe want to explore (obtained from Figures 15.8 to 15.10). We can see that 3 servers minimizes the Total Cost at $2,730 (cell E7).

Next Monte creates a data table to determine the sensitivity of this decision to the “fuzzy” cost,C_w. He decides he wants to explore values for C_wfrom 0 to $180. The steps for Monte to do this in his spreadsheet are:

1. Enter the initial value of 0 in cell A11.

2. Click back on cell A11, then choose Edit, Fill, and then “Series.”

3. Click on “Series in Columns,” enter a step value of 20 and a terminal value of 180. Click on OK.

4. Enter the formulas for the quantities we want to track (total cost with 2 servers, total cost with 3 servers, total cost with 4 servers) in cells B10:D10. These formulas are =E6, =E7, and =E8, respectively.

5. Highlight the range A10:D20, and click on Data, then “Table.”

6. Enter the Column Input Cell as B2. Click on OK.

7. Excel automatically fills in the table as shown in cells A10:D20 of Figure 15.11. Finally, Monte wants to graph the results of this sensitivity analysis to look for patterns and trends. To do this, he highlighted the range A11:D20, clicked on the Chart Wizard, and then followed the steps to generate the graph shown in Figure 15.12.

We can see that 2 servers is optimal for C_w= 0, while 3 servers is optimal from C_w= $20 up to $180. It does look as though 4 servers will become the optimal decision for values ofC_w≥$200.

Let us complete our examination of the hematology lab model with these observa-tions: We have seen how to find values for L,L_q,W, and W_q. These values were then used to select the appropriate number of technicians (servers). This decision might be made on intuitive grounds or on the basis of an explicit economic analysis. Sensitivity analysis is also important to perform, especially on harder-to-quantify parameters like C_w. We now move on to Monte’s second model.

FIGURE 15.11 Economic Analysis for Hematology Lab with 2, 3, or 4 Servers

Cell Formula Copy To

C6 $B$1*A6*$E$1 C7:C8

D6 $B$2*B6*$E$1 D7:D8

(17)

15.8

MODEL 2: A FINITE QUEUE

(WATS LINES) Do not be misled by the title of this section. It is devoted to Model 2, Monte’s attempt to_{select the appropriate number of WATS lines for St. Luke’s. Fortunately, in this case he can} expect help from the telephone company. It has a great deal of expertise in such matters, since queuing models have found extensive use in the field of telephone traffic engineering. The problem of how many lines are needed by a switchboard was typically attacked by using the M/G/smodel, “with blocked customers cleared.” You already know that this model is a multichannel queue with sservers (slines), exponential interarrival times for the calls, and a general distribution for the service time, which in this case is the length of each call. The phrase “blocked customers cleared” is queuing jargon. It means that when an arrival finds all of the servers occupied (all of the lines busy), he or she does not get in a queue but simply leaves. This phrase clearly describes the behavior of the traditional telephone switchboard. More sophisticated systems now provide for queuing of a finite number of customers, in some cases even providing the lucky customer the opportunity to enjoy a Muzak version of Elton John’s “Can You Feel the Love Tonight?” or the “Macarena.”

Probability of jBusy Servers The problem of selecting the appropriate number of lines (servers) is attacked by computing the steady-state probability that exactly jlines will be busy. This, in turn, will be used to calculate the steady-state probability that all slines are busy. Clearly, if you have slines and they are all busy, the next caller will not be able to place a call.

The steady-state probability that there are exactly jbusy servers given that slines (servers) are available is given by the expression

(15.8)

where = arrival rate (the rate at which calls arrive)

s= number of servers (lines)

The expression is called the truncated Poisson distributionor the Erlang loss distribution. It is noteworthy that although we are considering a general service-time distribution, the value P_jdefined by (15.8) depends only on the mean of this distribution.

Consider a system in which = 1 (calls arrive at the rate of 1 per minute) and 1/= 10 (the average length of a conversation is 10 minutes). Here /= 10. Suppose that we have

1

= mean service time (the average length of a conversation)

P_j = (/) j_/_j_!

s k=0 (/)k_/_k_! FIGURE 15.12

Graph of Sensitivity Analysis on Cost of Waiting for Hematology Lab

(18)

five lines in the system (s= 5) and want to find the steady-state probability that exactly two are busy (j= 2). From (15.8) we see that

In other words, on the average, two lines would be busy 3.4% of the time. An alternative way of obtaining P_jthat is easy to implement in a spreadsheet (because of its sequential for-mulation) is as follows:

P_i= P_i_–1(/)/i

So, for example once we know P₂, we can calculate P₃as:

P₃= P₂(10)/3 = (0.034)(10)/3 = 0.1133 Likewise,P₄is found as P₄= P₃(10)/4 = (0.1133)(10)/4 = 0.2833

Each successive P_i_–1is multiplied by (/) and divided by ito achieve the new P_i.

The more interesting question is: “What is the probability that all of the lines are busy?” since in this case a potential caller would not be able to place a call on the WATS lines. To find the answer to this question, we simply set j= s(in our example s= 5) and we obtain

P₅= P₄(10)/5 = (0.2833)(10)/5 = 0.564

or on the average the system is totally occupied 56.4% of the time.

Again, it is easy enough to implement all these formulas in a spreadsheet. The prob-ability that the system is totally occupied (all servers are busy) is calculated in a new worksheet called “finiteQ” in the workbook WATS.XLS. We can see that the probability that a customer balks with 5 servers is much more easily calculated in a spreadsheet, as shown in Figure 15.13. Of course, we get the same value (0.564 in cell F13). We can then build a data table to determine this value for several different values ofs. Figure 15.14 shows such a data table that Monte built for examining the possibility of between 0 and 10 phone lines.

The steps for Monte to do this in the spreadsheet are:

1. Enter the initial value of 0 in cell A23.

2. Click back on cell A23, then choose Edit, Fill, and then “Series.”

3. Click on “Series in Columns,” enter a step value of 1 and a terminal value of 10. Click on OK. = 50 1477.67 = 0.034 = 50 1 + 10 + 50 + 166.67 + 416.67 + 833.33 = (10) 2_/(2_⋅₁₎ 1 + 101/1 + 102/(2 ⋅ 1) + 103/(3 ⋅ 2 ⋅ 1) + 104/(4 ⋅ 3 ⋅ 2 ⋅ 1) + 105/(5 ⋅ 4 ⋅ 3 ⋅ 2 ⋅ 1) P₂ = (/) 2_/2!

5 k=0 (/)k_/_k_!

(19)

4. Enter the formulas for the quantity we want to track (probability that a customer balks) in cell C22. The formula is =F13.

5. Highlight the range B22: C33, and click on Data, then “Table.”

6. Enter the Column Input Cell as E4. Click on OK.

Excel automatically fills in the table as shown in Figure 15.14. Next we can create col-umn D that calculates the marginal improvement in this probability as we add servers. This is also shown in Figure 15.14. Here it is clear that the marginal effect of adding more servers decreases. For example, adding a second line when there was one in service decreases the probability of the system being busy by 0.089, whereas adding the tenth line when there were already nine in service decreases this probability by 0.059.

Average Number of Busy Servers Another interesting and useful quantity in the design of phone installations is the average number of busy lines. This quantity is called the carried loadin queuing jargon. If we define as the average number of busy servers, then

(15.9) Assume now that in Monte’s model with WATS lines for St. Luke’s,= 1 and 1/= 10. Thus, if he purchases 10 lines, we see in Figure 15.14 that the probability that all 10 are busy =0.215 (cell C33). It follows from (15.9) that

N = 10(1 – 0.215) = 7.85

N =

[1 – Prob. That a Customer Will Balk] N

FIGURE 15.13 Finite Queue Spreadsheet Calculation of Probability that Customer Balks

FIGURE 15.14 Data Table for Probability that Customer Balks for Different Values of s

(20)

15.9

MODEL 3: THE REPAIRPERSON MODEL

In other words, the entire system will be busy with probability 0.215 or about one fifth of the time and, on the average, almost 8 lines will be busy. After has been calculated, the server utilization can be calculated by dividing by s(the number of servers). Thus, for the situation at St. Luke’s, the server utilization is 7.85/10 = 78.5%, which means that each server (on average) is busy 78.5% of the time and idle 21.5% of the time.

Monte feels that ten lines is a reasonable compromise. There does not seem to be a great deal of excess capacity, but, on the other hand, the probability of finding the system busy is in a region (70% to 80%) that he feels is appropriate for the hospital. If he is uncomfortable with this solution, based on a subjective balancing of the number of lines and the probability of finding the system busy, and is willing to specify a cost for each time a caller finds the system busy, he can select the number of lines to minimize the expected cost per hour. He would proceed in the same manner as in the M/M/ssystem of Sections 15.6 and 15.7.

In this model Monte must decide how many repairpersons to hire to maintain 20 pieces of electronic equipment. Repairpersons deal with machines on a come (perhaps first-failed is more accurate), first-served basis.

A single repairperson treats each broken machine. You can thus think of the failed machines as forming a queue in front of multiple servers (the repairpersons).

This is another M/M/smodel, but it differs in a fundamental way from the M/M/s sys-tem (the blood-testing model) considered in Section 15.6. In this model there is a limited number of items (20) that can join the queue, whereas in the hematology lab model an unlimited number could potentially join the queue.

A queuing model, like the repairperson model, in which only a finite number of “people” are eligible to join the queue is said to have a finitecalling population. Models with an unlimited number of possible participants are said to have an infinite calling population.

Consider the model with 20 machines and 2 repairpersons. Assume that when a machine is running, the time between breakdowns has an exponential distribution with parameter = 0.25 per hour; that is, the average time between breakdowns is 1/= 4 hours. Similarly, assume that the time it takes to repair a machine has an exponential distribution and that the mean repair time is 0.50 hour (i.e., 1/= 0.50). This model is an M/M/2 model with a maximum of 18 items in the queue (20 including the 2 in service) and a finite call-ing population. In this case the general equations for the steady-state probability that there are njobs in the system is a function of,,s(the number of repairpersons), and N(the number of machines). In particular,

(15.10) We also know that

(15.11) We thus have N+ 1 linear equations (Nfrom equation [15.10] and 1 from equation [15.11]) in the N+ 1 variables of interest (P₀,P₁, . . . . ,P_n). This makes it possible (if painful) to calculate values ofP_nfor any particular model. One can see that each model becomes a bit more complex than the previous one, and that the formulas ofP_nbecome more complex.

There are, however, no simple expressions (even by these standards) for the expected number of jobs (broken machines) in the system or for waiting. If the values for P_nare

N n=0 P_n = 1 P_n = N! (N – n)!s!sn–s (/)nP0 for s < n≤N P_n = N! n!(N – n)! (/) n_P 0 for 0 ≤n≤s N N

(21)

15.10

TRANSIENT VERSUS STEADY-STATE RESULTS: ORDER PROMISING

computed, then it is (truly) a simple task to find a numerical value for the expected num-ber in the system. You must just calculate

expected number in system =

If God had intended humans to perform this kind of calculation by hand, He would not have let spreadsheets be invented. Figure 15.15 shows the setup of the “finite Population” worksheet in the REPAIR.XLS workbook that can be used to compute values ofP_n,the expected number in the system, and the expected number waiting, for a variety of different systems. As shown, the user enters the typical system parameters (,, and s) in the “MMs” worksheet, then clicks on the new worksheet “finitePopulation” and adds the size of the finite population (cell E5), and the spreadsheet does the rest. In this case, “the rest” consists of numerically evaluating the equations for P_nand using the results to find the expected number in the system. There is one “trick” with this kind of model, when you enter the value for the arrival rate () in the “MMs” worksheet, you actually need to enter N_*, or the entire population’s arrival rate.

As you see in Figure 15.15, for this system,L_q, the average number of machines waiting for service, is 3.348 (cell F8) and W, the expected time in the system, is 1.405 hours (cell F11).

The utilization is high at 92.5% and we see from the histogram that there’s a 3.3% chance that the system is empty, an increasing probability of there being one machine (8.3%), and the probabilities keep growing for increasing numbers of machines until it reaches a maximum at four and five machines (they’re tied at 11.8%). Then the probabili-ties decline for machines beyond five.

It is not always the case that we are interested in steady-state results, or that an analytical model is available to predict the behavior of the queuing system of interest. In this section we will consider a situation in which we are interested in the transientbehavior of the sys-tem and must use simulation to obtain the desired answers.

Manufacturing processes can be viewed as complex queuing systems. Probably the most frequently used management science tool in manufacturing is queuing systems simu-lation. Larry Lujack, a production planner at SONOROLA and a recent graduate of the University of Chicago’s Graduate Business School, is wondering if what he learned about queuing models in his last semester of school could help him decide when to promisea new customer order. The order is for 20 units of an item that requires sequential processing at 2

L =

N n=0 nP_n FIGURE 15.15 Finite Population Spreadsheet for the Repairperson Model

(22)

work stations. The average time to process a unit at each work station is 4 hours. Each work station is available for 8 hours every working day.

By considering when the last of the 20 units will be completed, Larry initially estimates that it will take 10.5 days to process the order. The last unit must wait at Work Station 1 for the first 19 units to be completed, then it must be processed at Work Station 1 and then at Work Station 2. Assuming that it does not have to wait when it gets to Work Station 2, Larry calculates as follows:

(19 units ×4 hours/unit + 4 hours + 4 hours) ÷ 8 hours/day = 10.5 days

However, this analysis is somewhat simplistic. It ignores the variability of the process-ing times and the possibility of queuprocess-ing at Work Station 2. Larry feels that the exponential distribution is an appropriate distribution for processing times because the 4-hour figure was arrived at by averaging many processing times that were less than 4 hours with a few processing times that were significantly longer than 4 hours (see Section 15.11). These few long processing times were due to equipment failures at a work station while processing a unit.

Next Larry checks to see whether the assumptions of the basic queuing model are met. The output from Work Station 1 are the arrivals to Work Station 2, and the time between arrivals is exponential because the processing time at Work Station 1 is exponen-tial. The service time at Work Station 2 is exponential because it is the same as the pro-cessing time. The units are processed on a first-come, first-served basis at Work Station 2, and there is sufficient buffer capacity between the work stations so that the queue size is for all practical purposes infinite. However, the assumption of an infinite time horizon is not met. Larry is interested only in the behavior of the system until “customer” 20 ends its processing.

Larry decides to apply the basic model anyway and use it as an approximation. He approximates the time it takes to process 20 units as follows. First, he estimates that the last unit in the batch of 20 will leave Work Station 1 after 20 ×4 = 80 hours. Then this unit will wait in the queue in front of Work Station 2. Finally, it will complete processing at Work Station 2, at which time all 20 units will have been completed. The total time that the last unit spends at Work Station 2 is W. Thus, Larry’s estimate is 20 ×4 + W. In the basic model, Wis given by the formula 1/(– ). Larry now realizes his dilemma:and are equal (1 unit per 4 hours) and the formula is valid only when is greaterthan .

Larry decides to set up a spreadsheet (ORDER.XLS) to stimulate the flow of the 20 units through the 2 work stations. His spreadsheet is shown in Figure 15.16. Larry assumes that raw material is always available at Work Station 1 (WS1) so that the next unit at Work Station 1 can start as soon as the current unit is finished. This means that for Work Station 1 the start time of a unit is the stop time of the previous unit. For example, the formula in cell B8 of the spreadsheet is =C7. The start time of a unit at Work Station 2 is either the stop time of that unit on Work Station 1 or the stop time of the previous unit on Work Station 2, whichever is larger. So, for example, the formula in cell D8 is =MAX(C8,E7). The stop time of a unit is just the start time plus the processing time. The finish time in days is shown in cell F2 and is calculated by dividing the stop time at Work Station 2 of the last unit by the number of hours/day (8 in cell F1). The spreadsheet calculates this time to be 10.5 days ifevery unit takes exactly 4 hours at every work station.

To analyze the impact of processing time variability, Larry replaces the constant pro-cessing time of 4 at WS1 in his spreadsheet with the appropriate random distribution (in Crystal Ball, he chooses the exponential distribution from the gallery; in @RISK, he enters =RiskExpon($B$1) ), which samples from an exponential distribution with a mean of 4. This makes the finish time a random variable. Larry would like to know the 99th percentile of this random variable (the number that 99% of the time the random variable will be less than or equal to). He could then promise the order in that number of days and be 99% sure that it would actually be completed on time.

(23)

FIGURE 15.16 Order Promising Spreadsheet

Cell Formula Copy To

C7 $B$1 + B7 C8:C26 D7 C7 — E7 B8 D8 F2 D7+$B$2 C7 MAX(C8, E7) E26/F1 E8:E26 B9:B26 D9:D26 — FIGURE 15.17

Statistical Output for Order Completion Model

Using Crystal Ball or @RISK, it is a simple matter to find any percentile of any random variable in the spreadsheet. Figure 15.17 shows the 99th percentile for cell F2 (the finish time in days) based on 1000 iterations (1000 sets of 40 random processing times—20 for Work Station 1, 20 for Work Station 2). First, note that the expected finish time (“Mean” for cell F2 in Figure 15.17) is 15.52 days, 2 days longer than Larry’s initial calculation. The extra

(24)

15.11

THE ROLE OF THE

EXPONENTIAL DISTRIBUTION

2 days is the average queuing delay caused by the variability of the processing times. If Larry wants to be 99% sure of having the order completed by the time he promises, he should set the due date to be 18.28 days (“Target #1(Value)=” in Figure 15.17) after the material becomes available at Work Station 1.

The queuing that takes place at Work Station 2 has increased the lead time(time from the start of the order to its completion) by nearly 8 days (18.28 less 10.5) over what it would be if there were no variability in the processing times. Figure 15.18 shows the histogram of the finish time, and we can see that the time can vary from 6.5 days up to 21 days.

Even though the basic model was not applicable in this case, it helped Larry to think about the model and to understand this answer from his spreadsheet simulation. Problem 15-29 explores a slightly different situation in which the basic model can be used to esti-mate the average finish time and the 99th percentile.

There is an enormous body of literature concerning queuing systems, and it is virtually impossible for a manager to be aware of all the results. There are, however, some general considerations that are useful in helping a manager think about the use of queuing models. One such consideration is the role of the exponential distribution in analytic queuing models.

There are essentially no analytic results for queuing situations that do not involve the exponential distribution either as the distribution of interarrival times or service times or both. This fact makes it important for a manager to recognize the set of circumstances in which it is reasonable to assume that an exponential distribution will occur. The following three properties of the exponential distribution help to identify it:

1. Lack of memory: In an arrival process this property implies that the probability that an arrival will occur in the next few minutes is not influenced by when the last arrival occurred; that is, the system has no memory of what has just happened. This situa-tion arises when (1) there are many individuals who could potentially arrive at the system, (2) each person decides to arrive independently of the other individuals, and (3) each individual selects his or her time of arrival completely at random. It is easy to see why the assumption of exponential arrivals fits the telephone system so well. FIGURE 15.18

Histogram of Finish Times for Order Completion Model

(25)

15.12

QUEUE DISCIPLINE

2. Small service times: With an exponential distribution, small values of the service time are common. This can be seen in Figure 15.19. This figure shows the graph of the probability that the service time Sis less than or equal to t(Prob {S≤t}) if the mean service time is 10; that is = 0.1 and 1/= 10. Note that the graph rises rapidly and then slowly approaches the value 1.0. This indicates a high probability of having a short service time. For example, when t= 10, the probability that S≤tis 0.632. In other words, more than 63% of the service times are smaller than the average service time. This compares to a normal distribution where only 50% of the service times are smaller than the average. The practical implication of this fact is that an exponential distribution can best be used to model the distribution of service times in a system in which a large proportion of “jobs” take a very short time and only a few “jobs” run for a long time.

It is said that engineers think that the whole world is exponentially distributed, while social scientists think that the whole world is normally distributed. A quick and dirty way to check what kind of distribution you have is to see if the average of the data is close to its standard deviation. If it is, then most likely the numbers are expo-nentially distributed. If it is or less of the mean, it is probably normally distributed.

3. Relation to the Poisson distribution: While introducing the basic model (Section 15.2) we noted the relationship between the exponential and Poisson distributions. In particular, if the time between arrivals has an exponential distribution with parame-ter , then in a specified period of time (say ,T) the number of arrivals will have a Poisson distribution with parameter T. Then, ifXis the number of arrivals during the time T, the probability that Xequals a specific number (say,n) is given by the equation

This equation holds for any nonnegative integer value ofn(i.e.,n= 0, 1, 2, and so on). The relationship between the exponential and the Poisson distributions plays an important role in the theoretical development of queuing theory. It also has an important practical implication. By comparing the number of jobs that arrive for service during a spe-cific period of time with the number that the Poisson distribution suggests, the manager is able to see if his or her choices of a model and parameter values for the arrival process are reasonable.

In the previous sections we specified the arrival distribution, the service distribution, and the number of servers to define a queuing system. Queue discipline is still another charac-teristic that must be specified in order to define a queuing system. In all of the models that we have considered so far, we have assumed that arrivals were served on a come, first-served basis (often called FIFO, for “first-in, first-out”). This is certainly an appropriate

Prob{X = n} = e –T₍_T₎n n! 1_\ 3 FIGURE 15.19

A High Probability of Short Service Times Prob S t 1.0 10 20 30 40 0.632 t