A submodular optimization approach to bicriteria scheduling problems with controllable processing times on parallel machines

(1)

promoting access to White Rose research papers

White Rose Research Online

[email protected]

Universities of Leeds, Sheffield and York

http://eprints.whiterose.ac.uk/

White Rose Research Online URL for this paper:

http://eprints.whiterose.ac.uk/78189/

Published chapter

Shioura, A, Shakhlevich, NV and Strusevich, VA (2013)

A submodular

optimization approach to bicriteria scheduling problems with controllable

processing times on parallel machines.

SIAM Journal on Discrete Mathematics,

27 (1). 186 - 204 (19).

(2)

A SUBMODULAR OPTIMIZATION APPROACH TO BICRITERIA SCHEDULING PROBLEMS WITH CONTROLLABLE PROCESSING

TIMES ON PARALLEL MACHINES∗

AKIYOSHI SHIOURA†, NATALIA V. SHAKHLEVICH‡, AND VITALY A. STRUSEVICH§

Abstract. In this paper, we present a general methodology for designing polynomial-time algorithms for bicriteria scheduling problems on parallel machines with controllable processing times. For each considered problem, the two criteria are the makespan and the total compression cost, and the solution is delivered in the form of the break points of the efficient frontier. We reformulate the scheduling problems in terms of optimization over submodular polyhedra and give efficient procedures for computing the corresponding rank functions. As a result, for two of the considered problems we obtain the first polynomial-time algorithms, while for the third problem we considerably improve the known running time.

Key words. submodular optimization, parallel machine scheduling, controllable processing times, bicriteria problems

AMS subject classifications.90C27, 90B35, 90C05

DOI.10.1137/110843836

1. Introduction. In this paper, we study preemptive scheduling problems on parallel machines with controllable processing times. In the model under consider-ation, the jobs of set _N = {1_,2_{, . . . , n}} have to be processed on parallel machines

M1, M2, . . . , Mm, where m ≥2. Throughout this paper, it is assumed that n≥m.

For a job_j∈_N, its processing time_p(_j) is not given in advance but has to be chosen by the decision maker from a given interval [_l(_j)_{, u}(_j)]. That selection process can be seen as either compressing (also known as crashing) the longest processing time

u(_j) down to_p(_j) ordecompressing the shortest processing time_l(_j) up to_p(_j). In the former case, the value_x(_j) =_u(_j)−_p(_j) is called thecompression amount of job

j. Compression may decrease the completion time of job_j but incurs additional cost

w(_j)_x(_j), where _w(_j) is a given nonnegative unit compression cost. The total cost associated with a choice of the actual processing times is represented by the linear function_W =_j∈N_w(_j)_x(_j).

Each job_j ∈_N can be given arelease date _r(_j), before which it is not available. In the processing of any job, preemption and migration are allowed, so that the processing can be interrupted on any machine at any time and resumed later on, possibly on another machine. It is not allowed to process a job on more than one machine at a time, and a machine processes at most one job at a time.

∗_{Received by the editors August 8, 2011; accepted for publication (in revised form) November 28,}

2012; published electronically January 31, 2013. This research was supported by the EPSRC funded project EP/J019755/1, “Submodular Optimisation Techniques for Scheduling with Controllable Pa-rameters.”

http://www.siam.org/journals/sidma/27-1/84383.html

†_{Graduate School of Information Sciences, Tohoku University, Sendai, Japan ([email protected].}

tohoku.ac.jp). This author was partially supported by the Humboldt Research Fellowship of the Alexander von Humboldt Foundation and by Grant-in-Aid of the Ministry of Education, Culture, Sports, Science and Technology of Japan.

‡_{School of Computing, University of Leeds, Leeds LS2 9JT, UK ([email protected]).} §_{School of Computing and Mathematical Sciences, University of Greenwich, Old Royal Naval}

College, London SE10 9LS, UK ([email protected]).

(3)

Given a schedule, let _C(_j) denote the completion time of job_j, i.e., the time at which the last portion of job_j is ﬁnished on the corresponding machine. A schedule is calledfeasible if the processing of a job_j ∈_N takes place no earlier than its release date_r(_j). The value_C_max= max{_C(_j)|_j∈_N}determines the maximum completion time of all jobs and is called themakespan.

The machines can be eitheridentical, i.e., they have the same speed, oruniform, i.e., machine _M_v has speed _s_v, 1 ≤_v ≤_m. Without loss of generality, throughout this paper we assume that the uniform machines are numbered in nonincreasing order of their speeds, i.e.,

(1) _s₁≥_s₂≥ · · · ≥_s_m_.

For some schedule, denote the total time during which a job _j∈_N is processed on machine_M_v, 1≤_v≤_m, by_qv(_j). Taking into account the speed of the machine, we call the quantity_s_v_qv(_j) theprocessing amountof job_jon machine_M_v. It follows that

p(_j) =

m

v=1

svqv(_j)_.

Scheduling problems with controllable processing times have received consider-able attention since the 1980s; see, e.g., surveys by Nowicki and Zdrzalka [9] and by Shabtay and Steiner [13]. The models with controllable processing times have found applications in supply chain management and scheduling, imprecise computation, make-or-buy decision making, etc.

Traditionally, in this area two functions determine the quality of a schedule: (a) total compression cost_W given by a linear function_w(_j)_x(_j), and (b) a func-tion _F of the job completion times. The following four types of models are mainly considered in the literature:

Π1: to minimize_W, subject to a bounded value of _F; Π2: to minimize_F, subject to a bounded value of_W;

Π3: to minimize some aggregated function, e.g., a linear combination of_W and

F;

Π4: to minimize both functions_W and_F, i.e., to determine the set of the Pareto-optimal solutions.

The problems considered in this paper fall in the category Π4. We need to find the set of Pareto-optimal solutions defined by the break points of the so-called efficiency frontier; see [18] for definitions and a state-of-the-art survey of multicriteria schedul-ing. Recall that a schedule_S is calledPareto-optimal if there exists no schedule_S such that _C_max(_S) ≤_C_max(_S) and _W(_S) ≤ _W(_S), where at least one of these inequalities is strict.

Adapting standard notation for scheduling problems by Lawler et al. [7], we denote a bicriteria problem in the most general setting by _Q|_r(_j)_{, p}(_j) = _u(_j)−

(4)

(_C_max_{, W}), which means that we are searching for a set of Pareto-optimal solutions with respect to the two given criteria.

Notice that the bicriteria models are most general and if we know how to solve the Π4 version of a scheduling model, then we can deduce a solution to any other single-criterion counterpart, Π1–Π3_. The bicriteria models are also most important since a solution delivers to a decision maker the whole range of options to choose from. Although the ﬁrst studies on the Π4 problems date to as early as 1982 [21], positive results for these problems are still quite rare, as admitted in the survey [13]. Besides, attempts to handle the bicriteria problems lack a general methodology.

Below we mainly review the previously known results on the bicriteria scheduling problems (model Π4), as most relevant to this study.

Van Wassenhove and Baker [21], Tuzikov [19], and Hoogeveen and Woeginger [5] consider various versions of a bicriteria single machine problem with a function_F rep-resenting the maximum completion penalty. The break points of the eﬃciency frontier are found by tracking the changes in the structure of a schedule as the processing times of the jobs change.

A similar method is applied to problem_P|_u(_j)−_x(_j)_{, pmtn}|(_C_max_{, W}) with iden-tical parallel machines by Nowicki and Zdrzalka [10], who obtain an_O(_n2)-time algo-rithm.

For problem_Q|_u(_j)−_x(_j)_{, pmtn}|(_C_max_{, W}) with uniform machines, Nowicki and Zdrzalka in [9] describe an approach that allows ﬁnding an _ε-approximation of the eﬃciency frontier in pseudopolynomial_O(_nm(_d−_d)_/ε) time, where_dand_dare the optimal makespan values if all jobs are fully decompressed and fully compressed, respectively.

A systematic development of a general framework for solving scheduling problems with controllable processing times via submodular methods has been initiated by Shakhlevich and Strusevich [14, 15] and further advanced by Shakhlevich, Shioura, and Strusevich [16] and Shioura, Shakhlevich, and Strusevich [17]. This paper makes an additional contribution to the development of this approach.

In [14, 15] a number of scheduling problems with controllable processing times have been formulated in terms of maximization linear programming problems defined over special polyhedra with submodular constraints. For several models, including those studied in this paper, the corresponding rank functions of submodular polyhe-dra have been developed. Still, at that stage, the solution methods developed in those papers have mainly remained schedule-based and implemented the greedy procedures of compression or decompression of the processing times of the jobs. The only advan-tage of the submodular reformulations has been that of an easy justification of the greedy approach for the whole range of related problems. Prior to [14, 15], researchers justified the greedy reasoning from the first principles and in a problem-dependent way. As far as the bicriteria problems are concerned, Shakhlevich and Strusevich [14] combine submodular and scheduling reasoning to develop an _O(_nlog_n) time algo-rithm for problem _P|_u(_j)−_x(_j)_{, pmtn}|(_C_max_{, W}), while in [15] they design the first polynomial-time algorithm for problem_Q|_u(_j)−_x(_j)_{, pmtn}|(_C_max_{, W}), that requires

O(_nlog_n+_nm4) time. While submodular reasoning has been crucial in deriving those results, its potential was not explored in [14, 15] in depth.

(5)

Table 1

Time complexity of the algorithms.

Machines Release dates Previously known This paper

Identical parallel Zero O(nlogn) [14] N/A

Identical parallel Diﬀerent N/A O(n2_log_m₎

Uniform parallel Zero O(nlogn+nm4_{) [15]} _O₍_nm_log_m₎

Uniform parallel Diﬀerent N/A O(n2m)

contained in that paper has become one of the essential tools that is used in all our subsequent papers, including this study. This statement, quoted below as Theorem 1, states that a linear programming problem over a submodular polyhedron intersected with a box can be reduced to a problem deﬁned over a better structured polyhedron, a so-called base polyhedron with a modiﬁed rank function. It is well known that the resulting problem admits a greedy solution algorithm with the optimal decision variables written in closed form; see Theorem 2 in section 2.

For the single criterion counterparts (model Π1) of the models considered in this paper, Shioura, Shakhlevich, and Strusevich [17] develop the fastest known algorithms based on a submodular reformulation and decomposition. These results are yet to appear in the form of a journal publication.

In this paper, we continue the line of research that links scheduling with submodu-lar optimization and present fast algorithms that solve bicriteria problems_P|_r(_j)_{, u}(_j)−

x(_j)_{, pmtn}|(_C_max_{, W}), _Q|_u(_j) − _x(_j)_{, pmtn}|(_C_max_{, W}), and _Q|_r(_j)_{, u}(_j) − _x(_j)_,

pmtn|(_C_max_{, W}). Our reasoning is schedule-free and is based on reformulation of the corresponding problems as optimization problems with submodular constraints. For each of these problems we develop an appropriate routine for computing the cor-responding rank functions as piecewise-linear functions followed by computing their sum, with several stages of the solution process being problem-independent, either technically or at least ideologically.

The summary of the results relevant to this paper is given in Table 1. Notice that here we do not consider problem _P|_u(_j)−_x(_j)_{, pmtn}|(_C_max_{, W}), because the algorithm from [14] has the running time of _O(_nlog_n), leaving no room for further improvements, since solving any bicriteria problem under consideration requires the sorting of the jobs with respect to their unit compression costs. Observe that for problem_Q|_r(_j)_{, u}(_j)−_x(_j)_{, pmtn}|(_C_max_{, W}) no polynomial-time algorithm has been previously known, even for its special case with identical machines, i.e., problem

P|r(_j)_{, u}(_j)−_x(_j)_{, pmtn}|(_C_max_{, W}).

The remainder of this paper is organized as follows. Section 2 gives a brief review of the necessary facts on submodular optimization, demonstrates how the schedul-ing problems under consideration can be formulated in terms of maximization linear programming problems with submodular constraints, and provides the explicit expres-sions for the corresponding rank functions. The key new outcome of that section is a collection of general principles that are applicable to solving any bicriteria scheduling problem under consideration. Those principles are aimed at obtaining an expression for the cost function as a piecewise linear function of makespan. Sections 3 through 5 consider each of the three problems individually and contain algorithms for computing the corresponding rank functions and for ﬁnding the cost functions in the piecewise linear form. Some concluding remarks are given in section 6.

2. General principles. In this section, we describe algorithms for ﬁnding the set of Pareto-optimal solutions to the bicriteria problems_Q|_u(_j)−_x(_j)_{, pmtn}|(_C_max_,

(6)

We start with the features that are common to all problems. Let _S(_d) denote a feasible preemptive schedule in which all jobs are completed by time _d. The task of checking whether such a schedule exists for a given_d can be formulated in terms of submodular optimization.

For completeness, we introduce the necessary deﬁnitions. For a set_N ={1_,2_{, . . . ,}

n}, let 2N denote the family of all subsets of_N. For a subset_X⊆_N, letRX denote the set of all vectorspwith real components_p(_j), where_j∈_X. For a vectorp∈RN, deﬁne_p(_X) =_j∈X_p(_j) for every nonempty set _X∈2N and deﬁne_p(∅) = 0.

A set function_ϕ: 2N →Ris calledsubmodular if the inequality (2) _ϕ(_X∪_Y) +_ϕ(_X∩_Y)≤_ϕ(_X) +_ϕ(_Y)

holds for all sets_{X, Y} ∈2N. For a submodular function_ϕdeﬁned on 2N such that

ϕ(∅) = 0, the pair (2N_{, ϕ}) is called asubmodular system on_N, while_ϕis referred to as therank function of that system.

For a submodular system (2N_{, ϕ}), deﬁne two polyhedra

P(_ϕ) ={p∈RN |_p(_X)≤_ϕ(_X)_{, X}∈2N}_, (3)

B(_ϕ) ={p∈RN |p∈_P(_ϕ)_{, p}(_N) =_ϕ(_N)}_, (4)

called a submodular polyhedron and a base polyhedron, respectively, associated with the submodular system. Notice that _B(_ϕ) represents the set of all maximal vectors in_P(_ϕ).

For a scheduling problem under consideration, below we explain that the set of feasible schedules_S(_d) can be described as a polyhedron of the form

(5) _P(_ϕ)u_l ={p∈RN |_p(_X)≤_ϕ(_X)_{, X}∈2N;_l(_j)≤_p(_j)≤_u(_j)_{, j}∈_N}_, where _ϕ : 2N → R is a set function and l_,u ∈ RN are vectors of lower and upper bounds on the processing times, respectively. For a set of jobs_X ⊆_N the value_p(_X) is the total processing requirement for the jobs of set _X with respect to their actual processing times, while function_ϕ(_X) represents the total largest processing capacity available for these jobs. Notice that if function_ϕis submodular, then the polyhedron

P(_ϕ)u_l is a submodular polyhedron_P(_ϕ) of the form (3) intersected with a box. All problems under consideration share the same necessary and suﬃcient con-ditions for the existence of a feasible schedule with a given common deadline _d, as formulated, e.g., in [1]. Informally, these conditions state that for a given deadline _d a feasible schedule exists if and only if

(i) for each _v,1 ≤ _v ≤ _m−1, _v longest jobs can be processed on _v fastest machines by time_d, and

(ii) all_njobs can be completed on all_mmachines by time_d.

Thus, for a problem at hand, we need to ﬁnd an expression for the largest process-ing capacity available to process any subset_X of jobs. Such expressions are presented below in the form of a set function_ϕ(_X), which in all cases appears to be submodular. As a result, checking the existence of a feasible schedule_S(_d) reduces to determining a feasible point in the polyhedron_P(_ϕ)u_l associated with the relevant rank function_ϕ. Consider problem_Q|_u(_j)−_x(_j)_{, pmtn}|(_C_max_{, W}) in which all jobs are simultane-ously available at time zero, i.e.,_r(_j) = 0. The machines are numbered in accordance with (1). Deﬁne

(6) _S₀= 0_{, S}_v=

v

i=1

si, 1≤_v≤_m,

(7)

To complete before time_d, for any set that contains more than_mjobs, including the whole set of jobs_N, the total processing capacity_dS_mshould be enough, while for any set_X with less than_mjobs it should be suﬃcient to use the|_X|fastest machines during the time interval [0_{, d}]. Therefore, problem_Q|_u(_j)−_x(_j)_{, pmtn}|(_C_max_{, W}) is associated with the polyhedron_P(_ϕ)u_l with the rank function_ϕof the form

(7) _ϕ(_X) =

dS|X| if |X| ≤m−1,

dSm otherwise.

The conditions_p(_X)≤_ϕ(_X),_X ∈2N, for the function_ϕ(_X) of the form (7) cor-respond to conditions (i) and (ii) above. As proved in [15], function_ϕis submodular. Consider now problem_Q|_r(_j)_{, u}(_j)−_x(_j)_{, pmtn}|(_C_max_{, W}) in which the jobs have individual release dates. To derive the rank function for the polyhedron_P(_ϕ)u_l associ-ated with checking the existence of a feasible schedule_S(_d) observe the following. For a feasible solution vectorp, the following conditions should be satisﬁed: any job can be completed by time_dif it is processed by the fastest machine, any pair of jobs can be completed by_dif they are processed on the two fastest machines, etc., any subset of at most_m−1 jobs can be completed by_don_m−1 fastest machines, and ﬁnally, all jobs can be completed by_don all_m machines. In order to take into account the release dates, assume that the jobs are numbered in such a way that

(8) _r(1)≤_r(2)≤ · · · ≤_r(_n)_,

and for a set_X∈2N deﬁne_r_i(_X) as the_ith smallest release date in set_X, 1≤_i≤ |_X|. Similarly to the piece of notation_p(_X), we denote the sum of the release dates of the jobs of set_X by_r(_X).

For a nonempty set _X of jobs, the largest processing capacity available on the fastest machine_M₁is_s₁(_d−_r₁(_X)), the total largest processing capacity on the two fastest machines_M₁and_M₂is equal to_s₁(_d−_r₁(_X)) +_s₂(_d−_r₂(_X)), etc. Thus, we deduce that

(9) _ϕ(_X) =

dS|X|−|X|i=1siri(X) if |X| ≤m−1,

dSm−m_i=1siri(_X) otherwise_.

This formula is shown in [8, 15] in a diﬀerent (but equivalent) form. Function _ϕ(_X) can be proved to be submodular as in [15].

If the machines are identical, then for the resulting problem_P|_r(_j)_{, u}(_j)−_x(_j)_,

pmtn|(_C_max_{, W}) function (9) can be simpliﬁed. For a set of jobs_X ⊆_N, let _R(_X) denote the sum of min{|_X|_{, m}} smallest release dates for the jobs of set _X. Notice that if|_X| ≤_m−1, then_R(_X) =_r(_X). For completeness, deﬁne_R(∅) = 0. Then

(10) _ϕ(_X) =

dS|X|−r(X) if |X| ≤m−1,

dSm−R(_X) otherwise_.

This formula is also shown in [17], where the roles of the release dates and the deadlines are exchanged. Observe that_S_|X|=|_X|and_S_m=_mfor identical machines, assuming that the speed of any machine is 1.

(8)

function over the intersection of a submodular polyhedron and a box is equivalent to maximizing the same objective function over a base polyhedron associated with another rank function.

Theorem 1 (cf. [16]). Polyhedron P(ϕ)u

l is nonempty if and only if l ∈P(ϕ)

andl≤u. If_P(_ϕ)u_l is nonempty, then the set of maximal vectors in_P(_ϕ)u_l is a base polyhedron_B(_ψ)associated with the submodular system(2N_{, ψ}), where the submodular rank function _ψ: 2N →R is given by

(11) _ψ(_X) = min

Y∈2N{ϕ(Y) +u(X\Y)−l(Y \X)}, X ∈2

N_.

A detailed proof of Theorem 1 is given in [16]. Notice that Theorem 1 can also be derived from Proposition II.2.11 of [2], which addresses the truncation operation for generalized polymatroids.

Since_u(_X\_Y) =_u(_X)−_u(_X∩_Y) for_{X, Y} ⊆_N, we may rewrite (11) to obtain

(12) _ψ(_X) =_u(_X) + min

Y∈2N{ϕ(Y)−u(X∩Y)−l(Y \X)}.

For the problems under consideration, ﬁnding a schedule with the makespan

Cmax = _d and the minimum total compression cost _W reduces to the problem of maximizing the function_j∈N_w(_j)_p(_j) over_P(_ϕ)u_l. In turn, due to Theorem 1, the latter problem reduces to

Maximize

j∈N

w(_j)_p(_j) (13)

subject to p∈_B(_ψ)_.

The beneﬁt gained by such a reduction is the possibility of using the most well-known result of submodular optimization that guarantees that a solution to the problem of maximizing a linear function over a base polyhedron can be found by a greedy algorithm. Informally, to determine an optimal vectorp∗ such an algorithm starts with p∗ =land considers the components of the current p∗ in the sequence

σ= (_σ(1)_{, σ}(2)_{, . . . , σ}(_n)) such that

(14) _w(_σ(1))≥_w(_σ(2))≥ · · · ≥_w(_σ(_n))_,

giving the current component the largest possible increment that keeps the vector feasible.

Another advantage of the reduction to a problem of the form (13) is that the solution vector p∗ can be obtained essentially in a closed form, as stated in the theorem below. Deﬁne

(15) _N_t(_σ) ={_σ(1)_{, . . . , σ}(_t)}_, 1≤_t≤_n;

for completeness, deﬁne_N₀(_σ) =∅.

Theorem 2 (_{cf. [3]}). _{Given an LP problem of the form}_(13),_let_σ_{= (}_σ₍₁₎_{, σ}₍₂₎_,

. . . , σ(_n)) be an ordering of elements in_N that satisfies(14). Then, vector p∗∈RN given by

p∗(_σ(_t)) =_ψ(_N_t(_σ))−_ψ(_N_t−1(_σ))_{, t}= 1_,2_{, . . . , n,}

(9)

Theorem 2 provides the foundation for our new approach that ﬁnds the eﬃciency frontier of the bicriteria scheduling problems in a closed form.

Let_S∗(_d) denote a schedule with a makespan_C_max=_dthat minimizes the total compression cost. The solution to a bicriteria problem will be delivered as a collection of break points of the eﬃciency frontier (_{d, W}(_d)), where_dis a value of the makespan of schedule _S∗(_d) and_W(_d) is a (piecewise-linear in_d) function that represents the total optimal compression cost. Let also_p∗(_{j, d}) denote the optimal value of the actual processing time of job_j in schedule_S∗(_d). It follows that

(16) _W(_d) =

n

t=1

w(_σ(_t))_p∗(_σ(_t)_{, d})_.

For the problems under consideration, due to (7), (9), and (10), the rank function

ϕ(_X) as well as the function_ψ(_X) are functions of_d; therefore in this paper we may write_ϕ(_{X, d}) and_ψ(_{X, d}) whenever we want to stress that dependence.

Given a value of_dsuch that all jobs can be completed by time_d, deﬁne a function

(17) _ψ_t(_d) =_ψ(_N_t(_σ)_{, d})_, 1≤_t≤_n,

computed for this value of_d. By (12),

ψt(d) =u(Nt(σ)) + min

Y∈2N{ϕ(Y)−u(Nt(σ)∩Y)−l(Y \Nt(σ))}.

For all scheduling problems under consideration, due to (7), (9), and (10), there are

mexpressions for_ϕ(_Y), depending on|_Y| ∈ {1_,2_{, . . . , m}−1}or|_Y| ≥_m. As we show in the following sections, ﬁnding the minimum in the above formula results in_ψ_t(_d) represented as a piecewise-linear function of the form of an envelope

(18) _ψ_t(_d) =_u(_N_t(_σ)) +

⎧ ⎪ ⎪ ⎪ ⎪ ⎪ ⎨ ⎪ ⎪ ⎪ ⎪ ⎪ ⎩

dSm+_Am_t for_{d < d}≤_dm_t _,

dSm−1+_Am−1_t for_dm_t _{< d}≤_dm−1_t _, ..

.

dS1+_A1_t for_d2_t _{< d}≤_d1_t_, 0 for_d1_t _{< d <}+∞_,

where_ddenotes the smallest deadline_dfor which there exists a feasible schedule_S(_d), while the values _Av_t, 1 ≤_v ≤ _m, are appropriately determined problem-dependent constants. Their calculation will be explained in the subsequent sections. Notice that if for some_i, 0≤_i≤_m−1, the function_u(_N_t(_σ))+_dS_m−i+_Am−i_t does not contribute into_ψ_t(_d) as a piece, the break point_dm−i_t is set equal_dm−i−1_t .

The value of _d can be found for each problem in advance, since it is equal to the minimum makespan, provided that the processing times are equal to their lower bounds _l(_j). For the model with uniform machines this takes_O(_n+_mlog_m) time if the release dates are equal [4] and _O(_nlog_n+_nm) time if the release dates are diﬀerent [12]. For the model with identical machines and diﬀerent release dates we can use the algorithm from [11] that requires_O(_nlog_m) time, provided that the jobs are numbered in accordance with (8).

Recall that by Theorem 2

W(_d) =

n

t=1

w(_σ(_t))_p∗(_σ(_t)_{, d}) =

n

t=1

(10)

For completeness, deﬁne_w(_σ(_n+ 1)) = 0 and rewrite

(19) _W(_d) =

n

t=1

w(_σ(_t)) (_ψ_t(_d)−_ψ_t−1(_d)) =

n

t=1

(_w(_σ(_t))−_w(_σ(_t+ 1)))_ψ_t(_d)_.

Thus, in order to be able to compute the function_W(_d)_,we first have to compute the functions _ψ_t(_d), _t = 1_,2_{, . . . , n,} for all relevant values of _d and then compute their weighted sum by (19). This function fully defines the efficiency frontier for the corresponding bicriteria scheduling problem.

Theorem 2 implies that we need a procedure that computes the value of a sub-modular function_ψ(_X) for a given_X∈2N. In the following sections, we explain how to compute function_ψ(_X) for the scheduling problems under consideration and how to adapt the corresponding procedures for computing the functions_ψ_t(_d) of the form (18).

Another feature of our approach that is common for all scheduling problems under consideration is related to computing function_W(_d). Assume that for a scheduling problem piecewise-linear functions_ψ_t(_d) of the form (18) are found. We can organize the break points of these functions as an_n×_mmatrix

(20) _D=

⎛ ⎜ ⎜ ⎜ ⎝

dm1 dm−11 · · · d11

dm2 dm−12 · · · d12

..

. . .. ...

dmn dm−1n · · · d1n

⎞ ⎟ ⎟ ⎟ ⎠,

where each row is a nondecreasing array. Then, the weighted sum_W(_d) given by (19) can be found by merging the arrays of the break points, obtaining a nondecreasing sequence of_O(_nm) potential break points of function_W(_d). It is straightforward to compute the value of_W(_d) between any two consecutive break points.

It is well known that merging _k sorted arrays of _l elements each into a single sorted array requires _O(_kllog_k) time; see, e.g., section 5.4.1 of [6]. Merging_k = _n rows of_l=_melements of matrix_Dwould take_O(_nmlog_n) time. Thus, having found the functions_ψ_t(_d), 1≤_t≤_n, the function_W(_d) of the form (19) can be computed in_O(_nmlog_n) time.

On the other hand, if the columns of matrix_D are known to be sorted, then the list of the break points of _W(_d) can be obtained in _O(_nmlog_m) time by merging

k =_m columns of _l =_n elements each, which is an improvement over _O(_nmlog_n) for_n≥_m.

Below we present a suﬃcient condition for the columns of matrix_Dto be ordered. The following lemma holds for all scheduling problems under consideration, provided the rank functions_ψ_t(_d) satisfy (18).

Lemma 1. LetDbe the matrix of the break points of the piecewise-linear functions

ψt(_d)_, 1≤_t≤_n, of the form (18). Then, for_i,1≤_i≤_m, and_t,1≤_t≤_n−1_,the inequality

(21) _Ai−1_t −_Ai_t≤_Ai−1_t+1−_Ai_t+1

implies that

(11)

Proof. For simplicity, we present the proof assuming that the intervals (_di−1_t _{, d}i_t] and (_di−1_t+1_{, d}i_t+1] are both nonempty. The proof can be appropriately adjusted to handle the case that at least one of these intervals is empty.

The break point_di_tis the solution of the equation

u(_N_t(_σ)) +_dS_i−1+_Ai−1_t =_u(_N_t(_σ)) +_dS_i+_Ai_t_,

i.e.,_di_t= (_Ai−1_t −_Ai_t)_/(_S_i−_S_i−1) = (_Ai−1_t −_A_ti)_/s_i_,where_s_i is the speed of machine

Mi. Similarly, _di_t+1= (_A_t+1i−1−_Ai_t+1)_/s_i. Then (21) implies (22), as required.

As discussed in section 3, for one of the scheduling problems due to its spe-cial structure the running time for computing function _W(_d) can be reduced to

O(_nmlog_m). For the other problems, such a reduction, even if possible, will not reduce the overall running time; see sections 4 and 5.

For all scheduling problems studied below, the corresponding rank functions_ϕ(_X) given by (7), (9), and (10) depend on the cardinality of set _X. This is why in the subsequent consideration it is convenient to use the sets

(23) Yv={_Y ∈2N | |_Y|=_v}_, 1≤_v≤_n,

which contain all subsets of the ground set with exactly_v elements; for completeness we deﬁneY0={∅}.

3. Uniform machines, common release date: Rank function computa-tion. In this section, we consider problem_Q|_u(_j)−_x(_j)_{, pmtn}|(_C_max_{, W}) and present a procedure for computing the rank function _ψ(_X) for a given deadline _d and set

X ⊆N. Then we show how that procedure can be adapted for ﬁnding all functions

ψt(_d) =_ψ(_N_t(_σ)_{, d}) for all_t∈_N as piecewise-linear functions of _d.

We assume that the machines are numbered in accordance with (1) and the values

S0, S1, . . . , Smare deﬁned by (6).

For given_dand set_X, due to (7) and (12) we may write_ψ(_X) = min{_ψ(_X)_{, ψ}(_X)}, where

ψ(_X) =_u(_X) + min

0≤v≤m−1

dSv−max Y∈Yv{u

(_X∩_Y) +_l(_Y\_X)}

; (24)

ψ(_X) =_u(_X) +_dS_m−max

v≥m

max

Y∈Yv{u

(_X∩_Y) +_l(_Y\_X)}

; (25)

recall the deﬁnition ofY_v in (23).

Lemma 2. For problemQ|u(j)−x(j), pmtn|(C_max, W), given a deadlinedand a

set_X, define _U to be a list of values(_u(_j)|_j ∈_X)and define _L to be a list of values

(_l(_j)|_j ∈ _N\_X); let _β_z denote the _zth largest element in the merger of these lists. Then

ψ(_X) =_u(_X) + min

0≤v≤m−1

⎧ ⎨ ⎩dSv−

v

j=1

βj

⎫ ⎬ ⎭,

(26)

ψ(_X) =_dS_m−_l(_N\_X)_. (27)

Proof. Observe that for each_v, 0≤_v≤_m−1, the equality

max

Y∈Yv{u

(_X∩_Y) +_l(_Y\_X)}=

v

j=1

βj

(12)

To compute_ψ(_X), we need to determine max_Y_∈Y_v_,v≥m{_u(_X∩_Y) +_l(_Y \_X)}. Since each job in _Y contributes either the lower bound or the upper bound on its processing time, it follows that

max

Y∈Yv,v≥m{u

(_X∩_Y) +_l(_Y \_X)}=_u(_X) +_l(_N\_X)_,

so that (25) becomes_ψ(_X) =_u(_X) +_dS_m−_u(_X)−_l(_N\_X) and (27) is valid. The value_ψ(_X) can be found by the procedure below.

Procedure PsiCompQr0.

Input: An instance of problemQ|u(j)−x(j), pmtn|(C_max, W), a deadline d, and a set_X ⊆_N

Output: _{The value}_ψ₍_X₎

Step 1. Determine a list _U of values (_u(_j)|_j ∈ _X) and a list_L of values (_l(_j)|_j ∈

N\X). Determine the values _β₁, _β₂_{, . . . , β}_m−1, where_β_z is the_zth largest element in the merger of the lists_U and_L.

Step 2. Compute_ψ(_X) and_ψ(_X) by (26) and (27), respectively.

Step 3. Output_ψ(_X) = min{_ψ(_X)_{, ψ}(_X)}.

The most time-consuming part is Step 1, where choosing the_m−1 largest elements in the merger of the lists _U and _L can be done in _O(_n) time by using the median ﬁnding technique, and sorting the_m−1 largest elements can be done in _O(_mlog_m) time. Hence, Procedure PsiCompQr0 requires_O(_n+_mlog_m) time.

Procedure PsiCompQr0 will be the basis of our algorithm for solving the bicriteria problem_Q|_u(_j)−_x(_j)_{, pmtn}|(_C_max_{, W}) presented below. Recall that we need to ﬁnd the functions_ψ_t(_d) =_ψ(_N_t(_σ)_{, d}) computed for all values of_dand all_t∈_N, where the sets_N_t(_σ) are deﬁned by (15). As mentioned in section 2, the value of_d, the smallest deadline_dfor which a feasible schedule exists, can be found in _O(_n+_mlog_m) time by an algorithm from [4].

Given a _t, 1 ≤ _t ≤ _n, deﬁne _β_t,z_,1 ≤ _z ≤ _m−1, as the _zth largest element in the merger of the lists _U_t of the values (_u(_j)|_j ∈ _N_t(_σ)) and _L_t of the values (_l(_j)|_j∈_N\_N_t(_σ))_. It follows from Lemma 2 that

(28) _ψ_t(_d) =_u(_N_t(_σ)) + min

0≤v≤m{dSv+A v t},

where

(29) _Av_t =

⎧ ⎨ ⎩

0 for_v= 0_,

−v_z=1βt,z for 1≤v≤m−1, −l(_N\_N_t(_σ))−_u(_N_t(_σ)) for_v=_m.

Notice that once all values_Av_t are found for some_t, 1≤_t≤_n, determining_ψ_t(_d) in the form (18) or (28) is equivalent to the problem of ﬁnding the lower envelope of

mlinear functions given in increasing order of their slopes_S_v. Exactly this problem has been studied in [20] and has been shown to be solvable in _O(_m) time. We use that method as part of our algorithm presented below. Our algorithm is based on Procedure PsiCompQr0 applied to_X =_N_t(_σ) for all_t= 1_{, . . . , n.}For each_t, 0≤_t≤

n, it maintains a sorted list_V_tof_m−1 largest values among{_u(_j)|_j∈_N_t(_σ)}∪{_l(_j)|

(13)

Algorithm AllPsiQr0.

Input: An instance of problemQ|u(j)−x(j), pmtn|(C_max, W)

Output: A collection of functionsψ_t(d), 1≤t≤n, each in a piecewise-linear form

Step 1. Find a sequence _σdeﬁned by (14). If required, renumber the machines so that (1) holds and compute the values_S_v, 1≤_v≤_m, by (6). Compute_dby running an algorithm from [4] applied to_p(_j) =_l(_j)_{, j}∈_N.

Step 2. Create list_V₀that contains_m−1 largest values_l(_j),_j∈_N, sorted in nonin-creasing order. Deﬁne_N₀(_σ) :=∅ and set_u(_N₀(_σ)) := 0 and_l(_N\_N₀(_σ)) :=

l(_N).

Step 3. For_tfrom 1 to_ndo:

(a) Set_N_t(_σ) :=_N_t−1(_σ)∪ {_σ(_t)}, _u(_N_t(_σ)) :=_u(_N_t−1(_σ)) +_u(_σ(_t)), and

l(_N\_N_t(_σ)) :=_l(_N\_N_t−1(_σ))−_l(_σ(_t)).

If _u(_σ(_t)) is less than the smallest element of _V_t−1, rename_V_t−1 as _V_t without updating it and go to Step 3(b).

Else delete from _V_t−1 the element _l(_σ(_t)), if it belongs to _V_t−1, or its smallest element, otherwise. Insert _u(_σ(_t)) in that list keeping the re-sulting list in nonincreasing order. Call the rere-sulting list_V_t.

(b) Taking the elements in list _V_tin the order of appearance, rename them by _β_t,z_,1 ≤_z ≤ _m−1. Scanning the values _β_t,z in the order of their numbering, compute the sums v_z=1_β_t,z, 1≤_v ≤_m−1, and thereby ﬁnd the values_Av_t by (29).

(c) Compute_Am_t :=−_l(_N\_N_t(_σ))−_u(_N_t(_σ))_.

(d) Use the algorithm from [20] to determine function_ψ_t(_d) in the form (18), as a lower envelope given by its break points_d≤_dm_t ≤_dm−1_t ≤ · · · ≤_d1_t_. Let us estimate the running time of Algorithm AllPsiQr0. Step 1 is the preprocess-ing stage that requires_O(_nlog_n) time. Step 2 can be implemented in_O(_n+_mlog_m) time, since choosing the _m−1 largest values takes _O(_n) time and sorting them re-quires _O(_mlog_m) time. For a typical iteration _t of the loop in Step 3 the updates in Step 3(a) require _O(_m) time. Since the list _V_t is kept sorted, the values _β_t,z_, 1≤_z≤_m−1, and all their partial sums can be found in_O(_m) time. Thus, all values

Avt, 1 ≤v ≤m, can be found inO(m) time. The algorithm from [20] employed in

Step 3(d) also needs_O(_m) time. Thus, the following statement holds.

Lemma 3. For problem Q|u(j)−x(j), pmtn|(C_max, W) the functions ψ_t(d) for

all_t,1≤_t≤_n, can be computed in _O(_nlog_n+_nm) time.

As mentioned in section 2, computing function_W(_d) of the form (19) that deter-mines the eﬃciency frontier for the original bicriteria scheduling problem additionally requires_O(_nmlog_n) time. However, for the problem under consideration this running time can be reduced due to the following statement.

Lemma 4. _Let_D_{be the matrix of the break points of the piecewise-linear functions}

ψt(d),1≤t≤n, of the form(18)computed for problemQ|u(j)−x(j), pmtn|(Cmax, W).

Then, for all _i,1≤_i≤_m and all_t,1≤_t≤_n−1_, the inequality(21)holds.

Proof. Take an arbitrary_t, 1≤_t≤_n−1_. First, consider the case that 1≤_i≤

m−1, so that in accordance with (29)

Ai−1t =− i−1

z=1

βt,z, Ait=− i

z=1

βt,z,

and _Ai−1_t −_Ai_t = _β_t,i, where _β_t,i is the _ith largest element in list _V_t. Similarly,

(14)

from list_V_tby replacing an element_xof list _V_tby_u(_σ(_t+ 1)), where

(30) _l(_σ(_t+ 1))≤_x≤_u(_σ(_t+ 1)) ;

see Step 3(a) of Algorithm AllPsiQr0. As the result, the _ith largest element in list

Vt+1 is no smaller than in listVt, i.e.,

Ai−1t −Ait=βt,i≤βt+1,i=_Ai−1_t+1−_Ai_t+1_,

and (21) holds.

Now we look at the case_i=_m. It follows from (29) that

Am−1t =− m−1

z=1

βt,z, Am−1t+1 =− m−1

z=1

βt+1,z.

Also (29) implies that

Amt+1=−l(N\Nt+1(_σ))−_u(_N_t+1(_σ))

=−_l(_N\_N_t(_σ)) +_l(_σ(_t+ 1))−_u(_N_t(_σ))−_u(_σ(_t+ 1)) =_Am_t +_l(_σ(_t+ 1))−_u(_σ(_t+ 1))≤_Am_t _.

If the lists_V_t and_V_t+1 coincide, then_Am−1_t =_Am−1_t+1 =−m−1_z=1 _β_t,z and

Am−1t −Amt =Am−1t+1 −Amt ≤Am−1t+1 −Amt+1,

as required.

If the two lists are diﬀerent, then, as explained above, this is due to the fact that in Step 3(a) of Algorithm AllPsiQr0 an element_xof list_V_tis replaced by_u(_σ(_t+ 1)), so that (30) holds. This implies that

Am−1_t+1 −Am_t+1=

−m−1 z=1

βt,z+x−u(σ(t+ 1))

−(_Am_t +_l(_σ(_t+ 1))−_u(_σ(_t+ 1)))

=_Am−1_t −_Am_t +_x−_l(_σ(_t+ 1))≥_Am−1_t −_Am_t _,

which completes the proof.

Lemma 4 allows us to give the following estimate of the running time required to solve problem_Q|_u(_j)−_x(_j)_{, pmtn}|(_C_max_{, W}).

Theorem 3. _Problem_Q|_u₍_j₎−_x₍_j₎_{, pmtn}|₍_C_max_{, W}₎_{is solvable in}_O₍_nm_log_m₎

time.

4. Uniform machines, diﬀerent release dates: Rank function computa-tion. In this section, we consider problem _Q|_r(_j)_{, u}(_j)−_x(_j)_{, pmtn}|(_C_max_{, W}) and present a procedure for computing the rank function _ψ(_X) for a given deadline _d and set _X ⊆_N. Then we show how that procedure can be adapted for ﬁnding all functions_ψ_t(_d) =_ψ(_N_t(_σ)_{, d}) for all_t∈_N as piecewise-linear functions of_d.

(15)

Due to (9) and (12) we represent the function_ψ(_X) as_ψ(_X) = min{_ψ(_X)_{, ψ}(_X)}, where

ψ(_X) =_u(_X) (31)

+ min

0≤v≤m−1

dSv−_Ymax_∈Y

v

_v

i=1

siri(Y) +u(X∩Y) +l(Y\X)

,

ψ(_X) =_u(_X) +_dS_m− max

Y∈Yv,v≥m

_m

i=1

siri(Y) +u(X∩Y) +l(Y\X)

.

(32)

We describe a dynamic programming procedure that computes the function_ψ(_X) for given_dand_X. To compute _ψ(_X) we deﬁne

(33) _λ(_j) =

u(_j) if_j ∈_X,

l(_j) if_{j /}∈_X

and introduce

zv(Y) = v

i=1

siri(Y) +λ(Y),

so that (31) can be rewritten as

ψ(_X) =_u(_X) + min

0≤v≤m−1

dSv−_Ymax_∈Y

v{zv

(_Y)}

.

For some _v, 1≤_v ≤_m−1_,consider job _k, _v ≤_k ≤_n, and the subsets in Yv that contain no jobs _{j > k}. Denote the set of such subsets by Yv[_k]. Let_z_v[_k] be a real number given by

zv[_k] = max{_z_v(_Y)| _Y ∈ Yv[_k]}_.

We note that Yv[_n] = Yv, and therefore in order to compute _ψ(_X) we need to determine the value

zv[_n] = max{_z_v(_Y)|_Y ∈ Yv}_.

The lemma below shows that the values _z_v[_j] can be computed by a dynamic programming approach.

Lemma 5. For each v= 1,2, . . . , m−1, we have

zv[v] =zv−1[v−1] +svr(v) +λ(v),

(34)

zv[_j] = max{_z_v[_j−1]_{, z}_v−1[_j−1] +_s_v_r(_j) +_λ(_j)} for _j=_v+ 1_{, . . . , n,} (35)

where it is assumed that _z₀[_j] = 0for_j = 0_,1_,2_{, . . . , n}.

Proof. For each _v = 1_,2_{, . . . , m}−1, the set family Yv[_v] contains only one set

{1_,2_{, . . . , v}}. Therefore, we obtain (34) as follows:

zv[_v] =_z_v({1_,2_{, . . . , v}}) =

v

j=1

(16)

We then suppose that 1≤_v≤_m−1 and 2≤_j≤_n. We have

Yv[_j] =Yv[_j−1]∪ {_Y ∪ {_j} |_Y ∈ Yv−1[_j−1]}_.

For_Y ∈ Y_v−1[_j−1], the equality_z_v(_Y ∪ {_j}) =_z_v−1(_Y) +_s_v_r(_j) +_λ(_j) holds, since the jobs are numbered in nondecreasing order of the release dates. Hence, (35) can be obtained as follows:

max{_z_v(_Y)|_Y ∈ Yv[_j]}

= max{max{_z_v(_Y)|_Y ∈ Y_v[_j−1]}_,max{_z_v(_Y ∪ {_j})|_Y ∈ Y_v−1[_j−1]}} = max{_z_v[_j−1]_{, z}_v−1[_j−1] +_s_v_r(_j) +_λ(_j)}_.

We now explain how to compute function_ψ(_X). Suppose that we know the set

Y such that|_Y| ≥_mand

(36)

m

i=1

siri(Y) +λ(Y) = max

_m

i=1

siri(Y) +λ(Y)Y ∈ Yv, v≥m

.

Let _k ∈ _N be the job such that the set {_j ∈ _Y | _j ≤ _k} contains exactly _m elements. It should be noted that_k itself need not be an element of _Y. Since the jobs are numbered in nondecreasing order of the release dates, it follows that the jobs of set_Ywith_msmallest release dates are contained in the set_Y₁={_j∈_Y|_j≤_k}, so that

m

i=1

siri(Y) = m

i=1

siri(Y1).

Denote_Y₂={_j∈_Y|_{j > k}}, so that_Y=_Y₁∪_Y₂. Since

m

i=1

siri(Y) +λ(Y) = m

i=1

siri(Y1) +λ(Y1) +λ(Y2),

we may include all jobs_{j > k}into set_Y₂ to achieve the maximum in (36). Thus, for a chosen_k, we deﬁne_Y₂={_k+ 1_{, k}+ 2_{, . . . , n}}.

In order to find set _Y, it suffices to determine set _Y₁, which can be done by performing a systematic search among all _m-element subsets of jobs from _N. In our computation of _ψ(_X) we use the values _z_v[_k] and the sets Yv[_k] defined for 1≤_v ≤_m−1 and_v ≤_k≤_n. We now extend our definitions to the case of_v =_m. In other words,

(37) _z_m[_k] = max

Y∈Ym[k]

_m

i=1

siri(_Y) +_λ(_Y)

,

whereY_m[_k] denotes the set of all_m-element subsets with no jobs_{j > k}. For ﬁnding the values _z_m[_k], we simply extend the procedure outlined in Lemma 5 for _v =_m. Let_Y₁[_k] be a set inY_m[_k] such that

m

i=1