A strongly polynomial algorithm for line search in submodular polyhedra

(1)

www.elsevier.com/locate/disopt

A strongly polynomial algorithm for line search

in submodular polyhedra

Kiyohito Nagano

Department of Mathematical Informatics, Graduate School of Information Science and Technology, University of Tokyo, Bunkyo-ku, Tokyo 113-8656, Japan

Received 16 June 2004; received in revised form 2 May 2005; accepted 21 September 2007 Available online 31 October 2007

Abstract

A submodular polyhedron is a polyhedron associated with a submodular function. This paper presents a strongly polynomial time algorithm for line search in submodular polyhedra with the aid of a fully combinatorial algorithm for submodular function minimization. The algorithm is based on the parametric search method proposed by Megiddo.

c

Keywords:Submodular function; Network flow; Exchange capacity

1. Introduction

LetUbe a finite nonempty set. A functionρdefined on 2U _is_submodular_if

ρ(X)+ρ(Y)≥ρ(X∪Y)+ρ(X∩Y), ∀X,Y ⊆U. (1) Iwata, Fleischer and Fujishige [13] and Schrijver [21] independently presented combinatorial, strongly polynomial time algorithms for submodular function minimization. Iwata [11] presented a fully combinatorial strongly polynomial time algorithm, which uses only additions, subtractions, comparisons, and the oracle calls for function values.

For a vector x ∈ RU andu ∈ U, we denote by x(u)the component of x on u. For a submodular function ρ:2U →Rwithρ(∅)=0, thesubmodular polyhedronP(ρ)and thebase polyhedronB(ρ)are defined by

P(ρ)= {x∈RU |x(X)≤ρ(X)(∀X ⊆U)}, (2)

B(ρ)= {x∈RU |x∈P(ρ),x(U)=ρ(U)}, (3)

wherex(X)=P

u∈Xx(u).

LetV be a finite nonempty set with|V| =nand let f :2V _→_R_{be a submodular function with} _f₍_∅₎₌_{0. In this} paper we consider the following problem:

Problem Line Search in Submodular Polyhedra (LSSP)

Instance: A submodular function f : 2V → Rwith f(∅) = 0, a starting point x0 ∈ P(f)and a direction vector

a ∈RV.

Task: Findt∗=max{t ∈R|x0+t a∈P(f)}. E-mail address:kiyohito [email protected].

(2)

Fig. 1. Problem LSSP (n=2).

Problem LSSP is a basic problem, but it is previously unknown if Problem LSSP can be solved in strongly polynomial time. We denote the set of nonpositive real numbers byR−, the set of nonnegative real numbers byR+. Ifa ∈R₋V, Problem LSSP does not have an optimal solution. Hence throughout we assume that a 6∈ RV−. We can assume f(X)≥0, ∀X ⊆V, andx0=0, by resetting f(X):= f(X)−x0(X)for allX⊆V. So throughout we assume that

f is nonnegative, f(∅)=0 andx0=0. An example of Problem LSSP is illustrated inFig. 1.

Thelexicographically optimal base problem, which was introduced by Fujishige [6], is a generalization of the lexi-cographically optimal flow problem considered by Megiddo [14]. Fujishige [6] (also see [7, Section 9.2]) showed that the lexicographically optimal base can be obtained by repeatedly solving Problem LSSP witha≥0at mostntimes.

Let us consider the following problem, which is a special case of Problem LSSP:

Problem Line Search in Base Polyhedra (LSBP)

Instance: A submodular function f : 2V → Rwith f(∅)= 0, a starting point x0 ∈ B(f)and a direction vector

a∈RV _with_a₍_V₎₌_0.

Task: Findt∗=max{t∈R|x0+t a∈B(f)}(=max{t ∈R|x0+t a∈P(f)}).

As is the case with Problem LSSP, it is previously unknown if Problem LSBP can be solved in strongly polynomial time. We will see some problems associated with Problem LSBP in the following paragraphs.

Hartvigsen [9,10] addressed aconstrained submodular optimization problem: max ( X v∈V w(v)x(v)|x∈B(f),X v∈V bi(v)x(v)=di(i =1, . . . ,p) ) ,

where p is a nonnegative integer,w ∈ RV, andbi ∈ RV,di ∈ Rfor eachi = 1, . . . ,p. Hartvigsen [10] showed that the constrained submodular optimization problem can be solved in strongly polynomial time forfixedvalues of

p. Problem LSBP is a special case of the constrained submodular optimization problem such that the feasible set has dimension 1 (or 0), that is, p=O(n). Thus Hartvigsen’s result applied to Problem LSBP does not lead to a strongly polynomial time algorithm.

For eachv ∈V, letχ_v∈RV be the characteristic vector that has value 1 onvand 0 elsewhere. As an instance of Problem LSBP, ifa = χ_v−χ_v0 forv, v0 ∈ V, v 6= v0, the optimal valuet∗ = max{t ∈ R | x₀+t(χ_v−χ_v0) ∈

B(f)} is said to be an exchange capacity (see, for example, Fujishige [7]) and can be computed directly by a submodular function minimization algorithm. So Problem LSBP can be interpreted as a problem of computing a

generalized exchange capacity. In many algorithms for optimization problems associated with submodular functions, e.g. submodular flow problems, we compute exchange capacities iteratively. Approaches using generalized exchange capacities may provide good algorithms for problems associated with submodular functions.

The problems of finding maximum flow values in capacitated networks can be reduced to Problem LSBP. Consider a networkN with a directed graphG=(V,E), whereV is a vertex set and Eis an arc set, and with a nonnegative capacity vectorc∈RE. ForX ⊆V, we denote the arc set{e=(v, v0)∈ E|v∈ X, v0∈V\X}byδ(X). We define a functionκc:2V →Rasκc(X)=c(δ(X))(X ⊆V). This functionκcis called acut functionand it is a nonnegative submodular function withκc(∅)=κc(V)=0. A vectorx ∈RV is said to befeasibleforN if there exists a vector

y∈RE_{such that}

0≤y≤c, and X{y(e)|e∈δ({v})} −X{y(e)|e∈δ(V\ {v})} =x(v), ∀v∈V, (4) that is, there exists a flowyin networkN which satisfies capacity constraints w.r.t.cand supply constraints w.r.t.x.

Maximum r–s flows. Let r,s ∈ V,r 6= s and we consider finding the maximum flow value from r to s. The maximum r–s flow value is t∗ = max{t ∈ R | t(χr −χs)is feasible forN}. Gale [8] showed that the set

(3)

{x ∈ RV | xis feasible forN}is equal to the base polyhedronB(κc). Therefore we can find the maximumr–s

flow value by solving Problem LSBP.

Maximumw-proportional flows.LetS+,S−⊆V,S+∩S−= ∅and letw∈RV be a vector which satisfies

X {w(v)|v∈ S+} =1, X{w(v)|v∈ S−} = −1 and    w(v) >0 (v∈ S+), w(v) <0 (v∈ S−), w(v)=0 (v∈V \(S+∪S−)).

Fort ∈R, we say thaty∈REis aw-proportional flow with flow valuetifysatisfies(4)w.r.t.x=tw. The maximum w-proportional flow value ist∗=max{t ∈R|tw∈B(κc)}and this is the optimal value of Problem LSBP.

The Newton method (Section4) is a simple approach to Problem LSSP. Ifa∈RV₊\ {0}, it is shown that the number of iterations of the Newton method is at mostn+1 and Problem LSSP can be solved in strongly polynomial time. (See Fujishige [7, Section 7.2], Fleischer and Iwata [5].) Ifa ∈RVanda 6∈R₊V, however, only a weakly polynomial running time bound is given, and it is left open to verify if the Newton method for Problem LSSP runs in strongly polynomial time.

In this paper, we propose an algorithm for Problem LSSP, which is quite different from the Newton method. The algorithm uses a fully combinatorial algorithm for submodular function minimization [11,12], within the framework of the parametric search method proposed by Megiddo [15,16]. It solves Problem LSSP in strongly polynomial time. From the definition of a submodular polyhedron(2), it is easy to see that the optimal valuet∗of Problem LSSP is equal to min{f(X)/a(X)| X ⊆ V,a(X) >0}. So Problem LSSP can be regarded as a minimum-ratio problem. Using the same technique as the algorithm for Problem LSSP, we also show that a minimum-ratio problem which is a generalization of Problem LSSP can be solved in strongly polynomial time. The paper is organized as follows. In Section2, we provide preliminaries for the following sections. Section 3 discusses algorithms for submodular function minimization. In Section4, we describe the Newton method using an algorithm for submodular function minimization as a subroutine. Section5 presents a strongly polynomial time algorithm for Problem LSSP using a fully combinatorial algorithm for submodular function minimization within the framework of the parametric search method proposed by Megiddo, and Section6gives a strongly polynomial time algorithm for a minimum-ratio problem which is a generalization of Problem LSSP.

2. Preliminaries

2.1. Definitions and basic properties

LetUbe a finite nonempty set. A familyD⊆2Uis said to be aring familyif it satisfiesX,Y ∈D⇒X∪Y,X∩Y ∈ D. Of course 2U _{is a ring family. Let}_D_⊆₂U _{be a ring family. A function}_ρ_:_D_→_R_{is called}_submodular_if

ρ(X)+ρ(Y)≥ρ(X∪Y)+ρ(X∩Y), ∀X,Y ∈D. (5) A functionρ:D→Ris calledsupermodularif−ρis submodular, and a functionρ:D→Ris calledmodularifρis submodular and supermodular. For a modular functionρ:D→R,ρcan be expressed asρ(X)=b0+b(X),∀X ∈D, using someb0∈Randb∈RU.

LetD⊆2U be a ring family and letρ : D→ Rbe a submodular function. Let arg minρ ⊆Ddenote a family of all the minimizers ofρ. It is not difficult to see that arg minρ forms a ring family using the submodularity ofρ. As arg minρis closed under union and intersection, there exists theminimal minimizer X_min =T

arg minρand the

maximal minimizer Xmax=Sarg minρ.

Letρ : 2U → Rbe a submodular function with ρ(∅) = 0 and let x ∈ P(ρ). A subset X ⊆ U is said to be

x-tightw.r.t.ρifx(X)=ρ(X). We denote the family ofx-tight sets w.r.t. ρbyD_ρ(x). Namely,D_ρ(x)= {X ⊆U |

x(X)=ρ(X)}. For anyy∈RU, a functionρy :2U _→_R_{defined by}_ρy(_X₎₌_ρ(_X₎₋_y₍_X₎₍_X _⊆_V₎_{is obviously a} submodular function andρy(∅)=0. Asx∈P(ρ),ρx(∅)=0 andρx(X)≥0,∀X ⊆U. This implies

X ∈D_ρ(x) ⇐⇒ X ∈arg minρx.

SoD_ρ(x)=arg minρx, thereforeD_ρ(x)forms a ring family. Note that∅ ∈D_ρ(x).

LetD⊆2U _{be a ring family. Now we assume}_{∅_,_U_{} ⊆}_{D. Although the cardinality of}_D_{may be exponentially} large in general,Dcan always be represented by a directed graph with at most|U| vertices, that is, O(|U|2)size.

(4)

For a directed graphG = (U,A), we call X ⊆U aclosureof G if(u,u0) ∈ Aandu ∈ X impliesu0 ∈ X. It is known that there exists a directed graphGD =(U,AD)such thatDis the family of closures ofG. For example, if

GD=(U,AD)is a directed graph with

AD=

n

(u,u0)|u,u0∈U,u6=u0,u0∈\{X |u ∈ X∈D}o, (6) then D = {X | X ⊆ Uis a closure ofGD} (see, e.g., [7, Section 3.2]). We say that GD is a directed graph

representationof D. We decomposeGD into strongly connected components{ΓD(s) | s ∈ S}, whereΓD(s)is a

subset ofU for eachs ∈ S. LetGD = (S,FD)be a directed acyclic graph determined byGD in a natural way.

For each T ⊆ S, we defineΓD(T) = Ss∈TΓD(s) ⊆ U. NowD = {ΓD(T)|T ⊆Sis a closure ofGD}. We say that(ΓD,GD)is acontracted directed graph representationofD. A ring familyDis calledsimpleifGDis acyclic.

For a simple ring family D, we can identifyGD with(ΓD,GD). For a ring familyD, we consider the case when

{∅,U} 6⊆ D. Let U_res = S

D\T

D andD_res = {X \T

D | X ∈ D}. Note thatD_res is a ring family with {∅,Ures} ⊆Dres⊆2Ures. We defineGD :=GDres,GD:=GDres, andΓD:=ΓDres. We say that(

T

D,S

D,GD)is

a directed graph representation ofD, and that(T

D,S

D, (ΓD,GD))is a contracted directed graph representation of

D. All the information aboutDis equivalent to(T

D,S

D,GD)or(TD,SD, (ΓD,GD)).

2.2. Optimality conditions for problem LSSP

As an instance of Problem LSSP, w.l.o.g., we assume that f is nonnegative, f(∅)=0,x0=0, anda 6∈ R−V. We

explain that the optimal valuet∗of Problem LSSP is nonnegative and finite. The optimal value is by definition

t∗=max{t|t a∈P(f)}. (7)

Since 0 ∈ P(f), t∗ is nonnegative. Let A ⊆ V be a subset which satisfiesa(A) > 0. Ift > f(A)/a(A), then

t a(A) > f(A)and hencet a6∈P(f). Sot∗≤ f(A)/a(A)andt∗is finite.

For anyt ∈Rwe consider deciding whethert a∈P(f)ort a6∈P(f). Since, for anyx ∈RV, f(∅)−x(∅)=0, we have

x∈P(f) ⇐⇒ min{fx(X)| X ⊆V} =0, and ifxcan be represented ast a, usingt a(∅)=0,

t a∈P(f) ⇐⇒ min{ft a(X)|X ⊆V} =0,

t a6∈P(f) ⇐⇒ min{ft a(X)|X ⊆V}<0.

(8) So we can decide whethert a∈P(f)ort a6∈P(f)by minimizing ft a.

Now, let us consider the optimality condition for Problem LSSP. Fort ≥0, we consider the conditions of “t<t∗”, “t =t∗” and “t >t∗”. SeeFig. 2to understand each condition intuitively. Note thatt =t∗andt ain the boundary of

P(f)are not equivalent (see Ex. 1. 2 and Ex. 1. 3 inFig. 2). Eq.(7)directly implies that

t =t∗ ⇐⇒ t a∈P(f), ∀ε >0(t+ε)a 6∈P(f),

⇐⇒ t a∈P(f), ∃X ⊆V such that∀ε >0εa(X) > ft a(X) (≥0), ⇐⇒ t a∈P(f), ∃X ∈Df(t a)such thata(X) >0,

⇐⇒ t a∈P(f), max{a(X)| X ∈Df(t a)}>0. (9) Fort a ∈P(f),Df(t a)always includes∅, so max{a(X)| X ∈Df(t a)} ≥0. Thus using(8)and(9), we obtain the following conditions fort∗and anyt ≥0:

t <t∗ ⇐⇒ min{ft a(X)| X ⊆V} =0, max{a(X)|X ∈Df(t a)} =0, t =t∗ ⇐⇒ min{ft a(X)| X ⊆V} =0, max{a(X)|X ∈Df(t a)}>0, t >t∗ ⇐⇒ min{ft a(X)| X⊆V}<0. (10)

(5)

Fig. 2. Relation betweentandt∗.

3. Submodular function minimization

3.1. Finding a minimizer of a submodular function

LetU be a finite nonempty set and letρ : 2U → Rbe any submodular function. We assume that for any given

X ⊆ U a function valueρ(X)can be acquired by an oracle call. Letγ (ρ) denote the upper bound on the time to compute the function value ofρ. An algorithm for submodular function minimization is said to be a strongly polynomial time algorithm if for any submodular functionρ :2U →Rthe total number of oracle calls for function evaluation andarithmetic operations, that is, additions, subtractions, multiplications, divisions and comparisons, is bounded by some polynomial in|U|. Combinatorial strongly polynomial time algorithms for submodular function minimization are given independently by Iwata, Fleischer and Fujishige (IFF) [13] and Schrijver [21]. Iwata [12] described an improved variant of the IFF algorithm and this algorithm achieves the best known bound on the running time, O((|U|6log|U|)·γ (ρ)+ |U|7log|U|). Let AlgorithmSFMbe an algorithm which finds a minimizer of a submodular functionρ : 2U →Rwith O(TO₍_|_U_|₎₎_{oracle calls for function evaluation and O}₍_TA₍_|_U_|₎₎_arithmetic

operations whereTO₍_|_U_|₎_and_TA₍_|_U_|₎_{are some polynomials in}_|_U_|_{. Let}_T₍_|_U_|_{, γ (ρ))}₌_TO₍_|_U_|₎_·_{γ (ρ)}₊_TA₍_|_U_|₎_.

An algorithm for submodular function minimization is said to be afully combinatorialstrongly polynomial time algorithm if the total number of oracle calls for function evaluation ofρ andfully combinatorial operations, that is, additions, subtractions and comparisons, is bounded by some polynomial in |U|. Iwata [11] presented a fully combinatorial strongly polynomial time algorithm for submodular function minimization as a variant of the IFF algorithm, and later, Iwata [12] described an improved algorithm, which runs in O(|U|8log2|U| ·γ (ρ))time. Let AlgorithmFC-SFMbe a fully combinatorial algorithm which finds a minimizer of a submodular functionρ:2U _→_R with O(TO

FC(|U|))oracle calls for function evaluation ofρand O(TFCFC(|U|))fully combinatorial operations, where

T_FCO(|U|)andT_FCFC(|U|)are some polynomials in|U|. LetT_FC(|U|, γ (ρ))=T_FCO(|U|)·γ (ρ)+T_FCFC(|U|).

3.2. Constructing all the minimizers of a submodular function

Letρ:2U →Rbe a submodular function andX_min,Xmaxbe the minimal, maximal minimizer ofρ, respectively.

We are interested in constructing arg minρ, that is, findingX_min,Xmax, andGarg minρ(or(Γarg minρ,Garg minρ)). A (fully combinatorial) algorithm which computesXmin,XmaxandGarg minρ can be designed by using any (fully combinatorial) algorithm which finds the minimal minimizer of a submodular function|U|times. For eachu∈U, let ρu:2U\{u}_→_R_{be a submodular function defined by}_ρu(_X₎₌_ρ(_X_{∪ {}_u_}_{) (}_X _⊆_U_{\ {}_u_}₎_{. We compute the minimal}

minimizer Mu of fufor eachu ∈ U, and putα = min{ρ(∅), min{ρ(Mu) | u ∈ U}}. Note thatαis equal to the minimum value ofρ. It is easy to see that

Xmin=

\

{Mu|u∈U, f(Mu)=α} and Xmax =

[

(6)

We can obtainGarg minρ using(6) and the information about{Mu | u ∈ U,f(Mu)= α}. It is known that the IFF algorithm and its variants always find the maximal minimizer of a submodular function. For a submodular function ρ : 2U → R, a function ρ] : 2U → Rdefined by ρ](X) = ρ(U \ X) (X ⊆ V)is also submodular. So any algorithm finding the maximal (minimal) minimizer of a submodular function can be transformed to an algorithm finding the minimal (maximal) minimizer of a submodular function. Therefore we can construct arg minρusing the IFF algorithm (or its variants)|U|times. The minimal minimizer of a submodular function can be easily computed using any submodular function minimization algorithm|U| +1 times. (See, e.g., Note 10.12. in [17].) Hence arg minρ can be constructed using any submodular function minimization algorithm O(|U|2)times.

As for currently known combinatorial, strongly polynomial time algorithms [11–13,21], we can do much better than that: using each one of them, we can construct arg minρin the same asymptotic running time as a single computation of the original algorithm. Now we explain how to achieve this.

As resettingρ(X):=ρ(X)−ρ(∅) (X ⊆U)does not change the problem, we can assumeρ(∅)=0. Forx ∈RU, we definex−∈RU byx−(u)=min{0,x(u)}foru ∈U, and define supp−(x), supp+(x)⊆Uby{u ∈U |x(u) <0}, {u∈U|x(u) >0}respectively. For anyx∈B(ρ)and anyX ⊆U, we havex−(U)≤x(X)≤ρ(X), and the vector reduction theorem on polymatroids due to Edmonds [4] immediately implies

max{x−(U)|x∈B(ρ)} =min{ρ(X)|X ⊆U}. (11)

So, for a maximizerx0of the left-hand side of(11), we have

arg minρ= {X | X ∈D_ρ(x0),supp−(x0)⊆X ⊆U\supp+(x0)}. (12) An extreme point of a base polyhedron is said to be anextreme base. The following theorem plays an important role for the construction of arg minρ.

Theorem 3.1 (Bixby, Cunningham and Topkis [2]).Letρ : 2U → Rbe a submodular function withρ(∅)=0, and let b be an extreme base of B(ρ).

(1) D_ρ(b)includes{∅,U}and is a simple ring family.

(2) A directed graph representation ofD_ρ(b)can be constructed inO(|U|2·γ (ρ))time.

If a maximizer x0of the left-hand side of(11)is given as a convex combination ofkextreme bases, then using

(12)andTheorem 3.1we can construct arg minρin O(k|U|2·γ (ρ))time. (Refer to Note 10.11. in [17] for details.) Schrijver’s algorithm finds not only a minimizer of the right-hand side of(11), but also a maximizer of the left-hand side of(11)which is represented as a convex combination of, we may assume, at most|U|extreme points ofB(ρ). Vygen [22] showed that Schrijver’s algorithm runs in O(|U|7·γ (ρ)+ |U|8)time. So arg minρcan also be constructed in O(|U|7·γ (ρ)+ |U|8)time.

Unfortunately, the IFF algorithm and its variants do not find a maximizer of the left-hand side of (11). But we can overcome this difficulty. They use the same framework and we can construct arg minρ in the same way. In the algorithms, we maintainZ,H ⊆U, a partition{Γ(s)|s∈S}ofU\(Z∪H)and a directed acyclic graphG=(S,F)

such that

hIFF-1i Z ⊆X_min,H ⊆(U\X_max),

hIFF-2i for eachs∈SandX∈arg minρ,Γ(s)⊆X orΓ(s)∩X = ∅,

hIFF-3i for each arc(s,s0)∈ FandX ∈arg minρ,Γ(s)⊆XimpliesΓ(s0)⊆X.

Note that{Z,H}∪{Γ(s)|s∈S}is a partition ofU. Intuitively, in the algorithms, we update(Z,U\H, (Γ,G))toward the construction of a contracted directed graph representation of arg minρ,(X_min,Xmax, (Γarg minρ,Garg minρ)). For eachT ⊆S, we defineΓ(T)=S

s∈TΓ(s). We define a functionρˆ:2S→Rby ˆ

ρ(T)=ρ(Γ(T)∪Z)−ρ(Z)(T ⊆S).

It is obvious thatρˆis submodular andρ(ˆ ∅)=0. We denoteT

arg minρ,ˆ S

arg minρˆbyXˆmin,Xˆmaxrespectively. By

hIFF-1i,hIFF-2iand the definition ofρˆ, for each minimizerT ⊆Sofρˆ,Γ(T)∪Zis minimizer ofρ, and, conversely, for each minimizerX ofρthere exists a minimizerT ofρˆsuch thatX =Γ(T)∪Z. Fors ∈S, letR(s)⊆Sdenote the set of vertices reachable froms inG. In the algorithms,Z,H,Γ, andG = (S,F)finally satisfy the following

(7)

inequality:

η:=max{ ˆρ(R(s))− ˆρ(R(s)\ {s})|s∈S} ≤0.

We assumeη≤0. AsG =(S,F)is acyclic, there exists a linear orderLˆ onS in which for each(s,s0)∈ F,s0is a predecessor ofs. LetLˆ =(s1, . . . ,s|S|)be such a linear order. We defineLˆ(sj)= {s1, . . . ,sj}(j =1, . . . ,|S|). By definition, for eachs∈S,R(s)⊆ ˆL(s). Letbˆ∈RSbe a vector defined by

ˆ

b(s)= ˆρ(Lˆ(s))− ˆρ(Lˆ(s)\ {s})(s∈ S).

Now bˆ is an extreme base of B(ρ)ˆ (see Edmonds [4]). For each s ∈ S, we have, by the submodularity of ρ,ˆ ˆ

b(s)≤ ˆf(R(s))− ˆf(R(s)\ {s})≤η≤0, that is,_bˆ_≤₀_{. Since}_bˆ_∈_B₍_ρ)_ˆ _and_bˆ_≤₀_{, we have for any}_T _⊆_S ˆ

b−(S)= ˆb(S)= ˆρ(S)≤ ˆb(T)≤ ˆρ(T). (13)

By(13),Sis a minimizer ofρˆand, of course,Xˆmax= S. SoΓ(S)∪Z is the maximal minimizer ofρ. The original

algorithms outputU\ H =Γ(S)∪Z as a minimizer ofρ and stop. We can construct arg minρˆusingbˆ ∈B(ρ)ˆ as follows. By(13)we have

arg minρˆ= {T |T ∈Dρˆ(bˆ),supp −₍_ˆ

b)⊆T ⊆S}. (14)

Asbˆis an extreme base ofB(ρ)ˆ , we can easily obtainG_D

ˆ

ρ(bˆ)=(S,AD_ρˆ(bˆ))byTheorem 3.1. It follows from(14)that ˆ

Xminis the set of vertices reachable from supp−(bˆ)inGDρˆ(bˆ). LetG

0_{be the subgraph of}_G

Dρˆ(bˆ)induced byS \ ˆXmin.

Note thatG0 is acyclic. It is easy to see that(Xˆ_min,S,G0)is a directed graph representation of arg minρˆ. From the correspondence between arg minρˆand arg minρ, we can construct arg minρstraightforward. LetΓ0:S\ ˆX_min→2U be a set-valued function defined byΓ0(s)=Γ(s) (s∈S\ ˆX_min). Then(X_min,Xmax, (Γ0,G0))is a contracted directed

graph representation of arg minρwhere X_min =Γ(Xˆ_min)∪Z andXmax = Γ(S)∪Z. As a result, we can design

a combinatorial algorithm which constructs arg minρin O((|U|6log|U|)·γ (ρ)+ |U|7log|U|)time. Moreover, we can design a fully combinatorial algorithm which constructs arg minρin O(|U|8log2|U| ·γ (ρ))time.

Let AlgorithmSFMambe some combinatorial strongly polynomial time algorithm which constructs all the

mini-mizers of a submodular functionρ :2U _→_R_{, and let Algorithm}_FC_-_SFM

ambe some fully combinatorial strongly

polynomial time algorithm which constructs all the minimizers of a submodular functionρ:2U _→_R_{. For simplicity} we assume that the running time ofSFMamis O(T(|U|, γ (ρ)))and that ofFC-SFMamis O(TFC(|U|, γ (ρ))).

4. The Newton method for problem LSSP

In this section we describe the Newton method for Problem LSSP. The Newton method for Problem LSSP uses an algorithm for submodular function minimization as a subroutine. We define a functionh:R→Ras

h(t)=min{ft a(X)|X ⊆V} =min{f(X)−t a(X)| X ⊆V}. (15) It is obvious thathis concave. As f(∅)=0 and0∈P(f),h(0)=0. Since ft a(∅)=0 for anyt ∈R,h(t)≤0 for any

t ∈R. Using(7),(8)and(15), we havet∗=max{t ∈R|h(t)=0}. The graph ofhis illustrated inFig. 3by a thick curve. For anyt ∈Rwe can obtain the valueh(t)by runningSFM(f −t a). For simplicity, we assumen =O(γ (f))

in the rest of the paper. (Remember that we reset f(X):= f(X)−x0(X)for all X ⊆ V in Section1.) Using this assumption, for anyx∈RV, the function value of fxcan be acquired in O(γ (f))time. So f −t acan be minimized in O(T(n, γ (f)))time. The Newton method is described below.Fig. 3illustrates the process of the algorithm.

The Newton method for Problem LSSP

Step 0: ForX0⊆V such thata(X0) >0, sett1:= f(X0)/a(X0)(≥t∗). Seti:=1. Step 1: ObtainXi ⊆V such thath(ti)= f(Xi)−tia(Xi)by runningSFM(f −tia).

Step 2: Ifh(ti)=0, returnt∗:=tiand stop. Ifh(ti) <0 then setti+1:= f(Xi)/a(Xi)andi:=i+1. Go to Step 1. Ash(t)has at most 2n_{linear segments, the Newton method terminates in a finite number of iterations. If}_a_∈_RV

+\ {0},

it is known that the number of iterations of the Newton method for Problem LSSP is at mostn+1. (See Fujishige [7, Section 7.2], Fleischer and Iwata [5] for details.) But it is left open to verify if the Newton method for Problem

(8)

Fig. 3. The Newton method.

LSSP with arbitrarya ∈ RV runs in strongly polynomial time. An analysis based on Radzik [19,20] gives a weakly polynomial bound on the number of iterations.

Theorem 4.1. Let f be an integer-valued nonnegative submodular function with f(∅)=0and a be an integer vector which satisfies a6∈R₋V. If maxX⊆V|f(X)| ≤U1,maxX⊆V|a(X)| ≤U2, then the Newton method for ProblemLSSP

runs inO(logU1+logU2)iterations.

5. A strongly polynomial algorithm

In this section we present a combinatorial strongly polynomial time algorithm for Problem LSSP. We use a fully combinatorial strongly polynomial algorithm for submodular function minimization [11,12] and the parametric search method proposed by Megiddo [15,16].

5.1. Framework

Later we will describe two procedures for Comparison with the Optimal Valuet∗; ProcedureCOVand Procedure

L-COV.

ProcedureCOV.For any given nonnegative valuet ≥ 0, we can tell whether “t < t∗”, “t = t∗” or “t > t∗” by runningCOV(t)in O(T_COVO (n)·γ (f)+TA

COV(n))time, whereT O

COV(n)andTCOVA (n)are some polynomials inn.

Procedure L-COV.For any given nonnegative value t ≥ 0, oncet a(v)is computed for each v ∈ V, Procedure

L-COVcomparesttot∗with O(T_LO_-_COV(n))oracle calls for function evaluation of f, O(T_LFC_-_COV(n))comparisons, and O(nTO

L-COV(n)+TLFC-COV(n))additions and subtractions, whereTLO-COV(n)andTLFC-COV(n)are some polynomials inn.

As we assumedn =O(γ (f))(see Section4), the running time ofL-COV(t)is O(TO

L-COV(n)·γ (f)+TLFC-COV(n)).

Moreover, ift=t∗, ProcedureL-COVreturns a subsetX ⊆V such that f(X)=t∗a(X)anda(X) >0.

By running COV(0)we can tell whether t∗ = 0 or t∗ > 0. So we can assume that t∗ > 0. If we knew the value oft∗ and runL-COV(t∗), then it would return “t∗ = t∗” and a subset X ⊆ V such that f(X) = t∗a(X)

anda(X) > 0, that is,t∗ = f(X)/a(X). We try to runL-COV(t∗)without knowingthe value oft∗. If we can run

L-COV(t∗)successfully without knowing the value oft∗, we can obtaint∗by f(X)/a(X)using X ⊆ V such that

f(X)=t∗a(X)anda(X) >0. The point ishowto runL-COV(t∗)successfully without knowing the value oft∗. To achieve this goal, we use Megiddo’s parametric search method [15,16].

5.2. Megiddo’s parametric search

We give a strongly polynomial time algorithm for Problem LSSP using the parametric search technique of Megiddo [15,16]. We explain this technique in the following paragraphs.

Operations used in runningL-COV(t∗)are additions, subtractions, comparisons, oracle calls for function evaluation of f, and onlynmultiplications to obtaint∗a(v)for eachv∈V. So each value which appears in runningL-COV(t∗)

can be represented as the form p−qt∗ where values p,q are known values and not functions oft∗. We consider trying to runL-COV(t∗)without knowing the value oft∗ with all the values represented as linear functions oft∗. When values are represented as linear functions oft∗, each operation is done as follows:

Operation

An addition:(p1−t∗q1)+(p2−t∗q2):=(p1+p2)−t∗(q1+q2).

(9)

A comparison:(p1−t∗q1) ? T(p2−t∗q2):= (p1−t∗q1) > (p2−t∗q2) or (p1−t∗q1)=(p2−t∗q2) or (p1−t∗q1) < (p2−t∗q2).

An addition of two linear functions oft∗needs 2 scalar additions. A subtraction of two linear functions oft∗needs 2 scalar subtractions. So, even thought∗is not known, additions and subtractions do not change the asymptotic running time of the procedure. A comparison of two linear functions oft∗, however, is not so easy. We consider comparing two linear functions oft∗. Let p1,p2,q1,q2be known values. Let us consider the comparison ofp1−t∗q1andp2−t∗q2. Setting p = p1−p2, q =q1−q2, we want to decide whether p−t∗q >0, p−t∗q =0 or p−t∗q <0. Now we assumet∗>0. A comparison ofp−t∗qcan be resolved either immediately, ifp q≤0, or by running Procedure

COVwith parameterp/qto compare p/qwitht∗. We describe below AlgorithmLSSP, which solves Problem LSSP within Megiddo’s parametric search method.

AlgorithmLSSP

Step 1: Decide whether “t∗=0” or “t∗>0” by runningCOV(0). Ift∗=0, then stop.

Step 2: RunL-COV(t∗)without knowing the value oft∗with all the values represented as linear functions oft∗. Each comparison of two linear functions oft∗encountered during the computation can be evaluated (if necessary) by running ProcedureCOV. We can obtainX ⊆V such that f(X)=t∗a(X)anda(X) >0.

Step 3: Returnt∗:= f(X)/a(X).

We will show that Algorithm LSSP solves Problem LSSP in strongly polynomial time after describing two procedures; ProcedureCOVand ProcedureL-COV.

5.3. Comparison of t with t∗

As a preparation for describing ProcedureCOVand ProcedureL-COV, we introduce AlgorithmMFM. LetUbe a finite nonempty set andD⊆2U be a ring family. For a vectorb∈RU, we define a modular functionbD:D→Ras

bD(X)=b(X)(X ∈D).

Let us consider minimizing a modular functionbD:D→R. We assume that a directed graph representation ofDis

known. Using a result of Picard [18] modular function minimization over a ring family can be reduced to the minimum cut problem of a network with O(|U|)vertices in O(|U|2)time, and Cunningham [3] showed the equivalence between the modular function minimization problem and the minimum cut problem. For the minimum cut problem, many combinatorial strongly polynomial time algorithms are known [1], and most of them are fully combinatorial. So we can design a fully combinatorial strongly polynomial time algorithm for modular function minimization over ring families. For example,bDcan be minimized with O(|U|3)fully combinatorial operations. For a vectorb∈RU and a

ring familyD⊆2U(we know a directed graph representation ofD), let AlgorithmMFMbe an algorithm which finds a minimizer of a modular functionbD withTMFM(|U|)fully combinatorial operations, whereTMFM(|U|)is some

polynomial in|U|. We assumeTMFM(|U|)=O(TA(|U|))andTMFM(|U|)=O(T_FCFC(|U|)).

We describe below ProcedureCOV, which decides, for any given nonnegative valuet ≥ 0, whether “t < t∗”, “t =t∗” or “t >t∗” using conditions(10)directly. In Step 1, we examine whethert a∈ P(f)or not. In Step 2, we maximizeaDf(t a)and examine whethert =t

∗_or_t _<_t∗_. ProcedureCOV(Comparison with the Optimal Value)

Input: A nonnegative valuet ≥0.

Output: A decision whether “t <t∗”, “t =t∗” or “t >t∗”.

Step 1: Minimize ft a on 2V by running SFMam(ft a). If min{ft a(X) | X ⊆ V} < 0 then stop (t > t∗). (If

min{ft a(X)| X ⊆V} =0 we obtain a directed graph representation ofDf(t a).)

Step 2: MaximizeaDf(t a) : Df(t a)→ Rby runningMFM(−a,Df(t a)). If max{a(X)| X ∈ Df(t a)} = 0 then stop (t <t∗). If max{a(X)| X ∈Df(t a)}>0 then return a maximizer ofaDf(t a)and stop (t=t

(10)

As we assumedn =O(γ (f))(see Section4), for anyx ∈RV, the function value of fx can be acquired in O(γ (f)) time. So the running time of Step 1 is O(T(n, γ (f))). Thus, the total running time of ProcedureCOVis O(T(n, γ (f))

+TMFM(n))=O(T(n, γ (f))). LetT_COVO (n)=TO(n),T_COVA (n)=TA(n), andTCOV(n, γ (f))=T(n, γ (f)).

Let ProcedureL-COVbe a procedure which is obtained by replacing AlgorithmSFMamby AlgorithmFC-SFMam

in ProcedureCOV. And moreover ift =t∗, ProcedureL-COVreturns a subsetX ⊆ V such that f(X)=t∗a(X)

anda(X) > 0. For any givent ≥ 0, oncet a(v) is computed for eachv ∈ V, ProcedureL-COVcomparest to

t∗with O(TO

FC(n))oracle calls for function evaluation of f, O(TFCFC(n))comparisons, and O(nTFCO(n)+TFCFC(n))

additions and subtractions. LetT_LO_-_COV(n)=T_FCO(n),T_LFC_-_COV(n)=T_FCFC(n), andT_L-COV(n, γ (f))=TFC(n, γ (f)).

The running time ofL-COV(t)is O(T_L_-_COV(n, γ (f))). 5.4. Complexity

The following theorem is the main result in the paper.

Theorem 5.1. AlgorithmLSSPsolves ProblemLSSPin strongly polynomial time.

Proof. The running time in Step 1 is O(T_COV(n, γ (f))). In Step 2, O(T_LFC_-_COV(n))comparisons of linear functions of t∗ are evaluated and the running time of the other part is O(T_L-COV(n, γ (f))). So Algorithm LSSP runs in

O(T_LFC_-_COV(n)·T_COV(n, γ (f))+T_L-COV(n, γ (f)))time.

6. The minimum-ratio problem

Let f : 2V _→ _R_{be a submodular function with} _f₍_∅₎₌_{0 and} _f0 _:₂V _→ _R_{be a supermodular function with}

f0(∅)=0. We assume that f(X)≥0 for allX ⊆V and f0(X) >0 for some X ⊆V. Let us consider the following minimum-ratio problem.

Problem MR: Minimize f(X)

f0(_X)subject toX ⊆V,f 0₍

X) >0.

We give a strongly polynomial time algorithm for Problem MR. We can easily see that Problem MR is equivalent to the following maximization problem of a parametert.

Problem P-MR: Findt∗ =max{t∈R| f(X)−t f0(X)≥0,∀X ⊆V},

and findX⊆V such that f(X)−t f0(X)=0 and f0(X) >0.

Problem P-MR is a generalization of Problem LSSP. The optimal valuet∗of Problem P-MR is nonnegative. In the same way as the discussion in Section2, we have the following conditions fort∗and anyt≥0:

t <t∗ ⇐⇒

min{f(X)−t f0(X)| X⊆V} =0, max{f0(X)| X∈arg min(f −t f0)} =0,

t =t∗ ⇐⇒

min{f(X)−t f0(X)| X⊆V} =0, max{f0(X)| X∈arg min(f −t f0)}>0,

t >t∗ ⇐⇒ min{f(X)−t f0(X)|X ⊆V}<0.

(16)

For any t ≥ 0, f −t f0 is a submodular function defined on 2V. Using (16) and the same technique as the algorithm for Problem LSSP (Section5), we can develop a strongly polynomial time algorithm for Problem P-MR and simultaneously for Problem MR. Note that a supermodular function maximization problem on a ring familyD, or a submodular function minimization problem onD, can be reduced to a normal submodular function minimization problem if we know a directed graph representation ofD. (See Schrijver [21].)

Acknowledgments

I would like to thank Satoru Iwata for a number of useful suggestions, including the efficient method of constructing all the minimizers of a submodular function via the IFF algorithm, and two anonymous referees for their helpful comments on an earlier draft of this paper.

(11)

References

[1] R.K. Ahuja, T.L. Magnanti, J.B. Orlin, Network Flows—Theory, Algorithms, and Applications, Prentice-Hall, Englewood, NJ, 1993. [2] R.E. Bixby, W.H. Cunningham, D.M. Topkis, The partial order of a polymatroid extreme point, Mathematics of Operations Research 10

(1985) 367–378.

[3] W.H. Cunningham, Minimum cuts, modular functions, and matroid polyhedra, Networks 15 (1985) 205–215.

[4] J. Edmonds, Submodular functions, matroids, and certain polyhedra, in: R. Guy, H. Hanai, N. Sauer, J. Sch¨onheim (Eds.), Combinatorial Structures and Their Applications, Gordon and Breach, New York, 1970, pp. 69–87.

[5] L. Fleischer, S. Iwata, A push-relabel framework for submodular function minimization and applications to parametric optimization, Discrete Applied Mathematics 131 (2003) 311–322.

[6] S. Fujishige, Lexicographically optimal base of a polymatroid with respect to a weight vector, Mathematics of Operations Research 5 (1980) 186–196.

[7] S. Fujishige, Submodular Functions and Optimization, North-Holland, Amsterdam, 1991. [8] D. Gale, A theorem on flows in networks, Pacific Journal of Mathematics 7 (1957) 1073–1082.

[9] D. Hartvigsen, A submodular optimization problem with side constraints, Mathematics of Operations Research 23 (1998) 661–679. [10] D. Hartvigsen, A strongly polynomial time algorithm for a constrained submodular optimization problem, Discrete Applied Mathematics 113

(2001) 183–194.

[11] S. Iwata, A fully combinatorial algorithm for submodular function minimization, Journal of Combinatorial Theory (B) 84 (2002) 203–212. [12] S. Iwata, A faster scaling algorithm for minimizing submodular functions, SIAM Journal on Computing 32 (2003) 833–840.

[13] S. Iwata, L. Fleischer, S. Fujishige, A combinatorial strongly polynomial algorithm for minimizing submodular functions, Journal of the ACM 48 (2001) 761–777.

[14] N. Megiddo, Optimal flows in networks with multiple sources and sinks, Mathematical Programming 7 (1974) 97–107.

[15] N. Megiddo, Combinatorial optimization with rational objective functions, Mathematics of Operations Research 4 (1979) 414–424. [16] N. Megiddo, Applying parallel computation algorithms in the design of serial algorithms, Journal of the ACM 30 (1983) 852–865. [17] K. Murota, Discrete Convex Analysis, Society for Industrial and Applied Mathematics, Philadelphia, 2003.

[18] J.C. Picard, Maximal closure of a graph and applications to combinatorial problems, Management Science 22 (1976) 1268–1272.

[19] T. Radzik, Parametric flows, weighted means of cuts, and fractional combinatorial optimization, in: P.M. Pardalos (Ed.), Complexity in Numerical Optimization, World Scientific, Singapore, 1993, pp. 351–386.

[20] T. Radzik, Fractional combinatorial optimization, in: D.Z. Du, P.M. Pardalos (Eds.), in: Handbook of Combinatorial Optimization, vol. 1, Kluwer Academic Publishers, Boston, 1998, pp. 429–478.

[21] A. Schrijver, A combinatorial algorithm minimizing submodular functions in strongly polynomial time, Journal of Combinatorial Theory (B) 80 (2000) 346–355.