Contents lists available atScienceDirect
Theoretical Computer Science
journal homepage:www.elsevier.com/locate/tcs
Stable normal forms for polynomial system solving
Bernard Mourrain
a,∗, Philippe Trébuchet
baGALAAD, INRIA Méditerranée, 2004 Route des Lucioles, BP 93, 06902 Sophia Antipolis, Cedex, France bUPMC, LIP6, Equipe APR, 4 place jussieu, 75015 Paris, France
a r t i c l e i n f o Keywords: Multivariate polynomial Quotient algebra Normal form Border basis Root-finding Symbolic-numeric computation a b s t r a c t
The paper describes and analyzes a method for computing border bases of a zero-dimensional idealI. The criterion used in the computation involves specific commutation polynomials, and leads to an algorithm and an implementation extending the ones in [B. Mourrain, Ph. Trébuchet, Generalised normal forms and polynomial system solving, in: M. Kauers (Ed.), Proc. Intern. Symp. on Symbolic and Algebraic Computation, ACM Press, New-York, 2005, pp. 253–260]. This general border basis algorithm weakens the monomial ordering requirement for Gröbner bases computations. It is currently the most general setting for representing quotient algebras, embedding into a single formalism Gröbner bases, Macaulay bases and a new representation that does not fit into the previous categories. With this formalism, we show how the syzygies of the border basis are generated by commutation relations. We also show that our construction of normal form is stable under small perturbations of the ideal, if the number of solutions remains constant. This feature has a huge impact on practical efficiency, as illustrated by the experiments on classical benchmark polynomial systems, at the end of the paper.
©2008 Elsevier B.V. All rights reserved. 1. Introduction
Solving polynomial systems is the cornerstone of many computations in robotics, geometric modeling, signal processing, chromatology, structural molecular biology etc. In these applications, the systems have, most of the time, finitely many solutions and the equations often appear with approximate coefficients.
From a computational point of view, it is a challenge to develop efficient and numerically stable methods for solving such systems. Backward stability is required, that is, the computed solution should be the exact solution of a system in a small neighborhood of the input. Efficiency is also mandatory to tackle the systems. One can expect, for instance, that the behavior of the method depends mainly on the number of solutions, and partially on other extrinsic parameters, such as the number of variables.
To ensure backward stability, one can apply classical numerical methods such as Newton-like iterations. These local methods, however, provide neither guarantee of global convergence nor a complete description of all the roots. Algebraic methods, on the contrary, handle all the roots simultaneously. They reduce the problem to computing the structure of the quotient algebraAof the polynomial ring modulo, the idealIgenerated by the input system [8]. Such a structure is given by a basisBof the algebraAas a vector space and by the tables of multiplication in the algebraA. Equivalently, it can be described by an algorithm that projects the ring of polynomialsK
[
x]
onto the vector spaceh
Bi
, generated by the basisB along the idealI. We call such a projection, a normal form for the idealI.From a knowledge of multiplication tables, we deduce either an exact encoding of the roots through a rational univariate representation [32,12], or a numerical approximation by means of eigenvector computation [1,24,34]. Since the
∗Corresponding author.
E-mail addresses:[email protected](B. Mourrain),[email protected](P. Trébuchet). 0304-3975/$ – see front matter©2008 Elsevier B.V. All rights reserved.
eigencomputation is a numerically well-controlled process [13], the remaining challenge is to compute efficiently and in a stable way the quotient algebra structureA.
In the family of algebraic methods, resultant-based techniques (see, e.g. [19,9,3]) exploit the properties of the coefficients matrices of the monomial multiples in the input equations. The multiplication tables can be expressed via Schur complements in these matrices [9]. Their construction is deeply linked to the geometry of the underlying variety, which makes these methods very tolerant to a small perturbation. Unfortunately they heavily rely on genericity hypotheses, and this reduces their applicability. Moreover, the size of the constructed matrices usually grows exponentially with the number of variables.
To avoid such a pitfall, the so-called H-bases have been studied [20,22]. The construction proceeds degree by degree, and stops when the terms of the highest degree of the computed polynomials generate the terms of the highest degree in all the idealI. The stopping criterion requires the computation of generators of the syzygies of these highest degree terms. Though it also yields a basis of the quotient ringA, without any apriori knowledge of its dimension, practically speaking, it also suffers from a swelling of the size of the linear systems to be solved.
The approach can be refined further, by using a grading in which the highest term of a polynomial is a monomial. This leads to Gröbner basis computation (see, e.g. [6]). The approach also yields a basis of the quotient algebraAand a normal form for the idealI. As in the other methods, their computation can be seen just as a triangulation of a certain matrix. Unfortunately these methods suffer from unavoidable instability: the monomial ordering attached to the Gröbner basis makes the pivot selection strategy in the triangulation depend only on the symbolic structure of the rows of the matrix, and not on the numerical values of the coefficients appearing in these rows. Thiscanlead to unwanted artificial singularities in the representation of the quotient ring. (Compare, for instance, the degree reverse lexicographic Gröbner basisp1
=
ax21+
bx2
2
+
ε
1x1x2,p2=
cx21+
dx 22
+
ε
2x1x2withε
1=
ε
2=
0 andε
16=
0,ε
26=
0.)To circumvent this artificial difficulty, a new approach was proposed in [25], based on a normal form criterion and involving commutation relations (for basesBconnected to 1). Further investigations of this technique in [27,28] and the Ph.D. dissertation [36] have led to a first version [29] of a normal form algorithm, which allowed one to construct efficiently and in a stable way the zero-dimensional quotient algebra representations. Similar investigations were also pursued in [15,
16], but in the more restrictive case, where the basis is stable by division. Other investigations related to the stabilization of the normal form process can be found in [34].
In this paper, we discuss border basis algorithms in the general sense, that is, for the basesBconnected to 1. See [26] for an introductory presentation of their properties. As in [11] or [10], our approach is based on the linear algebra tools. As for the H-bases, we use a grading of the polynomial ring, but the construction is optimized in the spirit of [27]. We describe an efficient criterion based on commutation polynomials to check the normal form property. This leads to an algorithm, presented in [29], which has been improved to treat optimally the case when a syzygy of the components of highest degree is found, which is not a syzygy of the corresponding polynomials. We prove that the syzygies of a (general) border basis are generated by the commutation relations, giving a short and concise answer to a conjecture in [16] for bases stable by division. Meanwhile, work related to the conjecture for this special case was also investigated in [14]. Regarding the numerical stability of border bases, we prove that our construction of normal form is stable against small perturbations of the input system, if the number of solutions remains constant.
The paper is structured as follows. The next section is devoted to the notations, and Section3to the stopping criterion for generalized normal forms. In Section4, we show how to recover the syzygies from the commutation relations. In Section5, we recall briefly the main idea of the algorithm in [29]. In Section6, we analyze the stability of this algorithm from a symbolic-numeric perspective. Finally, we show the efficiency of our implementation and its numerical behavior on classical polynomial benchmarks.
2. Notations
We recall some definitions in [25,27,28] and add a few more ones.
LetKbe an effective field. The ring ofn-variate polynomials overKwill be denoted byR,R
=
K[
x] =
K[
x1, . . . ,
xn]
. Weconsidern-variate polynomialsf1
, . . . ,
fs∈
R. These polynomials generate an ideal ofK[
x]
that we denoteI. The quotient of K[
x]
moduloIwill be denoted byA. From now on, we suppose thatI is zero dimensional, so thatAis a finite dimensional K-vector space. Our goal is to solve the system of equationsf1=
0, . . . ,
fs=
0 over the algebraic closureKofK. The roots,with coordinates in the algebraic closure ofK, are denoted
ζ
1, . . . , ζ
d, withζ
i=
(ζ
i,1, . . . , ζ
i,n)
∈
Kn.The support supp
(
p)
of a polynomialp∈
K[
x]
is the set of monomials appearing inpwith non-zero coefficients. Given a setSof elements ofK[
x]
, we denote byh
Si
theK-vector space spanned by the elements ofS. Finally, we denote the set of all the monomials in the variablesx=
(
x1, . . . ,
xn)
byM. A term is an element of the formλ
·
mwithλ
∈
K− {
0}
andm
∈
M. For a subsetSofM, we denote byScthe set-theoretical complement of the setSinM. For any monomialm∈
M,m
·
Mdenotes the set of all monomial multiples ofm.For any subsetSofR, we denote byS+orD
(
S)
the setS+=
S∪
x1S
∪ · · · ∪
xnS,∂
S=
S+\
S.S+is called theprolongationofS. For anyk
∈
N,S[k]isSk+···+timesthe result of applyingktimes the operator+onS. By convention,S[0]isS.IfB
⊂
Mcontains 1, then for any monomialm∈
Mthere existsksuch thatm∈
B[k]. We say that a monomialmis ofA set of monomialsBis said to be connected to 1 if and only if, 1
∈
Band for every monomialminB, there exists a finite sequence of variables(
xij)
j∈[1,l]such that∀
l0
∈ [
1,
l]
,
Q
j∈[1,l0]xij
∈
BandQ
j∈[1,l] xij
=
m.A set of monomialsBis stable by division if for anym
∈
Band any variablexisuch thatm=
xim0(m0∈
M), we havem0inB. Remark that a setBstable by division is connected to 1.
Definition 2.1. LetΛbe a monoid with a well order relation
≺
, such that:∀
α, β, γ
∈
Λ, α
≺
β
⇒
γ
+
α
≺
γ
+
β.
A
(
Λ,
≺
)
-grading ofK[
x]
is the decomposition ofK[
x]
as the direct sumK[
x] =
L
λ∈ΛK
[
x]
[λ], with the following property:∀
f∈
K[
x]
[α],
g∈
K[
x]
[β]⇒
f g∈
K[
x]
[α+β].
The following element ofΛis denoted degΛ
(
f)
or deg(
f)
(when no confusion is possible) and byΛ(
f)
, and is said to be the degree off: Λ(
f)
=
min(
λ
∈
Λ|
f∈
M
λ0λ K[
x]
[λ0])
.
For any setV
⊂
K[
x]
, letVλ=
L
λ0λ K
[
x]
[λ0]∩
V. For anyλ
∈
Λ, letλ
+=
min{
λ
0∈
Λ;
K[
x]
+λ⊂
K[
x]
λ0}
and letλ
−=
max{
λ
0∈
Λ;
K
[
x]
+λ0⊂
K[
x]
λ}
.
In order not to be confused with different notions of degree, for any monomialmofK
[
x]
, we define the size ofm, denotedby
|
m|
, to be the integerdsuch thatm=
xi1· · ·
xid, which itself imposes a grading.Another classical grading, is the one associated with a monomial ordering, whereΛ
=
Nnand≺
is a monomial order(see [7, p. 328]) such that for all
α
=
(α
1, . . . , α
n)
∈
Nnwe haveK
[
x]
[α]=
Kxα11· · ·
xαnn.
Definition 2.2. We say thatΛis a reducing grading, ifΛis a grading and if the following property holds: for all monomials m
,
m0∈
Msuch thatm0strictly dividesmwe haveΛ(
m0)
≺
Λ(
m),
Λ(
m0)
6=
Λ(
m).
Both gradings induced by the classical degree and a monomial ordering are reducing grading. Hereafter,we denote byΛa reducing grading.
Definition 2.3. A rewriting familyFfor a monomial setBis a set of polynomialsF
= {
fi}
i∈I, such that supp(
fi)
⊂
B+,fihasexactlyonemonomial
γ (
fi)
in∂
B(also called theleading monomialoffi) such that ifγ (
fi)
=
γ (
fj)
theni=
j,Remark that the elements ofFcan be seen as rewriting rules for the leading monomial using monomials of the setB. Definition 2.4. A reducing familyFof a degree
λ
∈
Λfor a setBis a set of polynomials, such thatFis a rewriting family for the setB,∀
m∈
∂
Bof degree at mostλ
,∃
f∈
F|
γ (
f)
=
m.For the setB
= {
1,
x0,
x1,
x0x1}
, the set of polynomialsF= {
x20−
1,
x 21
−
x1,
x20x1−
x1,
x21x0−
x1}
is a reducing family ofdegree 3.
A reducing family of degree
λ
for a setB(connected to 1) allows us to rewrite the monomials ofh
B+i
λ moduloF as elements of
h
Bi
λ. This leads, in fact, to the definition of the linear projectionπ
F, associated to a reducing family for a setBconnected to 1.
Definition 2.5. Given a reducing familyF of degree
λ
∈
Λfor a setBconnected to 1, we define the linear projectionπ
F: h
B+i
λ→ h
Bi
λsuch that∀
m∈
Bλ, π
F(
m)
=
m,
∀
m∈
∂
Bλ, π
F(
m)
=
m−
f;
wheref
∈
Fis the unique member ofF, such asm=
γ (
f)
. We extend this construction toh
B+i
λbyK-linearity.
There is a parallel with the differential algebra terminology, which we want to highlight here. To a polynomial f
(
x1, . . . ,
xn)
∈
Rof a degree of at mostδ
, we can associate the differential equationf(∂
1, . . . , ∂
n)
Φ(
t1, . . . ,
tn)
=
0, where∂
iis the derivation with respect to the variableti. It is a linear equation inΦ. The solutionΦ(
t1, . . . ,
tn)
lives in the ring K[[
t1, . . . ,
tn]]
of formal power series int1, . . . ,
tn, which we can truncate in degreeλ
Λ(
fi)
. In the literature this setJλ
(
t)
of truncated series in degreeλ
is called the space ofλ
-jets. See, e.g. [33] for more details.Given a set of polynomialsF
⊂
K[
x]
of degree at mostλ
, we consider the so-calledsolution manifoldRλof the differential systemF(∂)(
Φ)
=
0 inJλ(
t)
. (In our case, it is just a linear space.) Givenλ
0≺
λ
andΦ∈
Jλ
(
t)
, we can delete the coefficients of degrees exceedingλ
0, and project the set of solutions ofF(∂)(
Φ)
=
0 projected onJλ0(
t)
. Then this set would be defined by the equationsFλ0(
Φ0)
=
0 whereh
Fλ0i = h
Fi ∩
K[
x]
λ0, and we denote it byπ
λ0(
F)
. We denote byD(
F)
the new system that extendsFwith the equations∂
if(∂)
=
0, fori∈
1. . .
n,f∈
F, obtained by formal multiplication by the∂
i. This systemis called the prolongation ofF.
The polynomial systemFof degree at most
λ
is said to beformally integrableifπ
λ(
Dr+1(
F))
=
Ffor anyr≥
0. A technicalcondition of involutivity is introduced to ensure the regularity of the differential system [30]. A result of Cartan–Kuranishi [4,17,30] asserts that any systemFcan be transformed by prolongation and projection into a (involutive) formally integrable
system. The connection of this construction with the notion of Mumford regularity of polynomial systems has been detailed in [21]. This correspondence has been used explicitly for solving polynomial equations in [31]. This involutive division is also used to construct a family of normal form projection related to the so-called involutive bases (see e.g. [2]). Unlike the Janet basis construction, however, the variables do not play a symmetric role. The so-called multiplicative variables are used to perform reduction and non-multiplicative ones to extend the polynomial vector space.
In the following, instead of working in the vector space of all polynomials of a given degree, we will consider completion procedures based on prolongations and projections relative to a specific set of monomialsB. Dealing locally with this set of monomials related to the number of solutions of the system, will allow us to improve significantly the linear algebra stages when we deal with this type of methods.
In the sequel, we will make heavy use of multiplication operators by one variable that we define as follows: Mi,λ
: h
Bi
λ−→ h
Bi
λb
7→
π
F(
xib).
The subscriptλis redundant, as soon as we know thatFis a reducing family of degree
λ
, and we will omit this subscript in the sequel.Definition 2.6. LetF
= {
f1, . . . ,
fs}
be a polynomial set, we denote byFhλithe vector space:Fhλi
= h{
xαfi|
Λ(
xαfi)
≤
λ
}i
.
Obviously, we haveFhλi
⊂
(
F)
λwhere(
F)
is the ideal generated byF. Next we introduce a definition, which weakens thenotion of monomial ordering for Gröbner basis.
Definition 2.7. We say that a function
γ
:
K[
x] →
M(whereMis the set of all monomials in the variablesx1, . . . ,
xn) isa choice function refining the gradingΛif, for any polynomialp,
γ (
p)
is a monomial such that (a)γ (
p)
∈
supp(
p)
, (b) if m∈
supp(
p)
,m6=
γ (
p)
thenγ (
p)
does not dividem, and (c)Λ(γ (
p))
=
max{
Λ(
m),
m∈
supp(
p)
}
. The coefficient of the monomialγ (
p)
inpwill be denoted byκ(
p)
.Example 2.8. In the following, we consider aMacaulay1choice function
γ
, such that for allp∈
K
[
x]
, the functionγ (
p)
=
xα1
1
· · ·
xαnn satisfies the following property|
γ (
p)
| =
max{|
m|;
m∈
supp(
p)
} =
d, and∃
i0st.α
i0=
max{
degxi(
m),
m∈
supp
(
p)
and,
|
m| =
d;
i=
1, . . . ,
n}
. In case more than one monomial satisfies these conditions, the greatest monomial according to the lexicographic order is chosen.The monomial returned by the choice function has the same name as theleadingmonomial of an element of a reducing family. This is intended, as we will define a reducing family on behalf of the choice functions, and in this framework the two monomials coincide. Hereafter if S
= {
p1, . . . ,
ps}
is a polynomial set, then we denote byγ (
S)
the set:γ (
S)
=
{
γ (
p1) . . . γ (
ps)
}
.
Definition 2.9. Let
γ
be a choice function refining a gradingΛ. For any polynomialsp1,
p2∈
K[
x]
, let the C-polynomialrelative to
γ
and(
p1,
p2)
be C(
p1,
p2)
=
lcm(γ (
p1), γ (
p2))
κ(
p1) γ (
p1)
p1−
lcm(γ (
p1), γ (
p2))
κ(
p2) γ (
p2)
p2.
Let us writeΛ
(
lcm(γ (
p1)
,γ (
p2)))
for theC-degree of(
p1,
p2)
and lcm(γ (
p1), γ (
p2))
for the leading monomial of the pair(
p1,
p2)
.If
γ
is a monomial ordering, then this is almost the same definition as for anS-polynomial in [6]. We, however, use a new name to emphasize that nowγ
can be not a monomial ordering (i.e. a total order compatible with monomial multiplication). As we will see in the next section, theC-polynomials express commutation conditions for theMi,λ.3. Generalized normal form criterion
LetF
= {
f1, . . . ,
fs}
be a polynomial system, and letIbe the ideal generated by it. Remember thatFhλi(theK-vector spacespanned by the monomial multiples of thefi,xαfiof degrees
λ
∈
Λ) is included intoIλ. Thus, whenIλ=
Fhλiwe can definea normal form moduloI, up to the degree
λ
as the projection ofK[
x]
λalongFhλionto a supplementary spaceh
Bi
λ. Hereafter,we consider a setBof monomials, containing 1.
LetFbe a rewriting family and letH
= {
m,
∃
p∈
F, γ (
p)
=
m}
be the set of their leading monomials. Then, obviously, F allows us to define the projectionπ
F of the intersectionB∪
Hon the setBalongh
Fi
. We may, however, extend thisprojection using the following extension process:
Definition 3.1. LetFbe a rewriting family. For allm
∈
B, we defineπ
Fe(
m)
=
m. Form6∈
B, there existsm0∈
∂
Bandr integersi1, . . . ,
ir∈ [
1,
n]
, such thatm=
xi1· · ·
xirm0. We defineπ
eF
(
m)
by induction onk, as follows.•
ifr=
0,π
e F(
m 0)
is defined asπ
e F(
m 0)
=
π
F
(
m0)
=
m0−
f wheref∈
Fis such thatγ (
f)
=
m0.• ∀
k,
1≤
k≤
r,π
Fe(
xir−k· · ·
xirm0
)
=
π
F(
xir−kπ
e
F
(
xir−k+1· · ·
xirm0
))
where the latter quantity is defined. Otherwise, we saythat
π
Fe(
m)
is undefined.Remark that the above process allows us to define
π
Feonly on monomials, and we extend it implicitly, by linearity. Remark also, that this extension process is not defined in a unique way. Indeed, two different decompositions of a monomialmmay lead to two different values ofπ
Fe(
m)
. The following theorem, however, shows that this extension process becomes canonical as soon as we check some commutativity conditions.Theorem 3.2. Assume that B is connected to1. Let F be a rewriting family, and let E be the set of monomials m such that for all decompositions of m into the products of variables, m
=
xi0· · ·
xik,π
F(
xi0π
F(
xi1· · ·
π
F(
xik)
· · ·
))
is defined. Suppose that for allm
∈
E and all indices i,
j∈ [
1,
n]
, such that xixjm∈
E, we haveπ
eF
(
xiπ
Fe(
xjm))
=
π
Fe(
xjπ
Fe(
xim)).
Then
π
eF coincides with the linear projection
π
Sofh
Ei
onh
Bi
along the vector space spanned by the polynomials S= {
xαf, α
∈
Nn,
f∈
F and xαγ (
f)
∈
E}
.Proof. Remark that the way we define it, makes
π
Feinherently alinearmultivalued map. Hence to prove the theorem, we first have to show that, under the above hypotheses,π
Febecomes a well defined map, and next that this well defined linear map coincides with the projectionπ
Sofh
Ei
onh
Bi
alongh
Si
.Remark also thatEis obviously stable by monomial division: if all the possible decompositions ofmas a product of variablesm
=
xi0· · ·
xikare such thatπ
F(
xi0· · ·
π
F(
xik))
is defined, then a fortiori ifm0
is a divisor ofm, this property is true form0. Let us show that the extension process is independent of the waymis decomposed as a product of variables. Let
m
=
xi0m0
=
x i1m00withi
0
6=
i1andm,
m0,
m00∈
E. Then there existsm000∈
E(sinceEis stable by monomial division), suchthatm
=
xi0xi1m000
. Sincem
,
m0,
m00,
andm000are inE, the projectionsπ
Fe(
m0)
,π
Fe(
m00)
,π
Fe(
xi0m0
)
,
π
Fe(
xi1m00
),
and
π
Fe(
m000)
are defined and we have:π
e F(
xi0π
e F(
m 0))
=
π
e F(
xi0π
e F(
xi1π
e F(
m 000))),
π
e F(
xi1π
e F(
m 00))
=
π
e F(
xi1π
e F(
xi0π
e F(
m 000))).
The commutation condition guarantees that the two quantities are equal, so that the definition of
π
eF does not depend on
the way of writingmas a product of variables.
Next, we have to show that the projections
π
Feandπ
Scoincide on their common set of definition. We do this by inductionon the size of the monomials as follows:
It is true that
π
Fe(
1)
=
π
S(
1)
=
1 (since 1∈
B). For any monomialm6=
1 inE, the property of connectivity ofBand the definition ofEimplies that∃
m0∈
Eandi0∈ [
1,
n]
such thatm=
xi0m0
and
π
Fe(
m0)
is defined, so that we haveπ
e F(
m)
=
π
Fe(
xi0m 0)
=
defπ
e F(
xi0π
e F(
m 0))
=
inductionπ
e F(
xi0π
S(
m 0))
∈ h
Bi
.
Now by induction,m0−
π
S(
m0)
∈
Sλ−whereλ
=
Λ(
m)
and m−
π
Fe(
m)
=
xi0(
m 0−
π
S(
m0))
+
xi0π
S(
m 0)
−
π
e F(
xi0π
S(
m 0))
∈ h
Si
.
Thus
π
Fe(
m)
is the projection ofmonh
Bi
alongh
Si
.Now, suppose that we are given a reducing family of degree
δ
instead of a rewriting family. Then, we can further extend the above theorem with the help from the following lemma.Lemma 3.3. Let F be a reducing family of degree
λ
, for a set B connected to1and suppose that∀
f∈
F,
Λ(γ (
f))
=
Λ(
f)
. With the notation ofTheorem3.2, the set E contains the set of monomials of degree less than or equal toλ
.Proof. Letm
∈
Mλ be a monomial of degree less than or equal toλ
, so thatmcan be written asm=
xi1. . .
xid withd
= |
m|
. Let us prove by induction onk≤
dthat the projection pk=
π
F(
xikπ
F(
· · ·
π
F(
xi1))
· · ·
)
is defined and that Λ(
pk)
≤
Λ(
xik· · ·
xi1)
.Considerxik+1pk
∈
B+. Form0
∈
supp(
xik+1pk
)
∩
B, we haveπ(
m0
)
=
m0and by the induction hypothesisΛ(
m0)
≤
Λ
(
xik+1· · ·
xi1)
. We have a rewriting rule for m0
∈
supp
(
xik+1pk)
∩
∂
BbecauseF is a reducing family of degreeλ
. The hypothesis that∀
f∈
F,
Λ(γ (
f))
=
Λ(
f)
implies thatm0rewrites in terms of the monomials of degrees bounded byΛ
(
xik+1· · ·
xi1)
. Thereforepk+1:=
π
F(
xik+1pk)
is defined andΛ(
pk+1)
≤
Λ(
xik+1· · ·
xi1)
. By inductionπ
Fe(
xi1· · ·
π
e
F
(
xid))
is defined for any decompositionm=
xi1· · ·
xid∈
Mλ, so thatm∈
E. This completes theproof.
Theorem 3.4. Let F be a reducing family of degree
λ
for a set B connected to1. If we have:• ∀
f∈
F,
Λ(γ (
f))
=
Λ(
f)
.•
Mj,λ◦
Mi,λ−=
Mi,λ◦
Mj,λ−, for1≤
i,
j≤
n,Proof. ByLemma 3.3, we haveE
⊃
MλbecauseFis a reducing family of degreeλ
.Let us prove that for allm
∈
Mλ−− and all pairs of indices(
i,
j)
, there exists a way to defineπ
Fe such thatπ
Fe(
xiπ
eF
(
xjm))
=
π
Fe(
xjπ
Fe(
xim)).
AsMλ−−
⊂
Mλ⊂
E,π
Fe(
m)
is defined and supp(π
Fe(
m))
⊂
B. We defineπ
Fe(
xim)
=
π
F(
xiπ
Fe(
m))
and, similarly,π
eF
(
xjm)
=
π
F(
xjπ
Fe(
m))
. With this definition we have:π
eF
(
xiπ
Fe(
xjm))
=
Mi,λ(
Mj,λ−(π
eF
(
m)))
=
Mj,λ(
Mi,λ−(π
Fe(
m)))
=
π
Fe(
xjπ
Fe(
xim)),
which proves the commutation property. We end the proof by applyingTheorem 3.2. Corrolary 3.5. With the hypothesis ofTheorem3.4, we haveK
[
x]
λ= h
Bi
λ⊕
Fhλi.Let us show another effective way of checking whether we can define a projection fromFhλi(vector space spanned by the
monomial multiples of thefiof degree
λ
) ontoh
Bi
λ(the elements of degreeλ
in the vector space spanned byB) starting froma reducing family of degree
λ
, and without computingexplicitlythe multiplication operators.Theorem 3.6. Let
λ
∈
Λ. Let F be a reducing family of degreeλ
∈
Λ, for B. Assume that∀
f∈
F,
Λ(γ (
f))
=
Λ(
f)
and letπ
Fbe the induced projection from
h
B+i
λonto
h
Bi
λ. Then∀
f,
f0∈
Fhλisuch that C
(
f,
f0)
∈ h
B+i
λ,π
F(
C(
f,
f 0))
=
0
iff
π
Fextends uniquely as a projectionπ
FefromK[
x]
λontoh
Bi
λsuch thatker(π
e F)
=
Fhλi.Proof. ByTheorem 3.4, we have to show that this condition is equivalent to the commutation of the operatorsMi,λ0
, λ
0< λ
on the monomials ofBλ−−.For anym
∈
Bλ−− and anyi16=
i2, such thatxi1m
∈
∂
B,xi2m∈
∂
B, there existsf,
f 0∈
F hλ−isuch thatγ (
f)
=
xi 1m,γ (
f0)
=
x i2m. Thus, we haveπ
F(
xi1m)
=
γ (
f)
−
f,π
F(
xi2m)
=
γ (
f 0)
−
f0andC(
f,
f0)
=
x i2f−
xi1f 0∈ h
Bi
+ λ. Consequently, Mi2,λ(
Mi1,λ−(
m))
−
Mi1,λ(
Mi2,λ−(
m))
=
Mi2,λ(γ (
f)
−
f)
−
Mi1,λ(γ (
f 0)
−
f0)
=
π
F(
xi2γ (
f)
−
xi2f)
−
π
F(
xi1γ (
f 0)
−
xi1f 0)
=
π
F(
xi1f 0−
xi2f)
=
π
F(
C(
f 0,
f)),
which is zero by hypothesis. A similar proof applies ifxi1m∈
Borxi2m∈
B.Conversely, since ker
(π
Fe)
=
FhλiandC(
f,
f0)
∈
Fhλi∩ h
Bi
+λ, we have thatπ
F(
C(
f0,
f))
=
π
Fe(
C(
f 0,
f
))
=
0, which proves the equivalence andTheorem 3.6.Remark 3.7. In this proof, we have shown that if theC-polynomials up to the degree
λ
reduce to 0, then the multiplication operatorsMi,λcommute.Definition 3.8. A reducing familyFfor all degrees
λ
∈
Λon a setBof monomials, connected to 1 such thatπ
F extendsuniquely toRwill be called aborder basisforB.
Finally, the previous results lead us to a new proof of Theorem 3.1 of [25]:
Theorem 3.9. Let F be a reducing family for a set B of monomials, connected to1, let
π
F be the corresponding reduction fromh
B+i
ontoh
Bi
, and let Mi
: h
Bi → h
Bi
, such that∀
b∈ h
Bi
, Mi(
b)
=
π
F(
xib)
. ThenMj
◦
Mi=
Mi◦
Mj,
for 1≤
i,
j≤
niff there exists a unique projection
π
FefromK[
x]
ontoh
Bi
such thatker(π
Fe)
=
(
F)
and(π
eF
)
|hB+i=
π
F.Proof. Under the above hypotheses, byTheorem 3.4, the projection
(π
F)
|hB+iλextends uniquely for anyλ
∈
Λto a projectionπ
eFλfromK
[
x]
λontoh
Bi
λ, such that ker(π
e
Fλ
)
=
Fhλi. Since for anyλ, λ
0∈
Λsuch thatλ
≺
λ
0, we have(
Bλ0)
λ=
Bλ, and Fhλi⊂
(
Fhλ0i)
λ, we also have(π
eFλ0
)
|K[x]λ=
π
Feλ. This defines a unique linear operatorπ
e F onK
[
x]
, such thatπ
e |K[x]λ=
π
e Fλ and ker(π
e F)
=
P
λ∈ΛFhλi
=
(
F)
. We have completed the proof of the direct implication. The converse implication isimmediate.
4. Syzygies and commutation relations
In this section, we analyze more precisely the relations between the polynomials of the border basisF
=
(
fω)
ω∈∂Bwherefω
=
ω
−
ρ
ωwithρ
ω∈ h
Bi
. These relations or syzygies form a module that we denote by Syz(
F)
=
(
X
ω hωeω∈
K[
x]
∂B;
X
ω hωfω=
0)
,
where
(
eω)
ω∈∂Bis the canonical basis ofK[
x]
∂B. IfFis a border basis family constructed from an initial set of polynomialsH
= {
h1, . . . ,
hs}
, we can expressfω∈
Fin terms of the polynomials inH(and conversely) so that a syzygy onFinduces asyzygy onH(and conversely). Therefore, we are going to consider only the syzygies onF. For anyb
∈ h
Bi
andi∈ [
1,
n]
,xib∈ h
B+i
can be projected inh
Bi
alongF:π
F(
xib)
=
xib−
X
ω∈∂Bµ
i b,ωfω.
More generally, for anyp
∈
K[
x]
, we denote byµ
ip,ω the coefficientµ
ib,ωinπ
F(
xib)
forb=
π
F(
p)
∈ h
Bi
and writeµ
i(
p)
=
P
ω∈∂Bµ
i
pfω. Notice that ifxip
∈
Bthenµ
i(
p)
=
0.For anym
=
xi3· · ·
xik∈
Bandi1,
i2∈
1. . .
n, the two decompositionsπ
F(
xi1π
F(
xi2m))
=
π
F(
xi2π
F(
xi1m))
yield the syzygy xi1X
ω∈∂Bµ
i2 m,ωfω−
xi2X
ω∈∂Bµ
i1 m,ωfω−
X
ω∈∂B(µ
i2 xi1m,ω−
µ
i1 xi2m,ω)
fω=
0.
(1)These relations can be also rewritten as: xi1
µ
i2
(
m)
+
µ
i1(
xi2m
)
−
xi2µ
i1
(
m)
−
µ
i2(
xi1m
)
=
0.
We denote byΞ the module ofK
[
x]
∂B generated by these syzygies. It is also the module generated by the relations ofcommutationMi1
◦
Mi2(
b)
−
Mi2◦
Mi1(
b)
=
0 for allb∈
B,ii,
i2∈ [
1. . .
n]
. We distinguish the following relations:•
Ifxi1m,
xi2m∈
B, thenµ
i1
(
m)
=
µ
i2(
m)
=
0,µ
i1(
xi2m
)
=
µ
i2
(
xi−1m
)
=
0 andmdoes not yield a non-trivial relation ofthe form (1).
•
Ifxi1m∈
Bbutxi2m∈
∂
B, andxi1xi2m∈
∂
B, thenπ
F(
xi2m)
=
xi2m−
fxi2mand we have the relationxi1fxi2m
−
fxi1xi2m+
X
ω∈∂Bµ
i1xi2m,ωfω
=
0.
(2)•
Ifxi1m∈
B,xi2m∈
∂
B, andxi1xi2m∈
B, thenπ
F(
xi2m)
=
xi2m−
fxi2mand we have the relationxi1fxi2m
+
X
ω∈∂Bµ
i1xi2m,ωfω
=
0.
(3)•
Ifxi1m∈
∂
Bandxi2m∈
∂
B, thenπ
F(
xi1m)
=
xi1m−
fxi1m,π
F(
xi2m)
=
xi2m−
fxi2mand we have the syzygy xi1fxi2m−
xi2fxi1m−
X
ω∈∂B(µ
i2 xi1m,ω−
µ
i1 xi2m,ω)
fω=
0.
(4)The syzygies (2) and (4) are called respectivelynext-doorandacross-the-street relationsin [16]. The syzygies (3) do not exist ifBis stable under division (since in this case,xi2m
6∈
Bimpliesxi1xi2m6∈
B). All these syzygies are simply the non-trivial relations induced by the ‘‘border’’C-polynomials (seeDefinition 2.9).Now let us prove that the module of syzygies is generated by these commutations syzygies. For any monomialm
=
xi1
· · ·
xik, we can rewrite by induction its projection as:π
F(
m)
=
π
F(
xi1π
F(
xi2· · ·
)
=
m−
X
l=1...k xi1· · ·
xil−1X
ω∈∂Bµ
il xil+1···xk,ωfω (5)with the convention thatxik+1
=
1. We denote by Ξxi1,...,xik=
X
l=1...k xi1· · ·
xil−1X
ω∈∂Bµ
il xil+1···xk,ωeωthe corresponding element ofK
[
x]
∂B. Notice that this decomposition depends on the order of the decompositionm=
xi1
· · ·
xikas a product of variables.Lemma 4.1. If m
=
xi1· · ·
xik=
xj1· · ·
xjkthen Ξxi1···xik−
Ξxj1,...,xjk∈
Ξ.
Proof. First consider a permutation of two variablesm
=
m1xilxil+1m2=
m1xil+1xilm2 (withm1=
xi1· · ·
xil−1,m2=
xil+2· · ·
xik). Using (5), the two ways of projectingm=
m1xilxil+1m2=
m1xil+1xilm2yield the syzygyΞ...,xil,xil+1,...
−
Ξ...,xil+1,xil,...=
m1 xilX
ω∈∂Bµ
il+1 m2,ωeω−
xil+1X
ω∈∂Bµ
il m2,ωeω+
X
ω∈∂B(µ
il xil+1m2,ω−
µ
il+1 xilm2,ω)
eω!
which is inΞ. By iterated permutations of two variables, we transform the sequence xi1
, . . . ,
xik into the sequenceLemma 4.2. If m
θ
=
m0θ
0withθ, θ
0∈
∂
B and m,
m0∈
M, then m eθ≡
m0eθ0+
X
ω∈∂B
pωeωmodΞ
with pω
∈
K[
x]
of degree<
max(
|
m|
,
|
m0|
)
.Proof. Ifm
=
xi1· · ·
xidwithθ
=
xid+1b∈
∂
Bandb∈
B, the projection formula (5) ofmθ
has the formπ
F(
mθ)
=
mθ
−
xi1· · ·
xidfθ−
X
l=1...d xi1· · ·
xil−1X
ω∈∂Bµ
il xil+1···xidθ,ωfω sincexid+1b=
θ
andπ
F(θ)
=
θ
−
fθ. Ifm 0=
xj1· · ·
xjd0 andθ
0∈
∂
Bwithm
θ
=
m0θ
0, then byLemma 4.1the two decompositions yield the syzygyxi1
· · ·
xideθ−
xj1· · ·
xjd0eθ0+
X
l=1...d xi1· · ·
xil−1X
ω∈∂Bµ
il xil+1···xidθ,ωeω−
X
l=1...d0 xj1· · ·
xjl−1X
ω∈∂Bµ
jl xjl+1···xjd0θ0,ωeω as an element ofΞ. The syzygy is of the formm eθ−
m0eθ0
+
P
ω∈∂Bpωeωwith deg
(
pω) <
max(
d,
d0
)
, which proves thelemma.
This yields the following result for a border basis, conjectured in [16] for the case whereBis stable by division: Theorem 4.3. Let B
⊂
Mbe connected to1and let F=
(
fω)
ω∈∂Bbe a border basis for B. Then Syz(
F)
is generated by the relations(2),(3)and(4). Proof. Let
σ
=
P
ωpωeω
∈
Syz(
F)
. In this sum consider any termλ
m eθ whereλ
∈
K− {
0}
,m∈
M,θ
∈
∂
Bandδ
B(
mθ) <
|
m| +
1. Then there existsm0∈
M,θ
0∈
∂
Bsuch thatmθ
=
m0θ
0andδ
B(
m0θ
0)
= |
m0| +
1. This implies that|
m|
>
|
m0|
. ApplyLemma 4.2to reduce the productm eθ moduloΞ, to m0eθ0
+
X
ω∈∂Bqωeω
,
with deg
(
qω) <
|
m|
. Iterate this reduction, assuming for each termλ
m eω(λ
∈
K− {
0}
,m∈
supp(
pω)
,ω
∈
∂
B) thatδ
B(
mω)
= |
m| +
1. Notice that the polynomialm fωhas only one monomial of maximalB-index, which ismω
.Since
P
ωpωfω
=
0, there existθ
6=
θ
0∈
∂
Band monomialsm∈
supp(
pθ)
,m0∈
supp(
pθ0)
such thatmθ
=
m0θ
0andδ
B(
mθ)
is maximal among all terms of the syzygy. ByLemma 4.2, we can replace this pair(
m eθ,
m0eθ0)
moduloΞby a sum of terms of smaller degree. This transformation reduces either the number of terms with maximalB-index or the degree of the polynomialspω.Since we cannot iterate it infinitely, we deduce that
σ
is inΞ. 5. Algorithmic issuesBased on our study in the previous sections, we can arrive at an algorithm in [29]. The main idea is to translate the previous concepts into linear algebra setting. According to Section3, to compute effectively a normal form, one has to find a (monomial) basisBof the quotient algebra connected to 1 and to define border relations such that the multiplication operators commute. The algorithm in [29] is a fix point method, which updates
•
a potential monomial basisBof the quotient algebra and•
a setPof polynomials or rewriting rules (with one monomial of their support in∂
Band the remaining monomials inB), until a fix point is reached. At each step of the algorithm, the following operations are performed:1. The setP0of polynomials ofP+with support inB+is computed.
2. By taking linear combinations, a basisP
˜
of the vector spaceh
P0i
is computed such that each element of this basis has at most one monomial in∂
B(and has the other monomials inB).3. TheC-polynomials of the elements ofPwith their support inB+are computed and reduced byP.
˜
4. If non-zero polynomials with support inBappear, the potential basisBand the polynomial setPare updated. The update ofBis done by removing some parts ofB. The update ofPis done by combining the elements ofP
˜
and the reduced C-polynomials, in order to obtain rewriting rules for the new setB.The details and the technical proof of termination of the algorithm are available in [29]. We mention here that we have improved the algorithm since then: unlike [29] we avoid repeated execution of similar reductions, when we treat the degree drops.
6. Stability of the bases
This section is devoted to the study of the stability under numerical perturbations of the bases computed by the previous algorithm.
In many real-life problems, the systemf
=
(
f1, . . . ,
fs)
to be solved is given only with limited accuracy. However, most ofthe time, one also knows that the structure of the solutions is invariant in a small neighborhood of the system. Thus, it is often required that the polynomial solvers produce a representation of the quotient algebra that is stable in a small neighborhood of the initial system. The structural numerical stability of the basis is required in order to have a smooth behavior of the coefficients of the representation in the neighborhood off
=
(
f1, . . . ,
fs)
. By definition, a neighborhood offis an open set(containingf) in the vector space of polynomials
(
h1, . . . ,
hs)
such thatΛ(
hi)
Λ(
fi)
(i=
1, . . . ,
n). For>
0, we definebyN
(
f)
the set of systems(
h1, . . . ,
hs)
such thatΛ(
hi)
Λ(
fi)
and the coefficient vector ofhiis at most at distancefromthe coefficient vector offi(for the
∞
-norm).Assumption 6.1. Hereafter
γ
denotes a choice function refining a reducing gradingΛsuch that, for allp∈
R,γ (
p)
depends only on the support ofpand not on the numerical value of its monomial coefficients (e.g., Macaulay’s choice function, grevlex choice function,. . . ). Letγ
be the choice function that for anyp∈
R, applies the choice functionγ
on the monomials ofp, whose coefficient norm is larger than.Theorem 6.2. Letf
=
(
f1, . . . ,
fs)
be a zero dimensional polynomial system, such that in a neighborhood U off, all systems havethe same number D of complex solutions, counted with multiplicities. Let a positive
be small enough, there existsν >
0such that for any systemf0∈
Nν
(
f)
⊂
U , the basis B computed withγ
satisfyingAssumption6.1for the systemfis also a basis for the systemf0.Proof. Consider the matrixM, whose rows correspond to the coefficient vectors of the monomial multiplesm fi, with
deg
(
m)
+
deg(
fi)
≤
κ
+
d0whereκ
is the number of loops in Algorithm [29] andd0is the maximum degree of the polynomialsfi. The columns are indexed by all the monomials of degree
≤
κ
+
d0.We denote byBthe monomial set obtained as a basis ofA
=
R/
Iby applying Algorithm [29] to the polynomialsf1, . . . ,
fswith the choice function
γ
. By construction, the set of monomials indexing the columns ofMcontains∂
B. LetMgbe the same matrix asM, but constructed with the polynomialsfireplaced bygenericequations, i.e. equations with
indeterminate coefficients having the same support asf1
, . . . ,
fs. LetNbe the block of columns ofMindexed by monomialsnot inBand letNgbe the corresponding block inMg.
Since the operations of the algorithm in [29] consist in computing linear combinations of some monomial multiples of the polynomialsP(see Section5and [29]), and thus of monomial multiples offiof degree
≤
κ
+
d0, the complete computationcan be reinterpreted as an optimized triangulation procedure of the blockN.
The blockNgspecialized atfis invertible, because any monomial not inBcan be reduced by the computed border basis offto an element in
h
Bi
. This implies thatNgspecialized atf0is also invertible, forf0∈
Nν
(
f)
and sufficiently small positiveν
. Therefore, any monomial of∂
Bcan be reduced modulo the ideal(
f0)
to an element inh
Bi
. Consequently,B(which contains1) is a generating set of the quotient algebraA0
=
R/(
f0)
for anyf0
∈
Nν
(
f)
.Since the number of solutionsD
= |
B|
(counted with multiplicity) is left unchanged by small perturbations in the neighborhood Nν(
f)
of f, the monomial setB, which has cardinality exactly D, is also a basis of the quotient algebra A0=
R/(
f0)
for the perturbed systemf0.Consider now a slight modification of the algorithm in [29]: the coefficients whose norm is less than
are simply ignored in all the steps of the algorithm. This means that they will not be taken into account when we choose aleadingmonomial, or deciding if a polynomial is nonzero. This minor variation of the algorithm will be called the-algorithm in the next theorem. Similar deletion of the absolutely small input coefficients is routine in numerical algorithms. Remark, however, that the following theorem is stated rigorously and that the computed result isnotan approximation of the true quotient algebra!Theorem 6.3. Letf
=
(
f1, . . . ,
fs)
be a zero dimensional polynomial system, such that in a neighborhood U offall systems havethe same number D of complex solutions, counted with multiplicities. Let a positive
be small enough. Then there existsν >
0, such that for any systemf0∈
Nν(
f)
⊂
U , the basis B computed withγ
satisfyingAssumption6.1for the systemfis also the basis obtained withγ
and the-algorithm for the systemf0.
Proof. ByTheorem 6.2,Bis also a basis of the quotient algebraR
/(
f0)
. This basisBis obtained by applying Algorithm [29] to the systemf. The result of this algorithm does not change if we replace the choice function
γ
byγ
where a positiveis small enough (e.g., is smaller than the minimum of the norm of the coefficients of the polynomials on whichγ
is applied).Let us show by induction on the loop indexkof the algorithm, that the steps of and the polynomials computed by the
-algorithm withγ
are the same as for the direct algorithm, up to the terms of norm smaller that.This is clearly true for the first stepk