The numerical computation of the strength of an approximate GCD

The strength of the approximate GCD and its computation

5.5 The numerical computation of the strength of an approximate GCD

For the computation of the minimization problems in (5.16) or (5.17) we need an appropriate numerical procedure. However, the successful computation of such a global minimum is not always guaranteed. The minimization problem in (5.16) or (5.17) is actually non-convex and in cases of sets of many polynomials, where the number of arbitrary parameters is usually large, it is very likely to lead to unsatisfactory results. Conversely, it is easier to find some bounds for the main function in (5.16) which is

kSQk = S^P −h

Om,r| ˜S_P^(r)^∗i

· ˆΦv

We analyse how the norm kS^Qk is bounded and what information we can get from these bounds. Without loss of generality, we assume a given polynomial v(s) with no zero roots. Combining the relations (5.4) and (5.13), gives the following equation:

S_Q· ˆΦ⁻¹_v = S_P · ˆΦ⁻¹_v −h

O_m,r| ˜S_P^(r)^∗i

(5.18) Let bSP = SP · ˆΦ⁻¹_v and split bSP such that

SbP = ˆS_P^′ + ˆS_P^′′ (5.19)

where ˆS_P^′′ has the same structure as S_P^(r)^∗ =h

O_m,r| ˜S_P^(r)^∗i

. Specifically, if we denote by A[i, j] the (i, j) element of a matrix A, the partitioning of bSP is based on the next rule:

Sˆ_P^′ [i, j] =

( SbP[i, j], if S_P^(r)^∗[i, j] = 0

0, if S_P^(r)^∗[i, j]6= 0 ∀ i, j (5.20) Therefore, ˆS_P^′′ can be presented as ˆS_P^′′ =

Om,r| ¯S

, where ¯S is an m× (n + p − r) matrix. From (5.18) and (5.19) we get the following relations:

SQ· ˆΦ⁻¹_v = ˆS_P^′ +

Om,r| ¯S

−h

Om,r| ˜S_P^(r)^∗i

= ˆS_P^′ +h

O_m,r| ¯S− ˜S_P^(r)^∗i

(5.21) It readily follows that:

S_Q = ˆS_P^′ · ˆΦv +h

O_m,r| ¯S− ˜S_P^(r)^∗i

· ˆΦv (5.22)

and if we use the Frobenius norm, which relates to the set of polynomials in a

direct way, we get:

kS^Qk ≤ k ˆS_P^′ · ˆΦvk + kh

O_m,r| ¯S− ˜S_P^(r)^∗i

· ˆΦvk or (5.23) kS^Qk ≤ k ˆS_P^′ k kˆΦvk + kh

O_m,r| ¯S− ˜S_P^(r)^∗i

k kˆΦvk (5.24) Furthermore, from the equation (5.21) we will have:

kSQk kˆΦ⁻¹_v k ≥ ˆS_P^′ +h

O_m,r| ¯S− ˜S_P^(r)^∗i (5.25) It is clear that any exact common factor of the polynomials of the set is expected to givekSQk = 0. Therefore, we may consider a polynomial as a good approximation of an exact common divisor or the exact GCD of a given set, if kSQk is close enough to zero.

The structure of the matrices here allows us to select the arbitrary parameters of the setP^∗ such that

S = ˜¯ S_P^(r)^∗ (5.26)

Then, if we apply the above result (5.26) to the inequalities (5.23) and (5.25) and since the condition number of ˆΦv according to the Frobenius norm is

Cond( ˆΦv) = kˆΦvk kˆΦ⁻¹_v k ≥ n + p > 1 (5.27)

the following important inequality will be obtained:

k ˆS_P^′ k

kˆΦ⁻¹_v k ≤ kSQk ≤ k ˆS_P^′ · ˆΦvk (5.28) Obviously, if k ˆS_P^′ k = 0, then kSQk = 0 and therefore the given polynomial v(s) can be considered as an exact common divisor of degree r of the original set.

Otherwise, the inequality (5.28) gives a lower bound of kSQk, which indicates the minimum distance towardskSQk = 0.

Definition 5.5. Given a polynomial v(s) with no zero roots, we shall define as:

i) S(v), the strength of v(s) given by the minimization problems (5.16), (5.17).

ii) S(v) , k ˆS_P^′ k

kˆΦ⁻¹_v k)−1

, the lower strength bound of v(s).

iii) S(v) , k ˆS_P^′ · ˆΦvk, the upper strength bound of v(s).

iv) S^a(v) , S(v) + S(v)

2 , the average strength of v(s).

The computation of the strength boundsS(v) and S(v) is straightforward and the results can be used as an indicator of the strength of the given approximation.

For example, ifS(v) >> 1, then the given approximation has very poor quality and the opposite holds if S(v) << 1. The strength bounds are very reliable indicators of the strength of a given GCD approximation v(s) provided that the respective matrix ˆΦv is well-conditioned ( Cond( ˆΦv)≈ O(n + p) ).

The following algorithm establishes a method for the evaluation of the strength bounds and hence the average strengthS^a(v) of a given approximation v(s).

ALGORITHM 5.1. The algorithm of Average Strength.

Input : Give a set P ∈ Π(n, p; h + 1) of univariate polynomials.

Give a univariate polynomial v(s) of degree r ≤ p with no zero roots.

Step 1 : Construct the (n, p)-expanded Sylvester matrix SP of P.

Construct the special Toeplitz representation ˆΦv of v(s).

Compute the first column of the inverse of ˆΦv and construct the matrix ˆΦ⁻¹_v .

Step 2 : Compute the matrix bSP by solving the linear system : Φˆ_v^t· bS_P^t = S_P^t

Step 3 : Split bSP such that bSP = ˆS_P^′ + ˆS_P^′′ using (5.20) .

Compute the Frobenius norms k ˆS_P^′ k, kˆΦ⁻¹_v k and k ˆS_P^′ · ˆΦvk.

Output : S(v) = k ˆS_P^′ k

kˆΦ⁻¹_v k , S(v) = k ˆS_P^′ · ˆΦvk, S^a(v) = S(v) + S(v) 2

◮ Computational complexity

Due to the special structure of the matrices SP and ˆΦv, it is possible to avoid the matrix operations and compute the norms explicitly and more efficiently.

The inverse of the lower triangular matrix ˆΦv is computed by solving a simple linear system of the form ˆΦv x = e¹_n+p, where x represents the first column of the matrix ˆΦ⁻¹_v and e¹_n+p = [1, 0, . . . , 0]^t ∈ R^n+p. The multiple linear system Φˆ_v^t· bS_P^t = S_P^t can be solved by using only backward substitution since ˆΦ_v^t is an upper triangular matrix. The multiplication ˆS_P^′ · ˆΦv actually zeros specific entries of ˆS_P^′ and produces an extended-resultant-like matrix. Thus it is not required to perform it. The number of operations required by the above algorithm is given in Table 5.1. The total amount of operations is O

3n²h+10n²+2nr−r² 2

for n = p. For an effective computation of the GCD, the respective matrix ˆS_P^′ is quite sparse and the required operations are less than O(2hn²).

Φˆ⁻¹_v SbP k ˆS_P^′ · ˆΦvk kˆΦ⁻¹_v k kˆΦvk

Table 5.1: Required operations for the computation of the strength bounds.

◮ Computational examples

In the following, we will demonstrate the steps of the previous Algorithm 5.1 for computing the average strength of a given approximation by considering the next example.

Example 5.1. Consider a polynomial setP3,3 ∈ Π(3, 2; 3) with arbitrary coeffi-cients. The setP^3,3 will be a{3, 2}-ordered polynomial defined as:

P3,3 =

According to the representation (5.5), the generalised resultant matrix of the set P^3,3 has the form:

For simplicity reasons and without loss of generality we will consider a 1^st degree GCD approximation for the set P^3,3 such that

v(s) = s + c , c6= 0

As it was described in Definition 5.3, a Toeplitz-like matrix can represent the polynomial v(s) in the form:

Φˆv =

Then, the respective inverse of ˆΦv is a lower triangular matrix of the form:

Φˆ⁻¹_v =

The next computations are made by following the steps 2 and 3 of the Algorithm 5.1 and lead to the evaluation of the strength bounds and the average strength of the approximation v(s).

Sˆ_P^′ =

Now, we can give the algebraic expression of the strength bounds using the Frobenius matrix norm. The upper strength bound for v(s) is:

S(v) = k ˆS_P^′ · ˆΦvk^F = (5.36)

and the lower strength bound of v(s) is:

S(v) = k ˆS_P^′ k^F ·

The condition number of the matrix ˆΦv characterises the stability of the computa-tion of the inverse of ˆΦv [18, 81] which also affects the computation of the matrix Sˆ_P^′ and hence the strength bounds. Since we use the Frobenius matrix norm, our computations can be considered sufficiently numerical stable if Cond( ˆΦv) is about O(n + p). Otherwise, it is very likely to have an unreliable result. Therefore, when working in a variable precision computational environment it is more preferable to perform the computation of the strength bounds using enough digits of precision, for example quadruple precision (34 digits). In the present case, the condition number of ˆΦv is given by act as an indicator for using scaling techniques [18] in order to prevent unreliable results.

Example 5.2. Consider a set of polynomialsP ∈ Π{3, 2; 3} with numeric floating-point coefficients:

which obviously has the same structure as (5.29). Then, the respective generalised resultant matrix of P has the form:

SP =

Clearly, for the typical 16-digit numerical precision the three polynomials of the above set P are considered coprime. The three polynomials of the set have been constructed so as to have a root around -1. The data are given in 4-digit numerical precision and thus, for 2-digit numerical precision the polynomials should have approximately the same root. However, it is interesting and perhaps more appropriate to search for a 4-digit approximate root and, consequently, a 1^st degree approximate common factor v(s) = s + c, which could also be considered here as the approximate ε-GCD of the set P for ε = 10⁻⁴.

Using the results from the previous theoretical example 5.1 for the setP3,3, we tested and computed the strength bounds and the average strength of 150 approximations of the form v(s) = s + c for the set P. The constant c ranged from 0.995 to 1.010 with increment 0.0001. The values of the strength bounds and the average strength of these approximations are presented in the graph of Figure 5.2.

This particular graph shows that when the constant c increases, the average strengthS^a(v) of v(s) decreases, following a nearly parabolic line until it reaches a minimum at c = 1.0022. Then it increases, following again a nearly parabolic line in a symmetrical way. The results presented in Table 5.2 show that, for a specified numerical accuracy ε = 10⁻⁴, the approximation

v(s) = s + 1.0022 has a minimum average strength equal to

S^a(v) = 0.01821270609

and thus we may consider it as a “good” approximate ε-GCD of the set P.

However, in Figure 5.2 we notice that the strength boundsS(v) and S(v) have

Figure 5.2: Measuring the strength of a 1^st degree approximate GCD of the set P ∈ Π(3, 2; 3) in Example 5.2.

similar graphs with S^a(v) and their minimum values are obtained for different approximations v(s) = s + 1.0023 and v(s) = s + 1.0021, respectively. These minimum values are highlighted in Table 5.2.

v(s) S(v) S(v) S^a(v) Cond( ˆΦv)

s+ 1.0019 0.009662968139 0.02696250452 0.01831273633 11.57988227 s+ 1.0020 0.009615551663 0.02689041331 0.01825298248 11.57783592 s+ 1.0021 0.009581612104 0.02685759025 0.01821960118 11.57579055 s+ 1.0022 0.009561282419 0.02686412976 0.01821270609 11.57374615 s+ 1.0023 0.009554638914 0.02690995372 0.01823229632 11.57170274 s+ 1.0024 0.009561699564 0.02699481292 0.01827825624 11.56966031 s+ 1.0025 0.009582423551 0.02711829199 0.01835035777 11.56761885

Table 5.2: The strength of the approximate GCD v(s) = s + c of the set P ∈ Π(3, 2; 3) in Example 5.2.

Therefore, it is reasonable to search for approximations which have minimum strength bounds. According to (5.36), the upper strength bound for an arbitrary approximation v(s) = s + c, c∈ R r {0}, will be given as a real function of c such that

Sv(c) =

8 +5.9730

c −1.14092425

c² −24.0895470

c³ +5.2586780 c⁴ −12

c⁵ +18 c⁶

¹₂

(5.41)

We will attempt to minimize this function and see if it is possible to obtain

Figure 5.3: Graphical representation of the upper strength bound of v(s) in Example 5.2.

an approximation with lower upper strength bound and without the restriction of 4-digit numerical precision. Using standard methods of calculus (first derivative test) and by observing the graph of the real functionS^v(c) in Figure 5.3, we can prove that S^v(c) is minimized for c0 = 1.00213337932.¹ Therefore, the upper strength boundS(v) is minimized by the approximation:

v(s) = s + 1.00213337932 with minimum value:

minc S(v) = S^v(c₀) = 0.02685539677

Similarly, considering (5.37), the lower strength bound for an arbitrary ap-proximation v(s) = s + c, c∈ R r {0}, will be given as a real function of c such that

Sv(c) =

88344904− 88119396 c + 88501909 c²− 104274792 c³+ 16658914 c⁴−

−72430188 c⁵+ 15436303 c⁶+ 23892000 c⁷+ 32000000 c⁸

· 2.5· 10⁻⁷

5 c⁸+ 4 c⁶+ 3 c⁴+ 2 c²+ 1

¹₂

(5.42)

Using again standard methods of calculus (first derivative test) and by observing the graph of the real function Sv(c) in Figure 5.4, we can prove that Sv(c) is minimized for c^′₀ = 1.002298464659.

1The results are provided from the built-in routine of Maple minimize.

Figure 5.4: Graphical representation of the lower strength bound of v(s) in Example 5.2.

Therefore, the lower strength bound S(v) is minimized by the approximation:

v(s) = s + 1.002298464659 with minimum value:

minc S(v) = S^v(c^′₀) = 0.009554637298

Finally, we conclude that the “best” approximate GCD of the form v(s) = s + c for the polynomial set P is expected to have a strength S(v) :

0.009554637298≤ S(v) ≤ 0.02685539677 (5.43) Nevertheless, it remains important to solve the extended minimisation problem:

minc S(v) = min

c,P^∗

S^P −h

Om,r| ˜S_P^(r)^∗i

· ˆΦv

_F (5.44)

which derives from the original problem (5.16) for v(s) = s + c, c6= 0 and r = 1.

Then, it is required to study the following objective function:

S^v(c, ai,j) = S^P−h

O_m,r| ˜S_P^(r)^∗i

· ˆΦv

²_F (5.45)

The matrix h

Om,r| ˜S_P^(r)^∗i

with arbitrary parameters has the form:

hO_m,r| ˜S_P^(r)^∗i

Then, the objective function S^v(c, ai,j) in (5.45) has the form:

S^v(c, ai,j) = 2 (1− a1,2)²+ 2 (3− a1,2c− a1,3)²− 2 (1 + a1,3c + a1,4)²

−2 (3 + a^1,4c)²+ 3 (1− a^3,2)²− 3 (1.995 + a^3,2c + a3,3)²

−3 (3.015 + a3,3c)²+ 3 (1− a2,2)² − 3 (1.0005 + a2,2c + a_2,3)²

−3 (1.9990 + a2,3c)² (5.47)

The minimisation of the above functionS^v(c, ai,j) is basically a non-linear least-squares problem. Furthermore, it is necessary here to compute the global minimum of this function rather than a local minimum. For the purpose of this study, we used the built-in Optimization[Minimize] routine of Maple, which is based on the Gauss-Newton and modified Newton iterative methods [25]. Unfortunately, the results showed that the minimum value of S^v(c, ai,j) is 6.115444877153 for c =−2.703773531865. This result is not acceptable and obviously we deal with an ill-conditioned optimisation problem.

However, if we apply scaling such as v(s) = s

c+ 1 and set w := 1

c, then the objective function will be given by

Sv(w, ai,j) = 2 (1− a1,2w)²+ 2 (3− a1,2− a1,3w)² − 2 (1 + a1,3+ a1,4w)²

−2 (3 + a^1,4)²+ 3 (1− a^3,2w)²− 3 (1.995 + a^3,2+ a3,3w)²

−3 (3.015 + a3,3)²+ 3 (1− a2,2w)²− 3 (1.0005 + a2,2+ a2,3w)²

−3 (1.9990 + a^2,3)² (5.48)

Using again the Optimization[Minimize] routine of Maple, we will notice that the function S^v(w, ai,j) has a minimum value equal to 2.2860982654· 10⁻⁴ at w0 = 0.99770975355. This immediately shows that the solution of the problem (5.44) is mincS(v) =√

2.2860982654· 10⁻⁴ = 0.0151198487605, which is obtained for c = _w¹₀ = 1.0022955037196.

Unless we have not found the global minimum of the minimisation problem (5.44), the “best” approximate CCD of the set P is

v(s) = s + 1.0022955037196 (5.49) with strength

S(v) = 0.0151198487605 (5.50)

and, of course, the inequality (5.43) is satisfied.

Example 5.3. Consider a larger set of polynomials P ∈ Π{5, 4; 12} with numeric floating-point coefficients:

We search for an approximate GCD of the above setP and we will use the Hybrid ERES algorithm in order to find one. For the typical 16-digits numerical precision and tolerances εt = 10⁻¹⁵, εG = 10⁻¹⁵, the given solution is

gcd{P} : g(s) = s + 0.6 (5.52)

with average strength S^a(g) = 4.467947303· 10⁻²⁸. This solution can be charac-terised as an exact GCD or an excellent approximate GCD of the set P, because the value of the average strength is practically zero for the 16-digits accuracy.

The Hybrid ERES algorithm does not provide any other approximate solutions for the given set P. However, if we compute the average strength of a simple polynomial factor of the general form:

v(s) = s + c, c∈ R r {0}

for the set P and by using the Frobenius norm, then we actually obtain the average strength S^a(v) as a real function of c, denoted bySv^a(c), with its graph presented in Figure 5.5.

Figure 5.5: Graphical representation of the average strengthSv^a(c) of the common factor v(s) = s + c of the set P in Example 5.3.

If we carefully observe the graph of the average strength Sv^a(c) in Figure 5.5, we will notice that there are two local minima. The first local minimum is at c = 0.6, which is very normal, since it verifies the already computed GCD in (5.52).

However, there is another local minimum near c = 2 which reveals the presence of another simple common factor with weaker strength. We may compute this minimum by using the minimisation routine Optimization[Minimize] of Maple with objective function Sv^a(c) and initial point ˆc = 2.1 (or ˆc = 1.9). Then, we get minc S^a(v) = 0.002342396475056914 (5.53) for c = 2.00015927122308631. Therefore, there exists another approximate com-mon factor of the form:

v(s) = s + 2.00015927122308631 (5.54) with average strength S^a(ˆv) = 2.342396475056914· 10⁻³. Obviously, ˆv(s) is a

“weaker” common factor of P in comparison with g(s), but we may infer that there is an approximate GCD of degree 2, given by the product of these common factors. Thus, we can have an approximate GCD of the form:

g(s) = ˆv(s)· g(s) =

= s²+ 2.60015927122308631 s + 1.20009556273385179 (5.55) with average strength S^a(ˆg) = 2.951102916· 10⁻³. However, if we evaluate the

strength of ˆg(s) by solving the problem:

S(ˆg) = min

P^∗

S^P−h

O_m,2| ˜S_P⁽²⁾^∗i

· ˆΦgˆ

_F (5.56)

we get S(ˆg) = 3.2297259946 · 10⁻³.

Surprisingly, if we solve the extended minimisation problem (5.44) for v(s) = s²+ c1s + c2 with deg{v(s)} = r = 2, where c1, c2 are nonzero constants in R, we get

minc₁,c₂S(v) = 0.00322861972251372628

for c₁ = 2.60015931739949568 and c₂ = 1.20009515827349452. This implies that there exists a 2^nd degree approximate GCD of the form:

g(s) = s²+ 2.600159317399495681 s + 1.20009515827349452 (5.57) with strength S(˜g) = 3.2286197225 · 10⁻³, which is nearly identical with ˆg(s) in (5.55).

Furthermore, if we solve again the extended minimisation problem (5.44) for a simple common factor v(s) = s + c1, r = 1, where c1 is a nonzero constant in R, and initial point ˆc = 2.1, we get:

minc1 S(v) = 0.00305624668250932152 (5.58) for c1 = 2.00015926974934885. Therefore, we have computed again a 1^st degree approximate common factor of the form:

v(s) = s + 2.00015926974934885 (5.59) with strength S(˜v) = 3.0562466825 · 10⁻³, which is nearly identical with ˆv(s) in (5.54).

Conclusively, we used here the strength minimisation problem mincS(v) and the average strength minimisation problem mincS^a(v) in order to compute a simple approximate common factor v(s) = s + c for a set P of 12 real polynomials of maximum degree 5. The given results in (5.54) and (5.59) are nearly identical. But the major difference is that the former problem involves 50 arbitrary parameters (including c) to be solved, whereas the latter involves only one arbitrary parameter, c, which results in significantly less computational complexity and improved efficiency.

In document ERES Methodology and Approximate Algebraic Computations (Page 127-142)