Discussion - Structured matrix methods for computations on Bernstein basis polynomials

The results of the examples in Section 6.2 are consistent because the matrices S( ˙f, ˙g) and ¯S( ´f , α2g) yield incorrect results but ¯´ S( ˜f , α1˜g)Q yields the correct result. There-fore, it is instructive to consider the superiority of ¯S( ˜f , α1g)Q with respect to the˜ matrices S( ˙f, ˙g) and ¯S( ´f , α2´g). We will firstly consider S( ˙f , ˙g) and ¯S( ˜f, α1˜g)Q.

If A ∈ R^p×q, p ≥ q, is an arbitrary matrix of rank r that is perturbed to A + δA, then it is shown in [30] that

|σⁱ(A + δA) − σⁱ(A)| ≤ kδAk², i = 1, . . . , q, where σi(A), i = 1, . . . , q, are the singular values of A. It follows that

|σⁱ(A + δA) − σⁱ(A)| ≤ kδAk² kAk²

σ1(A), i = 1, . . . , q, and thus if the errors in σi(A) and kAk² are

|δσⁱ(A)| = |σⁱ(A + δA) − σⁱ(A)| and ∆A = kδAk² kAk² , respectively, then

|δσⁱ(A)|

∆A ≤ σ¹(A) = kAk² ≤√

qkAk¹, i = 1, . . . , q, (6.31) and it follows that the upper bound on the ratio of the absolute error in the singular values of A to the relative error in kAk2 is decreased by reducing kAk1. The upper bound in (6.31) is expressed in terms of the 1-norm of A, rather than its 2-norm, because the diagonal matrix Q postmultiplies ¯S( ˜f, α1˜g), and its effect is therefore most clearly quantified by examining the column sums of S( ˙f, ˙g) and ¯S( ˜f , α1g)Q.˜ The application of (6.31) to S( ˙f, ˙g) and ¯S( ˜f , α1g)Q yields˜

|δσⁱ(S)|

∆S ≤√

m + nkSk¹, i = 1, . . . , m + n, S = S( ˙f , ˙g),

and

|δσⁱ( ¯SQ)|

∆( ¯SQ) ≤√

m + nk ¯SQk¹, i = 1, . . . , m + n, SQ = ¯¯ S( ˜f, α1g)Q,˜ and since ∆S = ∆

SQ¯

≈ ε, where ε is the componentwise signal-to-noise ratio, it follows that if

k ¯SQk¹ < kSk¹, (6.32)

the singular nature of S( ˆf , ˆg) is more faithfully preserved when computations are performed on ¯S( ˜f , α1˜g)Q than when computations are performed on S( ˙f , ˙g).

Figures 6.5, 6.6 and 6.7 show the column sums of S( ˙f , ˙g) and ¯S( ˜f, α1˜g)Q, σj = log

m+nX

i=1

|S( ˙f, ˙g)|^i,j, τj = log

m+nX

i=1

| ¯S( ˜f , α₁˜g)Q|^i,j, j = 1, . . . , m + n, for Examples 6.4, 6.5 and 6.6 respectively, where |P |^i,j denotes the absolute value of element (i, j) of P and log = log₁₀. The figures show that the maximum absolute column sum of ¯S( ˜f , α1g)Q is significantly smaller than the maximum absolute col-˜ umn sum of S( ˙f , ˙g), and thus the inequality (6.32) is satisfied, which implies that the inclusion of α1 and θ1, and Q, yields greatly improved computational results.

Furthermore, the column sums of ¯S( ˜f , α1g)Q span a smaller range than the column˜ sums of S( ˙f, ˙g) by several orders of magnitude. For example, Figure 6.6 shows that the column sums τj span approximately 3 orders of magnitude, but the column sums σj span about 10 orders of magnitude.

Consider now the superiority of ¯S( ˜f, α1g)Q with respect to ¯˜ S( ´f , α2g). The above´ analysis suggests that it is desirable to examine the absolute column sums of ¯S( ´f , α2g)´ and ¯S( ˜f , α1g)Q.˜

It follows from (3.7), (3.8), (3.25), (3.28), (6.4) and (6.7) that the sums of the absolute values of the entries in each of the first n columns, and each of the last m columns,

of ¯S( ˜f , α₁g)Q are˜ Similarly, it follows from (3.7), (3.8), (3.11) and (6.9) that the sums of the absolute values of the entries in each of the first n columns, and each of the last m columns, of ¯S( ´f , α2g) are´ faithfully preserved when computations are performed on ¯S( ˜f , α1g)Q than when com-˜ putations are performed on ¯S( ´f , α2g).´

It is seen from Examples 6.4, 6.5 and 6.6 that since α1 for ¯S( ˜f, α1g)Q and α˜ 2 for

S( ´¯ f, α₂g) are O(1), they are much smaller than terms of the form´ ^m_i , ⁿ_j

and

m+n−1 k

, they can be set equal to one. In addition, it is seen from these three ex-amples that θ1 for ¯S( ˜f , α1g)Q is approximately equal to θ˜ 2 for ¯S( ´f , α2g), that is,´ θ1 ≈ θ². Therefore, the effect of α1 and θ1 on ¯S( ˜f , α1˜g)Q is similar to the effect of α2 and θ2 on ¯S( ´f , α2´g), and thus the difference between ¯S( ˜f , α1˜g)Q and ¯S( ´f , α2´g) is mainly caused by the matrix Q. In particular, since α1 = O(1) and α2 = O(1), and θ1 ≈ θ² for Examples 6.4, 6.5 and 6.6, and λ, µ, η and ρ are normalization constants, it follows that the differences between (6.33) and (6.34), and (6.36) and (6.37), arise from the combinatorial factors ⁿ⁻¹_j−1

, j = 1, . . . , n, and ^m−1_j−1

, j = 1, . . . , m, in (6.33) and (6.34) respectively.

Figures 6.8, 6.9 and 6.10 show the column sums of ¯S( ´f, α2g) and ¯´ S( ˜f, α1˜g)Q for Ex-amples 6.4, 6.5 and 6.6 respectively. It is seen from these figures that the variation in magnitude of {Γ^j(a), Γk(b)} is much smaller than the variation in magnitude of {Σ^j(a), Σk(b)} for j = 1, . . . , n, and k = 1, . . . , m. In particular, the maximum value of {Γ^j(a), Γk(b)} is less than the maximum value of {Σ^j(a), Σk(b)} for j = 1, . . . , n, and k = 1, . . . , m, and it therefore follows from (6.35) and (6.38) that (6.39) is satis-fied. This confirms that the inclusion of Q is important for improved computational results because it reduces the column sums of ¯S( ˜f , α1g)Q.˜

The parameters α1 and θ1 are included in the computation of d in order to minimize the ratio of the entry of maximum absolute value, to the entry of minimum absolute value, of ¯S( ˜f, α1˜g)Q, where ˜f = ˜f (w, θ) and ˜g = ˜g(w, θ) are defined in (6.27) and (6.28) respectively. This objective is consistent with the effect of Q, but there is an important difference between α1 and θ1, and Q:

• The parameters α¹ and θ1 are functions of the Bernstein basis coefficients ¯ai and

¯bj, combinatorial factors of the forms ^m_i , ⁿ_j

and ^m+n−1_k

, and the entries of Q. The importance of their inclusion therefore increases as the degrees of f (x) and g(x), and the variation in magnitude of the coefficients ¯ai and ¯bj, increase.

• The entries of Q are functions of m and n, but they are independent of the coefficients ¯ai and ¯bj. They therefore mitigate the effects of the combinatorial factors ^m_i

, ⁿ_j

and ^m+n−1_k

, for large values of m and n, in a Sylvester matrix.

6.4 Summary

This chapter has introduced the computation of the degree of an AGCD of two inexact Bernstein polynomials f (x) and g(x) using their Sylvester matrix. It was shown that in order to obtain the best results, it is necessary to include the diagonal matrix Q and preprocess the Sylvester matrix S(f, g)Q by three operations, such that all the computations are performed on the Sylvester matrix ¯S( ˜f , α1g)Q, where α is˜ a scaling parameter. The third preprocessing operation introduces the parameter θ, which transforms the polynomials from the Bernstein basis to the modified Bernstein basis. The optimal values of α and θ are obtained from the solution of a linear programming problem.

In Chapters 5 and 6, the degree of an AGCD of inexact polynomials is determined by observing the variation of the singular values of their B´ezout and Sylvester resultant matrices. Furthermore, some advanced techniques using the first principal angle and the residual of an approximate linear algebraic equation can also be used to estimate the degree of an AGCD, which involves the Sylvester subresultant matrices. These issues are discussed in the next chapter.

5 10 15 20 25 30 35 40 45 50

−25

−20

−15

−10

−5 0

i log10σi / σ1

i=40

(a)

5 10 15 20 25 30 35 40 45 50

−20

−16

−12

−8

−4 0

i log10 σi / σ1

i=40

(b)

5 10 15 20 25 30 35 40 45 50

−16

−14

−12

−10

−8

−6

−4

−2 0

i log10σi / σ1

i=40

(c)

Figure 6.2: The normalized singular values of (a) S( ˙f, ˙g), (b) ¯S( ´f, α2´g) and (c) S( ˜¯ f, α1g)Q for Example 6.4.˜

10 20 30 40 50 60

−25

−20

−15

−10

−5 0

i log10σi / σ1

i=42

(a)

10 20 30 40 50 60

−25

−20

−15

−10

−5 0

i log10 σi / σ1

i=42

(b)

10 20 30 40 50 60

−14

−12

−10

−8

−6

−4

−2 0

i log10σi / σ1

i=42

(c)

Figure 6.3: The normalized singular values of (a) S( ˙f, ˙g), (b) ¯S( ´f, α2´g) and (c) S( ˜¯ f, α1g)Q for Example 6.5.˜

10 20 30 40 50 60 70

−25

−20

−15

−10

−5 0

i log10σi / σ1

i=47

(a)

10 20 30 40 50 60 70

−25

−20

−15

−10

−5 0

i log10 σi / σ1

i=47

(b)

10 20 30 40 50 60 70

−18

−16

−14

−12

−10

−8

−6

−4

−2 0

i log10σi / σ1

i=47

(c)

Figure 6.4: The normalized singular values of (a) S( ˙f, ˙g), (b) ¯S( ´f, α2´g) and (c) S( ˜¯ f, α1g)Q for Example 6.6.˜

5 10 15 20 25 30 35 40 45 50 0

2 4 6 8 10 12