Chapter 3 - From Gaussian Elimination to LU Factorization

(1)

Chapter 3 - From Gaussian Elimination to LU Factorization

Maggie Myers Robert A. van de Geijn The University of Texas at Austin

Practical Linear Algebra – Fall 2009

(2)

Gaussian Elimination - Take 1

(3)

Consider the system of linear equations 2x + 4y − 2z = −10 4x − 2y + 6z = 20 6x − 4y + 2z = 18

Notice that x, y, and z are just variables, for which we can pick any name we want. To be consistent with the notation we introduced previously for naming components of vectors, we use the names χ

₀

, χ

₁

, and and χ

₂

instead of x, y, and z, respectively:

2χ

₀

+ 4χ

₁

− 2χ

₂

= −10

4χ

0

− 2χ

₁

+ 6χ

2

= 20

6χ − 4χ + 2χ = 18

(4)

2χ

₀

+ 4χ

₁

− 2χ

₂

= −10 4χ

₀

− 2χ

₁

+ 6χ

₂

= 20 6χ

0

− 4χ

₁

+ 2χ

2

= 18

Solving this linear system relies on the fact that its solution does not change if

1

Equations are reordered (not actually used in this example);

and/or

2

An equation in the system is modified by subtracting a multiple of another equation in the system from it; and/or

3

Both sides of an equation in the system are scaled by a

nonzero.

(5)

Example: Gaussian Elimination

The following steps are knows as Gaussian elimination. They transform a system of linear equations to an equivalent upper triangular system of linear equations:

Subtract λ

10

= (4/2) = 2 times the first equation from the second equation:

Before After

2χ

0

+ 4χ

1

− 2χ

₂

= −10 4χ

₀

− 2χ

₁

+ 6χ

₂

= 20 6χ

0

− 4χ

₁

+ 2χ

2

= 18

2χ

0

+ 4χ

1

− 2χ

₂

= −10

− 10χ

₁

+ 10χ

₂

= 40

6χ

0

− 4χ

₁

+ 2χ

2

= 18

(6)

Subtract λ

₂₀

= (6/2) = 3 times the first equation from the third equation:

Before After

2χ

0

+ 4χ

1

− 2χ

₂

= −10

− 10χ

₁

+ 10χ

2

= 40 6χ

₀

− 4χ

₁

+ 2χ

₂

= 18

2χ

0

+ 4χ

1

− 2χ

₂

= −10

− 10χ

₁

+ 10χ

2

= 40

− 16χ

₁

+ 8χ

₂

= 48 Subtract λ

21

= ((−16)/(−10)) = 1.6 times the second equation from the third equation:

Before After

2χ

₀

+ 4χ

₁

− 2χ

₂

= −10

− 10χ

₁

+ 10χ

₂

= 40

− 16χ

₁

+ 8χ

2

= 48

2χ

₀

+ 4χ

₁

− 2χ

₂

= −10

− 10χ

₁

+ 10χ

₂

= 40

− 8χ

₂

= −16

(7)

This now leaves us with an upper triangular system of linear equations.

Multipliers

In the above Gaussian elimination procedure, λ

₁₀

, λ

₂₀

, and λ

₂₁

are

called the multipliers.

(8)

Back substitution

2χ

0

+ 4χ

1

− 2χ

₂

= −10

− 10χ

₁

+ 10χ

2

= 40

− 8χ

₂

= −16 Solve last equation: χ

₂

= −16/(−8) = 2.

Substitute χ

₂

= 2 into second equation and solve:

χ

1

= (40 − 10(2))/(−10) = −2.

Substitute χ

2

= 2 and χ

1

= −2 into first equation and solve:

χ

₀

= (−10 − (4(−2) + (−2)(−2)))/2 = 1.

Thus, the solution is the vector x =



 χ

0

χ

₁

χ

₂



 =



 1

−2 2



.

(9)

Gaussian Elimination - Take 2

(10)

It becomes very cumbersome to always write the entire equation.

The information is encoded in the coefficients in front of the χ

_i

variables, and the values to the right of the equal signs.

We could just let





2 4 −2 −10

4 −2 6 20

6 −4 2 18



 represent

2χ

₀

+ 4χ

₁

− 2χ

₂

= −10

4χ

0

− 2χ

₁

+ 6χ

2

= 20

6χ

₀

− 4χ

₁

+ 2χ

₂

= 18

Then Gaussian elimination can simply work with this array of

numbers.

(11)

Initial system of equations:





2 4 −2 −10 4 −2 6 20

6 −4 2 18





Subtract λ

10

= (4/2) = 2 times the first row from the second row:





2 4 −2 −10

0 −10 10 40

6 −4 2 18





Subtract λ

20

= (6/2) = 3 times the first row from the third row:





2 4 −2 −10

0 −10 10 40

0 −16 8 48





Subtract λ

21

= ((−16)/(−10)) = 1.6 times the second row from the third row:





2 4 −2 −10

0 −10 10 40

0 0 −8 −16





(12)

Back substitution





2 4 −2 −10

0 −10 10 40

0 0 −8 −16





The last row is shorthand for − 8χ

2

= −16 which implies χ

2

= (−16)/(−8) = 2 The second row is shorthand for − 10χ

1

+ 10χ

2

= 40 which implies − 10χ

1

+ 10(2) = 40

and hence χ

1

= (40 − 10(2))/(−10) = −2 The first row is shorthand for 2χ

0

+ 4χ

1

− 2χ

2

= −10 which implies 2χ

0

+ 4(−2) − 2(2) = −10 and hence χ

0

= (−10 − 4(−2) + 2(2))/(2) = 1

Solution equals x =

0 @ χ

0

χ

1

χ

2

1 A =

0 @ 1

−2 2

1 A

Check the answer (by plugging χ

0

= 1, χ

1

= −2, and χ

2

= 2 into the original system)

2(1) + 4(−2) − 2(2) = −10 X

4(1) − 2(−2) + 6(2) = 20 X

6(1) − 4(−2) + 2(2) = 18 X

(13)

Observations

The above discussion motivates storing only the coefficients of a linear system (the numbers to the left of the |) as a two dimensional array and the numbers to the right as a one dimension array.

We recognize this two dimensional array as a matrix:

A ∈ R

^m×n

is the two dimensional array of scalars

A =







α

_0,0

α

_0,1

· · · α

_0,n−1

α

1,0

α

1,1

· · · α

1,n−1

.. . .. . . .. .. . α

_m−1,0

α

_m−1,1

· · · α

_m−1,n−1





 ,

where α

i,j

∈ R for 0 ≤ i < m and 0 ≤ j < n.

(14)

Observations (continued)

We similarly recognize that the one dimensional array is a (column) vector x ∈ R

ⁿ

where

x =





 χ

0

χ

1

.. . χ

n−1





 .

The length of the vector is n.

Now, given A ∈ R

^m×n

and vector x ∈ R

ⁿ

, the notation Ax stands for







α

_0,0

χ

₀

+ α

_0,1

χ

₁

+ · · · + α

_0,n−1

χ

_n−1

α

1,0

χ

0

+ α

1,1

χ

1

+ · · · + α

1,n−1

χ

n−1

.. . .. . .. . .. .

α

_m−1,0

χ

₀

+ α

_m−1,1

χ

₁

+ · · · + α

_m−1,n−1

χ

_n−1







(15)

Gaussian Elimination - Take 3

(16)

Example





1 0 0

−2 1 0 0 0 1









2 4 −2

4 −2 6

6 −4 2



 =





2 4 −2

0 −10 10

6 −4 2



 .

(17)

Exercise Compute





1 0 0 0 1 0

−3 0 1









2 4 −2

0 −10 10

6 −4 2



 .

How can this be described as an axpy operation?

(18)

0 @

2

4 −2 −10

4

−2 6 20

6 −4 2 18

1 A 0

@

1 0 0

−2 1 0 0 0 1

1 A

0 @

2

4 −2 −10

4

−2 6 20

6 −4 2 18

1 A =

0 @

2

4 −2 −10

0 −10 10 40

6

−4 2 18

1 A 0

@

1 0 0 0 1 0

−3 0 1 1 A

0 @

2

4 −2 −10 0 −10 10 40

6

−4 2 18

1 A =

0 @

2 4 −2 −10 0

−10

10 40 0

−16

8 48

1 A 0

@

1 0 0

0 1 0

0 −1.6 1 1 A

0 @

2 4 −2 −10 0

−10

10 40 0

−16

8 48

1 A =

0 @

2 4 −2 −10 0 −10 10 40 0 0 −8 −16

1 A

(19)





2 4 −2

4 −2 6

6 −4 2









1 0 0

−2 1 0 0 0 1









2 4 −2

4 −2 6

6 −4 2



 =





2 4 −2

2

−10 10

6 −4 2









1 0 0 0 1 0

−3 0 1









2 4 −2

2

−10 10

6 −4 2



 =





2 4 −2

2

−10 10

3

−16 8









1 0 0

0 1 0

0 −1.6 1









2 4 −2

2

−10 10

3

−16 8



 =





2 4 −2

2

−10 10

3 1.6

−8





(20)





−10 20 18









1 0 0

−2 1 0 0 0 1









−10 20 18



 =





−10 40 18









1 0 0 0 1 0

−3 0 1









−10 40 18



 =





−10 40 48









1 0 0

0 1 0

0 −1.6 1









−10 40 48



 =





−10 40

−16





Back substitution as before

(21)

Gaussian Elimination - Take 4

(22)

Example





1 0 0

−2 1 0

−3 0 1









2 4 −2

4 −2 6

6 −4 2



 =





2 4 −2

0 −10 10

0 −16 8





(23)





2 4 −2

4 −2 6 6 −4 2









1 0 0

−2 1 0

−3 0 1









2 4 −2

4 −2 6 6 −4 2



 =





2 4 −2

2

−10 10

3

−16 8









1 0 0

0 1 0

0 −1.6 1









2 4 −2

2

−10 10

3

−16 8



 =





2 4 −2

2

−10 10

3 1.6

−8





(24)

Forward substitution





−10 20 18









1 0 0

−2 1 0

−3 0 1









−10 20 18



 =





−10 40 48









1 0 0

0 1 0

0 −1.6 1









−10 40 48



 =





−10 40

−16





Back substitution as before

(25)

Theorem

Let ˆ L

j

be a matrix that equals the identity, except that for i > jthe (i, j) elements (the ones below the diagonal in the jth column) have been replaced with −λ

i,j

:

L ˆ

_j

=







I

_j

0 0 0 · · · 0

0 1 0 0 · · · 0

0 −λ

_j+1,j

1 0 · · · 0 0 −λ

_j+2,j

0 1 · · · 0 .. . .. . .. . .. . . .. ...

0 −λ

_m−1,j

0 0 · · · 1





 .

Then ˆ L

j

A equals the matrix A except that for i > j the ith row is

(26)

Exercise Verify that





1 0 0

0 1 0

0 −1.6 1









1 0 0

−2 1 0

−3 0 1









2 4 −2

4 −2 6

6 −4 2





=





2 4 −2

0 −10 10

0 0 −8





and





1 0 0

0 1 0

0 −1.6 1









1 0 0

−2 1 0

−3 0 1









−10 20 18



 =





−10 40

−16



 .

(27)

Gaussian Elimination - Take 4

(28)

Example Consider





1 0 0

−λ

₁₀

1 0

−λ

₂₀

0 1









2 4 −2

4 −2 6 6 −4 2





=





2 4 −2

4 − λ

10

(2) −2 − λ

₁₀

(4) 6 − λ

10

(−2) 6 − λ

₂₀

(2) −4 − λ

₂₀

(4) 2 − λ

₂₀

(−2)





How should λ

₁₀

and λ

₂₀

be chosen so that zeroes are introduced below the diagonal in the first column?.

Examine 4 − λ

₁₀

(2) and 6 − λ

₂₀

(2).

λ

₁₀

= 4/2 = and λ

₂₀

= 6/2 = 3 have the desired property.

(29)

Example

Alternatively, we can write this as

0 @

1 `

0 0 ´

−

„ λ

10

λ

20

« „ 1 0 0 1

« 1 A

0 @

2 `

4 −2 ´

„ 4 6

« „

−2 6

−4 2

« 1 A

= 0

@

2 `

4 −2 ´

−

„ λ

10

λ

20

« 2 +

„ 4 6

«

−

„ λ

10

λ

20

«

` 4 −2 ´ +

„ −2 6

−4 2

« 1 A

To zero the elements below the diagonal in the first column:

−

λ

₁₀

λ

20

2 +

4 6

=

0 0

or, equivalently,

(30)

Generalizing this insight

Let A

⁽⁰⁾

∈ R

^n×n

and ˆ L

⁽⁰⁾

a Gauss transform. Partition A

⁽⁰⁾

→ α

⁽⁰⁾₁₁

a

^{(0) T}₁₂

a

⁽⁰⁾₂₁

A

⁽⁰⁾₂₂

!

, ˆ L

⁽⁰⁾

→ 1 0

−l

⁽⁰⁾₂₁

I

! .

Then ˆ L

⁽⁰⁾

A

⁽⁰⁾

=

1 0

−l

⁽⁰⁾₂₁

I

!

α

₁₁⁽⁰⁾

a

^{(0) T}₁₂

a

⁽⁰⁾₂₁

A

⁽⁰⁾₂₂

!

= α

₁₁⁽⁰⁾

a

^{(0) T}₁₂

a

⁽⁰⁾₂₁

− l

⁽⁰⁾₂₁

α

⁽⁰⁾₁₁

A

⁽⁰⁾₂₂

− l

₂₁

a

^{(0) T}₁₂

!

.

(31)

Generalizing this insight (continued)

α

⁽⁰⁾₁₁

a

^{(0) T}₁₂

a

⁽⁰⁾₂₁

− l

⁽⁰⁾₂₁

α

⁽⁰⁾₁₁

A

⁽⁰⁾₂₂

− l

21

a

^{(0) T}₁₂

! .

Choose l

⁽⁰⁾₂₁

so that a

⁽⁰⁾₂₁

− l

⁽⁰⁾₂₁

α

⁽⁰⁾₁₁

= 0:

l

₂₁⁽⁰⁾

= a

⁽⁰⁾₂₁

/α

⁽⁰⁾₁₁

.

A

⁽⁰⁾₂₂

→ A

⁽⁰⁾₂₂

− l

⁽⁰⁾₂₁

a

^{(0) T}₁₂

: this is a rank-1 update (ger).

Update

A

⁽¹⁾

:=

„ 1 0

−l

₂₁⁽⁰⁾

I

« α

⁽⁰⁾₁₁

a

^{(0) T}₁₂

a

⁽⁰⁾₂₁

A

⁽⁰⁾₂₂

!

(32)

Example Consider





1 0 0

0 1 0

0 −λ

₂₁

1 







2 4 −2

0 −10 10

0 −16 8





=





2 4 −2

0 −10 10

0 −16 − λ

₂₁

(−10) 8 − λ

₂₀

(10)





How should λ

₂₁

be chosen?

−16 − λ

₂₁

(−10) = 0 so that λ

₂₁

= −16/(−10) = 1.6 has the desired property.

Alternatively, we notice that, viewed as a vector,

λ

₂₁

= −16 /(−10).

(33)

Moving on

A

⁽¹⁾

→







A

⁽¹⁾₀₀

a

⁽¹⁾₀₁

A

⁽¹⁾₀₂

0 α

⁽¹⁾₁₁

a

^{(1) T}₁₂

0 a

⁽¹⁾₂₁

A

⁽¹⁾₂₂





 , ˆ L

⁽¹⁾

→







I

₁

0 0

0 1 0

0 −l

⁽¹⁾₂₁

I





 .

Then







I

₁

0 0

0 1 0

0 −l

₂₁⁽¹⁾

I













A

⁽¹⁾₀₀

a

⁽¹⁾₀₁

A

⁽¹⁾₀₂

0 α

⁽¹⁾₁₁

a

^{(1) T}₁₂

0 a

⁽¹⁾₂₁

A

⁽¹⁾₂₂







=





A

⁽¹⁾₀₀

a

⁽¹⁾₀₁

A

⁽¹⁾₀₂

0 α

⁽¹⁾

a

^{(1) T}



 .

(34)

Moving on

0 B

@

I

1

0 0

0 1 0

0 −l

₂₁⁽¹⁾

I 1 C A

0 B

@

A

⁽¹⁾₀₀

a

⁽¹⁾₀₁

A

⁽¹⁾₀₂

0 α

⁽¹⁾₁₁

a

^{(1) T}₁₂

0 a

⁽¹⁾₂₁

A

⁽¹⁾₂₂

1 C A

= 0 B

@

A

⁽¹⁾₀₀

a

⁽¹⁾₀₁

A

⁽¹⁾₀₂

0 α

⁽¹⁾₁₁

a

^{(1) T}₁₂

0 a

⁽¹⁾₂₁

− l

₂₁⁽¹⁾

α

⁽¹⁾₁₁

A

⁽¹⁾₂₂

− l

⁽¹⁾₂₁

a

^{(1) T}₁₂

1 C A .

Now,

Choose l

⁽¹⁾₂₁

so that a

⁽¹⁾₂₁

− l

⁽¹⁾₂₁

α

⁽¹⁾₁₁

= 0: l

⁽¹⁾₂₁

= a

⁽¹⁾₂₁

/α

⁽¹⁾₁₁

. A

⁽¹⁾₂₂

→ A

⁽¹⁾₂₂

− l

⁽¹⁾₂₁

a

^{(1) T}₁₂

: this is a rank-1 update (ger).

A

⁽²⁾

= 0 B

@

I

1

0 0

0 1 0

0 −l

⁽¹⁾₂₁

I 1 C A

0 B

@

A

⁽¹⁾₀₀

a

⁽¹⁾₀₁

A

⁽¹⁾₀₂

0 α

⁽¹⁾₁₁

a

^{(1) T}₁₂

0 a

⁽¹⁾₂₁

A

⁽¹⁾₂₂

1 C A =

0 B

@

A

⁽¹⁾₀₀

a

⁽¹⁾₀₁

A

⁽¹⁾₀₂

0 α

⁽¹⁾₁₁

a

^{(1) T}₁₂

0 0 A

⁽²⁾₂₂

1 C

A

(35)

More general yet

A

^(k)

→







A

^(k)₀₀

a

^(k)₀₁

A

^(k)₀₂

0 α

^(k)₁₁

a

^{(k) T}₁₂

0 a

^(k)₂₁

A

^(k)₂₂





 , ˆ L

^(k)

→







I

_k

0 0

0 1 0

0 −l

^(k)₂₁

I





 ,

where A

^(k)₀₀

and I

_k

are k × k matrices. Then







I

_k

0 0

0 1 0

0 −l

^(k)₂₁

I













A

^(k)₀₀

a

^(k)₀₁

A

^(k)₀₂

0 α

^(k)₁₁

a

^{(k) T}₁₂

0 a

^(k)₂₁

A

^(k)₂₂







 A

^(k)₀₀

a

^(k)₀₁

A

^(k)₀₂



(36)

0 B

@

I

k

0 0

0 1 0

0 −l

^(k)₂₁

I 1 C A

0 B

@

A

^(k)₀₀

a

^(k)₀₁

A

^(k)₀₂

0 α

^(k)₁₁

a

^{(k) T}₁₂

0 a

^(k)₂₁

A

^(k)₂₂

1 C A

= 0 B

@

A

^(k)₀₀

a

^(k)₀₁

A

^(k)₀₂

0 α

^(k)₁₁

a

^{(k) T}₁₂

0 a

^(k)₂₁

− l

₂₁^(k)

α

^(k)₁₁

A

^(k)₂₂

− l

^(k)₂₁

a

^{(k) T}₁₂

1 C A .

Choose l

^(k)₂₁

so that a

^(k)₂₁

− l

^(k)₂₁

α

^(k)₁₁

= 0: l

₂₁^(k)

= a

^(k)₂₁

/α

^(k)₁₁

. A

^(k)₂₂

→ A

^(k)₂₂

− l

₂₁^(k)

a

^{(k) T}₁₂

:

A

^(k+1)

= 0 B

@

I

1

0 0

0 1 0

0 −l

₂₁^(k)

I 1 C A

0 B

@

A

^(k)₀₀

a

^(k)₀₁

A

^(k)₀₂

0 α

^(k)₁₁

a

^{(k) T}₁₂

0 a

^(k)₂₁

A

^(k)₂₂

1 C A

= 0 B

@

A

^(k)₀₀

a

^(k)₀₁

A

^(k)₀₂

0 α

^(k)₁₁

a

^{(k) T}₁₂

0 0 A

^(k+1)₂₂

1 C

A

(37)

A := GE Take5 (A) Partition A →

„ A

T L

A

T R

A

BL

A

BR

«

where A

T L

is 0 × 0 while m(A

T L

) < m(A) do

Repartition

„ A

T L

A

T R

A

BL

A

BR

«

→ 0

@

A

00

a

01

A

02

a

^T10

α

11

a

^T12

A

20

a

21

A

22

1 A where α

11

is 1 × 1

a

21

:= a

21

/α

11

(= l

21

)

A

22

:= A

22

− a

21

a

^T12

(= A

22

− l

21

a

^T12

) Continue with

„ A

T L

A

T R

«

←

0 A

00

a

01

A

02

a

^T10

α

11

a

^T12

1

(38)

Insights

Now, if A ∈ R

^n×n

, then

A

⁽ⁿ⁾

= ˆ L

⁽ⁿ⁻¹⁾

· · · ˆ L

⁽¹⁾

L ˆ

⁽⁰⁾

A = U,

an upper triangular matrix. Also, to solve Ax = b, we note that U x = ( ˆ L

⁽ⁿ⁻¹⁾

· · · ˆ L

⁽¹⁾

L ˆ

⁽⁰⁾

A)x = ˆ L

⁽ⁿ⁻¹⁾

· · · ˆ L

⁽¹⁾

L ˆ

⁽⁰⁾

b

| {z }

ˆ b

.

The right-hand size of this we recognize as forward substitution

applied to vector b. We will later see that solving U x = ˆ b where U

is upper triangular is equivalent to back substitution.

(39)

The reason why we got to this point as “GE Take 5” is so that the reader, hopefully, now recognizes this as just Gaussian elimination.

The insights in this section are summarized in the algorithm,

in which the original matrix A is overwritten with the upper

triangular matrix that results from Gaussian elimination and

the strictly lower triangular elements are overwritten by the

multipliers.

(40)

Gaussian Elimination - Take 6

(41)

Inverse of a Matrix

Let A ∈ R

^n×n

and B ∈ R

^n×n

have the property that AB = BA = I.

Then B is said to be the inverse of matrix A and is denoted by A

⁻¹

.

Later we will see that for square A and B it is always the case that

if AB = I then BA = I and that the inverse of a matrix is unique.

(42)

Example Let

L = ˆ 0

@

1 0 0

−2 1 0 0 0 1

1 A and L = 0

@

1 0 0 2 1 0 0 0 1

1 A .

Then

L ˆ L = 0

@

1 0 0 2 1 0 0 0 1

1 A

0 @

1 0 0

−2 1 0 0 0 1

1 A =

0 @

1 0 0 0 1 0 0 0 1

1 A .

This should be intuitively true:

LA subtracts two times the first row from the second row. ˆ LA adds two times the first row from the second row.

L ˆ LA = L( ˆ LA) = A. Why?

Two transformations that always undo each other are inverses

of each other.

(43)

Exercise Compute





1 0 0

−2 1 0

−3 0 1









1 0 0 2 1 0 3 0 1





and reason why this should be intuitively true.

(44)

Theorem If

L = ˆ





I

_k

0 0

0 1 0

0 −l

₂₁

I



 then L =





I

_k

0 0

0 1 0

0 l

₂₁

I





is its inverse: L ˆ L = ˆ LL = I.

Proof

LL ˆ =





I

k

0 0

0 1 0

0 −l

₂₁

I









I

k

0 0

0 1 0

0 l

₂₁

I





=





I

_k

0 0

0 1 0

0 −l

₂₁

+ Il

21

I



 =





I

_k

0 0 0 1 0 0 0 I



 = I.

Similarly ˆ LL = I. (Notice that when we use I without indicating its dimensions, it has the dimensions that are required to fit the situation.)

http://z.cs.utexas.edu/wiki/pla.wiki/ 44

(45)

Exercise Recall that





1 0 0

0 1 0

0 −1.6 1









1 0 0

−2 1 0

−3 0 1









2 4 −2

4 −2 6

6 −4 2





=





2 4 −2

0 −10 10

0 0 −8



 .

Show that





2 4 −2

4 −2 6

6 −4 2





(46)

Exercise Show that





1 0 0 2 1 0 3 0 1









1 0 0

0 1 0

0 1.6 1



 =





1 0 0

2 1 0

3 1.6 1





so that





2 4 −2

4 −2 6

6 −4 2



 =





1 0 0

2 1 0

3 1.6 1









2 4 −2

0 −10 10

0 0 −8



 .

(47)

Theorem

Let ˆ L

⁽⁰⁾

, · · · , ˆ L

⁽ⁿ⁻¹⁾

be the sequence of Gauss transforms that transform an n × n matrix A to an upper triangular matrix:

L ˆ

⁽ⁿ⁻¹⁾

· · · ˆ L

⁽⁰⁾

A = U.

Then

A = L

⁽⁰⁾

· · · L

⁽ⁿ⁻²⁾

L

⁽ⁿ⁻¹⁾

U,

where L

^(j)

= ˆ L

^{(j) −1}

, the inverse of ˆ L

^(j)

.

(48)

Proof If

L ˆ

⁽ⁿ⁻¹⁾

L ˆ

⁽ⁿ⁻²⁾

· · · ˆ L

⁽⁰⁾

A = U.

then

A = L

⁽⁰⁾

· · · L

⁽ⁿ⁻²⁾

L

⁽ⁿ⁻¹⁾

L ˆ

⁽ⁿ⁻¹⁾

| {z }

I

L ˆ

⁽ⁿ⁻²⁾

| {z }

I

· · · ˆ L

⁽⁰⁾

| {z }

I

A

= L

⁽⁰⁾

· · · L

⁽ⁿ⁻²⁾

L

⁽ⁿ⁻¹⁾

L ˆ

⁽ⁿ⁻¹⁾

L ˆ

⁽ⁿ⁻²⁾

· · · L

⁽⁰⁾

A

| {z }

U

= L

⁽⁰⁾

· · · L

⁽ⁿ⁻²⁾

L

⁽ⁿ⁻¹⁾

U.

(49)

Lemma

Let ˆ L

⁽⁰⁾

, . . . , ˆ L

⁽ⁿ⁻¹⁾

be the sequence of Gauss transforms that transforms a matrix A into an upper triangular matrix U :

L ˆ

⁽ⁿ⁻¹⁾

· · · ˆ L

⁽⁰⁾

A = U and let L

^(j)

= ˆ L

^{(j) −1}

. Then

L ˜

^(k)

= L

⁽⁰⁾

· · · L

^(k−1)

L

^(k)

has the structure

L ˜

^(k)

=

L ˜

^(k)_{T L}

0 L ˜

^(k)_BL

I

!

(50)

Proof

Proof by induction on k.

Base case: k = 0. ˜ L

⁽⁰⁾

= L

⁽⁰⁾

= 1 0 l

₂₁⁽⁰⁾

I

!

meets the

desired criteria since 1 is a trivial unit lower triangular matrix.

(51)

Inductive step: Assume ˜ L

^(k)

meets the indicated criteria.

We will show that then ˜ L

^(k+1)

does too. Let

L ˜

^(k)

=

L ˜

^(k)_{T L}

0 L ˜

^(k)_BL

I

!

= 0 B

@

L ˜

^(k)₀₀

0 0

˜ l

^{(k) T}₁₀

1 0 L ˜

^{(k) T}₂₀

0 I

1 C A

where ˜ L

_{T L}

(and hence ˜ L

₀₀

) are unit lower triangular matrices of dimension (k + 1) × (k + 1). Then

L ˜

^(k+1)

= ˜ L

^(k)

L

^(k+1)

= 0 B

@

L ˜

^(k)₀₀

0 0

˜ l

^{(k) T}₁₀

1 0 L ˜

^{(k) T}₂₀

0 I

1 C A

0 B

@

I

k+1

0 0

0 1 0

0 l

₂₁^(k+1)

I 1 C A

= 0 B

@

L ˜

^(k)₀₀

0 0

˜ l

₁₀^{(k) T}

1 0 L ˜

^{(k) T}

l

^(k+1)

I

1 C A =

0 B B

@

L ˜

^(k)₀₀

0 ˜ l

^{(k) T}₁₀

1 ! 0

“ L ˜

^{(k) T}

l

^(k+1)

” I

1 C C A

=

˜ L

^(k+1)_{T L}

0 L ˜

^(k+1)_BL

I

!

,

(52)

By the Principle of Mathematical Induction the result

holds for L

^(j)

, 0 ≤ j < n − 1.

(53)

Corollary

Under the conditions of the Lemma L = ˜ L

⁽ⁿ⁻¹⁾

is a unit lower triangular matrix the strictly lower triangular part of which is the sum of all the strictly lower triangular parts of L

⁽⁰⁾

, . . . , L

⁽ⁿ⁻¹⁾

:

L = ˜ L

⁽ⁿ⁻¹⁾

=







1 0

l

₂₁⁽⁰⁾

1 0

l

⁽²⁾₂₁

. ..

1 0

l

⁽ⁿ⁻²⁾₂₁

1 



 .

(Note that l

⁽ⁿ⁻¹⁾₂₁

is a vector of length zero, so that the last step of

(n−1)

(54)

Example

A consequence of this corollary is that the fact that





1 0 0 2 1 0 3 0 1









1 0 0

0 1 0

0 1.6 1



 =





1 0 0

2 1 0

3 1.6 1





in a previous Exercise is not a coincidence: For these matrices all

you have to do find the strictly lower triangular parts of the

right-hand side is to move the nonzeroes below the diagonal in the

matrices on the left-hand side to the corresponding elements in the

matrix on the right-hand side of the equality sign.

(55)

Exercise

The order in which the Gauss transforms appear is important. In particular, verify that





1 0 0

0 1 0

0 1.6 1









1 0 0 2 1 0 3 0 1



 6=





1 0 0

2 1 0

3 1.6 1



 .

(56)

Theorem

Let ˆ L

⁽⁰⁾

, . . . , ˆ L

⁽ⁿ⁻¹⁾

be the sequence of Gauss transforms that transforms an n × n matrix A into an upper triangular matrix U : L ˆ