Chapter 1. Vector Spaces. 1.5 Gaussian Elimination

(1)

Chapter 1. Vector Spaces

1.5 Gaussian Elimination

I-Liang Chern

National Taiwan University

Fall, 2021

(2)

Outline

1

1.5 Gaussian elimination

Elimination as a reduction process

Geometric interpretation of the Gaussian elimination-1

Fundamental theorem of linear algebra

(3)

Gaussian elimination

We solve the equation

Ax = b

by Gaussian elimination. It is to change the equations to a set ofequivalentyetsimpler equations. In terms of matrix, it is a sequence of row operations on the row vectors of A (or the augmented matrix [A|b]). A row operation is to replace a row Aiby a new row A⁰_i.

There are three kinds of row operations:

(1) scaling: Ai αAi, α 6= 0, (2) swapping: Ai↔ A_j

(3) shearing: A⁰_i= Ai− αA_j, α 6= 0.

The Gaussian elimination process is divided into two parts:

I Forward elimination

I Backward substitution

(4)

The resulting matrix after forward elimination is called a matrix in echelon form (see the matrix U below), while the resulting matrix after backward substitution is called a reduced echelon form (see the matrix C below).

U=







×

× × × × ×

×

× × ×

×

× ×

0

^× ^×

×







, C=







1 × 0 0 0 ×

1 0 0 ×

1 0 ×

0

¹ ^×

×







I Echelon form: each row is either zero or has a nonzero starting entry, called the pivot entry (marked by × ); the entries below pivot entry are all zeros.

I Reduced echelon form: each pivot entry is normalized to be 1; all entries above or below the pivot entry are zeros.

The advantage of the reduced echelon form is that we can construct a basis in R(A^T)and a basis in N(A) easily.

In matlab, the command [R,p] = rref(A) returns the reduced row echelon matrix and the nonzero pivots p.

(5)

Examples

(a) Consider the system

(x₁− 3x2 + x₄= 1 x₃+ 2x4= 3

The variables x1and x3are the pivot variables, while x2, x4, the free variables. We can express x1

and x3in terms of x2and x4as

x₃= 3 − 2x4, x₁= 1 − 3x2− x₄.

In vector form:





 x₁ x₂ x₃ x₄







=





 1 0 3 0





 + x2







−3 1 0 0





 + x4







−1 0

−2 1





 .

The solution [1 0 3 0]^Tis called a special solution, which corresponds to the solution with x₂= x4= 0. The variables x2and x4are free parameters.

(6)

Examples

(b) This is an example for backward substitution and getting solutions from the reduced echelon form. First we perform a row scaling to normalize each pivot entry to be 1:







2 4 6 8 −6 4

3

9 12 3

5 −20 5

0

³ ⁶













1 2 3 4 −3 2

1

3 4 1

1 −4 1

0

¹ ²







Next we perform row operation to eliminate all entries above the pivot entry to be zeros.







1 2 3 4 0 8

1

3 0 −7

1 0 9

0

¹ ²













1 2 3 0 0 −28

1

0 0 −34

1 0 1

0

¹ ²













1 2 0 0 0 74

1

0 0 −34

1 0 1

0

¹ ²







A linear system in such reduced echelon form can be solved easily. In this example, the solution is

x₁+ 2x₂= 74, x₃= −34, x₄= 1, x₅= 2.

Here, x2is a free variable. In vector form, the solution reads





 x₁ x₂ x₃ x₄ x₅







=





 74

0

−34 1 2





 + x₂







−2 1 0 0 0





 .

(7)

Examples

(c) This is an example for forward elimination. Consider the system











x₁+ x₂= b₁

−x₁ = b₂ 2x₁+ x₂= b₃ 2x₁+ 3x₂= b₄

The Gaussian elimination for the augmented matrix is shown below:







1 1 b₁

−1 0 b₂

2 1 b₃

2 3 b4













1 1 b₁

0 1 b₁+ b₂ 0 −1 −2b₁+ b₃ 0 1 −2b1+ b₄













1 1 b₁

0 1 b₁+ b₂ 0 0 −b₁+ b₂+ b₃ 0 0 −3b1− b₂+ b₄







This gives constraints on b to guarantee existence of solution:

0 = −b₁+ b₂+ b₃ 0 = −3b₁− b₂+ b₄.

The solution is given by

x₁= b₂

(8)

Echelon form Reduced echelon form

(9)

Forward elimination

1 The forward elimination is performed from row 1 to row m.

2 Let us start from row 1. First, we search for the largest entry in magnitude in the first column {ak1|k = 1, ..., m}, say ap1. That is,

|ap1| = max{|ak1| |k = 1, ..., m}.

We are only interested to find the index p. Let us introduce the following notation for this index p:

p:=argmax{|ak1| |k = 1, ..., m}.

Then we swap the 1st equation and the pth equation. This swapping does not effect the solution at all. Let us still call the resulting matrix (aij).

(10)

3 If a116= 0, then we perform the shearing row operation to eliminate all {ak1} for k= 2, ..., m:

−^a²¹

a₁₁ (a11x₁+ a12x₂+ · · · + a1nx_n= b1) + (a21x₁+ a22x₂+ · · · + a2nxn= b2) 0 + a⁰₂₂x₂+ · · · + a⁰_2nxn= b⁰₂ where

a⁰₂₂= a22−a₂₁

a₁₁a₁₂, · · · , a⁰_2n= a2n−a₂₁

a₁₁a_1n, b⁰₂= b2−a₂₁ a₁₁b₁. Let us denote this procedure

−a₂₁

a₁₁× 1 + 2 2’ . In terms of the augmented matrix, it looks like







a11 a12 · · · a1n b1

a21 a22 · · · a2n b2

.. .

..

. . .. ... .. . a_m1 a_m2 · · · amn bn













a11 a12 · · · a1n b1

0 a⁰₂₂ · · · a⁰_2n b⁰₂ ..

. ..

. . .. ... .. . a_m1 a_m2 · · · amn bn







We can repeat the above procedure for the third row, ..., till the mth row:

−a₃₁

a₁₁ × 1 + 3 3’ , · · · , −a_m1

a₁₁ × 1 + m m’

(11)

Eventually, we arrive at







a₁₁ a₁₂ · · · a_1n b₁ 0 a⁰₂₂ · · · a⁰_2n b⁰₂ 0 a⁰₃₂ · · · a⁰_3n b⁰₃ ..

. ..

. . .. ... .. . 0 a⁰_m2 · · · a⁰_mn b⁰_n







4 If a₁₁= 0, it means that all a_i1= 0 for all i = 1, ..., m. The matrix looks like







0 a₁₂ · · · a_1n b₁ 0 a₂₂ · · · a_2n b₂ ..

. ..

. . .. ... .. . 0 a_m2 · · · amn bn





 .

In this case, we go to the next entry of this row, that is a₁₂. We repeat the above procedure to eliminate all entries below a₁₂, and so on. This finishes the procedure for the first row.

(12)

5 We continue the above elimination process for row 2, row 3, and so on, until no more entry to be eliminated. The resulting matrix looks like:







×

× × × × ×

×

× × ×

×

× ×

0

^× ^×

×







Such a matrix is called in echelon form (staircase). Suppose there are r nonzero row vectors. We will see later that this is exactly the dimension of the subspace

Span(A₁, ..., Am). We call r the row rank of A.

(13)

6 For each nonzero row, there is a nonzero leading entry (circled in the above figure). This leading entry is called a pivot of that row. Let us denote the pivot index of the ith row by j_p(i). It has the following properties:

(i) j_p(i + 1) > jp(i);

(ii) all entries below jp(i)are zeros;

(iii) rows with all zeros are at the bottom of the matrix.

The variable x_j_p_(i)is called apivot variable, otherwise, afree variable.







× × × × × ×

×

× × ×

×

× ×

0

^× ^×

×





 ,

(14)

Backward substitution

1 We perform backward substitution on the above echelon matrix from row r to row 1. The substitution is to use the pivot coefficient a_i,j_p_(i)to eliminate all entries above it (i.e. a_k,j_p_(i), k= i − 1, ..., 1.)







×

× × × × ×

×

× × ×

×

× ×

0

^× ^×

×













×

× × × 0 ×

×

× 0 ×

×

0 ×

0

^× ^×

×













×

× 0 0 0 ×

×

0 0 ×

×

0 ×

0

^× ^×

×







(15)

2 For each nonzero row i, i = r, ..., 1, we divide it by ai,jp(i)so that all pivot coefficients a_i,j_p_(i)= 1. The resulting matrix has the form







1 × 0 0 0 ×

1 0 0 ×

1 0 ×

0

¹ ^×

×







Such matrix is called in reduced echelon form. Let us denote it by

h

C d

i

=







− C^T₁ − d₁ ..

.

.. .

− C^T_r − dr

0 d⁰







m×(n+1)

Thus, the system Ax = b is changed to an equivalent system:

(16)

Gaussian elimination as an LU decomposition

1 A matrix L is called lower triangular matrix if

`ij= 0 for all i < j.

2 A matrix U is called upper triangular matrix if u_ij= 0 for all i > j.

3 A shearing row operation corresponds to a transformation: A ˜LA, where ˜Lis a lower triangular matrix.







1 0 0 · · · 0

`˜₂₁ 1 0 · · · 0 0 0 1 · · · 0

.. .

... 0

0 0 0 · · · 1













a11 a12 a13 · · · a1n a₂₁ a₂₂ a₂₃ · · · a_2n a₃₁ a₃₂ a₃₃ · · · a_3n

.. .

... .. . a_m1 a_m2 a_m3 · · · amn







=







a₁₁ a₁₂ a₁₃ · · · a_1n

a₂₁+ ˜`₂₁a₁₁ a₂₂+ ˜`₂₁a₁₂ a₂₃+ ˜`₂₁a₁₂ · · · a_2n+ ˜`₂₁a_1n

a₃₁ a₃₂ a₃₃ · · · a_3n

.. .

... .. .

a_m1 a_m2 a_m3 · · · amn







(17)

In terms of row vectors, it is







1 0 0 · · · 0

`˜₂₁ 1 0 · · · 0 ..

. ..

. 0

0 0 0 · · · 1













− A^T₁ −

− A^T₂ − .. .

− A^T_m −







=







− A^T₁ −

− A^T₂+ ˜`₂₁A^T₁ − .. .

− A^T_m −







4 If we ignore the swapping, then the forward step of the Gaussian elimination is to transform A to an upper triangular matrix U by a lower triangular matrix ˜L:







1 0 0 · · · 0

˜`₂₁ 1 0 · · · 0

˜`₃₁ `˜₃₂ 1 · · · 0 ..

.

.. . 0

`˜_m1 `˜_m2 `˜_m3 · · · 1













a₁₁ a₁₂ a₁₃ · · · a_1n a₂₁ a₂₂ a₂₃ · · · a_2n a₃₁ a₃₂ a₃₃ · · · a_3n

.. .

.. . a_m1 a_m2 a_m3 · · · amn







=







u₁₁ u₁₂ u₁₃ · · · u_1n 0 u₂₂ u₂₃ · · · u_2n 0 0 u₃₃ · · · u_3n

.. .

0 0 0 · · · umn







This can be rewritten as





a₁₁ a₁₂ a₁₃ · · · a_1n a₂₁ a₂₂ a₂₃ · · · a_2n a₃₁ a₃₂ a₃₃ · · · a_3n

. . .





=





1 0 0 · · · 0

`₂₁ 1 0 · · · 0

`₃₁ `₃₂ 1 · · · 0

. .









u₁₁ u₁₂ u₁₃ · · · u_1n 0 u₂₂ u₂₃ · · · u_2n 0 0 u₃₃ · · · u_3n

. .





(18)

where







1 0 0 · · · 0

`₂₁ 1 0 · · · 0

`₃₁ `₃₂ 1 · · · 0 ..

.

..

. 0

`_m1 `_m2 `_m3 · · · 1





 m×m







1 0 0 · · · 0

`˜₂₁ 1 0 · · · 0

`˜₃₁ `˜₃₂ 1 · · · 0 ..

.

.. . 0

˜`_m1 `˜_m2 ˜`_m3 · · · 1





 m×m

=







1 0 0 · · · 0 0 1 0 · · · 0 0 0 1 · · · 0

.. .

..

. 0

0 0 0 · · · 1





 m×m

The decomposition

A= LU (2)

is called the LU decomposition of a matrix. We can obtain L from ˜Lby a recursion formula.

5 If we include swapping, then there exists a permutation matrix P such that

PA= LU.

(19)

Solving a linear system in a reduced echelon form

1 Recall the augmented matrix in echelon form

h

C d

i

=







− C^T₁ − d1

.. .

− C^T_r − dr

0 d⁰







m×(n+1)

(3)

2 The column indices {1, ..., n} are classified into pivot indices P = {jp(1), ..., jp(r)}and free indices F = {1, ..., n} \ P. Let us rearrange the order of {x1, ..., xn} such that

xP=





 x_j_p₍₁₎ x_j_p₍₂₎ .. .





∈ R^r, xF =





 xj₁

xj₂

.. .







, jk∈ F , j₁< · · · < jn−r.

(20)

In this order, all pivot entries are put to the front and free-variable columns are moved to the rear. The reduced echelon form looks like







− C^T₁ − d₁

− C^T₂ − d₂ ..

. ...

− C^T_r − d_r







=







1 0 · · · 0 c_1,j₁ · · · c_1,j_n−r d₁ 0 1 · · · 0 c_2,j₁ · · · c_2,j_n−r d₂

..

. ... . .. ... ... ... 0 0 · · · 1 c_r,j₁ · · · c_r,j_n−r d_r







The equations read

x_j_p_(i)+X

j∈F

c_i,jx_j= di, i= 1, ..., r.

Thus,

x_j_p_(i)= di−X

j∈F

c_i,jx_j, i= 1, ..., r.

The solution has the explicit form

"

xP

xF

#

=

"

dP

0

#

+X

j∈F

x_j

"

−c_j δj

#

(21)

Here,

dP=





 d₁

.. . d_r





 , cj=





 c_1,j

.. . c_r,j





 , δ_j=





 δj₁,j

.. . δjn−r,j







, j∈ F . (4)

The notation δi,jis called the Kronecker delta function. It is defined as

δi,j=

( 1 if i = j 0 if i 6= j . We rewrite it as

x = xp+X

j∈F

x_jvj, xp:=

"

dP

0

# , vj:=

"

−c_j δ_j

#

. (5)

The list {vj}j∈Fis independent. For, if there are coefficients {aj|j ∈ F } such that X

j∈F

a_jvj= 0,

it implies

Xa_jδj= 0.

(22)

Example Consider

A=







1 1 2 −1 0 1

0 1 1 0 1 1

0 0 0 1 −1 1

0 0 0 0 1 0







This is a matrix in echelon form. The pivot and free indices are

P = {1, 2, 4, 5}, F = {3, 6}.

The reduced echelon matrix is

C=







1 0 1 0 0 1

0 1 1 0 0 1

0 0 0 1 0 1

0 0 0 0 1 0







This is the system











x₁ + x3 + x6= 0 x₂+ x3 + x6= 0 x₄ + x6= 0 x₅ = 0

(23)

This gives

x1= −x3− x6

x₂= −x3− x₆ x3= x3

x₄= −x6

x₅= 0 x₆= x6.

Or





 x₁ x₂ x₃ x₄ x₅ x₆







= x3







−1

−1 1 0 0 0





 + x6







−1

−1 0

−1 0 1







= x3v1+ x6v2.

You can check that

Avi= 0, Cvi= 0, I = 1, 2.

(24)

Geometric interpretation of the Gaussian elimination

1 The list of vectors {C₁, ..., Cr} constitutes a basis for R(A^T).

Proof. The row vector operations ( scaling, swapping, and shearing) transform

A=







− A^T₁ −

− A^T₂ − .. .

− A^T_m −





 C :=







− C^T₁ − .. .

− C^T_r − 0





 .

These row operations are closed in the row space R(A^T) =Span(A1, ..., Am). And by Lemma 1, we get

Span(C1, ..., Cr) =Span(A1, ..., Am) = R(A^T).

Lemma

Let A₁, A₂∈ V. Suppose A⁰₂= a₂A₂+ a₁A₁with a₂6= 0. Then Span(A₁, A⁰₂) =Span(A₁, A₂).

(25)

The row vector of C has the form







− C^T₁ −

− C^T₂ − .. .

− C^T_r − 0 .. . 0







m×n

=







1 0 · · · 0 c_1,j₁ · · · c_1,j_n−r 0 1 · · · 0 c_2,j₁ · · · c_2,j_n−r

..

. ... . .. ... ...

0 0 · · · 1 c_r,j₁ · · · c_r,j_n−r 0 0 · · · 0 0 · · · 0

.. .

.. . 0 0 · · · 0 0 · · · 0







m×n

(6)

From this expression, it is easy to read that {C₁, ..., Cr} is independent. We conclude that the Gaussian elimination provides an algorithm to construct a special basis {C₁, ..., Cr} for the subspace Span(A₁, ..., Am).

Row rank of A. The dimension r := dim R(A^T)is independent of the Gaussian

elimination process. Any row operation process gives the same number r. This number is called the row rank of A.

(26)

2 The list of vectors {v_j}j∈Fconstitutes a basis for N(A), where

vj:=

"

−c_j δj

# , cj=





 c_1,j

.. . c_r,j





 , δj=





 δj₁,j

.. . δjn−r,j





 , j∈ F

Proof. The kernel

N(A) = {x ∈ V|x ⊥Span(A1, ..., Am)} = {x ∈ V|x ⊥Span(C1, ..., Cr)}

We have seen the general solution for Ax = b has the expression (5)

x = xp+X

j∈F

x_jvj.

When b = 0, xp= 0, we obtain

N(A) =Span{vj|j ∈ F }.

We have seen that {vj}_j∈F is independent. Thus, {vj}_j∈F is a basis for N(A).

(27)

3 The vectors C_i⊥ v_jfor I ∈ P and j ∈ F .

We have seen this from above expression for v ∈ N(A). Alternatively, we can directly check this orthogonality. Suppose j = j1∈ F . Then







− C^T₁ −

− C^T₂ − .. .

− C^T_r − 0 .. . 0













−c1,j₁

−c2,j₁

.. .

−cr,j₁

1 .. . 0







=







1 0 · · · 0 c_1,j₁ · · · c_1,j_n−r 0 1 · · · 0 c2,j₁ · · · c2,j_n−r

.. .

..

. . .. ...

.. .

0 0 · · · 1 cr,j₁ · · · cr,j_n−r

0 0 · · · 0 0 · · · 0 ..

. .. .

.. .

.. . 0 0 · · · 0 0 · · · 0













−c1,j₁

−c2,j₁

.. .

−cr,j₁

1 .. . 0







= 0.

This shows

C^T_ivj₁= 0 for all i ∈ P.

Similar proof for C^T_ivj= 0 for other j ∈ F . .

(28)

4 The set {C_i|i ∈ P} ∪ {v_j|j ∈ F } constitutes a basis in V and

V= N(A) ⊕ R(A^T).

Proof.

(a) We show N(A) ∩ R(A^T) = {0}. Suppose v ∈ N(A) ∩ R(A^T). From N(A) = R(A^T)^⊥, v ⊥ v. This implies v = 0.

(b) We show N(A) + R(A^T) = V. From N(A) + R(A^T) ⊂ V, and

dim V = |P| + |F | = dim R(A^T) + dim N(A), by Proposition 1.1, we get

V= N(A) + R(A^T).

Proposition

Let U ⊂ V be a subspace. If dim U = dim V, then U = V.

(29)

5 N(A)^⊥= R(A^T).

Proof.

(a) First we show R(A^T) ⊂ N(A)^⊥. Suppose v ∈ R(A^T). This means that there is a w ∈ Wsuch that v = A^Tw. For any u ∈ N(A), we have

v · u = (A^Tw) · u = (A^Tw)^Tu = w^TAu = w · (Au) = 0.

Thus, v ∈ N(A)^⊥.

(b) Next, we show N(A)^⊥⊂ R(A^T). Suppose v ∈ N(A)^⊥⊂ V. From V= R(A^T) ⊕ N(A), we can expand v as

v =X

i∈P

αiCi+X

j∈F

βjvj.

Since v ∈ N(A)^⊥, we have

v · vk= 0 for all k ∈ F . This leads to βk= 0 for all k ∈ F . Thus,

v =X

αC ∈ R(A^T).

(30)

Fundamental theorem of linear algebra

Theorem (Fundamental theorem of linear algebra)

Let A be an m × n matrix. Then the four fundamental subspaces R(A), R(A^T), N(A) and N(A^T) have the properties:

(1) The domain V has the orthogonal decomposition

V= R(A^T) ⊕ N(A), R(A^T) = N(A)^⊥, N(A) = R(A^T)^⊥. (7)

(2) The range W has the orthogonal decomposition:

W= R(A) ⊕ N(A^T), R(A) = N(A^T)^⊥, N(A^T) = R(A)^⊥. (8)

(3) Row rank of A = Column rank of A:

dim R(A^T) = dim R(A). (9)

(4) The linear map x 7→ Ax, is 1-1 and onto from R(A^T)to R(A).

(31)

(32)

Proof.

1 We have proven (1).

2 The proof of (2) is a duality argument. We simply replace A by A^Tand use (A^T)^T= Ato get the result.

3 We prove (3). First, we claim that {ACi}i∈Pconstitutes a basis for R(A). For any v ∈ V, v can be represented as

v =X

j∈F

a_jvj+X

i∈P

b_iCi

We get

Av = A



 X

i∈P

b_iCi



 = X

i∈P

b_iACi.

This shows R(A) = Span({AC_i}i∈P). Next, we show {AC_i}i∈Pis independent. Suppose we have

X

i∈P

b_iACi= 0.

Then

A



 X

i∈P

b_iCi



 =0. ⇒ X

i∈P

b_iCi∈ N(A).

But Ci∈ N(A)^⊥for i ∈ P, thus we get all bi= 0, i ∈ P. This shows that {ACi}i∈Pis a basis for R(A).

(33)

The consequence of this result is

dim R(A) = |P| = r.

Recall that {C_i}i∈Pis a basis for R(A^T). Thus we obtain

dim R(A) = dim R(A^T) = |P|.

4 The restricted linear map

A: R(A^T) → R(A^T) is 1-1 and onto (check by yourself.). For any v =P

i∈PbiCi∈ R(A^T), its image by A is Av =X

i∈P

b_iACi∈ R(A).

(34)

Corollary

The following statements hold and are equivalent:

(a) For any subspace U ⊂ V, it holds

V= U ⊕ U^⊥ (10)

(b) For any subspace U ⊂ V, it holds

(U^⊥)^⊥= U. (11)

(c) If U ( V, then there exists a nonzero subspace Z ⊂ V such that U = Z^⊥.

(35)

Proof.

(a) We show (a) by the fundamental theorem of linear algebra. Let us choose a basis {A₁, ..., Ar} in U, and define a r × n matrix:

A=







− A^T₁ − .. .

− A^T_r −







with {A^T_i}^r_i=1being its row vectors. Then U = R(A^T). From the fundamental theorem of linear algebra, we have

V= R(A^T) ⊕ N(A), R(A^T) = U, N(A) = R(A^T)^⊥= U^⊥. Thus, we get

V= U ⊕ U^⊥.

(36)

(a) ⇒ (b). First, (10) implies

dim U = dim V − dim U^⊥. Next, we apply (10) again with U replaced by U^⊥to get

U^⊥⊕ (U^⊥)^⊥= V.

This implies

dim(U^⊥)^⊥= dim V − dim U^⊥. The above two gives

dim U = dim(U^⊥)^⊥.

On the other hand, we recall U ⊂ (U^⊥)^⊥. This together with dim U = dim(U^⊥)^⊥imply U= (U^⊥)^⊥.

(b) ⇒ (c). We choose Z = U^⊥. Then Z 6= {0}. Otherwise, U = V. From (U^⊥)^⊥= U, we get Z^⊥= U.

(c) ⇒ (a). Suppose U + U^⊥( V. Then we can find a nonzero subspace Z such that Z^⊥= U + U^⊥. Then, for any u ∈ Z, we have

u ⊥ Uand u ⊥ U^⊥.

This implies u ⊥ u. Thus, u = 0. This contradicts to Z 6= {0}. Hence, U + U^⊥= V.

(37)

Summary

Gaussian elimination perform row operations to transform [A|b] to an equivalent but simpler system (a reduced echelon form).

The Gaussian elimination process is divided into two parts:

I forward elimination

I backward substitution

There are three kinds of row operations:

(1) scaling: Ai αAi, α 6= 0, (2) swapping: Ai↔ A_j

(3) shearing: A⁰_i= Ai− αA_j, α 6= 0.

The reduced equations read

x_j_p_(i)+X

j∈F

c_i,jx_j= di, i= 1, ..., r.

which give solutions of the form

"

x # "

d # "

−c#

(38)

Theorem (Fundamental theorem of linear algebra)

Let A be an m × n matrix. Then the four fundamental subspaces R(A), R(A^T), N(A) and N(A^T) have the properties:

(1) The domain V has the orthogonal decomposition

V= R(A^T) ⊕ N(A), R(A^T) = N(A)^⊥, N(A) = R(A^T)^⊥. (12)

(2) The range W has the orthogonal decomposition:

W= R(A) ⊕ N(A^T), R(A) = N(A^T)^⊥, N(A^T) = R(A)^⊥. (13)

(3) Row rank of A = Column rank of A:

dim R(A^T) = dim R(A). (14)

(4) The linear map x 7→ Ax, is 1-1 and onto from R(A^T)to R(A).

(39)