Chapter 5
Theory of Matrices
As before, F is a field. We use F[x] to represent the set of all polynomials of x with coefficients in F.
We use M m,n (F) and M m,n (F[x]) to denoted the set of m by n matrices with entries in F and F[x]
respectively. When m = n, we write M m,n as M n .
In this chapter we shall study seven RST (Reflective, Symmetric, Transitive) relations among ma- trices over F:
1. Equivalence over F: B = P AQ, where P and Q are invertible matrices over F;
2. Equivalence over F[x]: B = P AQ, where P and Q are invertible matrices over F[x];
3. Congruence: B = P T AP , where P is invertible;
4. Hermitian congruence: B = P ∗ AP , where F = C and P ∗ := ¯ P T and is invertible;
5. Similarity: B = P − 1 AP ;
6. Orthogonal similarity: B = P − 1 AP , where F = R and P − 1 = P T ; 7. Unitary similarity: B = P − 1 AP , where F = C and P − 1 = P ∗ .
All of the relations except the first two require the related matrices A and B to be square. Otherwise the products defining the relations are impossible.
The seven relations are in three categories: equivalent, congruent, and similar. For each of the seven relations there are two major problems:
(1) Obtaining a set of canonical matrices;
(2) Finding a necessary and sufficient condition for two matrices to obey the relation.
Equivalence over F originates from solving system of linear equations, by making change of variables and performing operations on equations. Equivalence over F[x] is primarily a tool for the study of similarity and the system of linear constant coefficient ordinary differential equations.
Similarity occurs in the determination of all matrices representing a common linear transformation, or alternatively, in finding basis such that a linear transformation has a simple form.
Congruence originates from non-singular linear substitution in quadratic forms and Hermitian forms.
It can be used to characterize quadratic surfaces.
Finally, if the linear substitution is required to preserve the length, congruence then becomes or- thogonal or unitary similarity.
143
5.1 RST Relation
Definition 5.1. Let S be a set. A relation on S is a collection R of ordered pairs from S × S. We say x relates y and write x ∼ y if (x, y) ∈ R. We say x does not relate y and write x 6∼ y if (x, y) 6∈ R.
A relation is called reflexive if x ∼ x for all x ∈ S;
A relation is called symmetric if x ∼ y implies y ∼ x;
A relation is called transitive if x ∼ y and y ∼ z implies x ∼ z;
An RST relation 1 is a reflexive, symmetric, and transitive relation.
Given an RST relation ∼ on S, for each x ∈ S, the set [[x]] := {y ∈ S | y ∼ x} is called the equivalent class of x.
Gives an RST relation, there are two fundamental problems:
1. finding a set of canonical forms, i.e., a set containing one and exactly one representative from each equivalent class.
2. finding convenient criteria to determine whether any given two elements are related or to find the canonical form which an arbitrarily given element relates.
Example 5.1. Consider Z, the set of all integers. We define a “mod(3)” relation by m ∼ n if m − n is an integer muptiple of 3.
This is an RST relation which divides Z is into three equivalent classes:
Z 0 := {3k | k ∈ Z}, Z 1 := {1 + 3k | k ∈ Z}, Z 2 := {2 + 3k | k ∈ Z}.
From each class, we pick-up one and only one representative, say 0 from Z 0 , 1 from Z 1 , and −1 from Z 2 . Then we obtain a set C = {0, 1, −1} of canonical forms. Each integer relates exactly one canonical form.
Criteria: An integer of the form a 1 · · · a k (in dicimal) relates the remainder of (a 1 + · · · + a k )/3. For example: n = 8230609. Then the remainder of (8 + 2 + 3 + 6 + 9)/3 is 2, so n ∈ Z 2 .
Exercise 1. Let S = M n (F), the set of all square matrices of rank n and entries in F. Which kind of relations (reflexive, symmetric, transitive, and/or RST) are the following?
1. A ∼ B if and only if B = 0I; e.g. R = {(A, 0I) | A ∈ S};
2. A ∼ B if and only if A = 0I; e.g. R = {(0I, B) | B ∈ S};
3. A ∼ B if and only if A = B; e.g. R = {(A, A) | A ∈ S};
4. A ∼ B for all A and B; e.g. R = {(A, B) | A, B ∈ S};
5. A ∼ B if and only if both A and B are invertible;
6. A ∼ B if and only if A − B has at least one row of zeros;
7. A ∼ B if and only if AB = 0I.
1
This is commonly called “equivalence relation”, but in this chapter such terminology would be confusing.
5.1. RST RELATION 145 Exercise 2. Demonstrate by examples that the three relations, reflexive, symmetric, and transitive, are independent to each other; namely, find examples of relations that satisfy (i) non of them, (ii) only one of them, (iii) only two of them, and (iii) all three of them.
Exercise 3. Suppose ∼ is an RST relation on a set S. Show that for every x, y ∈ S, either [[x]] = [[y]] or
[[x]] ∩ [[y]] = ∅.
5.2 Equivalence over F
Consider a linear system
n
X
j=1
a i j x j = b i , i = 1, · · · , m (5.1)
of m equations and n unknowns x 1 , · · · , x n . Here a i j , b i are given numbers (scalars) in F. This system can be put in a matrix form
Ax = b, A = (a i j ) m×n , x = (x j ) n×1 , b = (b i ) m×1 . In this form, the system may be regarded as having only one unknown, the vector x.
Two systems of m-linear equations in n unknowns x 1 , · · · , x n are called equivalent if every solution of one system is also a solution of the other, and vice versa. Certain simple operations are commonly used to convert a system to an equivalent one that may be easier to solve. For instance, it surely makes no difference when an equation is written first or last or in some intermediate position.
An elementary operation for a system of equations is one of the following three types:
1. Interchange two equations;
2. multiplication of an equation by an invertible scalar;
3. Addition to an equation by a scalar multiple of a different equation.
A system of equations Ax = b can be recorded by an augmented matrix G = (A b).
The elementary operations of equations correspond to the elementary row operations (defined below) on the augmented matrix.
Definition 5.2. 1. An elementary row operation is one of the followings:
(a) O ij : Interchange the ith and jth rows;
(b) O i (d) : Multiplication of the ith row by an invertible scalar d;
(c) O ij (c): Addition to the ith row by a scalar c multiple of the jth row, i 6= j.
2. An elementary matrix is a matrix obtained by performing an elementary row operation on an identity matrix I. There are three types of elementary matrices: E ij , E i (d) (d invertible), and E ij (c), obtained by performing O ij , O i (d), and O ij (c) on the identity matrix I, respectively.
3. A matrix is called in a Row-Reduced-Echelon-Form 2 or RREF, if
(a) every nontrivial row has 1 as its first non-zero entry, called a leading one;
(b) any leading one in a lower row occurs only to the right of its counterpart above it;
(c) In any column containing a leading one, all entries above and below the leading one are zero;
(d) all rows consisting entirely zeros, if any, occurs at the bottom of the matrix.
4. The row space of a matrix is the space spanned by all the row vectors of the matrix.
2