Diagonalisable matrices

In this section, we will look at some special classes of square matrices which are diagonalisable. We will also be dealing with matrices having complex entries and hence for a matrix $A=[a_{ij}],$ recall the following definitions.

Note that a symmetric matrix is always Hermitian, a skew-symmetric matrix is always skew-Hermitian and an orthogonal matrix is always unitary. Each of these matrices are normal. If

is a unitary matrix then $A^* = A^{-1}.$

Proof. Let $(\lambda, {\mathbf x})$ be an eigenpair. Then $A {\mathbf x}= \lambda {\mathbf x}$ and

implies

$\displaystyle {\mathbf x}^* A = {\mathbf x}^* A^* = (A {\mathbf x})^* = ({\lambda}{\mathbf x})^* = \overline{{\lambda}} {\mathbf x}^*.$

Hence

$\displaystyle \lambda {\mathbf x}^*{\mathbf x}= {\mathbf x}^* ({\lambda}{\mathb... ...}} {\mathbf x}^*) {\mathbf x}= {\overline{ \lambda}} {\mathbf x}^* {\mathbf x}.$

But ${\mathbf x}$ is an eigenvector and hence ${\mathbf x}\neq {\mathbf 0}$ and so the real number $\Vert{\mathbf x}\Vert^2 = {\mathbf x}^* {\mathbf x}$ is non-zero as well. Thus $\lambda = {\overline{\lambda}}.$ That is, ${\lambda}$ is a real number. height6pt width 6pt depth 0pt

Proof. We will prove the result by induction on the size of the matrix. The result is clearly true if

Let the result be true for

we will prove the result in case

So, let

be a $k \times k$ matrix and let $(\lambda_1, {\mathbf x})$ be an eigenpair of

with $\Vert {\mathbf x}\Vert = 1.$ We now extend the linearly independent set $\{ {\mathbf x}\}$ to form an orthonormal basis $\{{\mathbf x}, {\mathbf u}_2, {\mathbf u}_3, \ldots, {\mathbf u}_k \}$ (using Gram-Schmidt Orthogonalisation) of ${\mathbb{C}}^k$ .

As $\{{\mathbf x}, {\mathbf u}_2, {\mathbf u}_3, \ldots, {\mathbf u}_k \}$ is an orthonormal set,

$\displaystyle {\mathbf u}_i^* {\mathbf x}= 0 \;\; {\mbox{ for all }} \; i = 2, 3, \ldots, k.$

Therefore, observe that for all $i, \; 2 \leq i \leq k,$

$\displaystyle (A {\mathbf u}_i)^* {\mathbf x}= ({\mathbf u}_i* A^*) {\mathbf x}... ..._i^* ({\lambda}_1 {\mathbf x}) = {\lambda}_1 ({\mathbf u}_i^* {\mathbf x}) = 0.$

Hence, we also have ${\mathbf x}^* (A {\mathbf u}_i) = 0$ for $2 \leq i \leq k.$ Now, define $U_1 = [ {\mathbf x}, \; {\mathbf u}_2, \; \cdots, {\mathbf u}_k ]$ (with ${\mathbf x}, {\mathbf u}_2, \ldots, {\mathbf u}_k$ as columns of

). Then the matrix

is a unitary matrix and

$\displaystyle U_1^{*} A U_1$	$\displaystyle =$	$\displaystyle U_1^* [ A {\mathbf x}\; A {\mathbf u}_2 \; \cdots A {\mathbf u}_k ]$
	$\displaystyle =$	$\displaystyle \begin{bmatrix}{\mathbf x}^* \\ {\mathbf u}_2^* \\ \vdots \\ {\ma... ...mbda}_1 {\mathbf x}) & \cdots & {\mathbf u}_k^* (A {\mathbf u}_k) \end{bmatrix}$
	$\displaystyle =$	$\displaystyle \left[\begin{array}{c\vert c} \lambda_1 & {\mathbf 0}\\ \hline {\mathbf 0}& \\ \vdots & B \\ {\mathbf 0}& \end{array} \right],$

where

is a $(k-1) \times (k-1)$ matrix. As

,we get $(U_1^{*} A U_1)^* = U_1^{*} A U_1$ . This condition, together with the fact that ${\lambda}_1$ is a real number (use Proposition 6.3.5), implies that

. That is,

is also a Hermitian matrix. Therefore, by induction hypothesis there exists a $(k-1) \times (k-1)$ unitary matrix

such that

$\displaystyle U_2^{*} B U_2 = D_2 = {\mbox{diag}}(\lambda_2, \ldots, \lambda_k).$

Recall that , the entries ${\lambda}_i, \;$ for $2 \leq i \leq k$ are the eigenvalues of the matrix

We also know that two similar matrices have the same set of eigenvalues. Hence, the eigenvalues of

are $\lambda_1, \lambda_2, \ldots, \lambda_k.$ Define $U= U_1 \begin{bmatrix}1 & {\mathbf 0}\\ {\mathbf 0}& U_2 \end{bmatrix}.$ Then

is a unitary matrix and

$\displaystyle U^{*} A U$	$\displaystyle =$	$\displaystyle \left( U_1 \begin{bmatrix}1 & {\mathbf 0}\\ {\mathbf 0}& U_2 \end... ...left(U_1 \begin{bmatrix}1 & {\mathbf 0}\\ {\mathbf 0}& U_2 \end{bmatrix}\right)$
	$\displaystyle =$	$\displaystyle \left(\begin{bmatrix}1 & {\mathbf 0}\\ {\mathbf 0}& U_2^{*} \end{... ...ft( U_1 \begin{bmatrix}1 & {\mathbf 0}\\ {\mathbf 0}& U_2 \end{bmatrix} \right)$
	$\displaystyle =$	$\displaystyle \begin{bmatrix}1 & {\mathbf 0}\\ {\mathbf 0}& U_2^{} \end{bmatri... ...} A U_1 \bigr) \begin{bmatrix}1 & {\mathbf 0}\\ {\mathbf 0}& U_2 \end{bmatrix}$
	$\displaystyle =$	$\displaystyle \begin{bmatrix}1 & {\mathbf 0}\\ {\mathbf 0}& U_2^{} \end{bmatri... ...in{bmatrix}{\lambda}_1 & {\mathbf 0}\\ {\mathbf 0}& U_2^{} B U_2 \end{bmatrix}$
	$\displaystyle =$	$\displaystyle \begin{bmatrix}{\lambda}_1 & {\mathbf 0}\\ {\mathbf 0}& D_2 \end{bmatrix}.$

Thus, $U^{*} A U$ is a diagonal matrix with diagonal entries $\lambda_1, \lambda_2, \ldots, \lambda_k,$ the eigenvalues of

Hence, the result follows. height6pt width 6pt depth 0pt

Proof. As

is symmetric,

is also an Hermitian matrix. Hence, by Proposition 6.3.5, the eigenvalues of

are all real. Let $({\lambda}, \; {\mathbf x})$ be an eigenpair of

Suppose ${\mathbf x}^t \in {\mathbb{C}}^n.$ Then there exist ${\mathbf y}^t, {\mathbf z}^t \in {\mathbb{R}}^n$ such that ${\mathbf x}= {\mathbf y}+ i {\mathbf z}.$ So,

$\displaystyle A {\mathbf x}= {\lambda}{\mathbf x}\Longrightarrow A ({\mathbf y}+ i {\mathbf z}) = {\lambda}( {\mathbf y}+ i {\mathbf z}).$

Comparing the real and imaginary parts, we get $A {\mathbf y}= {\lambda}{\mathbf y}$ and $A {\mathbf z}= {\lambda}{\mathbf z}.$ Thus, we can choose the eigenvectors to have real entries.

To prove the orthonormality of the eigenvectors, we proceed on the lines of the proof of Theorem 6.3.6, Hence, the readers are advised to complete the proof. height6pt width 6pt depth 0pt

EXERCISE 6.3.8

Let be a skew-Hermitian matrix. Then all the eigenvalues of are either zero or purely imaginary. Also, the eigenvectors corresponding to distinct eigenvalues are mutually orthogonal.
[Hint: Carefully study the proof of Theorem 6.3.6.]
Let be an $n \times n$ unitary matrix. Then
1. the rows of form an orthonormal basis of ${\mathbb{C}}^n.$
2. the columns of form an orthonormal basis of ${\mathbb{C}}^n.$
3. for any two vectors ${\mathbf x}, {\mathbf y}\in {\mathbb{C}}^{n \times 1},\;$ $\langle A {\mathbf x}, A {\mathbf y}\rangle = \langle {\mathbf x}, {\mathbf y}\rangle.$
4. for any vector ${\mathbf x}\in {\mathbb{C}}^{n \times 1},\;$ $\Vert A {\mathbf x}\Vert = \Vert {\mathbf x}\Vert.$
5. for any eigenvalue $\lambda$ $A, \;$ $\vert \lambda\vert = 1.$
6. the eigenvectors ${\mathbf x}, {\mathbf y}$ corresponding to distinct eigenvalues ${\lambda}$ and $\mu$ satisfy $\langle {\mathbf x}, {\mathbf y}\rangle = 0.$ That is, if $({\lambda}, {\mathbf x})$ and $(\mu, {\mathbf y})$ are eigenpairs, with ${\lambda}\neq \mu,$ then ${\mathbf x}$ and ${\mathbf y}$ are mutually orthogonal.
Let be a normal matrix. Then, show that if $(\lambda, {\mathbf x})$ is an eigenpair for then $({\overline{\lambda}}, {\mathbf x})$ is an eigenpair for
Show that the matrices $A = \begin{bmatrix}4&4\\ 0&4 \end{bmatrix}$ and $B = \begin{bmatrix}10&9 \\ -4&-2 \end{bmatrix}$ are similar. Is it possible to find a unitary matrix such that
Let be a $2 \times 2$ orthogonal matrix. Then prove the following:
1. if $\det (A) = 1,$ then $A = \begin{bmatrix}\cos \theta & - \sin \theta \\ \sin \theta & \cos \theta \end{bmatrix}$ for some $\theta, \;\; 0 \leq \theta < 2 \pi.$
2. if $\det A = -1,$ then there exists a basis of ${\mathbb{R}}^2$ in which the matrix of looks like $\begin{bmatrix} 1 & 0 \\ 0 & -1 \end{bmatrix}.$
  Or equivalently, $A = \begin{bmatrix}\cos \theta & \sin \theta \\ \sin \theta & - \cos \theta \end{bmatrix}$ for some $\theta, \;\; 0 \leq \theta < 2 \pi.$ In this case, prove that reflects the vectors in ${\mathbb{R}}^2$ about a line passing through origin. Also, determine this line.
Let $A = \begin{bmatrix}2 & 1 & 1 \\ 1 & 2 & 1 \\ 1 & 1 & 2 \end{bmatrix}.$ Determine $A^{301}$ .
Let be a $3 \times 3$ orthogonal matrix. Then prove the following:
1. if $\det (A) = 1,$ then is a rotation about a fixed axis, in the sense that has an eigenpair $(1, {\mathbf x})$ such that the restriction of to the plane ${\mathbf x}^{\perp}$ is a two dimensional rotation of ${\mathbf x}^{\perp}.$
2. if $\det A = -1,$ then the action of corresponds to a reflection through a plane followed by a rotation about the line through the origin that is perpendicular to

Remark 6.3.9 In the previous exercise, we saw that the matrices $A = \begin{bmatrix}4&4\\ 0&4 \end{bmatrix}$ and $B = \begin{bmatrix}10&9 \\ -4&-2 \end{bmatrix}$ are similar but not unitarily equivalent, whereas unitary equivalence implies similarity equivalence as $U^* = U^{-1}.$ But in numerical calculations, unitary transformations are preferred as compared to similarity transformations. The main reasons being:

Exercise 6.3.8.2 implies that an orthonormal change of basis leaves unchanged the sum of squares of the absolute values of the entries which need not be true under a non-orthonormal change of basis.
As $U^* = U^{-1}$ for a unitary matrix unitary equivalence is computationally simpler.
Also in doing ``conjugate transpose", the loss of accuracy due to round-off errors doesn't occur.

We next prove the Schur's Lemma and use it to show that normal matrices are unitarily diagonalisable.

Proof. We will prove the result by induction on the size of the matrix. The result is clearly true if

Let the result be true for

we will prove the result in case

So, let

be a $k \times k$ matrix and let $(\lambda_1, {\mathbf x})$ be an eigenpair for

with $\Vert {\mathbf x}\Vert = 1.$ Then the linearly independent set $\{ {\mathbf x}\}$ can be extended, using the Gram-Schmidt Orthogonalisation process, to get an orthonormal basis $\{{\mathbf x}, {\mathbf u}_2, {\mathbf u}_3, \ldots, {\mathbf u}_k \}$ of ${\mathbb{C}}^n({\mathbb{C}})$ . Then $U_1 = [ {\mathbf x}\; {\mathbf u}_2 \; \cdots {\mathbf u}_k ]$ (with ${\mathbf x}, {\mathbf u}_2, \ldots, {\mathbf u}_k$ as the columns of the matrix

) is a unitary matrix and

$\displaystyle U_1^{*} A U_1$	$\displaystyle =$	$\displaystyle U_1^* [ A {\mathbf x}\; A {\mathbf u}_2 \; \cdots A {\mathbf u}_k ]$
	$\displaystyle =$	$\displaystyle \begin{bmatrix}{\mathbf x}^* \\ {\mathbf u}_2^* \\ \vdots \\ {\ma... ..._1 & * \\ \hline {\mathbf 0}& \\ \vdots & B \\ {\mathbf 0}& \end{array} \right]$

where

is a $(k-1) \times (k-1)$ matrix. By induction hypothesis there exists a $(k-1) \times (k-1)$ unitary matrix

such that $U_2^{*} B U_2$ is an upper triangular matrix with diagonal entries $\lambda_2, \ldots, \lambda_k,$ the eigen values of the matrix

Observe that since the eigenvalues of

are $\lambda_2, \ldots, \lambda_k$ the eigenvalues of

are $\lambda_1, \lambda_2, \ldots, \lambda_k.$ Define $U= U_1 \begin{bmatrix}1 & {\mathbf 0}\\ {\mathbf 0}& U_2 \end{bmatrix}.$ Then check that

is a unitary matrix and $U^{*} A U$ is an upper triangular matrix with diagonal entries $\lambda_1, \lambda_2, \ldots, \lambda_k,$ the eigenvalues of the matrix

Hence, the result follows. height6pt width 6pt depth 0pt

EXERCISE 6.3.11

Let be an $n \times n$ real invertible matrix. Prove that there exists an orthogonal matrix and a diagonal matrix with positive diagonal entries such that $A A^t = P D P^{-1}$ .
Show that matrices $A = \begin{bmatrix}1 & 1 & 1\\ 0 & 2 & 1\\ 0 & 0 & 3 \end{bmatrix}$ and $B = \begin{bmatrix}2 & -1 & \sqrt{2}\\ 0 & 1 & 0\\ 0 & 0 & 3 \end{bmatrix}$ are unitarily equivalent via the unitary matrix $U = \frac{1}{\sqrt{2}} \begin{bmatrix}1 & 1 & 0\\ 1 & -1 & 0\\ 0 & 0 & \sqrt{2} \end{bmatrix}.$ Hence, conclude that the upper triangular matrix obtained in the "Schur's Lemma" need not be unique.
Show that the normal matrices are diagonalisable.
[Hint: Show that the matrix in the proof of the above theorem is also a normal matrix and if is an upper triangular matrix with then has to be a diagonal matrix].

Remark 6.3.12 (The Spectral Theorem for Normal Matrices) Let be an $n \times n$ normal matrix. Then the above exercise shows that there exists an orthonormal basis $\{{\mathbf x}_1, {\mathbf x}_2, \ldots, {\mathbf x}_n \}$ of ${\mathbb{C}}^n({\mathbb{C}})$ such that $A {\mathbf x}_i = \lambda_i {\mathbf x}_i$ for $1 \leq i \leq n.$
Let be a normal matrix. Prove the following:
1. if all the eigenvalues of are then $A = {\mathbf 0},$
2. if all the eigenvalues of are then
Let be an $n \times n$ matrix. Prove that
1. if is Hermitian and ${\mathbf x}A {\mathbf x}^* = 0$ for all ${\mathbf x}\in {\mathbb{C}}^n$ then $A = {\mathbf 0}$ .
2. if is a real, symmetric matrix and ${\mathbf x}A {\mathbf x}^* = 0$ for all ${\mathbf x}\in {\mathbb{R}}^n$ then $A = {\mathbf 0}$ .
  Do these results hold for arbitrary matrices?

We end this chapter with an application of the theory of diagonalisation to the study of conic sections in analytic geometry and the study of maxima and minima in analysis.