Cyclic subspaces

Section 7.5 Cyclic subspaces

Definition 7.5.1. (Principal vector and its order).

Let \(V\) be a finite-dimensional vector space over a field \(F\text{.}\) A nonzero vector \(e\in V\) is said to be a principal vector of an \(F\)-linear map \(T\colon V\to V\) with an eigenvalue \(\lambda\in F\) if there exists a natural number \(n\) such that \((T-\lambda\unit_V)^n(e)=0\text{.}\)

The least natural number \(m\) such that \((T-\lambda\unit_V)^m(e)=0\) is called the order of \(e\text{.}\)

A principal vector of \(T\) with an eigenvalue \(\lambda\) is also called a generalized eigenvector with eigenvalue \(\lambda\).

Remark 7.5.2.

An eigenvector is a principal vector of order \(1\text{.}\)

Example 7.5.3. (An illustrative example).

Let \(T\colon\R^2\to\R^2\) be a linear map corresponding to a matrix \(A=\begin{pmatrix}1\amp 1\\0\amp 1\end{pmatrix}\) (see Theorem 5.2.3). We have

\begin{equation*} (T-\unit_V)(e_1)=0\quad\text{and}\quad(T-\unit_V)(e_2)=e_1. \end{equation*}

Thus,

\begin{equation*} (T-\unit_V)^2(e_2)=0. \end{equation*}

Therefore, \(e_1\) is a principal vector of \(T\) with eigenvalue \(1\) and its order is \(1\text{.}\) While \(e_2\in\R^2\) is a principal vector of \(T\) with an eigenvalue \(1\text{,}\) and its order is \(2\text{.}\) Since vectors \(w_1=e_2,w_2=(T-\unit_V)(e_2)\) are \(\R\)-linearly independent, they form a basis of \(\R^2\text{.}\) Furthermore, since \(T(w_1)=w_1+w_2\) and \((T-\unit_V)(w_2)=(T-\unit_V)^2(e_2)=0\text{,}\) the matrix of \(T\) with respect to an ordered basis \((w_1,w_2)\) is

\begin{equation*} \begin{pmatrix}1\amp0\\1\amp1\end{pmatrix}. \end{equation*}

Definition 7.5.4. (Cyclic subspace).

Let \(V\) be a finite-dimensional vector space over a field \(F\text{.}\) Consider a nonzero (principal) vector \(e\in V\text{,}\) a scalar \(\lambda\in F\) such that \((T-\lambda\unit_V)^n(e)=0\text{.}\) Suppose that \(m\) is the least natural number such that \((T-\lambda\unit_V)^m(e)=0\text{.}\) The subspace of \(V\) generated by

\begin{equation*} e,(T-\lambda\unit_V)(e),(T-\lambda\unit_V)^2(e),\ldots,(T-\lambda\unit_V)^{m-1}(e) \end{equation*}

is called the cyclic subspace generated by \(e\).

Proposition 7.5.5.

Let \(V\) be a finite-dimensional vector space over a field \(F\) and let \(T\colon V\to V\) be an \(F\)-linear map. The cyclic subspace \(W\) generated by the principal vector \(e\) of order \(m\) with an eigenvalue \(\lambda\) is invariant under \(T\) and \(\dim_FW=m\text{.}\)

Proof.

We first compute the dimension of \(W\text{.}\) By Definition 7.5.4, the subspace \(W\) is spanned by

\begin{equation*} e,\;(T-\lambda\unit_V)(e),\;(T-\lambda\unit_V)^2(e),\ldots,(T-\lambda\unit_V)^{m-1}(e). \end{equation*}

Hence, \(\dim_FW\leq m\text{.}\) We show that \(\left\{(T-\lambda\unit_V)^i(e)\right\}_{i=0}^{m-1}\) is linearly independent. Suppose that

\begin{equation*} a_0e+a_1(T-\lambda\unit_V)(e)+\cdots+a_{m-1}(T-\lambda\unit_V)^{m-1}(e)=0. \end{equation*}

Since \((T-\lambda\unit_V)^m(e)=0\text{,}\) we get that \((T-\lambda\unit_V)^r(e)=0\) for any \(r\geq m\text{.}\) Hence, the linear map \((T-\lambda\unit_V)^{m-1}\) evaluated at the above expression gives

\begin{equation*} a_0(T-\lambda\unit_V)^{m-1}(e)=0. \end{equation*}

As \((T-\lambda\unit_V)^{m-1}(e)\neq 0\text{,}\) we have \(a_0=0\text{.}\) Therefore we are left with

\begin{equation*} a_1(T-\lambda\unit_V)(e)+\cdots+a_{m-1}(T-\lambda\unit_V)^{m-1}(e)=0. \end{equation*}

Now we apply \((T-\lambda\unit_V)^{m-2}\) to the above expression, and argueing as above, we obtain \(a_1=0\text{.}\) Continuing in this way we get that \(a_i=0\) for all \(i\text{.}\) This shows that \(\left\{(T-\lambda\unit_V)^i\right\}_{i=0}^{m-1}\) is a maximal linearly independent subset of \(W\text{,}\) i.e., it is a basis of \(W\text{.}\) Thus \(\dim_FW=m\text{.}\)

We now show that \(W\) is invariant under \(T\text{.}\) We rename the basis vectors of \(W\) obtained above as follows.

\begin{equation*} w_1=e,\;w_2=(T-\lambda\unit_V)(e),\;w_3=(T-\lambda\unit_V)^2(e),\;\ldots,\;w_m=(T-\lambda\unit_V)^{m-1}(e) \end{equation*}

Therefore we have

\begin{equation*} (T-\lambda\unit_V)(w_i)=w_{i+1}\quad\text{for}\;1\leq i\leq m-1\quad\text{and}\quad (T-\lambda\unit_V)(w_m)=0. \end{equation*}

In other words we have

\begin{equation*} T(w_i)=\lambda w_i+w_{i+1}\in W\quad\text{for}\;1\leq i\leq m-1\quad\text{and}\quad T(w_m)=\lambda w_m\in W. \end{equation*}

This shows that \(W\) is invariant under \(T\text{.}\)

Remark 7.5.6. (Matrix of a cyclic subspace).

We keep the notation of the above Proposition 7.5.5 and its proof. A matrix of \(T|_W\) with respect to the ordered basis \((w_1,w_2,\ldots,w_m)\) is the matrix with all diagonal entries \(\lambda\text{,}\) all lower diagonal entries \(1\text{,}\) and all other entries \(0\text{.}\)

Definition 7.5.7. (Jordan block).

The matrix of a cyclic subspace described in Remark 7.5.6 is called the Jordan block. Thus Jordan block obtained in Proposition 7.5.5 is the following lower triangular matrix.

\begin{equation*} J_\lambda=\begin{pmatrix}\lambda\amp 0\amp 0\amp\cdots\amp\cdots\amp 0\\1\amp\lambda\amp 0\amp\amp\amp 0\\0\amp 1\amp\lambda\amp\amp\amp\vdots\\\vdots\amp\amp\ddots\amp\ddots\amp\amp\vdots\\\vdots\amp\amp\amp\ddots\amp\lambda\amp 0\\0\amp 0\amp\cdots\amp\cdots\amp 1\amp \lambda\end{pmatrix} \end{equation*}

Remark 7.5.8.

The Jordan block of size \(1,2,3\) and \(4\) are, respectively

\begin{equation*} (\lambda),\;\begin{pmatrix}\lambda\amp\\1\amp\lambda\end{pmatrix},\;\begin{pmatrix}\lambda\amp \amp \\1\amp\lambda\amp \\\amp 1\amp\lambda\end{pmatrix},\;\begin{pmatrix}\lambda\amp\amp\amp\\1\amp\lambda\amp\amp\\\amp 1\amp\lambda\amp\\\amp\amp1\amp\lambda\end{pmatrix}. \end{equation*}

Lemma 7.5.9.

Let \(J_\lambda\in M_n(F)\) be a Jordan block.

The characteristic polynomial of \(J_\lambda\text{,}\) \(\chi(t)=(t-\lambda)^n\text{.}\)
The trace of \(J_\lambda\text{,}\) \(tr(J_\lambda)=n\lambda\) and the determinant of \(J_\lambda\text{,}\) \(\det(J_\lambda)=\lambda^n\text{.}\)
We have \(\chi(t)=t^n-tr(J_\lambda)+\cdots+(-1)^n\det(J_\lambda)\text{.}\)
The minimal polynomial of \(J_{\lambda}\) coincides with the characteristic polynomial of \(J_{\lambda}\text{.}\)

Proof.

This follows from the properties of determinant. The third statement is consequence of Lemma A.1.2. We now show the last statement. Suppose that \(J_\lambda\in M_r(F)\text{.}\) Note that \(J_\lambda=\lambda I_r+J_0\text{.}\) We have \(J_0^k=0\) for all \(k\geq r\text{.}\) Since \(\lambda I_r\cdot J_0=J_0\cdot\lambda I_r\) we have

\begin{equation*} \left(J_r-\lambda I_r\right)^k=J_0^k. \end{equation*}

By Proposition 7.2.20, the minimal polynomial divides the characteristic polynomial hence, we get the result.

Linear Algebra: Companion notes

Search Results:

Section 7.5 Cyclic subspaces

Definition 7.5.1. (Principal vector and its order).

Remark 7.5.2.

Example 7.5.3. (An illustrative example).

Definition 7.5.4. (Cyclic subspace).

Proposition 7.5.5.

Proof.

Remark 7.5.6. (Matrix of a cyclic subspace).

Definition 7.5.7. (Jordan block).

Remark 7.5.8.

Lemma 7.5.9.

Proof.