Lecture 11/12: Null Space, Range, and Dimension Theorem
Notes: So the null space is any vector that the transformation turns it into the zero vector. The range just like the definition is the set of the images of all the vectors in \(V\). If all the vectors have an image, then the range is equal to the codomain and that’s when the transformation is onto.
Example
\(N(T^b_a) = \{\)functions in \(C^0(\mathbf{R})\) whos average over \([a,b]\) is \(0\}\). In other words, it’s the set of all functions whose definite integral is 0.
\(R(T^b_a) = \mathbf{R}\). Here we want all functions whose definite integral is some constant/some real number. In other words, this will be the entire codomain (\(\mathbf{R}\)). We call this map \((T^b_a)\) onto.
Proof:
We need to verify the three subspaces properties. To prove that \(R(T)\) is a subspace:
- We need to prove that \(\bar{0}_W \in R(T)\). This means that we need a \(v \in V\) such that \(T(v) = \bar{0}_W\). Since \(T\) is linear then \(T(\bar{0}_V) = \bar{0}_W\). (We proved this in the previous lecture)
- We need \(R(T)\) to be closed under addition. Suppose \(w_1, w_2 \in R(T)\), then \(w_1 = T(v_1)\) and \(w_2 = T(v_2)\) for some \(v_1\) and \(v_2\). Therefore,
$$ \begin{align*} w_1 + w_2 &= T(v_1) + T(v_2) \\ &= T(v_1 + v_2) \ \text{because $$T$$ is linear} \end{align*} $$By definition of a vector space \(v_1 + v_2 \in V\), so the image of the sum \(w_1 + w_2 \in R(T)\) as required.
- We need \(R(T)\) to be closed under multiplication. Suppose \(w_1 \in R(T)\) and \(c\) is a scalar. We need \(cw_1 \in R(T)\). Since \(w_1 \in R(T)\), then there must be a vector \(v_1 \in V\) such that \(w_1 = T(v_1)\). Notice now,
$$ \begin{align*} w_1 &= T(v_1) \\ cw_1 &= cT(v_1) \\ &= T(cv_1) \ \text{because $$T$$ is linear} \end{align*} $$By definition of a vector space \(cv_1 \in V\), so the image of the product \(cv_1 \in R(T)\) as required.
From this, we conclude that the \(R(T)\) is a subspace of \(W\).
Proving that \(N(T)\) is a subspace is an exercise.
We know that \(N(T)\) is a subspace of \(V\). This means that \(\dim(N(T)) \leq \dim(V)\). This theorem tells us that the difference \(\dim(V) - \dim(N(T))\) is the dimension of \(R(T)\). Even if \(W\) is an infinite dimensional space, we know from linearity, it is finite dimensional. Typically, \(\dim(R(T))\) is called the rank of \(T\) and \(\dim(N(T))\) is called the nullity of \(T\).
Example
Consider the map \(T: \mathbf{R}^n \rightarrow \mathbf{R}^n\) where
The null space is the set of vectors where their images are the zero vector. This means that given some vector \(v=(x_1,...,x_n)\), we want all of \((x_1,...x_m)\) to be zero since we already know anything after \(m\) is zero by the definition of the map. Therefore,
Therefore, \(\dim(N(T)) = n - m\). For the range of \(T\),
From this we see that,
Proof:
Let \(T: V \rightarrow W\) be a linear transformation and \(V\) be finite dimensional. Let \(\dim(V) = n\). Let \(\dim(N(T)) = k\). We know that \(N(T)\) is a subspace of \(V\). Therefore, \(k \leq n\) (Theorem 1.11). Since \(\dim(N(T)) = k\), this means that any basis of the null space will have \(k\) elements. So let \(\beta_N = \{u_1,...,u_k\}\) be a basis for \(N(T)\).
(Note: Now that we have a basis for the null space and know its dimension, we need to somehow find a basis for the range to figure out its dimension as well so the plan is to go from the basis we just created to creating a basis for the range. To do this, we’re going to use the refinement and the replacement theorems)
Claim 1: We can extend the basis \(\beta_N\) by \(n-k\) vectors to form a basis of \(V\). To see this, let \(\beta = \{v_1,v_2,...,v_n\}\) be a basis for \(V\) (This is just a general basis. The basis that really we want is the one that would include the vectors in the \(\beta_N\)). We will use the replacement theorem by setting \(\mathcal{S} = \beta\) and \(\mathcal{U} = \beta_N\). These choices work because \(\beta\) generates \(V\) and \(\beta_N\) is a linearly independent subset of \(V\). The replacement theorem implies that \(k \leq n\) which we already know. It also implies that we can add \(n-k\) elements of \(\beta\) to \(\beta_N\) so that the resulting set generates \(V\). Label the newly added elements from \(\beta\) as \(v_1,v_2,...v_{n-k}\) (relabel if necessary). So now we have
Call this set \(\beta_V\). So \(\beta_V = \{u_1,...,u_k,v_1,...v_{n-k}\}\). We claim that \(\beta_V\) is a basis for \(V\). How do we know this? We know that it spans \(V\) by the replacement theorem above. Moreover, it has \(k+n-k = n\) elements which is the dimension of \(V\). To see that the vectors are linearly independent, suppose that they’re not. Since \(\beta_V\) generates \(V\), then by the refinement theorem, we can take away an element and still have a span that generates \(V\). But then if we do take an element out, this means that we’ll have \(n-1\) elements in \(\beta_V\). This is a contradiction since we need at least \(n\) elements to span \(V\). (TODO: What result did we use here exactly?)
(Note: so now we need to relate this to the dimension of the range)
Claim 2: we claim that \(\{T(v_1),...,T(v_{n-k})\}\) is a basis for \(R(T)\). If we prove this, then we’ll be done since,
So now we need to prove that \(\{T(v_1),...,T(v_{n-k})\}\) is a basis. To do so, we need to prove that it generates \(R(T)\) and so \(Span(\{T(v_1),...,T(v_{n-k})\}) = R(T)\). Additionally, we need to prove that the vectors are linearly independent. To see that it generates \((R)\), we know that \(R(T) = \{T(v) \ | \ v \in V\}\). Furthermore, we know that any vector \(v \in V\) an be written as a linear combinations in terms of the elements in \(\beta_V\) and we know that \(T\) is linear. So now we can re-write the definition as,
Next, to see that the set is linearly independent, we need to prove that the following equation
implies that \(a_1 = 0, ..., a_{n-k} = 0\). To see this, we’ll use the linearity of \(T\) where the image of a linear combination is the same as the linear combination of the images to get,
This equation if it holds, it tells us that \(a_1v_1 + ... + a_{n-k}v_{n-k}\) is in the null space \(N(T)\) by definition since it’s image is the zero vector. But we have a basis for the null space which is \(\beta_N\). So now this linear combination must also be in the span of \(\beta_N\). In other words
We claim that this must imply that \(a_1 = 0, ..., a_{n-k} = 0\). Why? Suppose not, then the linear combination \(a_1v_1 + ... + a_{n-k}v_{n-k}\) would be a linear combination of the elements \(u_1,...,u_k\) and so we can write
From the above we see that now we have a linear combination of all the \(v\) vectors and all the \(u\) vectors equal to the zero vector. But all the \(u\) and \(v\) vectors \(\{u_1,...,u_k,v_1,...,v_{n-k}\}\) are part of the basis \(\beta_V\) by construction. So any linear combination of these vectors equaling the zero vector must imply that the coefficients are zero because otherwise they are linearly dependent and this is a contradiction. \(\blacksquare\).
- \(T\) is 1-1.
- \(T\) is onto.
- \(rank(T) = \dim(V)\).
Note here that \(rank(T) = \dim(R(T))\).
References
- Math416 by Ely Kerman