Paul's Online Math Notes
     
 
Online Notes / Linear Algebra / Euclidean n-Space / Linear Transformations
Linear Algebra

You can navigate through this E-Book using the menu to the left. For E-Books that have a Chapter/Section organization each option in the menu to the left indicates a chapter and will open a menu showing the sections in that chapter. Alternatively, you can navigate to the next/previous section or chapter by clicking the links in the boxes at the very top and bottom of the material.

Also, depending upon the E-Book, it will be possible to download the complete E-Book, the chapter containing the current section and/or the current section. You can do this be clicking on the E-Book, Chapter, and/or the Section link provided below.

For those pages with mathematics on them you can, in most cases, enlarge the mathematics portion by clicking on the equation. Click the enlarged version to hide it.

In this section we’re going to take a look at a special kind of function that arises very naturally in the study of Linear Algebra and has many applications in fields outside of mathematics such as physics and engineering.  This section is devoted mostly to the basic definitions and facts associated with this special kind of function.  We will be looking at a couple of examples, but we’ll reserve most of the examples for the next section.

 

Now, the first thing that we need to do is take a step back and make sure that we’re all familiar with some of the basics of functions in general.  A function, f, is a rule (usually defined by an equation) that takes each element of the set A (called the domain) and associates it with exactly one element of a set B (called the codomain).  The notation that we’ll be using to denote our function is

 

 

 

 

When we see this notation we know that we’re going to be dealing with a function that takes elements from the set A and associates them with elements from the set B.  Note as well that it is completely possible that not every element of the set B will be associated with an element from A.  The subset of all elements from B that are associated with elements from A is called the range.

 

In this section we’re going to be looking at functions of the form,

 

 

In other words, we’re going to be looking at functions that take elements/points/vectors from  and associate them with elements/points/vectors from .  These kinds of functions are called transformations and we say that f maps  into .  On an element basis we will also say that f maps the element u from  to the element v from .

 

So, just what do transformations look like?  Consider the following scenario.  Suppose that we have m functions of the following form,

 

 

 

Each of these functions takes a point in , namely , and maps it to the number .  We can now define a transformation  as follows,

 

 

In this way we associate with each point  from  a point  from  and we have a transformation.

 

Let’s take a look at a couple of transformations.

 

 

Example 1  Given

            

define  as,

                                   

 

Note that the second form is more convenient since we don’t actually have to define any of the w’s in that way and is how we will define most of our transformations.

 

We evaluate this just as we evaluate the functions that we’re used to working with.  Namely, pick a point from  and plug into the transformation and we’ll get a point out of the function that is in .  For example,

                                                   

 

Example 2  Define  as .  A sample evaluation of this transformation is,

                                                        

 

Now, in this section we’re going to be looking at a special kind of transformation called a linear transformation.  Here is the definition of a linear transformation.

 

Definition 1  A function  is called a linear transformation if for all u and v in  and all scalars c we have,

                               

 

We looked at two transformations above and only one of them is linear.  Let’s take a look at each one and see what we’ve got.

 

Example 3  Determine if the transformation from Example 2 is linear or not.

 

Solution

Okay, if this is going to be linear then it must satisfy both of the conditions from the definition.  In other words, both of the following will need to be true.

                                          

 

                                  

 

In this case let’s take a look at the second condition.

                                           

 

The second condition is not satisfied and so this is not a linear transformation.  You might want to verify that in this case the first is also not satisfied.  It’s not too bad, but the work does get a little messy.

 

Example 4  Determine if the transformation in Example 1 is linear or not.

 

Solution

To do this one we’re going to need to rewrite things just a little.  The transformation is defined as  where,

                                                              

 

Now, each of the components are given by a system of linear (hhmm, makes one instantly wonder if the transformation is also linear…) equations and we saw in the first chapter that we can always write a system of linear equations in matrix form.  Let’s do that for this system.

 

                                     

 

Now, notice that if we plug in any column matrix x and do the matrix multiplication we’ll get a new column matrix out, w.  Let’s pick a column matrix x totally at random and see what we get.

                                                      

 

Of course, we didn’t pick x completely at random.  Notice that x we choose was the column matrix representation of the point from  that we used in Example 1 to show a sample evaluation of the transformation.  Just as importantly notice that the result, w, is the matrix representation of the point from  that we got out of the evaluation.

 

In fact, this will always be the case for this transformation.  So, in some way the evaluation  is the same as the matrix multiplication  and so we can write the transformation as

                                                                 

Notice that we’re kind of mixing and matching notation here.  On the left x represents a point in   and on the right it is a  matrix.  However, this really isn’t a problem since they both can be used to represent a point in .  We will have to get used to this notation however as we’ll be using it quite regularly.

 

Okay, just what were we after here.  We wanted to determine if this transformation is linear or not.  With this new way of writing the transformation this is actually really simple.  We’ll just make use of some very nice facts that we know about matrix multiplication.  Here is the work for this problem

                                    

                                                

 

So, both conditions of the definition are met and so this transformation is a linear transformation.

 

There are a couple of things to note here.  First, we couldn’t write the transformation from Example 2 as a matrix multiplication because at least one of the equations (okay both in this case) for the components in the result were not linear.

 

Second, when all the equations that give the components of the result are linear then the transformation will be linear.  If at least one of the equations are not linear then the transformation will not be linear either.

 

Now, we need to investigate the idea that we used in the previous example in more detail.  There are two issues that we want to take a look at. 

 

First, we saw that, at least in some cases, matrix multiplication can be thought of as a linear transformation.  As the following theorem shows, this is in fact always the case.

 

Theorem 1  If A is an  matrix then its induced transformation, , defined as,

                                                                

is a linear transformation.

 

Proof : The proof here is really simple and in fact we pretty much saw it last example.

 

 

 

 

 

So, the induced function, , satisfies both the conditions in the definition of a linear transformation and so it is a linear transformation.

Pf_Box

 

So, any time we do matrix multiplication we can also think of the operation as evaluating a linear transformation.

 

The other thing that we saw in Example 4 is that we were able, in that case, to write a linear transformation as a matrix multiplication.  Again, it turns out that every linear transformation can be written as a matrix multiplication.

 

Theorem 2  Let  be a linear transformation, then there is an  matrix such that  (recall that  is the transformation induced by A). 

 

The matrix A is called the matrix induced by T and is sometimes denoted as .

 

Proof : First let,

 

 

be the standard basis vectors for  and define A to be the  matrix whose ith column is .  In other words, A is given by,

 

 

 

Next let x be any vector from .  We know that we can write x in terms of the standard basis vectors as follows,

 

 

 

In order to prove this theorem we’re going to need to show that for any x (which we’ve got a nice general one above) we will have .  So, let’s start off and plug x into T using the general form as written out above.

 

 

 

Now, we know that T is a linear transformation and so we can break this up at each of the “+”’s as follows,

 

 

 

Next, each of the  ’s are scalars and again because T is a linear transformation we can write this as,

 

 

 

Next, let’s notice that this is nothing more than the following matrix multiplication.

 

 

 

But the first matrix nothing more than A and the second is just x and we when we define A as we did above we will get,

 

 

and so we’ve proven what we needed to.

Pf_Box

 

In this proof we used the standard basis vectors to define the matrix A.  As we will see in a later chapter there are other choices of vectors that we could use here and these will produce a different induced matrix, A, and we do need to remember that.  However, when we use the standard basis vectors to define A, as we’re going to in this chapter, then we don’t actually need to evaluate T at each of the basis vectors as we did in the proof.  All we need to do is what we did in Example 4, write down the coefficient matrix for the system of equations that we get by writing out each of the components as individual equations. 

 

Okay, we’ve done a lot of work in this section and we haven’t really done any examples so we should probably do a couple of them.  Note that we are saving most of the examples for the next section, so don’t expect a lot here.  We’re just going to do a couple so we can say we’ve done a couple.

 

Example 5  The zero transformation is the transformation  that maps every vector x in  to the zero vector in , i.e. .  The matrix induced by this transformation is the  zero matrix, 0 since,

                                                        

 

To make it clear we’re using the zero transformation we usually denote it by .

 

Example 6  The identity transformation is the transformation  (yes they are both  ) that maps every x to itself, i.e. .  The matrix induced by this transformation is the  identity matrix,  since,

                                                       

 

We’ll usually denote the identity transformation as  to make it clear we’re working with it.

 

So, the two examples above are standard examples and we did need them taken care of.  However, they aren’t really very illustrative for seeing how to construct the matrix induced by the transformation.  To see how this is done, let’s take a look at some reflections in .  We’ll look at reflections in  in the next section.

 

Example 7  Determine the matrix induced by the following reflections.

(a) Reflection about the x-axis.   [Solution]

(b) Reflection about the y-axis.   [Solution]

(c) Reflection about the line .   [Solution]

 

Solution

Note that all of these will be linear transformations of the form .

 

(a) Reflection about the x-axis.

 

Let’s start off with a sketch of what we’re looking for here.

Trans_Ex7_G1

So, from this sketch we can see that the components of the for the translation (i.e. the equations that will map x into w) are,

                                                                   

Remember that  will be the first component of the transformed point and  will be the second component of the transformed point.

 

Now, just as we did in Example 4 we can write down the matrix form of this system.

                                                        

So, it looks like the matrix induced by this reflection is,

                                                                  

[Return to Problems]

 

(b) Reflection about the y-axis.

 

We’ll do this one a little quicker.  Here’s a sketch and the equations for this reflection.

Trans_Ex7_G2

 

                                                                   

The matrix induced by this reflection is,

                                                                  

[Return to Problems]

 

(c) Reflection about the line .

 

Here’s the sketch and equations for this reflection.

Trans_Ex7_G3

                                                                    

The matrix induced by this reflection is,

                                                                    

[Return to Problems]

 

Hopefully, from these examples you’re starting to get a feel for how we arrive at the induced matrix for a linear transformation.  We’ll be seeing more of these in the next section, but for now we need to move on to some more ideas about linear transformations.

 

Let’s suppose that we have two linear transformations induced by the matrices A and B,  and .  If we take any x out of   will map x into .  In other words,  will be in  and notice that we can then apply  to this and its image will be in .  In summary, if we take x out of  and first apply  to x and then apply  to the result we will have a transformation from  to .

 

This process is called composition of transformations and is denoted as

 

Note that the order here is important.  The first transformation to be applied is on the right and the second is on the left.

 

Now, because both of our original transformations were linear we can do the following,

 

 

and so the composition  is the same as multiplication by BA.  This means that the composition will be a linear transformation provided the two original transformations were also linear.

 

Note as well that we can do composition with as many transformations as we want provided all the spaces correctly match up.  For instance with three transformations we require the following three transformations,

 

 

 and in this case the composition would be,

 

 

 

Let’s take a look at a couple of examples.

 

Example 8  Determine the matrix inducted by the composition of reflection about the y-axis followed by reflection about the x-axis.

Solution

First, notice that reflection about the y-axis should change the sign on the x coordinate and following this by a reflection about the x-axis should change the sign on the y coordinate.

 

The two transformations here are,

                 

The matrix induced by the composition is then,

                                     

 

Let’s take a quick look at what this does to a point.  Given x in  we have,

                                             

This is what we expected to get.  This is often called reflection about the origin.

 

Example 9  Determine the matrix inducted by the composition of reflection about the y-axis followed by another reflection about the y-axis.

 

Solution

In this case if we reflect about the y-axis twice we should end right back where we started.

 

The two transformations in this case are,

                 

The induced matrix is,

                                   

 

So, the composition of these two transformations yields the identity transformation.  So,

                                                      

and the composition will not change the original x as we guessed.


Online Notes / Linear Algebra / Euclidean n-Space / Linear Transformations

[Contact Me] [Links] [Privacy Statement] [Site Map] [Terms of Use] [Menus by Milonic]

© 2003 - 2009 Paul Dawkins