Paul's Online Math Notes
     
 
Online Notes / Linear Algebra / Vector Spaces / Orthonormal Basis
Linear Algebra

You can navigate through this E-Book using the menu to the left. For E-Books that have a Chapter/Section organization each option in the menu to the left indicates a chapter and will open a menu showing the sections in that chapter. Alternatively, you can navigate to the next/previous section or chapter by clicking the links in the boxes at the very top and bottom of the material.

Also, depending upon the E-Book, it will be possible to download the complete E-Book, the chapter containing the current section and/or the current section. You can do this be clicking on the E-Book, Chapter, and/or the Section link provided below.

For those pages with mathematics on them you can, in most cases, enlarge the mathematics portion by clicking on the equation. Click the enlarged version to hide it.

We now need to come back and revisit the topic of basis.  We are going to be looking at a special kind of basis in this section that can arise in an inner product space, and yes it does require an inner product space to construct.  However, before we do that we’re going to need to get some preliminary topics out of the way first.

 

We’ll first need to get a set of definitions out of way.

 

Definition 1  Suppose that S is a set of vectors in an inner product space. 

(a) If each pair of distinct vectors from S is orthogonal then we call S an orthogonal set.

(b) If S is an orthogonal set and each of the vectors in S also has a norm of 1 then we call S an orthonormal set.

 

Let’s take a quick look at an example.

 

Example 1  Given the three vectors ,  and  in  answer each of the following.

(a) Show that they form an orthogonal set under the standard Euclidean inner product for  but not an orthonormal set.   [Solution]

(b) Turn them into a set of vectors that will form an orthonormal set of vectors under the standard Euclidean inner product for .   [Solution]

 

Solution

(a) Show that they form an orthogonal set under the standard Euclidean inner product for  but not an orthonormal set.

 

All we need to do here to show that they form an orthogonal set is to compute the inner product of all the possible pairs and show that they are all zero.

                                         

So, they do form an orthogonal set.  To show that they don’t form an orthonormal set we just need to show that at least one of them does not have a norm of 1.  For the practice we’ll compute all the norms.

                                          

 

So, one of them has a norm of 1, but the other two don’t and so they are not an orthonormal set of vectors.

[Return to Problems]

 

 

(b) Turn them into a set of vectors that will form an orthonormal set of vectors under the standard Euclidean inner product for .

 

We’ve actually done most of the work here for this part of the problem already.  Back when we were working in  we saw that we could turn any vector v into a vector with norm 1 by dividing by its norm as follows,

                                                                     

This new vector will have a norm of 1.  So, we can turn each of the vectors above into a set of vectors with norm 1.

                                       

 

All that remains is to show that this new set of vectors is still orthogonal.  We’ll leave it to you to verify that,

 

and so we have turned the three vectors into a set of vectors that form an orthonormal set.

[Return to Problems]

 

We have the following very nice fact about orthogonal sets.

 

Theorem 1  Suppose  is an orthogonal set of non-zero vectors in an inner product space, then S is also a set of linearly independent vectors.

 

Proof : Note that we need the vectors to be non-zero vectors because the zero vector could be in a set of orthogonal vectors and yet we know that if a set includes the zero vector it will be linearly dependent.

 

So, now that we know there is a chance that these vectors are linearly independent (since we’ve excluded the zero vector) let’s form the equation,

 

 

 

and we’ll need to show that the only scalars that work here are , , … , .

 

In fact, we can do this in a single step.  All we need to do is take the inner product of both sides with respect to , , and then use the properties of inner products to rearrange things a little.

 

 

 

Now, because we know the vectors in S are orthogonal we know that  if  and so this reduced down to,

 

 

 

Next, since we know that the vectors are all non-zero we have  and so the only way that this can be zero is if .  So, we’ve shown that we must have , , … ,  and so these vectors are linearly independent.

Pf_Box

 

Okay, we are now ready to move into the main topic of this section.  Since a set of orthogonal vectors are also linearly independent if they just happen to span the vector space we are working on they will also form a basis for the vector space. 

 

Definition 2  Suppose that  is a basis for an inner product space.

(a) If S is also an orthogonal set then we call S an orthogonal basis.

(b) If S is also an orthonormal set then we call S an orthonormal basis.

 

Note that we’ve been using an orthonormal basis already to this point.  The standard basis vectors for  are an orthonormal basis.

 

The following fact gives us one of the very nice properties about orthogonal/orthonormal basis.

 

Theorem 2  Suppose that  is an orthogonal basis for an inner product space and that u is any vector from the inner product space then,

                                        

 

If in addition S is in fact an orthonormal basis then,

 

 

Proof : We’ll just show that the first formula holds.  Once we have that the second will follow directly from the fact that all the vectors in an orthonormal set have a norm of 1.

 

So, given u we need to find scalars , , … ,  so that,

 

 

 

To find these scalars simply take the inner product of both sides with respect to , .

 

 

 

Now, since we have an orthogonal basis we know that  if  and so this reduces to,

 

 

 

Also, because  is a basis vector we know that it isn’t the zero vector and so we also know that .  This then gives us,

 

 

 

However, from the definition of the norm we see that we can also write this as,

 

 

 

and so we’re done.

Pf_Box

 

What this theorem is telling us is that for any vector in an inner product space, with an orthogonal/orthonormal basis, it is very easy to write down the linear combination of basis vectors for that vector.  In other words, we don’t need to go through all the work to find the linear combinations that we were doing in earlier sections.

 

We would like to be able to construct an orthogonal/orthonormal basis for a finite dimensional vector space given any basis of that vector space.  The following two theorems will help us to do that.

 

Theorem 3  Suppose that W is a finite dimensional subspace of an inner product space V  and further suppose that u is any vector in V.  Then u can be written as,

                                                        

where  is a vector that is in W and is called the orthogonal projection of u on W and  is a vector in  (the orthogonal complement of W) and is called the component of u orthogonal to W.

 

Note that this theorem is really an extension of the idea of projections that we saw when we first introduced the concept of the dot product.  Also note that  can be easily computed from  by,

 

 

 

This theorem is not really the one that we need to construct an orthonormal basis.  We will use portions of this theorem, but we needed it more to acknowledge that we could do projections and to get the notation out of the way.  The following theorem is the one that will be the main workhorse of the process.

 

Theorem 4  Suppose that W is a finite dimensional subspace of an inner product space V.  Further suppose that W has an orthogonal basis  and that u is any vector in V then,

                                   

 

If in addition S is in fact an orthonormal basis then,

 

 

So, just how does this theorem help us to construct an orthogonal/orthonormal basis?  The following process, called the Gram-Schmidt process, will construct an orthogonal/orthonormal basis for a finite dimensional inner product space given any basis.  We’ll also be able to develop some very nice facts about the basis that we’re going to be constructing as we go through the construction process.

 

Gram-Schmidt Process

Suppose that V is a finite dimensional inner product space and that  is a basis for V.  The following process will construct an orthogonal basis for V, .  To find an orthonormal basis simply divide the  ’s by their norms.

 

Step 1 : Let .

 

Step 2 :  Let  and then define  (i.e.  is the portion of  that is orthogonal to  ).  Technically, this is all there is to step 2 (once we show that  anyway) since  will be orthogonal to  because it is in .  However, this isn’t terribly useful from a computational standpoint.  Using the result of Theorem 3 and the formula from Theorem 4 gives us the following formula for ,

 

 

 

Next, we need to verify that  because the zero vector cannot be a basis vector.  To see that  assume for a second that we do have .  This would give us,

 

 

But this tells us that  is a multiple of  which we can’t have since they are both basis vectors and are hence linearly independent.  So, .

 

Finally, let’s observe an interesting consequence of how we found .  Both  and  are orthogonal and so are linearly independent by Theorem 1 above and this means that they are a basis for the subspace  and this subspace has dimension of 2.  However, they are also linear combinations of  and  and so  is a subspace of   which also has dimension 2.  Therefore, by Theorem 9 from the section on Basis we can see that we must in fact have,

 

 

 

So, the two new vectors,  and , will in fact span the same subspace as the two original vectors,  and , span.  This is a nice consequence of the Gram-Schmidt process.

 

Step 3 : This step is really an extension of Step 2 and so we won’t go into quite the detail here as we did in Step 2.  First, define  and then define  and so  will be the portion of  that is orthogonal to both  and . We can compute  as follows,

 

 

 

Next, both   and  are linear combinations of  and  and so  can be thought of as a linear combination of , , and .  Then because , , and  are linearly independent we know that we must have .  You should probably go through the steps of verifying the claims made here for the practice.

 

With this step we can also note that because  is in the orthogonal complement of  (by construction) and because we know that,

 

 

from the previous step we know as well that  must be orthogonal to all vectors in .  In particular  must be orthogonal to  and .

 

Finally, following an argument similar to that in Step 2 we get that,

 

 

 

Step 4 : Continue in this fashion until we’ve found .

 

There is the Gram-Schmidt process.  Going through the process above, with all the explanation as we provided, can be a little daunting and can make the process look more complicated than it in fact is.  Let’s summarize the process before we go onto a couple of examples.

 

Gram-Schmidt Process

Suppose that V is a finite dimensional inner product space and that  is a basis for V then an orthogonal basis for V, , can be found using the following process.

 

 

 

 

 

 

 

To convert the basis to an orthonormal basis simply divide all the new basis vectors by their norm.  Also, due to the construction process we have

                           

and  will be orthogonal to  for .

 

Okay, let’s go through a couple of examples here.

 

Example 2  Given that , , and  is a basis of  and assuming that we’re working with the standard Euclidean inner product construct an orthogonal basis for .

 

Solution

You should verify that the set of vectors above is in fact a basis for .  Now, we’ll need to go through the Gram-Schmidt process a couple of times.  The first step is easy.

 

 

The remaining two steps are going to involve a little more work, but won’t be all that bad.  Here is the formula for the second vector in our orthogonal basis.

                                                         

and here is all the quantities that we’ll need.

                                                

 

The second vector is then,

                                          

 

The formula for the third (and final vector) is,

                                               

and here are the quantities that we need for this step.

                  

 

The third vector is then,

                             

 

So, the orthogonal basis that we’ve constructed is,

                       

 

You should verify that these do in fact form an orthogonal set.

 

Example 3  Given that , , and  is a basis of  and assuming that we’re working with the standard Euclidean inner product construct an orthonormal basis for .

 

Solution

First, note that this is almost the same problem as the previous one except this time we’re looking for an orthonormal basis instead of an orthogonal basis.  There are two ways to approach this.  The first is often the easiest way and that is to acknowledge that we’ve got a orthogonal basis and we can turn that into an orthonormal basis simply by dividing by the norms of each of the vectors.  Let’s do it this way and see what we get.

 

Here are the norms of the vectors from the previous example.

                                       

Note that in order to eliminate as many square roots as possible we rationalized the denominators of the fractions here.

 

Dividing by the norms gives the following set of vectors.

 

       

 

Okay that’s the first way to do it.  The second way is to go through the Gram-Schmidt process and this time divide by the norm as we find each new vector.  This will have two effects.  First, it will put a fair amount of roots into the vectors that we’ll need to work with.  Second, because we are turning the new vectors into vectors with length one the norm in the Gram-Schmidt formula will also be 1 and so isn’t needed.

 

Let’s go through this once just to show you the differences.

 

The first new vector will be,

                                       

 

Now, to get the second vector we first need to compute,

                                            

however we won’t call it  yet since we’ll need to divide by it’s norm once we’re done.  Also note that we’ve acknowledged that the norm of  is 1 and so we don’t need that in the formula.  Here is the dot product that we need for this step.

                                                               

 

Here is the new orthogonal vector.

                                    

Notice that this is the same as the second vector we found in Example 2.  In this case we’ll need to divide by its norm to get the vector that we want in this case.

                                

 

Finally, for the third orthogonal vector the formula will be,

                                                

and again we’ve acknowledged that the norms of the first two vectors will be 1 and so aren’t needed in this formula.  Here are the dot products that we’ll need.

                                           

The orthogonal vector is then,

           

Again, this is the third orthogonal vector that we found in Example 2.  Here is the final step to get our third orthonormal vector for this problem.

                                    

 

So, we got exactly the same vectors as if we did when we just used the results of Example 2.  Of course that is something that we should expect to happen here.

 

So, as we saw in the previous example there are two ways to get an orthonormal basis from any given basis.  Each has its pros and cons and you’ll need to decide which method to use.  If we first compute the orthogonal basis and the divide all of them at then end by their norms we don’t have to work much with square roots, however we do need to compute norms that we won’t need otherwise.  Again, it will be up to you to determine what the best method for you to use is.

 

Example 4  Given that , ,  and  is a basis of  and assuming that we’re working with the standard Euclidean inner product construct an orthonormal basis for .

 

Solution

Now, we’re looking for an orthonormal basis and so we’ve got our two options on how to proceed here.  In this case we’ll construct an orthogonal basis and then convert that into an orthonormal basis at the very end.

 

The first vector is,

                                                           

 

Here’s the dot product and norm we need for the second vector.

                                                

The second orthogonal vector is then,

                                        

 

For the third vector we’ll need the following dot products and norms

               

and the third orthogonal vector is,

                         

 

Finally, for the fourth orthogonal vector we’ll need,

                            

and the fourth vector in out new orthogonal basis is,

           

 

Okay, the orthogonal basis is then,

           

 

Next, we’ll need their norms so we can turn this set into an orthonormal basis.

                 

 

The orthonormal basis is then,

                                        

 

Now, we saw how to expand a linearly independent set of vectors into a basis for a vector space.  We can do the same thing here with orthogonal sets of vectors and the Gram-Schmidt process.

 

Example 5  Expand the vectors  and  into an orthogonal basis for  and assume that we’re working with the standard Euclidean inner product.

 

Solution

First notice that the two vectors are already orthogonal and linearly independent.  Since they are linearly independent and we know that a basis for  will contain 3 vectors we know that we’ll only need to add in one more vector.  Next, since they are already orthogonal that will simplify some of the work.

 

Now, recall that in order to expand a linearly independent set into a basis for a vector space we need to add in a vector that is not in the span of the original vectors.  Doing so will retain the linear independence of the set.  Since both of these vectors have a zero in the second term we can add in any of the following to the set.

                                

 

If we used the first one we’d actually have an orthogonal set without any work, but that would be boring and defeat the purpose of the example.  To make our life at least somewhat easier with the work let’s add in the fourth on to get the set of vectors.

                            

 

Now, we know these are linearly independent and since there are three vectors by Theorem 6 from the section on Basis we know that they form a basis for .  However, they don’t form an orthogonal basis.

 

To get an orthogonal basis we would need to perform Gram-Schmidt on the set.  However, since the first two vectors are already orthogonal performing Gram-Schmidt would not have any affect (you should verify this).  So, let’s just rename the first two vectors as,

                                             

and then just perform Gram-Schmidt for the third vector.  Here are the dot products and norms that we’ll need.

                

 

The third vector will then be,

                                   


Online Notes / Linear Algebra / Vector Spaces / Orthonormal Basis

[Contact Me] [Links] [Privacy Statement] [Site Map] [Terms of Use] [Menus by Milonic]

© 2003 - 2009 Paul Dawkins