David Austin
Grand Valley State University
david at merganser.math.gvsu.edu
Mail to a friend | Print this article |
Introduction
The topic of this article, the
A singular value decomposition provides a convenient way for breaking a matrix, which perhaps contains some data we are interested in, into simpler, meaningful pieces. In this article, we will offer a geometric explanation of singular value decompositions and look at some of the applications of them.
The geometry of linear transformations
Let us begin by looking at some simple matrices, namely those with two rows and two columns. Our first example is the diagonal matrix
Geometrically, we may think of a matrix like this as taking a point
The effect of this transformation is shown below: the plane is horizontally stretched by a factor of 3, while there is no vertical change.
Now let's look at
which produces this effect
It is not so clear how to describe simply the geometric effect of the transformation. However, let's rotate our grid through a 45 degree angle and see what happens.
Ah ha. We see now that this new grid is transformed in the same way that the original grid was transformed by the diagonal matrix: the grid is stretched by a factor of 3 in one direction.
This is a very special situation that results from the fact that the matrix
Said with more mathematical precision, given a symmetric matrix
where λi
If we use the eigenvectors of a symmetric matrix to align the grid, the matrix stretches and reflects the grid in the same way that it does the eigenvectors.
The geometric description we gave for this linear transformation is a simple one: the grid is simply stretched in one direction. For more general matrices, we will ask if we can find an orthogonal grid that is transformed into another orthogonal grid. Let's consider a final example using a matrix that is not symmetric:
This matrix produces the geometric effect known as a
It's easy to find one family of eigenvectors along the horizontal axis. However, our figure above shows that these eigenvectors cannot be used to create an orthogonal grid that is transformed into another orthogonal grid. Nonetheless, let's see what happens when we rotate the grid first by 30 degrees,
Notice that the angle at the origin formed by the red parallelogram on the right has increased. Let's next rotate the grid by 60 degrees.
Hmm. It appears that the grid on the right is now almost orthogonal. In fact, by rotating the grid in the domain by an angle of roughly 58.28 degrees, both grids are now orthogonal.
The singular value decomposition
This is the geometric essence of the singular value decomposition for 2
We will express this fact using vectors: with an appropriate choice of orthogonal unit vectors
We will use
We therefore have
M v2
We may now give a simple description for how the matrix
This means that
M x
Remember that the dot product may be computed using the vector transpose
which leads to
M
This is usually expressed by writing
where
This shows how to decompose the matrix
How do we find the singular decomposition?
The power of the singular value decomposition lies in the fact that we may find it for
Notice that the major and minor axes are defined by
In other words, the function |Mx| on the unit circle has a maximum at
The singular values are then given by σi
To explain this, we will assume that σi
M vj
Let's begin by looking at the expression
On the other hand, we have
Therefore,
In practice, this is not the procedure used to find the singular value decomposition of a matrix since it is not particularly efficient or well-behaved numerically.
Another example
Let's now look at the singular matrix
The geometric effect of this matrix is the following:
In this case, the second singular value is zero so that we may write:
In other words, if some of the singular values are zero, the corresponding terms do not appear in the decomposition for
Data compression
Singular value decompositions can be used to represent data efficiently. Suppose, for instance, that we wish to transmit the following image, which consists of an array of 15
Since there are only three types of columns in this image, as shown below, it should be possible to represent the data in a more compact form.
We will represent the image as a 15
If we perform a singular value decomposition on
σ 2
σ 3
Therefore, the matrix may be represented as
This means that we have three vectors
Why are there only three non-zero singular values? Remember that the number of non-zero singular values equals the rank of the matrix. In this case, we see that there are three linearly independent columns in the matrix, which means that the rank will be three.
Noise reduction
The previous example showed how we can exploit a situation where many singular values are zero. Typically speaking, the large singular values point to where the interesting information is. For example, imagine we have used a scanner to enter this image into our computer. However, our scanner introduces some imperfections (usually called "noise") in the image.
We may proceed in the same way: represent the data using a 15
σ 2
σ 3
σ 4
σ 5
...
σ 15
Clearly, the first three singular values are the most important so we will assume that the others are due to the noise in the image and make the approximation
This leads to the following improved image.
Noisy image |
Improved image |
Data analysis
Noise also arises anytime we collect data: no matter how good the instruments are, measurements will always have some error in them. If we remember the theme that large singular values point to important features in a matrix, it seems natural to use a singular value decomposition to study data once it is collected.
As an example, suppose that we collect some data as shown below:
We may take the data and put it into a matrix:
-1.03 | 0.74 | -0.02 | 0.51 | -1.31 | 0.99 | 0.69 | -0.12 | -0.72 | 1.11 |
-2.23 | 1.61 | -0.02 | 0.88 | -2.39 | 2.02 | 1.62 | -0.35 | -1.67 | 2.46 |
and perform a singular value decomposition. We find the singular values
σ 2
With one singular value so much larger than the other, it may be safe to assume that the small value of σ2
This brief example points to the beginnings of a field known as
In a similar way, singular value decompositions can be used to detect groupings in data, which explains why singular value decompositions are being used in attempts to improve Netflix's movie recommendation system. Ratings of movies you have watched allow a program to sort you into a group of others whose ratings are similar to yours. Recommendations may be made by choosing movies that others in your group have rated highly.
Summary
As mentioned at the beginning of this article, the singular value decomposition should be a central part of an undergraduate mathematics major's linear algebra curriculum. Besides having a rather simple geometric explanation, the singular value decomposition offers extremely effective techniques for putting linear algebraic ideas into practice. All too often, however, a proper treatment in an undergraduate linear algebra course seems to be missing.
This article has been somewhat impressionistic: I have aimed to provide some intuitive insights into the central idea behind singular value decompositions and then illustrate how this idea can be put to good use. More rigorous accounts may be readily found.
References:
-
- Gilbert Strang,
Linear Algebra and Its Applications. Brooks Cole.
Strang's book is something of a classic though some may find it to be a little too formal. - William H. Press
et al, Numercial Recipes in C: The Art of Scientific Computing. Cambridge University Press.
Authoritative, yet highly readable. Older versionsare available online. - Dan Kalman,
A Singularly Valuable Decomposition: The SVD of a Matrix, The College Mathematics Journal 27 (1996), 2-23.
Kalman's article, like this one, aims to improve the profile of the singular value decomposition. It also a description of how least-squares computations are facilitated by the decomposition. - If You Liked This, You're Sure to Love That,
The New York Times, November 21, 2008.
This article describes Netflix's prize competition as well as some of the challenges associated with it.
David Austin
Grand Valley State University
david at merganser.math.gvsu.edu
Those who can access JSTOR can find at least of the papers mentioned above there. For those with access, the American Mathematical Society's MathSciNet can be used to get additional bibliographic information and reviews of some these materials. Some of the items above can be accessed via the ACM Portal , which also provides bibliographic services.