Recent Changes - Search:

Chapter Four: Determinants

Definition

Geometry of Determinants

Other Formulas

Topics

edit SideBar

(options)

Determinants

In the first chapter of this book we considered linear systems and we picked out the special case of systems with the same number of equations as unknowns, those of the form T\vec{x}=\vec{b} where T is a square matrix. We noted a distinction between two classes of T‘s. While such systems may have a unique solution or no solutions or infinitely many solutions, if a particular T is associated with a unique solution in any system, such as the homogeneous system \vec{b}=\zero, then

T is associated with a unique solution for every \vec{b}.

We call such a matrix of coefficients ‘nonsingular’. The other kind of T, where every linear system for which it is the matrix of coefficients has either no solution or infinitely many solutions, we call ‘singular’.

Through the second and third chapters the value of this distinction has been a theme. For instance, we now know that nonsingularity of an \nbyn{n} matrix T is equivalent to each of these:

  • a system T\vec{x}=\vec{b} has a solution, and that solution is unique;
  • Gauss-Jordan reduction of T yields an identity matrix;
  • the rows of T form a linearly independent set;
  • the columns of T form a basis for \Re^n;
  • any map that T represents is an isomorphism;
  • an inverse matrix T^{-1} exists.

So when we look at a particular square matrix, the question of whether it is nonsingular is one of the first things that we ask. This chapter develops a formula to determine this. (Since we will restrict the discussion to square matrices, in this chapter we will usually simply say ‘matrix’ in place of ‘square matrix’.)

More precisely, we will develop infinitely many formulas, one for

\nbyn{1} matrices, one for \nbyn{2} matrices, etc. Of course, these

formulas are related — that is, we will develop a family of formulas, a scheme that describes the formula for each size.

Definition

For \nbyn{1} matrices, determining nonsingularity is trivial.

\begin{pmatrix}
          a
     \end{pmatrix}

is nonsingular iff a \neq 0

The \nbyn{2} formula came out in the course of developing the inverse.

\begin{pmatrix}
           a  &b  \\
           c  &d
      \end{pmatrix}

is nonsingular iff ad-bc \neq 0

The \nbyn{3} formula can be produced similarly (see nearbyexercise).

\begin{pmatrix}
           a  &b  &c  \\
           d  &e  &f  \\
           g  &h  &i
     \end{pmatrix}

is nonsingular iff aei+bfg+cdh-hfa-idb-gec \neq 0

With these cases in mind, we posit a family of formulas, a, ad-bc, etc. For each n the formula gives rise to a determinant function \map{\det_{\nbyn{n}}}{\matspace_{\nbyn{n}}}{\Re} such that an \nbyn{n} matrix T is nonsingular if and only if \det_{\nbyn{n}}(T)\neq 0. (We usually omit the subscript because if T is \nbyn{n} then ‘\det(T)‘ could only mean ‘\det_{\nbyn{n}}(T)‘.)

Exploration

This subsection is optional. It briefly describes how an investigator might come to a good general definition, which is given in the next subsection.

The three cases above don’t show an evident pattern to use for the general

\nbyn{n} formula. We may spot that the \nbyn{1} term a has

one letter, that the \nbyn{2} terms ad and bc have two letters, and that the \nbyn{3} terms aei, etc., have three letters. We may also observe that in those terms there is a letter from each row and column of the matrix, e.g., the letters in the cdh term

$$
\begin{equation*}
   \begin{pmatrix}
          &    &c \\
      d           \\
          &h
   \end{pmatrix} 
\end{equation*}
$$

come one from each row and one from each column. But these observations perhaps seem more puzzling than enlightening. For instance, we might wonder why some of the terms are added while others are subtracted.

A good problem solving strategy is to see what properties a solution must have and then search for something with those properties. So we shall start by asking what properties we require of the formulas.

At this point, our primary way to decide whether a matrix is singular is to do Gaussian reduction and then check whether the diagonal of resulting echelon form matrix has any zeroes (that is, to check whether the product down the diagonal is zero). So, we may expect that the proof that a formula determines singularity will involve applying Gauss’ method to the matrix, to show that in the end the product down the diagonal is zero if and only if the determinant formula gives zero. This suggests our initial plan: we will look for a family of functions with the property of being unaffected by row operations and with the property that a determinant of an echelon form matrix is the product of its diagonal entries. Under this plan, a proof that the functions determine singularity would go, ‘’Where T\rightarrow\cdots\rightarrow\hat{T} is the Gaussian reduction, the determinant of T equals the determinant of \hat{T} (because the determinant is unchanged by row operations), which is the product down the diagonal, which is zero if and only if the matrix is singular’‘. In the rest of this subsection we will test this plan on the

\nbyn{2} and \nbyn{3} determinants that we know.

We will end up modifying the unaffected by row operations part, but not by much.

The first step in checking the plan is to test whether the \nbyn{2} and \nbyn{3} formulas are unaffected by the row operation of pivoting: if

$$
\begin{equation*}
   T \grstep{k\rho_i+\rho_j} \hat{T}
\end{equation*}
$$

then is \det(\hat{T})=\det(T)? This check of the \nbyn{2} determinant after the k\rho_1+\rho_2 operation

$$
\begin{equation*}
   \det(
       \begin{pmatrix}
         a     &b       \\
         ka+c  &kb+d    \\
      \end{pmatrix}
   )
   = a(kb+d)-(ka+c)b = ad-bc
\end{equation*}
$$

shows that it is indeed unchanged, and the other \nbyn{2} pivot k\rho_2+\rho_1 gives the same result. The \nbyn{3} pivot k\rho_3+\rho_2 leaves the determinant unchanged

$$
\begin{align*}
    \det(
      \begin{pmatrix}
         a    &b    &c    \\
         kg+d &kh+e &ki+f \\
         g    &h    &i
      \end{pmatrix}
    )
    &=\begin{array}[t]{@{}l@{}}
         a(kh+e)i+b(ki+f)g+c(kg+d)h \\
         \ \hbox{}-h(ki+f)a-i(kg+d)b-g(kh+e)c  
    \end{array}                                 \\
     &=aei + bfg + cdh - hfa - idb - gec
  \end{align*}
$$

as do the other \nbyn{3} pivot operations.

So there seems to be promise in the plan. Of course, perhaps the \nbyn{4} determinant formula is affected by pivoting. We are exploring a possibility here and we do not yet have all the facts. Nonetheless, so far, so good.

The next step is to compare \det(\hat{T}) with

\det(T) for the operation

$$
\begin{equation*}
   T \grstep{ {\rho}_i \leftrightarrow {\rho}_j } \hat{T}
\end{equation*}
$$

of swapping two rows. The \nbyn{2} row swap \rho_1\leftrightarrow\rho_2

$$
\begin{equation*}
     \det(
       \begin{pmatrix}
         c  &d \\
         a  &b
       \end{pmatrix}
     )
     = cb - ad
  \end{equation*}
$$

does not yield ad-bc. This \rho_1\leftrightarrow\rho_3 swap inside of a \nbyn{3} matrix

$$
\begin{equation*}
   \det(
     \begin{pmatrix}
        g  &h  &i \\
        d  &e  &f \\
        a  &b  &c
     \end{pmatrix}
   )
   = gec + hfa + idb - bfg - cdh - aei
\end{equation*}
$$

also does not give the same determinant as before the swap — again there is a sign change. Trying a different \nbyn{3} swap \rho_1\leftrightarrow\rho_2

$$
\begin{equation*}
   \det(
     \begin{pmatrix}
        d  &e  &f \\
        a  &b  &c \\
        g  &h  &i
     \end{pmatrix}
   )
   = dbi + ecg + fah - hcd - iae - gbf
\end{equation*}
$$

also gives a change of sign.

Thus, row swaps appear to change the sign of a determinant. This modifies our plan, but does not wreck it. We intend to decide nonsingularity by considering only whether the determinant is zero, not by considering its sign. Therefore, instead of expecting determinants to be entirely unaffected by row operations, will look for them to change sign on a swap.

To finish, we compare \det(\hat{T}) to \det(T) for the operation

$$
\begin{equation*}
   T \grstep{ k{\rho}_i } \hat{T}
\end{equation*}
$$

of multiplying a row by a scalar k\neq 0. One of the \nbyn{2} cases is

$$
\begin{equation*}
  \det(
    \begin{pmatrix}
      a   &b   \\
      kc  &kd
    \end{pmatrix}
  )
  = a(kd) - (kc)b
  =k\cdot (ad-bc)
\end{equation*}
$$

and the other case has the same result. Here is one \nbyn{3} case

$$
\begin{align*}
  \det(
  \begin{pmatrix}
    a    &b    &c   \\
    d    &e    &f   \\
    kg   &kh   &ki
  \end{pmatrix}
  )
   &= \begin{array}[t]{@{}l@{}}
         ae(ki) + bf(kg) + cd(kh)                \\
         \>- (kh)fa - (ki)db - (kg)ec  
      \end{array}                                      \\
   &= k\cdot(aei + bfg + cdh - hfa - idb - gec)
\end{align*}
$$

and the other two are similar. These lead us to suspect that multiplying a row by k multiplies the determinant by k. This fits with our modified plan because we are asking only that the zeroness of the determinant be unchanged and we are not focusing on the determinant’s sign or magnitude.

In summary, to develop the scheme for the formulas to compute determinants, we look for determinant functions that remain unchanged under the pivoting operation, that change sign on a row swap, and that rescale on the rescaling of a row. In the next two subsections we will find that for each n such a function exists and is unique.

For the next subsection, note that, as above, scalars come out of each row without affecting other rows. For instance, in this equality

$$
\begin{equation*}
    \det(
      \begin{pmatrix}
        3  &3  &9  \\
        2  &1  &1  \\
        5  &10 &-5
     \end{pmatrix}
    )
    =3 \cdot \det(
               \begin{pmatrix}
                 1  &1  &3  \\
                 2  &1  &1  \\
                 5  &10 &-5
               \end{pmatrix}
             )                                  
\end{equation*}
$$

the 3 isn’t factored out of all three rows, only out of the top row. The determinant acts on each row of independently of the other rows. When we want to use this property of determinants, we shall write the determinant as a function of the rows: ‘\det (\vec{\rho}_1,\vec{\rho}_2,\dots\vec{\rho}_n)‘, instead of as ‘\det(T)‘ or ‘\det(t_{1,1},\dots,t_{n,n})‘.The definition of the determinant that starts the next subsection is written in this way.

« | Table of Contents | »

Edit - History - Print - Recent Changes - Search - Links
Page last modified on 25 February 2006, at 06:12 PM