Positive Matrices as Generalizations of Positive Numbers
A conceptual intuition of positive matrices from which most core properties of definite matrices can be derived
This is the first part of what is hopefully a sequence of posts to help build intuition on linear algebra. I’m certainly not the first person to think of this - Gregory Gundersen has an extremely similar and arguably more intuitive post here, but I stumbled across the idea myself and was proud of it so I’m writing this here anyways.
Here’s an interesting, perhaps obvious property of positive numbers: when you multiply a positive number by any non-zero number , the result lies on the same side of the number line as the original number did. Let’s take the positive number as an example:
In other words, positive numbers can be associated with sign-preserving functions. To formalize that last part, a positive number corresponds to a linear map defined by (show for yourself this is linear!)
If you’ve taken a good linear algebra course, then you’ll know that matrices correspond to linear maps. Here’s where the crucial piece of intuition comes in: we view positive definite matrices as generalizations of the above characterization of positive numbers to . A positive definite matrix corresponds to a linear map that sends any non-zero vector to another vector that lies on the same half of the vector space as was.
How do we define “same half of the vector space as ”? With respect to itself! If you consider the set of all possible vectors that are orthogonal to , the entirety of this set takes the shape of a hyperplane which divides your vector space in half. Then the ‘side’ that is on is simply the set of vectors such that the inner product . From this and our conceptual generalization above, we can derive the first definition of positive definite matrices :
Definition 1:
In other words, we view the quadratic product as the inner product between and . Notice how in the positive number/1D case, the dot product reduces to multiplying scalars: for all , which indeed holds for positive .
Before we continue, we should note that this conceptual framework restricts positive matrices to being square matrices: in our 1D case, positive numbers mapped from to , and in higher dimensions it doesn’t really make sense to take inner products between vectors of different sizes. It should also follow (though not very obviously) that positive matrices must be symmetric. We know the quadratic product is a scalar, so it is equal to its transpose . Then we can set