The Unapologetic Mathematician

Mathematics for the interested outsider

The Matrix of a Linear System

As I wait for the iTunes store to be less busy so it can reauthorize my iPhone to work with the updated firmware, we can finally get back on track.

Let’s consider a system of linear equations. We’ll use the m variables x^1, x^2, and so on up to x^m; and we’ll let there be n equations. Let’s write these out:

a_1^1x^1 + a_2^1x^2 + ... + a_m^1x^m = y^1
a_1^2x^1 + a_2^2x^2 + ... + a_m^2x^m = y^2

a_1^nx^1 + a_2^nx^2 + ... + a_m^nx^m = y^n

Here the constant a_i^j are the coefficient of x^i in the jth equation, and y^j is the constant term on the right hand side of the jth equation.

But this is all but writing out exactly our matrix notation! We can take the above system and rewrite it as

\displaystyle\begin{pmatrix}a_1^1&a_2^1&\cdots&a_m^1\\a_1^2&a_2^2&\cdots&a_m^2\\\vdots&\vdots&\ddots&\vdots\\a_1^n&a_2^n&\cdots&a_m^n\end{pmatrix}\begin{pmatrix}x^1\\x^2\\\vdots\\x^m\end{pmatrix}=\begin{pmatrix}y^1\\y^2\\\vdots\\y^n\end{pmatrix}

Picking values for the variables x^i is the same as picking the components of a column vector x=x^ie_i. We can collect the right hand sides of all our equations into one column vector y=y^jf_j, and the coefficients give a (linear) formula for taking the values we choose for the variables and turning them into the n values on the right of our equations. That is, they define a linear map A:\mathbb{F}^m\rightarrow\mathbb{F}^n. We can thus rewrite our system in a more abstract notation as:

Ax=y

Suddenly it looks a lot more like the first — and simplest — linear equation we wrote down. But now we can’t just “divide by A” to solve it. We need heavier tools to manage this task, or even just to show when it can be managed at all! In short: we need linear algebra.

Incidentally, now we see why we indexed the variables with superscripts: because that’s how we wrote the components of a vector, and the variables are the components of a single vector variable. And if you’re still on the fence, I’ll note that physicists use superscripts all the time to index variables (for similar purposes), and they even do it when the equations aren’t all linear. Just try it. You’ll get used to it.

About these ads

July 11, 2008 - Posted by | Algebra, Linear Algebra

3 Comments »

  1. The justification that comes to my mind for using superscripts is that the function “the nth component of this vector” with respect to a given basis is a linear functional, so it belongs in the dual space. Similarly, a linear transformation is an element of V \otimes V^{*} – one superscript and one subscript.

    Comment by Qiaochu Yuan | June 7, 2009 | Reply

  2. [...] is the matrix of the system. If is the zero vector we have a homogeneous system, and otherwise we have an inhomogeneous [...]

    Pingback by The Meaning of the SVD « The Unapologetic Mathematician | August 18, 2009 | Reply

  3. [...] but it’s still useful for solving systems of linear equations. If we have a system written in matrix form [...]

    Pingback by Solving Equations with Gaussian Elimination « The Unapologetic Mathematician | September 2, 2009 | Reply


Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

Follow

Get every new post delivered to your Inbox.

Join 392 other followers

%d bloggers like this: