The Matrix of a Linear System

As I wait for the iTunes store to be less busy so it can reauthorize my iPhone to work with the updated firmware, we can finally get back on track.

Let’s consider a system of linear equations. We’ll use the $m$ variables $x^1$ , $x^2$ , and so on up to $x^m$ ; and we’ll let there be $n$ equations. Let’s write these out:

$a_1^1x^1 + a_2^1x^2 + ... + a_m^1x^m = y^1$
$a_1^2x^1 + a_2^2x^2 + ... + a_m^2x^m = y^2$
…
$a_1^nx^1 + a_2^nx^2 + ... + a_m^nx^m = y^n$

Here the constant $a_i^j$ are the coefficient of $x^i$ in the $j$ th equation, and $y^j$ is the constant term on the right hand side of the $j$ th equation.

But this is all but writing out exactly our matrix notation! We can take the above system and rewrite it as

$\displaystyle\begin{pmatrix}a_1^1&a_2^1&\cdots&a_m^1\\a_1^2&a_2^2&\cdots&a_m^2\\\vdots&\vdots&\ddots&\vdots\\a_1^n&a_2^n&\cdots&a_m^n\end{pmatrix}\begin{pmatrix}x^1\\x^2\\\vdots\\x^m\end{pmatrix}=\begin{pmatrix}y^1\\y^2\\\vdots\\y^n\end{pmatrix}$

Picking values for the variables $x^i$ is the same as picking the components of a column vector $x=x^ie_i$ . We can collect the right hand sides of all our equations into one column vector $y=y^jf_j$ , and the coefficients give a (linear) formula for taking the values we choose for the variables and turning them into the $n$ values on the right of our equations. That is, they define a linear map $A:\mathbb{F}^m\rightarrow\mathbb{F}^n$ . We can thus rewrite our system in a more abstract notation as:

$Ax=y$

Suddenly it looks a lot more like the first — and simplest — linear equation we wrote down. But now we can’t just “divide by $A$ ” to solve it. We need heavier tools to manage this task, or even just to show when it can be managed at all! In short: we need linear algebra.

Incidentally, now we see why we indexed the variables with superscripts: because that’s how we wrote the components of a vector, and the variables are the components of a single vector variable. And if you’re still on the fence, I’ll note that physicists use superscripts all the time to index variables (for similar purposes), and they even do it when the equations aren’t all linear. Just try it. You’ll get used to it.

July 11, 2008 - Posted by John Armstrong | Algebra, Linear Algebra

3 Comments »

The justification that comes to my mind for using superscripts is that the function “the nth component of this vector” with respect to a given basis is a linear functional, so it belongs in the dual space. Similarly, a linear transformation is an element of $V \otimes V^{*}$ – one superscript and one subscript.

Comment by Qiaochu Yuan | June 7, 2009 | Reply
[…] is the matrix of the system. If is the zero vector we have a homogeneous system, and otherwise we have an inhomogeneous […]

Pingback by The Meaning of the SVD « The Unapologetic Mathematician | August 18, 2009 | Reply
[…] but it’s still useful for solving systems of linear equations. If we have a system written in matrix form […]

Pingback by Solving Equations with Gaussian Elimination « The Unapologetic Mathematician | September 2, 2009 | Reply

About this weblog

This is mainly an expository blath, with occasional high-level excursions, humorous observations, rants, and musings. The main-line exposition should be accessible to the “Generally Interested Lay Audience”, as long as you trace the links back towards the basics. Check the sidebar for specific topics (under “Categories”).

I’m in the process of tweaking some aspects of the site to make it easier to refer back to older topics, so try to make the best of it for now.

RSS Feeds

RSS - Posts
RSS - Comments
Feedback

Got something to say? Anonymous questions, comments, and suggestions at Formspring.me!
Subjects

Subjects
Archives

July 2008

M T W T F S S

1 2 3 4 5 6

7 8 9 10 11 12 13

14 15 16 17 18 19 20

21 22 23 24 25 26 27

28 29 30 31

« Jun Aug »
Search for:

The Unapologetic Mathematician

Mathematics for the interested outsider