This is part two of a four-part discussion of the idea behind how the Higgs field does its thing. Read Part 1 first.
Okay, now that we’re sold on the Lagrangian formalism you can rest easy: I’m not going to go through the gory details of any more variational calculus. I do want to clear a couple notational things out of the way, though. They might not all matter for the purposes of our discussion, but better safe than sorry.
First off, I’m going to use a coordinate system where the speed of light is 1. That is, if my unit of time is seconds, my unit of distance is light-seconds. Mostly this helps keep annoying constants out of the way of the equations; physicists do this basically all the time. The other thing is that I’m going to work in four-dimensional spacetime, meaning we’ve got four coordinates: , , , and . We calculate dot products by writing . Yes, that minus sign is weird, but that’s just how spacetime works.
Also instead of writing spacetime vectors, I’m going to write down their components, indexed by a subscript that’s meant to run from 0 to 3. Usually this will be a Greek letter from the middle of the alphabet like or . Similarly, instead of writing for the vector composed of the four spacetime derivatives of a field I’ll just write down the derivatives, and I’ll write instead of .
Along with writing down components instead of vectors I won’t be writing dot products explicitly. Instead I’ll use the common convention that when the same index appears twice we’re supposed to sum over it, remembering that the zero component gets a minus sign. That is, is the dot product from above. Similarly, we can multiply a matrix with entries by a vector to get ; notice how the summed index gets “eaten up” in the process.
Okay, now even without going through the details there’s a fair bit we can infer from general rules of thumb. Any term in the Lagrangian that contains a derivative of the field we’re varying is almost always going to be the squared-length of that derivative, and the resulting term in the variational equations will be the negative of a second derivative of the field. For any term that involves the plain field we basically take its derivative as if the field were a variable. Any term that doesn’t involve the field at all just goes away. And since we prefer positive second-derivative terms to negative ones, we usually flip the sign of the resulting equation; since the other side is zero this doesn’t matter.
So if, for instance, we have the following Lagrangian of a complex scalar field :
we get two equations by varying the field and its complex conjugate separately:
It may not seem to make sense to vary the field and its complex conjugate separately, but the two equations we get at the end are basically the same anyway, so we’ll let this slide for now. Anyway, what we get is a second derivative of set equal to times itself, which we call the “Klein-Gordon wave equation” for . Since the term gives rise to the term in the field equations, we call this the “mass term”.
In the case of electromagnetism in a vacuum we just have the electromagnetic fields and no charge or current distribution. We use the Faraday field to write down the Lagrangian
which gives rise to the field equations
or, equivalently in terms of the potential field :
The second equation just expresses a choice we can make to always consider divergence-free potentials without affecting the predictions of electromagnetism; the first equation looks like the Klein-Gordon equation again, except there’s no mass term. Indeed, we know that photons — the particles associated to the electromagnetic field — have no rest mass!
Turning back to the complex scalar field, we notice that there’s a certain symmetry to this Lagrangian. Specifically, if we replace and by
for any constant , we get the same result. This is important, and it turns out to be a clue that leads us — I won’t go into the details — to consider the quantity
This is interesting because we can calculate
where we’ve used the results of the Klein-Gordon equations. Since , this is a suitable vector field to use as a charge-current distribution; the equation just says that charge is conserved! That is, we can write down a Lagrangian involving both electromagnetism — that is, our “massless vector field” and our scalar field:
where is a “coupling constant” that tells us how important the “interaction term” involving both and is. If it’s zero, then the fields don’t actually interact at all, but if it’s large then they affect each other very strongly.