Momentum

One of the first thing I learned in classical physics is the concept of momentum as p=mv. Over time I learned that this was a very small part of the story. Angular momentum, energy, magnetic moment, spin, stress, and others kept creeping up over time. While position was a relatively strightfoward concept to understand, through coordinates, momentum had so many different forms that didn’t seem to be compatible with each other.

The purpose of this is to describe a general form of momentum that when applied to different circumstances it becomes the appropriate quantity we are used to. Furthermore, we will do it in such a way that this will lend itself to the intricate way that momentum becomes messed up in quantum mechanics.

We will use differential geometry as well as representation theory to build up the concept of the momentum map. We will then play around with Noether’s theorem and give some examples.

Phase space

This will be done in the context of Hamiltonian mechanics. However, the only functional difference from Hamiltonian Field theory is that we allow for mathematical structures that are less rigid than smooth manifolds (like infinite dimensional manifolds, or vector bundles with with uncountably dimensional fibers). A more complete exploration of this with the structures encountered in field theory can be found here. For now we will restrict ourselves in the mechanics case. A complete version of this discussion is found in the following.

  1. An Introduction to Riemannian Geometry
  2. Quantum Theory for Mathematicians
  3. Mathematical Methods of Classical Mechanics

Classical Mechanics Setting

Definition: A phase space is an even dimensional symplectic manifold (P,ω). Given any manifold M, we can define its canonical phase space π:TMM as its cotangent bundle together with the canonical symplectic or Liouville form ω=dθ where θΩ1(TM) is the symplectic potential, also known as the tautological one-form given for any αTM by

θα:=απ.

This looks super complicated so let's look at some familiar examples.

Example: (Euclidean Phase Spaces) Traditionally the configuration space of a particle (or a system of particles) is a point in some Euclidean space Rn. The canonical phase space for this configuration is TRnR2n, that we already know of. In particular consider some coordinate system in Rn given by (q1,q2,,qn) where qi:RnR are smooth coordinate maps.

Then we know that the set of smooth vector fields {Vi:=qi}i=0n forms a coordinate chart on TM since at every point qM we have that Viq:TqMR. So we can formally define a coordinate chart for TM like so:

αTM(qi(α),pi(α)):=(qiπ(α),Viπ(α)(α))

where we have used some abuse of notation to define the coordinate functions qi:TMR given by the pullback of the coordinates of Rnby π, as well as defined the coordinate functions pi:TMR given by the standard "velocity vectors."

In physics, we usually pick mVi instead of Vi for some positive mass m and we are going to see soon why this would be a more natural choice in this setting. But nothing, so far, forbids us from using the Vi basis.

Now we are ready to come up with the canonical symplectic form of TRn. In our coordinate chart we can calculate it like so. We know that since θΩ1(TM) it can be expressed, in our chart, as

θ=θidqi+θidpi.

where θi,θiC(TM). To find these components we can use the fact that

θ(qi)=θiθ(pi)=θi.

Using the definition we can calculate that for any αTM

θi(α)=α(π(qi))=α(qi)=α(Vi)=Vi,π(α)(α)=pi(α)

and similarly for θi

θi(α)=α(π(pi))=α(0)=0.

As a result we can conclude that the sumplectic potential takes the form

θ=pidqi,

implying that the Liouville is the familiar 2 form

ω=dθ=dpidqi.

This is a the canonical phase space structure of any Euclidean space. We will soon talk about a much more general way to describe the canonical form of a physical system which could often differ from the canonical form of the configuration space.

Example: (The Oscillator) To avoid introducing the horrid formalism in physics, we often describe the phase space of a pendulum as a R2 where one axis is the angle and the other is the angular momentum. This is a very nice thing to do (in fact it corresponds to the universal cover of the phase space) but we now have the tools to describe it completely.

A pendulum lives on a circle. In other words, the configuration space of a pendulum is S1. We know that the tangent bundle is TS1S1×R which makes sense for an intuitive point of view. In each angle the angular momentum can take any scalar value, but as the angle increases at some point it has to wrap around.

pendulum-phase-space

Here is a picture of how it looks locally around a point.

Hamiltonian Vector Fields

The point of phase space is that given a point, I can tell you exactly how you will flow over time. But there is no time axis here, and to be honest we don't want to include it. Instead we will depict the flow directly. The way we do this geometrically is with a vector field.

What we will develop here is a way to obtain that vector field from any Hamiltonian-like function. But since the Hamiltonian could be any function on our phase space, might as well generalize it for anything. Doing so we will get some pretty results concerning Lie brackets and so on.

Definition: For any smooth function fC(P) on a phase space (P,ω) its Hamiltonian vector field XfX(P) is a vector field such that

df=ιXfω,

where ιXf denotes the contraction by Xf. Namely, for any YX(f)

Yf=ω(Y,XF).

Proposition: For any fC(M) a Hamiltonian vector field exists and is unique.

Proof: Existence is guaranteed by existence of solutions of 1st order ODEs. Uniqueness is guaranteed by the nondegeneracy of ω as a symplectic form.

Example: (Hamiltonian vector field of the Hamiltonian) Let H:TRnR be a Hamiltonian in the Euclidean space we defined before. Then, its Hamiltonian vector field XH=Xiqi+Xipi can be calculated similarly to the symplectic potential θ. Namely, we know that

Hqj=ω(XH,qj)=dpidqi(XH,qj)=(dpidqidqidpi)(XH,qj)=dpidqi(XH,qj)=Xj

and similarly,

Hpj=ω(XH,pj)=dpidqi(XH,pj)=(dpidqidqidpi)(XH,pj)=dqidpi(XH,pj)=Xj.

Therefore the Hamiltonian vector field of the Hamiltonian is

XH=HpiqiHqipi.

The cool thing to notice is that the integral curves of this vector field follow Hamilton's equations of motion! Another cool thing that we notice is that the Hamiltonian vector field can be calculated by applying the following linear transformation to the gradient

XH=1(0110)H,

which is a rotation by π/2! So since the gradient of the Hamiltonian points perpendicular to the level curves, the Hamiltonian vector field is pointing parallel to them. This means that its integral curves (i.e. the physical paths in phase space) are precisely the level curves of the Hamiltonian! Which proves that the Hamiltonian is conserved. Here is an example in R2.

Hamiltonian Plot

In Grey is a 2D plot of a Hamiltonian, while in blue is a stream plot of its Hamiltonian vector field. In black is a stream plot of the gradient. We can see that the level curves of the Hamiltonian are the integral curves (streams) of its Hamiltonian vector Field.

Proposition: (Properties of Hamiltonian Vector Fields) Let f,gC(P) and aR. Then

  1. Xf+ag=Xf+aXg
  2. Xfg=gXf+fXg.

We will soon use these properties to show that we can convert C(P) into a Lie algebra. Another super interesting and useful theorem is the famous phase space conservation theorem.

Theorem: (Liouville's Theorem) For any fC(P) on some phase space (P,ω) the volume form is invariant along its Hamiltonian vector field. Namely,

LXfω=0,   fC(P).

Proof: In this formalism the proof is very simple. We use the following

LXfω=ιXfdω+d(ιXfω)

The first term vanishes because dω=ddθ=0 and the second term vanishes because d(ιXfω)=d(df)=0.

A very useful corollary in calculation is the following

Corollary: For any two Hamiltonian vector fields A,B we have that

[A,B]=Xω(A,B)

Poisson Bracket

With this insight we will introduce one of the most useful constructions of Hamiltonian mechanics. The Poisson bracket.

Definition: Given two smooth functions f,gC(P) on some phase space (P,ω) their Poisson bracket {,}:C(P)×C(P)C(P) is given by

{f,g}:=ω(Xg,Xf)=Xgf=Xfg.

We can think of the Poisson bracket as a measure of how similar the vector fields are in some sense. For example, if the functions are the same, then by the antisymmetry of ω, their Poisson bracket vanishes. But that is not the only case when they vanish. Let's examine some cases.

Since {f,g}=Xfg, the Poisson measures the change of g along the Hamiltonian vector field of f. This is naturally the opposite change of f along the Hamiltonian vector field of g which is verified by t the antisymmetry of the symplectic form. As a result we obtain the following familiar theorem.

Theorem: (Hamiltonian Conservation Laws) Given f,gC(P) on some phase space (P,ω) iff

{f,g}=0

then g is conserved on the level curves of f and vice versa.

Another very interesting theorem is the following.

Theorem: The Poisson bracket is a Lie bracket for C(P) converting it into a Lie algebra over R.

Proof: The corollary from the previous section can be rewritten as [Xf,Xg]=X{g,f}. Since we know that [,] is a Lie bracket for the Hamiltonian vector fields (closure of the Lie subalgebra is guaranteed by the corollary), we can apply it directly to show

0=[Xf,[Xg,Xh]]+[Xh,[Xf,Xg]]+[Xg,[Xh,Xf]]=[Xf,X{g,h}]+[Xh,X{f,g}]+[Xg,X{h,f}]=X{f,{g,h}}+X{h,{f,g}}+X{g,{h,f}}=X{f,{g,h}}+{h,{f,g}}+{g,{h,f}}0={f,{g,h}}+{h,{f,g}}+{g,{h,f}}.

Motion Using Lie Groups

We often think of motion in R3. We are in a position, a vector in R3 and then we pick a displacement, another vector in R3 to tell us where we will move to. We call that translation, and when a body is in motion it also translates with respect to a point. We also know that there is an associated momentum to translation.

Intuitively, we can think of the motion of an object as applying successive translations, and the momentum associated to translation telling us how the translations we apply change along the motion.

Translations are not the only way of moving though! We could also rotate, flip, expand, and all sorts of weird things. In this section we will generalize what we mean by motion in our configuration space using the language of Lie groups.

In the previous example, R3 is the Lie group of translations in the configuration space R3, and it acts on R3 by vector addition. We will generalize this for any Lie group now.

Lie Group Actions

The way groups transform a configuration space is through Lie group actions. We will quickly introduce them, and then move forward to the nicer topic of Fundamental vector fields.

Definition: Let M be a manifold and G a Lie group, then a Lie group action is a smooth group homomorphism

ρ:GDiff(M).

Given a group action ρ for every point pM the subgroup

Gp:={gGρ(g)(p)=p},

is called the group of stabilizers of p. Further more, the **orbit of **p is the set

Op=Gp=ρ(G)p:={ρ(g)(p)MgG}.

Finally, the oribit map of a point pM under the action of G is the smooth map

ϕp:GMgρ(g)(p).

Using a Lie group action, we can move the underlying configuration space. Here are some examples.

Example: (Euclidean Translation) Consider the action ρ:RnDiff(Rn) where in the standard coordinate system ψ=(q1,q2,,qn) it is given by

vR3ρ(v),

where for any pR3 we have

ρ(p)=(qi(p)+qi(v)).

This is the standard way we translate stuff. But what happens when the manifold is a bit curved? Then we have options on how to translate.

Example: (Euclidean Rotation) The set of rotations in Rn is describe by the group SO(n) that comes with its defining representation that maps an element of the group to the corresponding matrix.

Inducing Actions on the Cotangent Bundle

Ideally we would like to have a representation not only the configuration space, but also on the Phase space. This is something slightly harder to achieve. If the phase space is not canonical, then we have to figure it out by hand. However if we have an action on the configuration space and we want to lift it to an action on the phase space we absolutely can in a canonical way. Here is how to do it.

We start with formalizing the concept of an infinitesimal transformation. This is done by using the exponential map to induce a representation of the Lie algebra elements of the Group. A more in depth explanation is found here. In this section we will skip the details of the exponential map, and present the result with some useful properties.

Definition: A given a Lie group representation ρ:GDiff(M) a fundamental vector field X~X(M) on M related to a Lie algebra element Xg is a vector, such that at any point pM

X~p:=ϕp,e(X),

where ϕp:GM is the orbit map of pM under the action of G.

This means that for every Lie algebra element we can come up with a smooth vector field on M. It has interesting consequences. Perhaps the coolest of them is the following.

Theorem: Given a Lie group action of G on M, the map ψ:gX(M) given by XX~, is a Lie algebra homomorphism.

This is great news because we can now lift the group action to an action on the tensor bundle.

Theorem: Let ρ:G×MM be a Lie group action on a smooth manifold M. Then ρ^:G×TMTM given by

(g,α)ρ^(g)(α)=ρ(g1)α,

where ϕg1π(α) is the orbit map of g1π(α) and π:TMM is the canonical projection map of the cotangent bundle, is a Lie group action on TM. We call ρ^ the lift of ρ.

There are too many symbols on this theorem so here is a picture.

induced-representation

What we are essentially doing is using the representation to steal a one form form where we pick it up at p=π(α) to the new cotangent space gπ(α). But since we need to use a pullback for a form, there is a nice map to choose which is the one that takes us from gπ(α) to π(α). This map is ρ(g1), hence the pullback.

Symplectic Lie Group Actions

As a result, we now have a way to induce an action on the phase space, directly from the group. However, now that we are on phase space, we would like our action not to be anything! We would like to preserve the symplectic structure. Imagine doing a transformation in the base space that moves you to the left but for some reason switches your momentum to the right. This is not something physical so we need to be mindful of it. In other words we want our action to be symplectic.

Definition: Let M be a manifold and ρ:GDiff(M) is a Lie group action. Then ρ is symplectic iff ρ(g)ω=ω for all gG.

This has the following interesting consequences.

Corollary: ρ(g) is a symplectomorphism for all gG. Therefore the phase space volume is conserved under the action of G.

Lemma: The lifted group action to the tangent bundle together with the Liouville form is symplectic.

Proof: We need to show that ρ^(g)ω=ω for any gG. This is rather intricate but there is a nice proof here

The Momentum Map

We are now ready to talk about momentum! The construction that generalizes the concept of momentum is called the momentum map and it is another sort-of-natural construction that we attach to the representation.

Definition

Let's start with a small definition before we start writing examples and properties.

Definition: Given a symplectic Lie group action ρ:GDiff(M) on a symplectic manifold (M,ω), a momentum map μ:Mg for the action of G on M is a map such that for any Xg

dμX=ιX~ω.

Here μX:MR is notation for the smooth function on M given for all pM by μX(p)=μ(p)(X) where μ(p)g.

Lemma: (Momentum map on Tangent Bundle) Let ρ^:GAut(TM,ω) be the lifted Lie group action on a smooth manifold M. Then there exists a unique (up to constant) momentum map for this action given by

μX=ιX~θ,

where θ is the canonical symplectic potential.

Lemma: If a momentum map exists, then there also exists a momentum map that is G-equivariant, where G acts with the coajoint action (the dual of the adjoint action) on the Lie algebra.

The existence of such a momentum map is not guaranteed however. As a result we have this definition.

Definition: A symplectic Lie group action that admits an equivariant momentum map is called a Hamiltonian action.

Marsden–Weinstein Quotient

This is hard to visualize but it is an amazing tool to be able to play with. The momentum map attaches a momentum related to a Hamiltonian action on each point of the symplectic manifold. If the momentum is zero at that point, that means that this point is invariant under that action. Therefore, we can take an interesting quotient by removing the parts of the space that are not useful. This was done in 1974 by Marsden and Weinstein.

Before we write the theorem we need some extra properties.

Definition: A group action ρ:GAut(X) on some set X is called free if for some xX

gx=xg=e.

Furthermore, a Lie group action is proper if the map G×XX×X such that (g,x)(x,gx) is proper (i.e. the preimage of compact sets is compact).

These are niceness properties for some group actions. The first one says that the only element that fixes any point in the target space is the identity. This includes things like rotations and such. The second one says that essentially, no point from infinity is mapped nearby. Most interesting actions in physics are free and proper. The reason for introducing this is because these are properties necessary for taking the nice quotient I mentioned.

Proposition: Let (M,ω) be a symplectic manifold with a Hamiltonian Lie group action ρ:GDiff(M) with momentum map μ:Mg, then μ1(0) is invariant under G.

Proof: Since the momentum map is G-equivariant, then μ(gp)=gμ(p)=0. Therefore Gμ1(0)=μ1(0).

Now we are ready to write the theorem.

Theorem: (Symplectic Quotient Theorem) Let (M,ω) be a symplectic manifold with a Hamiltonian Lie group action ρ:GDiff(M) with momentum map μ:Mg, then if ρ|μ1(0):GDiff(μ1(0)) is free and proper, the quotient μ1(0)/G by the orbits of G is a symplectic manifold with symplectic form ω¯=ω|μ1(0) such that qω¯=ω. This quotient is known as the symplectic quotient of M by G and it is denoted by M//G.

Corollary: dimM//G=dimM2dimG.

So far this might not look like much, but physically this theorem allows us to isolate mechanical systems from one to the other. For example we can isolate the angular momentum from the position dependence to create a classical spin system, or we can isolate a system orbiting around a point and itself into two systems with their respective motions. Some cool examples are shown below.

Lie Algebra Representation

One of the most interesting results of this exploration is that we have come full circle. In particular, we are about to show that a Hamiltonian action of a Lie group preserves the structure of the Lie algebra in the momenta. As a result, all we really need to know to do Hamiltonian (and by extension quantum) mechanics is the structure of the underlying Lie algebra. This is an amazing result, so let's show it.

Theorem: Given a Hamiltonian representation ρ:GAut(M,ω) with momentum map μ, The map μ¯:gC(M) defined by Xμ¯(X):=μX is a Lie algebra representation.

Proof: We know already that μ is a linear map. The only thing we need to show is that it also preserves the structure of the Lie bracket. In other words

μ[X,Y]={μX,μY}.

To do so we will use the fact that ρ is symplectic. Since it is symplectic the momentum map follows

dμX=ιX~ωX~=XμX.

Therefore we can now do

Xμ[X,Y]=[X,Y]~=[X~,Y~]=[XμX,XμY]=X{μX,μY},

which implies the claim.

Examples

So what is momentum? So far all we can say is that momentum is a quantity associated with a Hamiltonian action on phase space. Let's do some examples.

Lie Group as Configuration Space

We will start with a general case that is very useful. The case where the configuration space is a Lie group.

Example: (Lie Group as a Configuration space) Let G be an n dimensional Lie group. Therefore, there is a canonical action on itself by left multiplication. The action of G is given by ρ:GDiff(G) where gLg where

Lg(h)=ghG,

is the left multiplication map on G which is smooth by construction. Using the Lemma from the previous section we can construct a Hamiltonian action on TGG×g as follows. For any αTG and gG we have

ρ^(g)(α)=ρ(g1)α.

Here is a really cool trick. We know that the fundamental vector fields of a basis of g, which in this case happen to be the left invariant ones, form a basis for each tangent space! Therefore we can write given a basis {Xi}i=0n of the Lie algebra g

α=αiLπ(α)1Ji,

where Jig is such that Ji(Xj)=δji.

As a result, we can calculate the lifted action by

ρ^(g)(α)=ρ(g1)α=αiLg1Lπ(α)1Ji=αiL(gπ(α))1Ji.

This is pretty cool! We now have the action in terms of the generators! In fact we can now calculate the momentum map. Consider a specific element of the Lie algebra Xg. Then from the previous lemma we know that for any point αTG

μX(α)=(ιX~θ)(α)=θα(X~)=α(πX~)=α(Lπ(α)X)=αiLπ(α)1Ji(Lπ(α)X)=αiJi(Lπ(α)1Lπ(α)X)=αiJi(X)

So really the momentum is the component of the Ji we just described!! That's really cool! In other words, we can fully define the momentum map as

J:TGg(α=αiLπ(α)1Ji)μ(α)=aiJi,

which implies that μ=Lπ().

While this is super cool, there is more coolness to be uncovered. In particular, each μXC(TG) is the component function of the momentum by moving in the Xg direction. It would be nice to calculate Poisson brackets.

We know from the theorem in this section that the fundamental vector fields are also Hamiltonian in this case. Therefore we can find which is the corresponding function that generates them.

Proposition: For any Yg we have that Y~=XμY.

This is awesome, because we can prove the following, really cool theorem

Theorem: For any X,Yg the following identity holds

{μX,μY}=μ[X,Y].

Proof: This is an application of the theorem we have proved previously.

This result is fascinating. It says that the component functions of the symmetry group maintain their Lie algebra relations! So we only need to identify the symmetry group in order to do physics with them on a phase space.

An application of this is the following.

Example: (Momentum of Translation) This is a direct application of what we did before where we can actually calculate the momentum map. Consider that our configuration space is the Lie group Rn with coordinates (q1,q2,,qn). Then we can actually calculate the momentum map of the self left action (i.e. translation).

We pick as generators of the Lie algebra

Vi=qiX(Rn).

Then we know that the symplectic potential is θ=pidqi, where pi(α)=Vi(α). Now can calculate the canonical momentum map like so.

μVi=ιV~iθ=pjιVidqj=pjδij=pi.

Therefore the canonical momentum is pi! And it has to follow the standard commutation relations which as a check we can do since we already know how to calculate them from the symplectic form.

0={pi,pj}=μ[Vi,Vj]=μ0=0.

Example: (Angular Momentum) Consider the configuration space to be G=SO(3). This is the Lie group where each element is an orientation of an object in R3. Then we can proceed with calculating the momentum.

Let's use the standard coordinate chart ψ:SO(3)R3 for SO(3) given by

p(θ1,θ2,θ3)

such that θi is the rotation around each axis. Then, we know that the Lie algebra elements corresponding to each rotation are the generators Ji with the familiar commutation relations

[Ji,Jj]=ϵijkJk.

In such a way that we can write every element of SO(3) as

p=eθiJi.

Now we are ready to calculate the momentum components in terms of the angles. Now we know that in the coordinates where pi(α)=θi(α) that we have seen before, the Liouville form is given by

ω=dpidqi=dθ=d(pidθi).

However, we can now write the vector fields θi in terms of J~i.

By calculating the canonical momentum map in this setting we have:

μJi=ιJ~iθ.

Example: (Classical Phase Space of a Spin) We want to create a Hamiltonian system that quantizes to a single spin system. The way we will do this is using the Symplectic quotient by the action of SO(3).

Consider the action SO(3)R3 given by its fundamental representation. If we consider the phase space TSO(3) the fundamental representation canonically lifts to a Hamiltonian action in TR3 as we have seen previously. Let's call the momentum map μ:TR3so(3) and find its kernel.

We know that

μX=ιX~θ=ιX~pidqi=piX~i=piϕ,e(X)i.

Now lets see what points are in its kernel. Consider the point x=(qi,pi)TR3, and the generator Jig then we have:

μJj(x)=piJj~xi=pTJi~q,

where J~i=ρ(Ji) is the well known representation of the generators of the Lie algebra. They are listed here

J1=(000001010)J2=(001000100)J3=(010100000).

As a result, we see that the full momentum map has three components given by

(μJ1,μJ2,μJ3)=q×p,

where q×p is the canonical cross product between q,pR3. Now all we need to do is to find the kernel of the cross product between position and momentum. This is all the points where q=λp for any λR are parallel. To be more precise

μ1(0)={(q,λq)TR3λR,qR3}.

This looks suspiciously like the diagonal of TR3, but it is not, in fact we can see it with the following proposition.

Proposition: μ1(0)R3×R.

Proof: Consider the linear map (q,λ)(q,λq).

Now all we need to do is to see whether the restricted action of SO(3) on μ1(0) is free and transitive.