Variational Calculus

Calculus of variations is used very often in physics, but often it is vibes based in order to derive a formula in a lecture. The purpose of these notes is to introduce the formalism of variational calculus in order to help when we encounter a completely unfamiliar action or path integral and we want to use variational techniques in a consistent way.

Function Spaces

Variational calculus is an attempt to generalize calculus constructions such as integration and differentiation on maps where the points the map from are functions and not numbers. Here we start by exploring basic properties of the structure of these function spaces.

Banach Spaces

We begin by introducing an algebraic structure that contains all the properties of real vector spaces we need while being general enough that it can be applied to other spaces like the space of spinors, orperators, etc.

Definition: A Banach space is a complete normed vector space. In other words it is a pair (B,||||) that consists of a vector space B and a positive definite, subadditive, absolutely homogeneous map ||||:BR, such that every Cauchy sequence in B has a converging subsequence in B under the topology induced by the norm.

In less fancy words this is a vector space with a distance such that if you follow any converging sequence you won't accidentally fall out of the vector space. These spaces are the setting of so much of calculus. In fact most of the times in physics we work wigivrn th functions that map a manifold to a Banach space.

Example: (Physics Fields interpreted as maps to Banach Spaces) A complex scalar field is, formally, a section of the complex line bundle L over some Lorentzian manifold M. By picking a global section of L we can identify every complex scalar field with a map of the form MC. Since C is vector isomorphic to R2, then C with the Eucledian norm is a Banach space. Therefore all complex scalar fields are functions to a Banach space.

This is also true for spinor fields for example. Using a similar argument, we can see that sections of the spin structure of a Lorentzian manifold, correspond to maps between the manifold and the vector space corresponding to a representation of some Clifford algebra. That vector space along with the attached spin invariant inner product forms a Banach space.

Banach spaces have some cool properties already. For example the existence of a norm can help us define differentiation without a rigid smooth structure on them. Here is how to do it.

Definition: Let V,W be Banach spaces, and UV be an open subset. Then a function f:UW is Frechet differentiable at xU if there exists a bounded linear operator A:VW such that

lim||h||V0||f(x+h)f(x)Ah||W||h||V=0.

Then A is the Frechet derivative of f at xU. We often write Dxf=Df(x):=A.

There are other nice properties here such as uniqueness or equivalence to the linear approximation in finite dimensional vector spaces. However, it is important to note that this is a very strong requirement, and as we will see we aim to relax this when talking about functional differentiation.

Function Spaces

Since Banach spaces appear in all the nice maps we want when studying physics it makes sense to understand them more abstractly. However, the real reason why we do so is the following super cool fact, which we introduce after this short definition.

Definition: Given two sets X,Y we define the set of all functions from XY as

YX:={f:XY}.

Here is the cool fact.

Theorem: Let X be a set and B a Banach space. Then the set of all functions BX is a Banach space under point-wise addition and scalar multiplication, and the supremum norm.

I love this! Not only did we bundle together all the spaces that we map to in physics, but also all the useful functions in physics belong to the same category of spaces. Therefore we only need to care about defining variational constructions in Banach spaces and we are done!

Before we move on we can introduce some nice properties of function spaces.

Proposition: Let X be a set and B a Banach space. If |X|N then BXB|X|.

Functionals

A functional is a term that we use when the word function is a ambiguous. Similarly to the words set, family, and collection in set theory. They are all sets, but depending on context we might have sets of sets, and sets of sets of sets, which might end up being cumbersome after some times.

Definition: Let X,Y be sets of functions, then a functional is a function f:XY. The set of all functionals form X to Y is YX.

There is no special property of functionals that makes them distinct from functions, but using function for both the map and its input could be confusing. Notice that distributions are functionals, and in fact we will use them a lot in our attempt at calculus.

Functional Differential

The well defined concept on functional derivatives is the functional differential, more so than the functional derivative itself. Here is how we define it.

Definition: let A,B be a Banach spaces, and F:AB be a function. The functional differential of F at aA is the linear map δF[a]:AB such that for any ϕA

δF[a](ϕ)=limϵ0F(a+ϵϕ)F(a)ϵ=ddϵF(a+ϵϕ).

We notice that if the Frechet derivative is defined for FBA then, δF[a] is the Frechet derivative at a!

Technically this is all we need in order to define functional derivatives, but lets first see some examples.

Example: Let M be a Lorentzian manifold and consider the Banach space B:=C0(M) of compactly supported functions on M. Then we can consider real functionals on B, ie. F:BR. We know that if F is continuous then it is a distribution (see distributions). Let F be of the form such that for any fB

F(f)=MLf=ML(x)f(x)dx,

where LB is some fixed function (One can recognize F as a regular distribution). Then its functional derivative at any point fB is given by the map δF[f] such that for any gB the following holds

δF[f](g)=MLg=F(g)δF[f]=F  fB.

Notation and Physics: In physics we often want something of the form δFδf that satisfies chain and product rule in order to call it a derivative. We can identify such an object in the following way. The functional derivative of F by f of a regular distribution F is the function δFδf such that

δF[f](g)=MδFδfg,

for any gB. Now we are almost done and we can show some cool properties that are very reminiscent of derivatives in this notation. Before we do so however, it would be nice to generalize this non regular distributions. But thankfully we have already developed a machinery that does that almost automatically.

Examples in Physics

Most of the applications of variation calculus appear in physics. Let's see how to define some of the most common objects.

Definition: Let (M,Σ,μ) be some measure space with Lebesgue measure μ, B a Banach space and L:BR. Then an action functional is a function S:BMR given for any γ:MB by

S(γ)=MLγdμ,

where we have used the Lebesgue integral. Additionally, L is called the Lagrangian.

Here is some setup for using this formalism.

Proposition: Given a smooth manifold M the set of sections Γ(TMTM) is a Banach space.

Proof: The set of such sections is the set where all the metrics live, so to effectively turn it into a Banach space, we need to find a norm on the following bundle TMTM. We define a norm on there in the form of a bundle metric, simply by taking a section of that bundle (such that it is symmetric, nondegenerate, and so on).

Now we can define an action functional on that manifold as such.

Proposition: Given a smooth Manifold M and a Lagrangian L:TMTMR one can define the following action S:ΓSym(TMTM)R for every gΓSym(TMTM)

S(g)=MgLg.

Proof: We have shown that S is a map of Banach spaces, and we know that smooth integration on a manifold defines a Lebesgue measure.

Now we are finally ready to define the stress energy tensor.

Theorem: (The definition of the stress energy tensor) The functional differential of the action S:ΓSym(TMTM)R above at gΓSym(TMTM) can be expressed for all δgΓSym(TMTM) as

δS[g](δg)=MgιTδg,

where TΓSym(TMTM) is a stress energy tensor (defined here), and ιTδg is the contraction of δg by T in components given by

ιTδg=Tμνδgμν.