Twisting Adventure
Twisted Hilbert space. “Oh just calculate the twisted Hilbert space by this defect.” “It’s just boundary conditions.” “You forgot to account for the twist fields.” “The operator at the endpoint of a TDL is a twisted field.” I swear to god after the first 10 minutes of learning defects you are bombarded with phrases like these as if talking about twisting Hilbert spaces is something you learned in Quantum 1.
Unlike most of the notes here, this chunk is meant to motivate the existence of twisted Hilbert spaces and use a constructive approach to obtain intuition and familiarity. The calculations here are rigorous (i.e. not handwavy except in the first section) but kept simple at the expense of generality. The point is to go through some constructive examples, and not to present a rigorous general framework, because there is no point chasing generality without a working understanding.
Twisting Adventure
Creating States
Space is a Circle
What is the ground state?
Interlude: States from Euclidean Path Integrals
Vacuum Using Path Integrals
State Operator Correspondence
Untwisted Free Fermion
Spinors in 2D
Spin Structures in 2D
Definitions
Counting the Spin Structures
Spin Structures for a Cylinder
A Path Integral for Free Fermions
Majorana Spinors and Kinetic Energy
Aside on Antisymmetry
The Path Integral Measure
Basic correlation Functions
Virasoro Primaries
Stress Tensor and Trace Anomaly
Vacuum
Twisted Free Fermion
An Equivalent Vacuum State?
Interpreting the Defect
Twisting the Path Integral
Decomposition Into Primaries
Creating States
We will start by exploring aspects of quantization of Conformal Field Theories. For us quantization will mean a way to obtain a the Hilbert space of a CFT using classical fields. Such a quantization could be the path integral formalism. Another could be canonical quantization. However, it is not necessarily the case that the Hilbert space of any CFT should be given by some quantization procedure, nor that different quantization procedures must give different Hilbert spaces. We will simply use it as a tool to build intuition and some basic examples from scratch.
Space is a Circle
One of the things we so often encounter in CFT is that the fields live on a circle. Yet, the space we are used to working in, almost without questioning it, is always something like an infinite line, so what gives? The answer presented in the big yellow book and others is
“To eliminate any infrared divergences, we compactify the space coordinate.”
To that I say “wtf?” Even if one doesn’t know about infrared divergences this sounds like “we do it on a circle because a line is harder.” This is honestly not far from what is actually happening, however the truth is backwards.
Sidenote: An infrared divergence in QFT is encountering an expression that diverges for low energies. Classically the energy of a field is proportional to its momentum (squared) so we often call an IR divergence a quantity that diverges at small Fourier modes, i.e. low momenta. For example the lowest momentum you can have on a line is
which corresponds to a field that is flat. If that field is nonzero, its integral diverges, which is an IR divergence.
It is true that there are no IR divergences if we compactify space. Roughly the reason is that the Hamiltonian operator will always converge for small modes since be integrating them over a compact manifold. The power of this is that any efect in
(that doesn’t account for IR divergences which we wanted to get rid of anyway) can be obtained by studying the equivalent theory in
.
That last sentence is where the conformal part comes in. Notice that inside a circle there is a hidden line! One can extract it using conformal invariance. The map is the stereographic projection, which is a (kind of) conformal transformation! pedantic sidenote: The true conformal transformation here is making the diameter of the circle larger while fixing one point as shown below.
So in essence some of the correlation functions of the line are already in the set of correlation functions of the circle and can be extracted using some minimal manipulation. But more to the point, one can show that the ones that aren’t there are correlation functions that contain evidence of IR divergences.
All this is to say that the spacelike slices we will be using for quantization are all going to be circles. How we connect them is going to be non trivial, but for now, they all have to be circles.
What is the ground state?
This is an incredibly confusing question in Quantum Field Theory, but luckily it is more tractable in a Conformal one. One way to think of a ground state is some kind of “empty” state.
In principle the emptiest state you have the more symmetric it is as a configuration. So we would intuitively expect that the conformal vacuum is the maximally symmetric state of the theory. This can be made precise later by saying that it is a state of vanishing conformal weight. That is, it transforms trivially under conformal transformations. But since we haven’t developed this formalism yet, there is another definition we can pursue.
Interlude: States from Euclidean Path Integrals
At its core a QFT is a machine that takes in a proxy of an experimental setup (that usually contains a kind of spacetime, some probes for the fields, data about how the fields are configured, etc.) and spits out a number which we interpret to be the correlation between the probes we have inserted. Schematically
We usuallt assume that any spacetime has no boundary. But often the main idea is to split the spacetime at some spacelike slice, define a Hilbert space there, and then write the correlation functions as inner products of the states in the Hilbert space. In fact there is a general theorem (the aptly named Hilbert space reconstruction theorem) that says that if you have correlation functions satisfying things like “locality”, etc. then there is a separable Hilbert space that gives rise to them as inner products.
Under this splitting though, we can rethink our
map. We can define the map
that roughly takes in a manifold
, a set of probes/data
on that manifold, and then it spits out a state on the Hilbert space that lives on the boundary of the manifold
All this is nice and fun in theory, but what happens in practice?
Wick (I think) observed that correlation functions can be analytically continued as functions that have a complex time coordinate. Not only that, but the analytic continuation is to literally plug in a complex time coordinate for in the path integral. This is not really relevant to show here, and to even make this statement in its full generality would be too involved. For our purposes though, this observation is enough to use the Path integral to define quantum states, aka the map
. Here is how.
As we said our spacelike slice is
so we want to define a Hilbert space on
. To do this, consider a 2 Dimensional manifold
such that its boundary
. Then if I give you an action, we better be able to find the largest set of manifolds that can accept it. In the case of the free boson this is relatively simple.
Definition: Let
be a 2-dimensional Riemannian manifold and
be a smooth function, then the free scalar field Lagrangian
is given by
This is pretty general and it is defined on all orientable 2D manifolds. Not only that, check this out.
Proposition: The free scalar field Lagrangian is Weyl invariant.
Proof: To show that we simply plug in a different metric
, where
. We notice that
So not only do we have a Lagrangian defined in a crap ton of manifolds, it is also conformally invariant! This is awesome. Now what we can do is defined an action.
Definition: Let
be a 2-dimensional Riemannian manifold and
be a smooth function, then the free scalar field action
is given by
Now the question is what is a large enough set of manifolds and functions where this
is well defined? The answer is roughly any manifold
with boundary conditions where
vanishes sufficiently fast. Yet, we can be more precise. Clearly when it comes to the set of Riemann surfaces, then
is defined there for any smooth
. But what happens once we move away from compactness? We need the dreaded physicist term “boundary conditions.” In other words, if I give you something like a cylinder you can still define an action on it as long as the field
is such that it converges. Let’s call
Also given a
we can define the set of fields on that
by
. Now we have all we need to talk about the path integral. Given a field
such that
as well as a measure
in
we define a vector in
as a linear map of the form
such that for any
where
. So heuristically, a state is a way to measure the number of field configurations that end in a specific configuration on the boundary. Different states might correspond to different measures. One subtle point is that it is not necessary that all the states are given by some path integral, but as we will see, when that prescription is available, we construct it such that at least a dense set of them is given by that.
Vacuum Using Path Integrals
After this large digression we can finally start thinking about our vacuum state. We have said that the intuition we want for our vacuum state is an “empty state.” We also said that this should correspond to some version of maximally symmetric. Now we can start being less handwavy.
We have seen before that this would corredpond to a finding a measure
that must be conformally invariant (since the state must be super-duper-symmetric). Unfortunately this is where our luck ends. It is hard to define such a measure. In a perfect world one would like to give enough conditions such as “locality, conformal invariance, convergence, etc.” and then have some version of measure theory spit out a unique
. Unfortunately, in almost all cases we can’t even have conformal invariance! We need to start using regulators, cutoffs, discretizations, or other regularization schemes that are not guaranteed to fix a unique measure. So bummer.
That said, there are theories where the measure does in fact become so crazily constrained that we get the closest thing to a “unique vacuum measure.” We will see the example of the free fermion in the next section where we will work everything out in detail. So, for now, assume that we have a mechanism for choosing it, and in the next section we will actually write it down.
Yet, picking a measure is only part of the story. We will assume that this measure is local, conformally covariant in the sense that it is invariant only up to terms that depend on the conformal anomaly (which we have explored here), and well defined on the manifolds we will consider. If we impose that the vacuum is
invariant, which is the Witt algebra, the one generated by rigid conformal transformations, we will see that we can create a measure where there is exactly one such state. In that case we will define that state to be
In other words, we will intuitively use as the vacuum the state corresponding to the “empty” unit disk. We will see this in the example of the free Fermion, but there will also be a more rigorous follow up as to why we should always be able to do this.
State Operator Correspondence
Now let's assume that we have a unique vacuum
. We will use conformalness to show one of the most fantastic results which is known as the operator state correspondence. First of all, in any QFT if we know the set of operators we pick an operator
and create corresponding states by
Effectively we are modifying the integration measure of the CFT by "inserting" the operator to it. In physics we call such a measure modification an operator insertion.
The nontrivial result of conformal magic is that while every operator corresponds to a state, this works the other way around. Here is how we can use states to construct local operators. Let's first consider if we have a state
constructed the way we did above, how can we find the operator
we used to generate it.
One idea would be to find the operator
which would be the operator that we need to multiply the vacuum state with in order to obtain the state
. In principle, one can define vacuum states for an arbitrary disk
of radius
. Because of conformal invariance we can see something cool. Since
and
are isomorphic, so are the sets of fields we can add define on top of them i.e.
. There is an isomorphism which is a conformal map
such that any
can be be mapped bijectively to
. However, notice that the action
is conformally invariant. So effectively the vacuum state at any radius remains unchanged on how it acts on fields of the circle.
This means that we can always define a vacuum state on any disk. Now we can see the following identity form the properties of
. Namely
In this case
is an operator of Hilbert spaces. What we can do, is define symbolically by taking
by taking
.
so we obtain the defining identity
By the way, we often write this as
Using a translation operator
for some
we can define the operator
at any point like so
This exercise was almost tautological. What is fantastic in a CFT is that it is possible to play this game for a dense number of states! In other words, if we know the operators we know the states, and if we know the states we know the operators. The main ingredient we need to show this is this.
Theorem: Consider the set of primary fields and descendants
. If their asymptotic states, namely states of the form
where
, generate a dense subspace
in
, then for every state
there exists a local operator
such that
This might seem tautological at first, but it is quite subtle. The subtlety lies in the fact that a random state in that dense subspace is a (possibly infinite) linear combination of descendant asymptotic states. The fact that we can take the limit out is the significant result. This is a statement about how well behaved the fields in a conformal field theory are.
Untwisted Free Fermion
Ok enough with wishy washy stuff. Time to do a detailed, rigorous example of a nice CFT. The one for a free fermion. As we have motivated above, we want to consider a free fermion living on a circle. This introduces the first subtlety. While on a line, there is only one way to write down a fermion (principal bundles on
are all trivial) on the circle there are more! So before we even start with a quantization procedure let's first examine the classical physics of a fermion on a circle.
Note: This section (except the last subsection) is not strictly needed in order to understand how to obtain a Hilbert space from a path integral in general. Its purpose is to define the simplest path integral that we know of so it dives into the nuances of fermionic path integrals and spin structures. With a working knowledge of the Majorana path integral quantization in 2D this section can be skipped. It is still fun and satisfying to work out a full description of a QFT. We don't often have the luxury of working with one.
Spinors in 2D
What we want to build up to is a path integral prescription that we are able to define on any
dimensional Riemannian manifold. One subtlety that we have to consider is that there might be multiple inequivalent ways to define a spinor (and by extension a fermion) on different manifolds. We need to find a way to take them into account.
Let's start by asking what is a spinor a bit more carefully. In physics a Dirac spinor is a vector in the defining representation of
where
is the dimension of spacetime. This is useless, so let's be more careful. For
we have that the complex Clifford algebra is
. So in a very basic sense a Dirac spinor is a vector in
where a vector in
can act on it through the gamma matrices.
Let's be a bit more concrete.
Definition: A 2-dimensional complex Dirac spinor is an element in
which carries the following action of
called the Clifford multiplication
where
, and
is a linear map to
. We can pick the Clifford representation to be
Now we can write the chirality element as element as
The next thing we need to work out is the orthochronous spin group. The orthochronous spin group is the set of invertible elements of the Clifford generated by even combinations of elements of each sector. Here a cool result.
Lemma: The spin group
is isomorphic to
.
Proof: The definition of
group is that each element is a product of even number of Clifford matrices associated to vectors
such that
. Now since the vectors only have two components we notice that if two vectors on the circle
are an angle
apart, then
In addition we know that by definition
Therefore we can solve that
where
squares to negative 1.
This simplifies our life so much because it simplifies what our spin structure can be. But for now let's check something about double covers. We know that the spin group must be the double cover of the group of rotations
let's make sure.
Theorem: There exists a double covering map
.
Proof: The squaring map
is a double cover of the form
, since both groups are isomorphic to
we are done.
However we can do better! We can explicitly find the map. In fact let's do it now. We know that there is a canonical action of the
group on
such that for any vector
and any
This may look unmotivated, but here are a couple of other ways to see this. We can write this as
Yet this still looks unmotivated. Here is the true place from where this came from. We can first define the left inverse of
which is the map that satisfies
, essentially takes the gamma matrices to the original vector they correspond to. Then we have that
This form will be useful in what we want to prove later. But the point is, we can now use this group action to define the map
such that for any vector
Let's show that The fact that this is a double cover comes directly from
. Notice that since
is the
rotation and we already know that every rotation in
can be written as
we are done!
Corollary: Consider the following group isomorphisms
and
where
and
. Then
. In other words, the following diagram commutes.
In other other words,
is the trivial double cover of
!
Spin Structures in 2D
We are not just interested in isolated spinors! We are interested in spinor fields over 2D manifolds. The whole point of doing this is noticing since the tangent space of any 2D manifold is
there is a natural action of the vector fields on the manifold to the kind of spinors we can add them. We want to create our spinor bundle such that it respects this. The way to do this is by picking a spin structure. Let's recall an abstract definition first and then we will make it progressively more tangible.
Definitions
Definition: Let
be a 2 dimensional Riemannian manifold. Then a spin structure on
is a a principal
bundle
with a double cover of principal bundles
, where
is the
reduction of the frame bundle of
such that the following diagram commutes.
where the dotted arrows are the principal actions of the structure groups to their bundles.
This certainly needs unpacking. However, what it is saying is that the spin structure is a double cover of the
bundle of our space such that the action of the
group is compatible with the double cover.
The first level of simplification here is that all of these objects are all
bundles since
. So in some loose sense defining a spin structure is related to the question of writing down a theory of electromagnetism on
. However, what we are interested as of now is asking, how many different (i.e. not bundle isomorphic) bundles we can have with structure group
on
.
Counting the Spin Structures
Well, if
then we know already that there is only one principal bundle we can have, because all
principal bundles are trivial. But what happens in more complicated spaces?
We are in luck again, check out the following lemma.
Lemma: Any two Riemann surfaces are conformally related iff they have the same genus.
This means that if two Riemann surfaces have the same genus, then there exists a conformal diffeomorphism between them! This is fantastic news as long as the spin structure is invariant under conformal transformations.
Lemma: A diffeomorphism of manifolds induces a
-equivariant bundle isomorphism of spin structures.
This says that if we have two manifolds that are diffeomorphic and one has a spin structure, then we can define a spin structure on the diffeomorphic manifold and vice versa. So diffeomorphic manifolds admit the same spin structures. If there was a structure on one that wasn't on the other, then their diffeomorphism would induce it.
Corollary: The number of admissible spin structures on a Riemann surface only depends on the genus.
WOW! AMAZING! Now let's do better than that and try to solve it. There is this theorem that I will not prove because it uses some super specific characteristic classes of smooth manifolds, but it will be useful for this.
Theorem: All orientable Riemannian manifolds
admit a spin structure. There is a bijection between the set of isomorphism classes of spin structures and
.
Proof(esque): The first part of the theorem is originally stated as: A manifold admits a spin structure of its second Stiefel-Whitney class vanishes, and all orientable Riemannian manifolds have that property. The second part, which is the most interesting for our purposes says that that cohomology group numbers the spin structures. The way to start showing this is to realize that the spin group (being a double cover) is a central extension of
like so
where the left arrow in our case maps
to the poles of
. This means that any two principal bundles must "differ" by a
bundle which boils down to calculating (up to principal bundle isomorphism) what are the kind of principal
bundles we can add on
, but they are classified by the number of nontrivial singular
cycles on
which is given by
.
As a brief aside to
valued cohomology groups since there is no such thing as a
valued form, we can't use de Rham cohomology directly. For our purposes here it suffices to say that
With this as our definition we can see something cool.
Lemma: Let
be a genus
compact Riemann surface. Then
.
Proof: It is well known that for such surfaces
This is proved using the Seifert-Van Kampen theorem on the polygonal representation of
. Then we need to find the homomorphisms between that and
. There trick here is to realize that
where
is the abelianization of the group
. This is true because the preimage of any homomorphism of
belongs to the Abelian part of
. However, the abelianization of
is simply
since we impose commutation relations for all pairs of
. Therefore we conclude that
since for every such homomorphism we need to make the choice of putting the generator to
or
.
This is nice! But we are not done, because the cylinder is not a compact Riemann surface! So let's try to work this out.
Lemma: The disk with
holes
has
.
Proof: The disk with
holes is homotopic to a wedge sum of
circles, therefore its fundamental group is the free group of
elements, and its abelianization is
. Using the logic of the previous proof the cohomology group is then
.
Spin Structures for a Cylinder
So while we are expecting that a disk has trivial cohomology group, therefore only 1 available spin structure, the cylinder has 2 available spin structures! In physics we call the two Ramond and Nuveu-Scwartz boundary conditions. Let's derive them now, and see where the name boundary conditions comes from.
Assume that we have a spin structure on our surface
. Then, we still haven't defined spinor fields. These are formally sections of associated vector bundles to the spin structure under the Dirac spinor representation. Before we even do this, here a super nice property of spin bundles.
Lemma: Let
be a spin structure on a Riemann surface
with covering map
. Then given any contractible char
and an associated vielbein
. Then there are precisely two local sections
such that
.
Proof: We can think of the vielbein
as placing
into
. Then we knoe that
is the double cover, so we should be able to find
precisely twice in there under the covering map
. Each possible disconnected sheet defines a section of
over
.
This means, that if we have chosen a vielbein (a set of orthogonal local coordinate vector fields on
) there is precisely two gauges that we can use to describe our spinor fields that are compatible with our vielbein. Another way to read the previous lemma is that given a choice of gauge in our spin structure no only we can associate a unique vielbein to it, but also we can find it's "opposite" gauge. Let's be more specific.
Definition: Given a spin structure
over a Riemann surface
and the dirac spinor representation
a spinor bundle
is given by the associated complex vector bundle (or
-twisted bundle)
Sections of
are called spinor fields.
Notice though that in the setting that we are all these bundles are
bundles! So things are not as dire as we would have expected. Also remember that
matrices are defined with respect to a vielbein
which is equal to
for some gauge
.
To be more precise, let
be a spinor field,
be convex chart, and
be gauges associated to a vielbein
on
. Then we can write a local section in the trivialization as
for some maps
(which btw is what we call spinors in physics, but here we will call local spinor fields). Now let's see the Clifford multiplication. Let
, then locally we can write it as
where
. Therefore we can write the Clifford multiplication as
It follows that
.
Proposition: These maps satisfy
.
Proof: We know that
must satisfy that for any two points
given by
for some
then
We also know that there must be a map between
and
. This can be given by an element of
such that
. Therefore we can write
However we know that
. Therefore we have that
Now let's find
. We know that
since
is a homomorphism and
. So what should the preimage be? We can now use the map directly and find that
for some
so without loss of generality we find that
. However, since
is forced to be a double cover,
so the only possible answer is
. Notice that this would be true for any dimension.
Ok, so what now? Our goal is to try and find a way to systematically obtain the possible spin structures we can add on a Riemann surface. To motivate this, let's study the possible spin structures on
. From our classification we are expecting two, one of which is trivial.
Theorem: The two inequivalent (up to spin structure isomorphism) spin structures on
are given by
where
are defined for any
as
Proof: First of all lets notice that these two are indeed spin structures. One way to prove this is to guess them and then show that under spin structure isomorphism they are inequivalent. So since we have two inequivalent ones we have found them all. Below is a more constructive proof that avoids using that fact (since we didn't actually prove it before).
Step 1: (There is a unique oriented frame bundle) We know that the spin structure is a double cover of the oriented frame bundle of the cylinder. But since we can find a global vielbein the cylinder is parallelizable which means that there frame bundle is unique. Now we know that for each such vielbein.
Step 2: (Any
principal bundle on
is trivial)
Notice that there is a global trivialization of
, and since
this means that
. Since also
we can find a bundle isomorphism between
and
, which means that the spin structure is trivial as a principal
bundle.
Step 3: (Spin structure map is a map from the base space to the fiber up to diffeomorphism)
Using the axioms of the spin structure we must be able to find a map
such that the following commutes
I fully acknowledge that this is overcomplicated, in 2 steps it will simplify a lot. In other words, what this diagram implies is that we the bundle double cover
must obey the property that for any
What is nice is that the map
is completely defined by its restriction on
. By the definition of a bundle automorphism, there must exist a diffeomorphism
such that
So up to diffeomorphism of the base space all we are really interested in classifying are the smooth functions
.
Step 4: (Constraining using Homotopy)
Now notice that if two such maps are equivalent up to homeomorphism, then they must be equivalent up to homotopy. In other words given a diffeomorphism of the base space
, the maps
and
are homotopic. Therefore we can first classify the maps
up to homotopy. But since
is homotopy equivalent to the circle, we have restricted the kind of maps to the maps from
which we know are classified by
as
for some
. Now we need to restrict more to find the diffeomorphism classes. As a result any inequivalent spin structure is of the form
Step 5: (Constraining using Dehn twists)
It might seem that we have completely constrained our spin structures, but we haven't considered all possible spin structure isomorphisms. We use a similar argument to the one above to consider bundle automorphisms. This time we find that up to diffeomorphism we only need to consider the maps
The logic is identical but now we restrict ourselves to maps of the form
such that under composition of
they are identity. Notice two spin structures
are isomorphic if there is a principal bundle isomorphism
such that
. Therefore since
for some
is a bundle isomorphism we can use repeated Dehn twists to find equivalences between
. In other words
iff
for some
. Now we can calculate
We have finally found our
grading! We have that
, which finally fully classifies our spin structures.
Definition: The spin structure
is known as the Ramond boundary condition while the spin structure
is known as the Nuveu-Scwartz boundary condition.
The previous theorem simply filled in the details of physics textbooks. We can restate the above result in terms of the spin holonomy on the Dirac spinor bundle.
To do this recall the following definition.
Definition: Given a spinor bundle
over a Riemann surface
associated to a spin structure
, the canonical spin connection
such that for any local section
,
where
where
is the spin connection defined by
where
is the Levi-Civita connection 1-form of
.
Phew! This is a lot, but what is particularly nice on the cylinder is that it is flat, so
hehe. In that case, the spin connection is the literal exterior derivative, which is at least nice.
Proposition: For any spinor field
and any simple nontrivial loop
on the cylinder we have that
where
is the parallel transport along
with respect to the spin connection.
Proof: Let's pick
to be our simple loop
, anyway the holonomy in invariant under the choice of loop, and consider a flat lift
of
on the frame bundle. Then given a spin structure, we can associate a local gauge
such that
. Then the parallel transport along
is given by
Now since
is a flat lift we have that in the trivialization
. Then under the Ramond structure we have that
. Here we see that
which proves the first part. for the other one we see that
for some
, where
. Now what we notice is that
The fact that the section
is shown using the quotient construction of the associated vector bundle, but I will not do it explicitly because COME ON! This proves the second part.
Let's generalize a bit. What happens in arbitrary Riemann surfaces? Well what we saw was that in 2D the classification basically boiled down to uncovering two boundary conditions per non-contractible cycle. In fact this makes sense with our original statement. We said that the spin structures are classified by
When we see this we are like: wait a second! What this group classifies is all the ways you can take the noncontractible cycles of
and label them with a
. That
is the holonomy of the spin connection corresponding to that spin structure! So in essence we can rephrase our classification of spin structures to a statement about boundary conditions of the fermion fields around non-contractible cycles using the method above.
A Path Integral for Free Fermions
Wow this was so much work, but what we have done is pretty remarkable. We have not only found a way to consistently describe all the possible ways we can have a spinor living on a Riemann surface
, but also found a way to parameterize them! So now the task of "summing over all possible fermion fields" which is something we have to do in our path integral is no longer as daunting.
Majorana Spinors and Kinetic Energy
The next step is to define an action. And in order to do that we need a
-invariant inner product to write things like kinetic energy and whatnot. In our search to find it we find out these really cool results summed up in the following theorem.
Theorem: The Dirac spinor representation
for
for
admits a real structure, i.e. it admits a complex
equivariant complex antilinear map
such that
. Additionally the same representation splits into two isomorphic eigenspaces
of the chirality element
.
Proposition: For
the complex structure is given by
defined for any
by
where
is the complex conjugate of
.
Proof: We know that
and
. Also we know that
Since every element in
is written as
for some
we have that
.
The proofs of the statments in the previous theorem are here but honestly they are not as needed, since we are only going to do everything in the 2D spinor representation anyway.
Definition: An element
is called Majorana if
and Weyl if
. For any spinor, we call
its Majorana conjugate or Charge conjugate. The restriction of the Dirac spinor representation to its real (Majorana) subspace is called the Majorana spinor representation.
This is an interesting split. In physics we sometimes care about the physics of Dirac spinors, or of their Majorana subspaces. In particular, a very interesting CFT arises when we consider our spinor representation the Majorana spinor representation. This doesn't change anything from the discussion above as it is general enough to work with any representation of
. That said from now on we will restrict our attention to Majorana spinors.
Definition: A Dirac form is a nondegenerate
-bilinear form
on a Dirac representation
of
such that for any
,
and
we have that
.
The dual space element
is known as the Dirac conjugate spinor.
A Dirac form is the closes we can get to a hermitian inner product that is
invariant. Notice that the only thing left for it to be a Hermitian form is positive definiteness, which honestly is the first thing we are ok with giving up.
Proposition: On
with the
representation the only Dirac form is defined for any
by
Proof: Property 1 of the definition implies equivariance for
matrices, this means that
. The next thing we want is conjugate symmetry and nodegeneracy which means that
for some invertible complex linear map
. Plugging in the first thing we find that
but also this should equal
. From this we find that
for some
.
Fantastic! Now that we can measure the length of spinors we can finally measure their kinetic energy by defining a Lagrangian. The only step missing is to extend the Dirac form into a bundle inner product which is done canonically the way we expect so let's go ahead and write down the Lagrangian!
Definition: Let
be a Riemann surface and
be the spinor bundle associated to a
structure on
. Then the massless free Dirac Lagrangian is given by\
where
is the Dirac operator given by
such that for any vielbein
The dirac operator is effectively a trace over a vielbein of the covariant derivative of a spinor. In local coordinates (in flat
) it is given by
When you're not flat you have to add some Christoffel symbols but whatever we are working on a flat signature anyway because of our previous exploration that all Riemann surfaces are conformally flat. Another thing to notice is that the Dirac operator swaps the chirality of the spinor, in other words it maps you between the Weyl subspaces.
We can now define the action
of a spinor field
by
So what we have now is the
part of our path integral! WOHOO! Now for the hard part, how do we put a measure on the set of spinor fields of
? Here is where grassmann variables come in.
Aside on Antisymmetry
So far all of our fields were complex valued, however what we would like is to impose an extra condition to the operators in the quantum field theory side. We would like them to be antisymmetric. Usually we could achieve this by giving an antisymmetric structure on
where we can build our spinors with antisymmetric components that have some intrinsic wedge product instead of a scalar product. That would be a much more complicated space to find spinor representations in but there is a way we can treat antisymmetric spinors (or fermions) using the language of spinors that we have already developed. Particularly by noticing that with the exception of the Dirac form, there was no point in which we exchanged two spinor components for the antisymmetry to be apparent.
So welcome to the wonderful world of Grassmann numbers. Honestly the rigorous picture of this is quite confusing it involves somehow defining a tensor product with an algebra that contains uncountably many generators so I am a bit iffy. Going through Quantum Fields and Strings (chapter 1 lol) there is a rigorous treatment of the extension of our spinor bundle in the language of sheves.
So we will use results from the cast down to a schematic language we can understand.
Definition: A commutative Grassmann algebra or superalgebra
is a
graded vector space over
with a multiplication
such that for any
where
is the
degree of
.
Proposition: Let
be a commutative superalgebra generated by
elements of odd degree. This means that each element
has a unique decomposition to
where
,
, i.e. the antisymmetric product product of all the
, and
is the powerset of
.
Here is an interesting observation. We can think of the superalgebra generated by a set of odd generators
as the set of polynomials in
with antisymmetric generators given by
.
Therefore what we should do is to define the integral of such polynomials. Let
then we know that
We will define the Berezin integral as follows.
Definition: The Berezin integral of
in some commuting superalgebra generated by a set of anticommuting generators
is given by
where
is the term in the usual decomposition above.
This measure simply extracts the coefficient of the higest order (if it exists) in the polynomial
. For example
The nice thing about this measure is that it is very course.
Intuition: Once we think of Grassmann numbers as formal polynomials of anticommuting variables that really transforms our idea of the Berezin integral. The Berezin integral is a way to "integrate" the polynomial over all possible "values" the anticommuting variables can take, even though it doesn't strictly makes sense to assign a value to the generators.
One interesting class of superalgebras is
which is given by
. Then the next most interesting one is
. One more small aside, when we refer to “complex Grassmann numbers”, we are referring to the Grassmann algebra with generating set
where there are twice as many Grassmann numbers, but now we have the conjugation involution to switch between the generators. We are now ready to define fermion fields!
To do this carefully we think about scalar fields. Scalar fields are usually smooth functions on our manifold. One way to represent a smooth function would be to assign a value at every point, however, what we have just observed is that it doesn't really makes sense to assign an antisymmetric value at a point.
Yet, we can think of real functions as polynomials. Let's consider
for a second, i.e. the set of smooth functions on
. As a Sobolev space we know that it is separable, i.e. we can find a countable basis for its elements. Let's call that basis
. Then every point
can be given by
for
. So in some sense, the polynomial
given by
generates all the possible elements of
simply by plugging in real numbers for the variables
.
Notice that at the end of the day, all we want to do in Quantum field theory is not to pick out a specific
, but rather to add a measure to all of them, and using the polynomial
we have essentially parameterized all of
. As a result, if we "integrate"
over
we would integrate over
. This is amazing news for our spinor fields.
In the same way we defined all commuting spinor fields by providing a polynomial that effectively parameterizes them, we can define all anticommuting spinor fields by providing a polynomial
with antisymmetric generators. That polynomial is going to be an element of a Grassmann algebra that has values in modes. Here is this statement a bit more precisely.
Definition: Given a spinor bundle
, we say that
parameterizes the Fermionic fields (or antisymmetric spinor fields) if there exists a basis
of spinor fields such that
, where
is the corresponding Grassmann odd generator.
It doesn't matter that we can't actually plug in values to
to obtain different Fermion fields, because we already have a notion to integrate over all of them using the Berezin integral. The previous definition leads to a natural follow up definition for Grassmann algebras.
Definition: Given a Grassmann algebra
generated by
, an element
is called generating iff it contains a single monomial for each generator. If
it is called th generating unit. If
for some separable algebra
then the algebra generator with respect to a countable basis
is the generating element
where
is an antisymmetric generator of
.
So with this definition
is the algebra generator of
with respect to the basis
. Btw the intuition of a generating element is things like the vector
in
.
The Path Integral Measure
The key word of this is mode expansions. Here is the high level description of this story. We find a countable basis of sections in
for some spinor bundle and create a set of modes called
,then we will find the corresponding generating polynomial for spinors and then use the Berezin integral as our path integral.
Note for pendantic Mathematicians: But how do you even find a set of countable modes over the smooth functions on your space? Blah blah blah! Ok this is the biggest pet peeve of mathematicians when they see path integrals. They think that the set we are trying to put a measure on is the actual set of
functions. Then someone says that it is
which does have a countable basis, but then proceeds to using non
functions. Here is the deal, the true answer is: We don’t care to measure either of these sets!
WAAAAT. Yes! What we do in fact measure is the following. Every lagrangian we write down HAS to have a kinetic term. In some sense this is a “quadratic term,” i.e. something that can be written as
where
is a field and
is a linear map of fields. What we are interested in measuring is the domain of
. To give some examples, in the free Boson lagrangian
, this has a wonderful basis of eigenvalues that are bounded and whatnot. In this case of the spinor field
which as as its domain pairs of analytic functions, which can be spanned using Laurent series. The true assumption that goes on in the path integral is that the domain of our kinetic energy operator is bounded. So with that out of the way, let’s actually calculate things.
Here is a definition.
Definition: Let
be a Riemann surface, then the free massless Majorana Fermion path integral without insertions
is given by
where
is the set of spin bundles associated to the possible spin structures (up to spin bundle isomorphism),
is the set of modes for fermions in
, i.e. a countable basis for
, and
is the Grassmann algebra generator of
for the basis
.
I AM SO HAPPY RIGHT NOW! We have a way to write the path integral for the spinor theory on an arbitrary Riemann surface! Thing is this can mean many things so let's try to work it out on some small way. Let’s start working out some modes.
Example: (Modes on the cylinder) This is getting obscure so it is high time we work out an example. Let’s find the modes for spinors on the cylinder
. What we will find (luckily) will be similar enough to the modes one obtains during second quantization which is at least satisfying.
We will work with Majorana spinors so let’s write down the set of majorana sections of a spinor bundle
over
as
. These are defined by
where
is the real structure of the spinor representation that is extended. Specifically, let’s use a set of isothermal coordinates for the complexified tangent space to define the vielbein
where
is the coordinate along the circle, and also let’s assume that we have a metric such that our circumference is
. This looks pretty already, but wait until you see how nicely we can write down the majorana spinor representations. In fact we know that
. So solving we obtain that
for some
. WOW! So let’s write out our sections in this gauge. Let
be one of the lifts of
to the spin structure of
, then we know that for any majorana spinor field
there exists a function
such that
This is awesome let’s goo! Now we need to specify
and by extension the spin structure in order to find modes. Let’s start with the Ramond sector where the holonomy of the spin connection is trivial. To calculate
we notice that under the action of
the vielbein
is given by
. So its simplest lift is itself
. Perfection. So we need to solve some kind of eigenvalue equation like
for
. Notice that since
can be defined globally we are looking for a function
which implies the boundary condition that
.
The way it is written right now it is begging us to take Fourier series in
since we have a periodic function, so let’s write
I’m in love. However the rest is not that nice. We don’t have an equally nice basis of Fourier modes for
. But we know that all these smooth functions have countable bases, for example the Hermite polynomials. Let’s pick a basis
to write
Eh not as cool, but at least we can now write a basis of fermions by assuming that
are Grassmann valued. Specifically we will define the set of Ramond modes to be
The next question is what happens in the Nouveau-Schwartz spin structure. There we are slightly less fortunate. The reason is that our spin structure doesn’t allow us to lift to a global vielbein. Namely we have the equation
which has solutions
which isn’t a function over
. But likeeeee it’s only one point where this is discontinuous in. So what if we define our functions on the double cover and then take one of the partial inverses down to the circle? This would correspond to the chart of the circle with one of the poles removed. then we can get the remaining point by switching charts.
Now having moved to the double where
, we have that
. This means that we can write it in modes
This appears because this circle has twice the radius, and the holonomy constraint implies that only the odd modes appear. Taking the restriction for
we are back on the circle with the north pole removed, but this was enough to fully define our modes everywhere, since we can take the limit for the last point. Therefore the Neveau-Schwartz modes are
There is a much nicer example where these are so much better defined but on the torus. For now we continue on the cylinder.
Now let's examine some cool consequences of the Berezin integral of the generating polynomials specifically.
Proposition: Let
be a generating in
and
a linear map of generators in
, then
Where
is the generator with the complex conjugate coefficients but in the algebra
, that we think as the complex conjugate of
. More precisely we have that
where
are the monomials in
.
Proof: First let's notice that for any linear map of generators
, the following identity holds
This is a standard property for top forms in an exterior algebra casted in this context, one can show this by unpacking the wedge product into an antisymmetric sum with Levi-Civita symbols. We know that since
contains only monomials for each generator in
there must be a unitary transformation
such that
Then we have that
As a result, we can calculate that for the polynomial
the term of the form
can be calculated by the expansion of the exponential as
Then we can calculate the top term to be
Realizing that the integral is
we have shown the thing.
Notice that in the above we have defined
with respect to the norm of
. In finite dimensional vector spaces this would be
, one can alternatively think of having an inner product such that
, this is how we generalize unitary transformations for infinite dimensional vector spaces. The reason for doing the above was to evaluate the path integral.
Corollary: The path integral with no insertions for the free fermion on a Riemann surface evaluates to
where
is the determinant on a vector space with a basis the of modes
of a spinor bundle
.
This is so amazing, I am so happy right now. However, this path integral only gives us numbers, in other words it can only offer a way to calculate correlation functions for closed manifolds. If we want to evaluate states we need to use the prescription we have developed in the first part of these notes. We will do this below.
Basic correlation Functions
The first thing we want to calculate is the correlation functions of our spinor fields. So let's try it. Let's do this on the sphere
. This has genus
so it only has the trivial spin structure which is particularly nice. What is even nicer is that if we work in a chart
that doesn't have the poles we have something conformally equivalent to a cylinder with a Ramond spin structure. What we have also seen is that for the complex coordinates
we can find a gauge
where a spinor field
is given by
where
. In this notation the action is written as
Now what we need is a mode expansion as well as the following lemma.
Lemma: Let
be the generating unit generator for a Grassmann algebra
generated by
, and
be a linear map of generators, then for any
Proof: This is a corollary of the previous proposition. Basically, we don't get the top form out of the exponential we get the second to last one and then wedge it with
.
Using the formula above we can calculate the correlation function of the spinors if we write the action in the form
. Here is how we can now cheat a bit using the above lemma. The correlation function is defined by
now we expect this to be equal to something like
having written our action appropriately. But in general it might not be true that the ratio of these two terms converges. The thing that breaks things a bit is the fact that we have used
and
in our notation. It seems as if we have used an uncountable number of degrees of freedom to define this correlation function. But here is where the generating functional comes to our rescue!
Definition: The generating functional for a free Majorana fermion is defined for a field parameterization
where
for a set of modes over the spin bundle of
.
Theorem: The generating functional of a free Majorana fermion is given by
where
is the convolution, and
is the distribution over the
such that
.
Proof: Here is a sketch. We will show that the path integral measure is invariant under shifts. In other words if we add to it a Grassmann variable that we are not integrating over then the Berezin integral should remain invariant.
Let
be a set of generators for a commutative superalgerba and
be the remaining odd generators. Now let
be a polynomial in
. Now consider the polynomial
where each generator
is replaced with
for some
and denote
the set of shifted generators. Now notice the following:
So the Berezin integral is invariant under shifts. This is quite nice. Now notice that the thing that appears in the exponential looks like
Now let
and we can write
Since the Berezin integral is invariant under the shift that created
we have that
This is a nice way of regularizing our path integral. However, I did cheat a bit. The shift invariance of the Berezin integral was not proved for uncountably many generators. However there is a systematic way of doing it, it would just involve a lot more work. One way is to take a finite set of generators and then take the limit to infinity of the difference
. This must be zero for each finite
. Anyway moving on we now only need to find
! Once we do we have all the correlators.
Lemma: For the free Majorana fermion on
the Green's function
such that
in a global gauge induced by the coordinates
is given by
Proof: The proof lies on the fact that
To show this we take a holomorphic function
and integrate over a disk
that contains
. Then we have
for an antiholomorphic function the integral is zero.
So we are done!
Corollary: The following is true for the correlation functions
.
Here we are looking at something remarkable. If we have
, classically we expect this to vanish because of antisymmetry. However, in quantum language it seems to blow up! What this means is that there is some ambiguity in the corresponding operator.
Remember that we are never actually evaluating these at points, rather we are thinking them as distributions. To put this in the language of modes, we should be able to reconstruct each mode by integrating against whatever expression we write down for
over something like
. So since only integrals of
matter we shouldn't be surprised that it might be possible that our expression for the product of two fields might not yield a finite result. We can still (very consistently) use this expression to calculate integrals of two fermions.
Sometimes we want though to obtain a field at a specific point by combining two. This would lead to the following distribution.
Definition: Let
be local fields then their normal ordered product is given by
In the language of operator valued distributions this has a more precise definition, but whatever, we will not waste our days and nights on this. Essentially this says that if A and B are singular when they collide remove their singularity. Notice that this doesn't mean that the normal ordered field does not have any other singularities.
Virasoro Primaries
We are about to see why we did this for free Majorana Fermions. The answer is going to be that there are only 3 primaries. Let's explore that. We will first calculate them, and then show that they are the only primaries. In order to do that we need to extract the conformal algebra. To do it we first need our stress tensor.
Stress Tensor and Trace Anomaly
Lemma: Let
be a spin bundle over a parallelizable Riemann surface
. The stress tensor
for a free massless Majorana spinor
is given by
where the volume form is
.
Proof: We know that the Lagrangian is
so what we want to figure out is how it changes under spacetime transformations. We also know that the spin connection is flat which means that
Therefore if the volume form is
we have
In isothermal coordinates
we can write the spinors as
, and in that gauge the stress tensor reduces to
From now on we will refer to the stress tensor components in this gauge like so
Very quickly we encounter an issue. We have already calculated the following
which means that if we try to take
in order to define
then
diverges. This is scary, but sometimes we have said that we don't really care about other divergences so long as the integrals work out well. The truth is though that there isn't a reason we should expect that what we calculated as
in the classical theory should be a meaningful quantity in the quantum theory.
The real meaning of the stress tensor is to take in a "spacetime symmetry" and spit out a corresponding conserved current. The way this is done is as follows. A spacetime symmetry is usually furnished by the action of a Lie group which for every element of its Lie algebra induces a corresponding vector field on the manifold. In physics we call that an infinitesimal transformation. In practice the stress tensor takes this vector field (we contract it against it) and returns a conserved current
(i.e.
) generated by our fields. This is the intuition we want to preserve in the quantum theory.
But there is an immediate issue. What we observe as classical physics (defined using our classical action
) such as conserved currents and symmetries, are only things that we know for a fact hold on average or more precisely at large scales. Why should we expect that the same symmetries should hold in the small quantum world? In fact insisting on it is what causes our singularity.
So how can we move on? The idea is to refine what we mean by symmetries only holding on large scales. Suppose that we have a fundamental vector field
that is a representation of some Lie algebra element for an action on our spacetime. Whatever we define as
it should have the property that
should be a "conserved current on average". The classical definition of a conserved current is that
. However,
is not an object that exists in our quantum theory. What does exist is averages of
with respect to other fields obtained using our generating functional. So a more precise restatement of conserved current is to describe how it acts on the rest of the fields.
Definition: Consider a Lie group
and a smooth group action
on the Riemann surface
. Then for any
we define the fundamental vector field
. That is a vector field that describes how the points move on
under the action of
.
Proposition: An action
induces a representation
of
to
, the set of sections of a spinor bundle over
. This representation satisfies that for any
and spinor field
where
is the spin connection.
Proof: This is the representation induced by the pullback of sections under the inverse map, i.e. the parallel transport of sections of
under the spacetime transformation induced by
.
Usually we choose the generators to be Hermitian which implies that we define the related vector fields as
, such that the matrix exponentials remain unitary operators. This is a matter of convention. In that case our formula is rewritten as
This is the physicist convention for an infinitesimal transformation of a field.
Ok but like why do we even care about this? Because of integration by parts. We know that by definition of the stress tensor the following is true
Ok cool, but isn't the whole point of saying that the current is conserved that this integral is zero? Not necessarily, we define the stress tensor to have this property irrespective of the group action we choose. The reason why it is always possible to do this is a result of variational calculus and it is found here.
Warning: There is no reason that
is unique. In fact it isn't, we can add total derivatives to it all day and it will serve the same purpose. Classically we can also add any term that vanishes because of the equations of motion. The key point is that equations of motion in the quantum theory don't always make sense! We consider (i.e. integrate over in the path integral) field configurations that are not solutions of the equations of motion so some classically equivalent stress tensors will stop being equivalent.
This is the case with our stress tensor
. So how can we uncover the right prescription? A better stress tensor would be one that satisfies a quantum version of
.
Theorem: (Ward Identity) The following identity holds where
parameterizes the antisymmetric spinors, and
is some vector field
or in local form
Proof: We will use the path integral to derive this. Consider a vector field
and some
. Then we know that the Berezin integral is invariant under shifts so it would be invariant under the parallel transport of
. Therefore
Since this must be true for all
the second term must vanish.
Ok we have derived our condition! this is exactly what we want to pick our
to satisfy. In other words we want to pick a
such that we can take the integral outside of the expectation value. Turns out picking the normally ordered
actually works. I will not show this because we have shown so many things here, but it is a direct calculation. So from now on the holomorphic part of the stress tensor is going to be given by
Vacuum
We have motivated previously that the vacuum would be the state defined on a disk with no insertions. Since the disk has trivial fundamental group there is only one spin structure we can add and that is the trivial one. To calculate modes we will use that the fields must have periodic boundary conditions to obtain that our fields must be comprised of appropriate Ramond modes that we have calculated above.
However, we want to figure out what we want to calculate exactly. We could calculate the functional that one uses to obtain the state directly, but this is not useful. In fact here is a symbolic expression for it.
where
is the set of modes that match
on
. I mean... cool? This is sort of useless because we have nothing to compare it to! A way to treat this is to consider it as a the first state
that we have discovered so far. One interesting thing we could do is to figure out its conformal weight. Ideally we expect it to be zero. So let's try.
We can generalize this stress tensor for any field by plugging in
the spinor generating polynomial. Now we are ready to show that we have the vacuum of the theory.
Theorem: The conformal weight of
is
.
Proof: We need a way to write down
and
, the easiest way is to do this with modes. We want a set of Ramond modes for the disk. Let's consider the basis
The basis has the property that
and
So now we can calculate the modes of
by plugging in the Fermion field generator
, where
are Grassmann odd generators. But before we do so, it is worth remembering that the operators
satisfy
so perhaps we can now simplify the modes that appear on
. Namely we have that
however we notice that the right part is holomorphic, which implies that only holomorphic terms can appear in this sum. Therefore it must be true that
. So let's adopt the notation
and
.
We have defined
after much though and consideration. So let's write it in terms of modes.
In particular we want to calculate the Hamiltonian.