Hamiltonian systems with constraints
Audience: graduate students with prior exposure to Lagrangian/Hamiltonian mechanics and basic field theory.
Goal: learn the Dirac–Bergmann treatment of constrained Hamiltonian systems and see it in action for Maxwell theory, the Proca (massive vector) field, and general relativity (ADM).
1. Why constraints appear
Section titled “1. Why constraints appear”In ordinary (regular) Lagrangian mechanics, the Legendre map
is invertible because the Hessian
has . One can solve and define the Hamiltonian .
A constrained system arises when the Hessian is singular: . Then the momenta are not independent; they satisfy relations
called primary constraints. The symbol (“weak equality”) means the relation holds on the constraint surface in phase space, but you should not use it inside Poisson brackets until you have computed them.
A helpful geometric picture is this: the equations define a submanifold of phase space, the constraint surface. A weak equality is an equality only after restricting to this submanifold. For example, even if , the bracket may be nonzero and can generate a physically meaningful motion along or away from the constraint surface. This is why one computes Poisson brackets first and imposes constraints afterward.
In gauge theories the singularity is not an accident: it reflects redundancy in the description (gauge symmetry). In massive theories (e.g. Proca) singularity can also occur because some variables are nondynamical multipliers even without gauge symmetry.
2. Dirac–Bergmann algorithm in a nutshell
Section titled “2. Dirac–Bergmann algorithm in a nutshell”2.1 Canonical Poisson brackets for fields
Section titled “2.1 Canonical Poisson brackets for fields”For canonical fields ,
For general functionals and , this means
This formula is often safer than manipulating unsmeared delta functions directly.
2.2 Total Hamiltonian and consistency
Section titled “2.2 Total Hamiltonian and consistency”Given a Lagrangian density :
-
Define canonical momenta
Relations among that do not determine velocities are primary constraints .
-
Canonical Hamiltonian (Legendre transform where possible)
-
Total Hamiltonian
where are Lagrange multipliers enforcing the primary constraints.
-
Consistency conditions Require constraints be preserved under time evolution:
This may:
- produce secondary constraints, tertiary, etc.; and/or
- fix some multipliers .
Continue until closure.
A practical checklist is:
- write all primary constraints before solving for any remaining velocities;
- build , not only ;
- impose for every new constraint;
- separate equations that produce new constraints from equations that determine multipliers;
- stop only when every constraint is preserved and every relevant multiplier is either fixed or remains arbitrary because of gauge freedom.
2.3 First-class vs second-class
Section titled “2.3 First-class vs second-class”Let be the complete set of constraints.
-
First-class constraint: for all .
These generate gauge transformations (redundancies). -
Second-class constraint: the matrix (kernel)
is (functionally) invertible on the constraint surface.
These do not generate gauge; they simply remove phase-space directions.
For many gauge systems, an individual first-class constraint is best understood as one member of a gauge-generator chain. The full gauge transformation is generated by a tuned combination of primary, secondary, and sometimes higher-stage constraints, with time derivatives of the gauge parameter. Maxwell theory in §3.6 is the simplest example.
2.4 What happens to Lagrange multipliers?
Section titled “2.4 What happens to Lagrange multipliers?”A common but slightly dangerous slogan is:
First-class constraints leave Lagrange multipliers arbitrary, whereas second-class constraints determine them.
The correct version is a little more precise.
The total Hamiltonian contains arbitrary multipliers only for primary constraints,
where the are primary. Once the full set of constraints has been found, consistency requires
These equations can do three things: generate new constraints, determine some combinations of the multipliers , or impose no new condition.
The practical rule is:
- A multiplier along a primary first-class direction remains arbitrary before gauge fixing. This arbitrariness is the Hamiltonian form of gauge freedom.
- A multiplier along a primary second-class direction is fixed by consistency, provided the constraint-bracket matrix has the appropriate constant rank and inverse on the second-class sector.
- After imposing gauge-fixing conditions, first-class constraints are converted into second-class pairs, and the formerly arbitrary multipliers are fixed by preserving the gauge conditions.
Two caveats are worth keeping in mind. First, secondary constraints do not come with independent multipliers in the ordinary total Hamiltonian, although one sometimes introduces an extended Hamiltonian for formal purposes. Second, words like “always” assume a regular system: no changing-rank constraint matrix, no unresolved boundary zero modes, and no hidden reducibility relations.
Maxwell and Proca illustrate the distinction sharply. In Maxwell theory, the multiplier in front of remains arbitrary until a gauge is chosen. In Proca theory, the same-looking primary constraint belongs to a second-class pair, and consistency fixes .
2.5 Dirac bracket (for second-class constraints)
Section titled “2.5 Dirac bracket (for second-class constraints)”If are second-class and is invertible with inverse ,
Then for all , so you may set strongly after switching to Dirac brackets.
Here is an inverse kernel, meaning
If the inverse requires solving an elliptic equation, as in Coulomb gauge, one must specify boundary conditions. Zero modes of the operator are not harmless bookkeeping details: they may represent residual gauge transformations or global charges.
2.6 Counting physical degrees of freedom
Section titled “2.6 Counting physical degrees of freedom”Let be the number of configuration fields per space point (so phase space has dimension ). Let be the number of first-class constraints and the number of second-class constraints (per point, in an appropriate local sense). Then
Equivalently, in phase space:
2.7 A finite-dimensional warm-up example
Section titled “2.7 A finite-dimensional warm-up example”Before the field-theory examples, it is useful to see the algorithm without delta functions. Consider
with configuration variables . The momenta are
Thus
is a primary constraint. Solving , the canonical Hamiltonian is
The total Hamiltonian is . Consistency of the primary constraint gives
so there is a secondary constraint
The two constraints have vanishing Poisson bracket,
so they are first-class. The arbitrary multiplier reflects the gauge freedom
under which is invariant. This toy model mirrors Maxwell theory: one primary constraint plus one secondary constraint together represent one gauge function.
3. Worked example I: Maxwell theory (massless spin-1)
Section titled “3. Worked example I: Maxwell theory (massless spin-1)”3.1 Lagrangian and 3+1 split
Section titled “3.1 Lagrangian and 3+1 split”Take vacuum Maxwell in flat space:
With our signature,
Define
Then
3.2 Canonical momenta and primary constraint
Section titled “3.2 Canonical momenta and primary constraint”Define
There is no in , hence
For ,
Thus is the electric field.
3.3 Canonical Hamiltonian
Section titled “3.3 Canonical Hamiltonian”Compute the Hamiltonian density
Solve :
Since ,
Integrating by parts (dropping boundary terms),
so the canonical Hamiltonian is
3.4 Total Hamiltonian and Gauss constraint
Section titled “3.4 Total Hamiltonian and Gauss constraint”Add the primary constraint with multiplier :
Preserve :
This yields the secondary constraint
i.e. Gauss’s law in vacuum.
No further constraints arise; the multiplier remains undetermined.
To see explicitly why the algorithm closes, use the Hamilton equation
Then
because is antisymmetric while is symmetric. Thus Gauss’s law is automatically preserved.
3.5 Constraint algebra and first-class nature
Section titled “3.5 Constraint algebra and first-class nature”Using canonical brackets,
Thus and are first-class.
The basic Hamilton equations are also instructive:
The first equation is just the definition . The second says that the time evolution of is arbitrary; this is the Hamiltonian trace of gauge freedom.
3.6 Gauge generator: one gauge function, two constraints
Section titled “3.6 Gauge generator: one gauge function, two constraints”A Maxwell gauge transformation is controlled by one spacetime function :
Thus
The appropriate Hamiltonian generator can be written, using Castellani’s algorithm, as
Then
so the canonical generator reproduces the spacetime gauge symmetry.
The important point is that and are not associated with two independent gauge functions. They are the two members of one primary-secondary chain:
The arrow means that preserving the primary constraint produces the secondary constraint:
If one tried to use only Gauss’s law,
then
This is not the full gauge transformation for time-dependent . The primary term is needed to transform correctly.
3.7 Degrees of freedom and the need for gauge fixing
Section titled “3.7 Degrees of freedom and the need for gauge fixing”We have configuration fields , so the full phase space
has dimension per spatial point. There are two first-class constraints and no second-class constraints:
Therefore
the two transverse photon polarizations.
A complete canonical gauge fixing of the full phase space usually adds two equal-time gauge conditions, for example
Together,
form a second-class set, assuming the relevant Laplacian has no problematic zero modes after boundary conditions are specified. The pair removes the nondynamical scalar-potential sector, while removes the longitudinal spatial sector.
If one treats as a Lagrange multiplier from the beginning and works only with , then there is only one first-class constraint, , and one spatial gauge condition such as is enough. This is why different books sometimes appear to count gauge conditions differently.
3.8 Optional: explicit reduction in Coulomb gauge
Section titled “3.8 Optional: explicit reduction in Coulomb gauge”The nontrivial part of the canonical reduction is the spatial longitudinal sector. To see the “constraint + gauge” removal explicitly, decompose
Then Gauss’s constraint is , which removes the longitudinal momentum (up to boundary conditions). The longitudinal coordinate is removed by gauge.
If you impose Coulomb gauge , then form a second-class pair:
which is invertible (as an operator) after specifying boundary conditions. The reduced Hamiltonian becomes a Hamiltonian for the transverse fields only:
With this gauge choice, the Dirac bracket of the unreduced variables projects onto the transverse part:
where is the transverse projector. This formula makes the two physical photon polarizations visible directly in the bracket.
3.9 Lorenz gauge vs canonical gauge conditions
Section titled “3.9 Lorenz gauge vs canonical gauge conditions”In covariant field theory one often says, “impose the Lorenz gauge to eliminate gauge freedom.” The spelling is Lorenz, after Ludvig Lorenz; “Lorentz gauge” is a common misnomer.
The Lorenz condition is one spacetime differential condition,
With the conventions of this page,
so Lorenz gauge says
But in the Hamiltonian formulation,
where is the arbitrary multiplier of the primary constraint . Therefore Lorenz gauge is, canonically,
It is not simply one equal-time condition on the phase-space variables ; it is a velocity-dependent condition, or equivalently a condition that fixes the multiplier .
This reconciles the two common statements:
- In the full canonical phase space, complete gauge fixing is often represented by two equal-time conditions, e.g.
- In the covariant Lagrangian formulation, Lorenz gauge is one spacetime condition imposed on entire histories .
These are different gauge choices. The temporal-plus-Coulomb choice is noncovariant and stronger than Lorenz gauge. Lorenz gauge is covariant, but it leaves residual transformations
with
because
Thus Lorenz gauge fixes the gauge only after one also specifies boundary conditions or removes zero modes of .
For a plane wave with , Lorenz gauge gives
which reduces four components to three. A residual transformation shifts
and removes one more unphysical component. Thus
physical polarizations, in agreement with the Hamiltonian count.
3.10 Maxwell path integral: where did the two canonical gauge conditions go?
Section titled “3.10 Maxwell path integral: where did the two canonical gauge conditions go?”The naive Lagrangian path integral
overcounts gauge-equivalent configurations and has a noninvertible kinetic operator. The Faddeev—Popov procedure chooses one spacetime gauge condition, for instance
and inserts
For Lorenz gauge,
so
(up to an irrelevant sign convention, often written as ). In abelian Maxwell theory this determinant is independent of , so ghosts decouple.
Averaging over with a Gaussian gives the familiar covariant -gauge action,
For finite , this does not impose sharply; it weights violations of the gauge condition. The sharp Landau-gauge limit is .
The apparent mismatch with the Hamiltonian statement is resolved as follows. The Hamiltonian, phase-space version of a completely gauge-fixed path integral has the schematic Faddeev—Senjanovic form
For temporal plus Coulomb gauge,
one gets
Here the two equal-time gauge conditions are visible explicitly.
By contrast, the covariant Lorenz path integral fixes one spacetime gauge function on whole field histories. Its Faddeev—Popov operator is second order in time, so with appropriate boundary conditions it controls both pieces of Cauchy data
These are the covariant-history counterpart of the two equal-time canonical gauge directions associated with
Another way to say the same thing: once the covariant gauge-fixing term
is added, the gauge-fixed Lagrangian contains a contribution. The original primary constraint is no longer a constraint of the gauge-fixed quadratic Lagrangian; the gauge quotient has already been implemented by the Faddeev—Popov construction.
4. Worked example II: Proca (massive spin-1)
Section titled “4. Worked example II: Proca (massive spin-1)”4.1 Lagrangian and sign conventions
Section titled “4.1 Lagrangian and sign conventions”With our signature , a convenient Proca Lagrangian that leads to a positive-energy Hamiltonian is
The field equation is
and taking gives the on-shell constraint
4.2 3+1 split
Section titled “4.2 3+1 split”Using , we get
Therefore
The crucial difference from Maxwell is the term. The field still has no velocity, but it is no longer a pure Lagrange multiplier: the secondary constraint will solve for rather than impose a gauge-generating Gauss law.
4.3 Canonical momenta and primary constraint
Section titled “4.3 Canonical momenta and primary constraint”Exactly as in Maxwell,
and
4.4 Canonical Hamiltonian
Section titled “4.4 Canonical Hamiltonian”Compute
One finds
Integrating by parts,
4.5 Total Hamiltonian and secondary constraint
Section titled “4.5 Total Hamiltonian and secondary constraint”Total Hamiltonian:
Preserve :
So the secondary constraint is
4.6 Second-class structure (no gauge symmetry)
Section titled “4.6 Second-class structure (no gauge symmetry)”Compute the constraint bracket:
Thus are second-class: there is no gauge symmetry. Consistency now fixes the multiplier rather than leaving it arbitrary.
It is useful to see what “fixes the multiplier” means. Using
we get
Therefore
Combining this with gives the Hamiltonian version of the Proca transversality condition:
Unlike Maxwell theory, no arbitrary gauge function remains.
4.7 Eliminating and the reduced Hamiltonian
Section titled “4.7 Eliminating A0A_0A0 and the reduced Hamiltonian”The constraint is algebraic in :
Substitute into the Hamiltonian. The -dependent terms combine as
Hence the reduced Hamiltonian is
which is manifestly bounded below.
For reference, the second-class matrix is ultralocal:
Its inverse exists only for . This is why the massless limit is structurally singular in the constrained-Hamiltonian sense: as , the second-class pair turns into the first-class Maxwell chain.
The spatial canonical bracket is unchanged by the Proca Dirac bracket,
while is no longer independent; equivalently,
consistent with .
4.8 Degrees of freedom
Section titled “4.8 Degrees of freedom”Here , , . Thus
corresponding to helicities of a massive spin-1 particle.
4.9 Optional: Stückelberg trick and “gauge vs second-class”
Section titled “4.9 Optional: Stückelberg trick and “gauge vs second-class””Introduce a scalar and replace
Then the Proca mass term becomes gauge invariant under
The theory is now gauge invariant (first-class constraints reappear), but it contains an extra field . After gauge fixing (e.g. ) you recover Proca with 3 physical DOF. This is a useful conceptual bridge: second-class constraints can be viewed as gauge-fixed first-class systems (under appropriate extensions).
5. General relativity as a constrained Hamiltonian system (ADM)
Section titled “5. General relativity as a constrained Hamiltonian system (ADM)”The Hamiltonian formulation of GR is the prototype of a field theory with:
- singular Lagrangian (lapse and shift are nondynamical),
- first-class constraints (encoding diffeomorphism invariance),
- a nontrivial constraint algebra with structure functions (hypersurface deformation algebra).
We sketch the derivation carefully enough to see where each ingredient comes from.
5.1 Einstein–Hilbert action and 3+1 split
Section titled “5.1 Einstein–Hilbert action and 3+1 split”Start from the Einstein–Hilbert action (with cosmological constant )
The boundary term (e.g. Gibbons–Hawking–York) ensures a well-posed variational principle when fixing the induced metric on the boundary.
Assume spacetime is foliated by spacelike hypersurfaces , with coordinates on each . The spacetime metric can be written in ADM form:
where:
- is the induced spatial metric on ,
- is the lapse,
- is the shift (with ).
Useful identities:
where .
Geometrically, is the proper time separation between neighboring slices along the unit normal, while tells how the spatial coordinates slide within the next slice. Thus lapse and shift describe how the foliation is threaded through spacetime; they are not local propagating gravitational degrees of freedom.
5.2 Extrinsic curvature and kinematics
Section titled “5.2 Extrinsic curvature and kinematics”Define the covariant derivative compatible with : .
The extrinsic curvature of embedded in spacetime is
This shows explicitly that appears linearly in , while and do not appear at all.
5.3 ADM Lagrangian density
Section titled “5.3 ADM Lagrangian density”A standard result of the Gauss–Codazzi decomposition (up to total derivatives absorbed by ) is:
Therefore, dropping total derivatives already accounted for by boundary terms, the ADM Lagrangian density is
5.4 Canonical momenta
Section titled “5.4 Canonical momenta”The canonical momentum conjugate to is
Since enters only through , and
one finds
Taking the trace gives
You can invert to express in terms of :
The combination
is the inverse-DeWitt-supermetric contraction of the momentum. Its minus sign in the trace direction is the origin of the familiar “conformal factor” indefiniteness of the gravitational kinetic term. This subtlety does not spoil the constraint analysis, but it is one reason canonical GR looks very different from a collection of ordinary positive-energy scalar fields.
Primary constraints: because and do not appear in ,
Thus,
are primary constraints.
5.5 Canonical Hamiltonian and the ADM constraints
Section titled “5.5 Canonical Hamiltonian and the ADM constraints”The canonical Hamiltonian is
where should be expressed using
A key computation uses integration by parts:
(up to boundary terms). After rewriting in terms of , one arrives at the standard ADM form
Here is a boundary term (e.g. ADM energy for asymptotically flat spacetimes). The bulk constraint densities are:
Momentum (diffeomorphism) constraint
Hamiltonian (scalar) constraint
(again, up to convention-dependent signs/factors).
5.6 Total Hamiltonian and secondary constraints
Section titled “5.6 Total Hamiltonian and secondary constraints”The total Hamiltonian adds the primary constraints:
Preserving gives
hence the Hamiltonian constraint
Preserving gives
hence the momentum constraints
No new independent constraints appear beyond these (for pure GR); instead, consistency fixes nothing because the theory is gauge invariant (diffeomorphisms).
5.7 Smeared constraints and the constraint algebra
Section titled “5.7 Smeared constraints and the constraint algebra”It is cleaner to use smeared functionals:
Then (schematically) the Poisson brackets close as:
This is the hypersurface deformation algebra (often called “Dirac algebra”). It is not a Lie algebra with constant structure constants; it has structure functions involving .
The last bracket is the most characteristic one. The “structure coefficient” is , which is itself a phase-space variable. Therefore the algebra is not an ordinary Lie algebra of constraints with constant coefficients; it is the algebra of deformations of embedded hypersurfaces.
The closure implies that and are first-class (together with the primary constraints ).
5.8 Gauge interpretation: diffeomorphisms
Section titled “5.8 Gauge interpretation: diffeomorphisms”- generates spatial diffeomorphisms on :
- generates normal deformations of the hypersurface (time reparametrizations / refoliations).
A precise mapping between and spacetime diffeomorphisms requires care, because the algebra closes with structure functions; nonetheless, the standard viewpoint is that these first-class constraints encode the redundancy under diffeomorphisms.
As in Maxwell theory, the complete spacetime-diffeomorphism generator also includes the primary constraints conjugate to lapse and shift. The secondary constraints and act on the canonical geometry , while the primary constraints control how and transform.
5.9 GR counterpart of “one gauge function, two constraints”
Section titled “5.9 GR counterpart of “one gauge function, two constraints””Maxwell theory has one gauge function and one primary-secondary first-class chain:
The ADM analogue is that spacetime diffeomorphisms have four descriptors, which can be decomposed relative to the foliation into one normal deformation and three tangential deformations,
In the full ADM phase space, these are associated with four primary-secondary chains:
Thus the full set
contains eight first-class constraints, but they correspond to four spacetime gauge functions, not eight independent gauge functions.
Schematically, the diffeomorphism generator has the same architecture as Maxwell:
The dots are not cosmetic. In full GR the correct generator contains additional lapse-, shift-, and structure-function-dependent terms. This complication reflects the hypersurface deformation algebra rather than an ordinary Lie algebra with constant structure constants. Conceptually, however, the parallel with Maxwell is clear: the primary constraints transform the nondynamical variables , while the secondary constraints transform the canonical geometry .
If and are treated from the beginning as Lagrange multipliers rather than canonical variables, one usually discusses only the four secondary constraints and . This is analogous to treating as a multiplier in Maxwell theory and focusing only on Gauss’s law in the spatial phase space.
5.10 Degrees of freedom of GR
Section titled “5.10 Degrees of freedom of GR”In 3+1D, the configuration variable is a symmetric tensor: per point.
Constraints:
- primary: (1), (3),
- secondary: (1), (3).
Altogether there are 8 constraints; however, the standard DOF counting for GR focuses on the true canonical pair and treats as multipliers.
On the phase space:
- phase-space dimensions per point,
- there are independent first-class constraints .
Thus
These are the two polarizations of the graviton (gravitational waves) in 4D.
If instead you include and as canonical variables, then there are configuration variables and first-class constraints . The same formula gives
This agrees with the reduced count above; the lapse and shift sector contributes no physical local degrees of freedom.
5.11 Linearized check (TT gauge intuition)
Section titled “5.11 Linearized check (TT gauge intuition)”Linearize around Minkowski: . In harmonic gauge one can reduce to transverse-traceless (TT) components satisfying a wave equation. The TT condition removes gauge redundancy and constraints, leaving two propagating modes—consistent with the Hamiltonian count above.
Details: harmonic gauge ⇒ TT wave equation and the “2 polarizations” count
Start from
Infinitesimal diffeomorphisms act as a gauge symmetry:
It is convenient to use the trace-reversed field
The harmonic (Lorenz) gauge condition is
In this gauge, the vacuum linearized Einstein equations simplify to
So the radiative degrees propagate as massless waves.
Harmonic gauge does not fully fix the gauge: it is preserved by residual transformations with
For a plane wave , the field equation gives and the gauge condition gives transversality . Using the residual gauge freedom one can impose the stronger TT conditions (for a wave moving along ):
The only nonzero components then live in the block transverse to the propagation direction and satisfy
A standard basis is
i.e. the two gravitational-wave polarizations.
This is the linearized counterpart of the Hamiltonian counting: the constraints remove non-propagating components and the gauge symmetry quotients out redundancies, leaving two physical modes (in 4D), exactly as in §5.10.
6. Conceptual comparison: Maxwell vs Proca vs GR
Section titled “6. Conceptual comparison: Maxwell vs Proca vs GR”| Theory | Nondynamical variables | Constraint class | Gauge functions/chains | Multiplier behavior | Physical DOF in 3+1D |
|---|---|---|---|---|---|
| Maxwell | , first-class | one function: | arbitrary until gauge fixing | ||
| Proca | , second-class | none | fixed by consistency: | ||
| GR/ADM | , , , first-class in full phase space | four diffeomorphism descriptors: , | lapse/shift multipliers arbitrary until coordinate gauge fixing |
The moral is that “a variable has no velocity” is only the beginning. The decisive question is whether preservation of its primary constraint leaves a multiplier arbitrary, as in gauge theories, or fixes it, as in Proca.
6.1 Why first-class “removes more” than second-class
Section titled “6.1 Why first-class “removes more” than second-class”A single first-class constraint does two things:
- it restricts to the constraint surface, and
- it generates a gauge flow—points along that flow are physically equivalent.
Therefore each first-class constraint removes two phase-space dimensions (one for the surface, one for the orbit), while each second-class constraint removes only one.
- Maxwell: constraints are first-class gauge symmetry 2 physical DOF.
- Proca: constraints are second-class no gauge symmetry 3 physical DOF.
- GR: constraints are first-class diffeomorphism redundancy 2 physical DOF in 4D.
6.2 Nondynamical fields and constraints
Section titled “6.2 Nondynamical fields and constraints”Both Maxwell and Proca have because has no time derivative.
The difference is what happens next:
- Maxwell: preservation produces Gauss law (first-class).
- Proca: preservation produces (second-class with ).
In GR, lapse and shift are nondynamical: . Their preservation produces Hamiltonian/momentum constraints, all first-class.
6.3 Canonical gauge fixing vs covariant gauge fixing
Section titled “6.3 Canonical gauge fixing vs covariant gauge fixing”The Maxwell example is the cleanest place to see a general lesson.
In canonical language, gauge fixing is imposed on a time slice. If the full phase space contains , then a complete canonical gauge choice naturally supplies two equal-time conditions, because the gauge generator involves both and .
In covariant path-integral language, one fixes the spacetime gauge orbit of the single function by one spacetime differential condition such as . The price is that the Faddeev—Popov operator is , and residual zero modes or boundary conditions must be handled. The same pattern appears in linearized gravity: harmonic gauge is one four-component spacetime condition, but it leaves residual diffeomorphisms until further conditions or boundary data are specified.
7. Exercises
Section titled “7. Exercises”-
Maxwell with sources: Add to the Maxwell Lagrangian and show that Gauss’s law becomes . Discuss which parts of the constraint structure change.
-
Coulomb gauge Dirac bracket: In Maxwell theory, impose Coulomb gauge and compute the Dirac bracket for the transverse fields.
-
Proca Dirac bracket: Treat and as second-class and compute the Dirac bracket for and .
-
ADM constraint algebra: Verify at least one of the ADM bracket relations using smeared constraints and integration by parts.
-
GR DOF in dimensions: Generalize the ADM DOF count to spacetime dimensions and show that the number of propagating graviton DOF is .
-
Lorenz gauge and residual gauge freedom: Show explicitly that is preserved by gauge transformations satisfying . For a plane wave, use this residual freedom to reduce three Lorenz-gauge components to two physical polarizations.
-
Hamiltonian vs Lagrangian gauge fixing: Starting from the phase-space Maxwell path integral, impose and and compute the determinant . Compare it with the covariant Faddeev—Popov determinant in Lorenz gauge.
Hints and checkpoints
-
Maxwell with sources. With the interaction in these conventions, check the sign of the term in . Gauge consistency requires current conservation if is prescribed externally.
-
Coulomb gauge Dirac bracket. Use the second-class pair
The constraint matrix contains up to signs. Its inverse is the Green’s function of the Laplacian, and the final bracket should be the transverse projector shown in §3.8.
-
Proca Dirac bracket. Use
The inverse of the constraint matrix is algebraic because the nonzero entry is . Check that and keep their canonical bracket, while becomes a dependent variable.
-
ADM constraint algebra. The easiest bracket to verify first is . Show that acts by Lie derivative on and , then use the commutator of Lie derivatives:
-
GR DOF in dimensions. Let the spatial dimension be . The spatial metric has independent components, and there are first-class secondary constraints. Therefore
-
Lorenz residual freedom. Under ,
For a null plane wave, take , so . The residual shift removes the pure-gauge polarization.
-
Path-integral determinants. With
the gauge-fixing matrix is block diagonal, with entries and up to signs. The Lorenz determinant is different because it fixes a spacetime history rather than an equal-time phase-space representative.
8. References for further study
Section titled “8. References for further study”- P. A. M. Dirac, Lectures on Quantum Mechanics (constrained Hamiltonian systems).
- M. Henneaux & C. Teitelboim, Quantization of Gauge Systems (modern, systematic).
- K. Sundermeyer, Constrained Dynamics (classic).
- R. M. Wald, General Relativity (ADM, constraints, canonical structure).
- E. Poisson, A Relativist’s Toolkit (3+1 tools, extrinsic curvature).
- Arnowitt–Deser–Misner (ADM) original papers/lectures (historical source).
- 梁灿彬, 周彬. 微分几何入门与广义相对论(下册)第二版. 科学出版社.