ferromagnet

(1)

Effective Lagrangians for ferromagnets

In the preceding note, I explained how to construct an effective Lagrangian for Nambu– Goldstone (NG) bosons of a spontaneously broken internal symmetry in an arbitrary, not necessarily relativistic, system. In this text, I complement the general discussion by a detailed analysis of one concrete example: the ferromagnet. I will in particular show how different field parameterizations can lead to seemingly very different Lagrangians. In order that this text is self-contained, I first review the general construction of the effective Lagrangian without any proofs. Afterwards, I work out its application to ferromagnets.

Overview of the general construction

The degrees of freedom of the low-energy effective field theory (EFT) are the NG fields πa(x), which represent coordinates on the coset space of broken symmetry, denoted as _G/_H. In the following, I will use the notation Ti,j,... for unspecified generators ofG, Tρ,σ,... for generators of

H, and Ta,b,... for the remaining, broken generators. It is convenient to couple the EFT to a

background gauge field, Ai

µ, for each generator Ti. These allow one to determine correlation

functions of the conserved currents of the theory. At sufficiently low energies, the effective action can be organized in a covariant gradient expansion in the joint number of derivatives acting on the NG fields and of gauge fields. Owing to the lack of Lorentz invariance, spatial and temporal derivatives have to be counted separately. Denoting the part of the effective Lagrangian with s spatial and t temporal derivatives as L(s,t), the expansion takes the form

L=L(0,1)₊_L(0,2)₊_L(2,0)₊_{· · ·}_{, assuming in addition rotational invariance. The first terms of}

the expansion can then be written as

L(0,1) ₌_c

a(π) ˙πa+ei(π)Ai0, L

(0,2) ₌ 1

2g¯ab(π)D0π

a_D

0πb, L(2,0) =−

1

2gab(π)Drπ

a_D

rπb, (1)

where Dµπa = ∂µπa −Aiµhai(π) is a covariant derivative of the NG field. Each part of the

Lagrangian is invariant, possibly up to a surface term, with respect to the simultaneous gauge transformation of the NG and gauge fields,

δπa(x) = i(x)ha_i(π), δAi_µ(x) =∂µi(x) +fjki A j µ(x)

k

(x). (2)

Here i _{are the infinitesimal parameters of the transformation and} _fi

jk are the structure

con-stants of the symmetry algebra. The gauge invariance allows one to determine the form of the coupling functionsca(π), ei(π), gab(π) and ¯gab(π) in terms of a few unknown parameters. First,

define the element of the coset space _G/_H as U(π) = eiπa_T

a _{and subsequently the} Lie-algebra-valued Maurer–Cartan (MC) form as ω(π) = ωa(π) dπa = Tiωia(π) dπa = −iU(π)

−1_d_U₍_π_).

Finally, define a bilinear form on the Lie algebra of _G as Tr(TiTj) = Ωij. It is assumed to

be nondegenerate but not necessarily proportional to δij as is often the case, and not even

positive-definite. It can be used to raise and lower indices of generators as in Ti _{= Ω}ij_T j. The

coupling functions can then be cast as

gab(π) =gcd(0)ωca(π)ω d

b(π), ei(π) = ej(0)

U(π)−1TiU(π)

j

, ca(π) = −ei(0)ωai(π),

(3) and analogously for ¯gab(π). The effective Lagrangian is now fully specified in terms of the

(2)

and ei(0), associated with charge densities in the ground state. In order to be compatible with

G-invariance, these have to satisfy further constraints,

f_σac gcb(0) +fσbc gac(0) = 0, fσji ei(0) = 0, (4)

and similarly for ¯gab(0).

NG fields and transformation rules

In a ferromagnet, the symmetry-breaking pattern is given by _G=_O(3) and _H=_O(2). The generators can be chosen so that the structure constants are fully antisymmetric and equal to

f_jki =εijk. The O(3) conserved charges correspond to the three components of total spin, and the external gauge field Ai

0 then has the meaning of a magnetic field, namely Ai0 =µBi, where

µis the magnetic moment of a single spin.

The _O(3)/_O(2) coset space is parameterized by two coordinates, π1,2_{, representing the NG}

degrees of freedom. However, the transformation properties with respect to the _O(3) rotations are much more easily expressed in terms of a unit three-vector, ~n(π). The precise mapping between these two descriptions of the NG field will be discussed later. In terms of ~n, an infinitesimal _O(3) rotation with the set of parameters ~ acts as δ~n =~n×~, or δinj = εijknk.

Without knowing the precise form of the function πa₍_~_n_{), we can now easily show that this}

transformation rule has the correct Lie-algebraic structure,

δiδjπa−δjδiπa=δi

∂πa ∂nkεjk`n

`

−(i↔j) = ∂π

a

∂nkεjk`εi`sn s₊

XX XX

XX

XX_X

X

∂2πa ∂nk_∂nsεjk`n

`_ε

istnt−(i↔j)

= ∂π

a

∂nkn s

(δjsδik−

H

H H

δijδks)−(i↔j) =

∂πa

∂nin

j₋ ∂πa

∂njn i

(5)

= ∂π

a

∂nsn t_ε

ijkεkst =εijkδkπa.

This is equivalent to the differential equation for the Killing vectorsha

i(π), derived by Leutwyler,

dihaj −djhai = fijkhak, where di = hai ∂

∂πa. Our transformation rule for ~n(π) is thus compatible with the general expression (2).

In order to be able to use the solution (3) for the coupling functions, we next have to find the relation between the matrix variableU(π) and the vector~n(π). To that end, we first define the matrix variable N(π) =Tini(π). Under a transformation by the rotation g =ei

i_T

i_{, we expect} this to change naturally as N → gN g−1_{. To check that this agrees with the above-defined}

transformation rule for~n(π), it is sufficient to consider an infinitesimal rotation,

Tjδnj =δN = i[iTi, N] =−injεijkTk =i(εijknk)Tj, (6)

which reproduces the rule δinj = εijknk. Let us now choose without lack of generality the

ground state as~n0 = (0,0,1), that is, N0 =T3. The relation between πa and ~n can be fixed, at

least in the vicinity of the ground state, by setting

N(π) = U(π)N0U(π)−1. (7)

The broken generators are then Ta with a = 1,2. The whole argument could be reversed

(3)

Effective coupling functions from the coset construction

All the expressions in Eq. (3) are independent of the choice of representation for the generators

Ti. However, from now on I will use the two-dimensional representation of O(3) so that I can

take advantage of some special properties of the Pauli matrices. Since the normalization is fixed by the structure constants, we have to set Ti =σi/2 so that Tr(TiTj) = 1₂δij, and hence

Ti ₌_σ i.

Let us first determine the metric gab(π). From the condition (4) one easily finds that gab(0)

must be proportional to the unit matrix. (This is true by Schur’s lemma whenever the broken generators span an irreducible representation of the unbroken subgroup _H.) We can thus set

gab(0) =F2δab. Given the normalization of the generators and Eq. (3), the kinetic term for the

NG fields can be expressed, in the absence of gauge fields, as

gab(π) dπadπb =gcd(0)ωca(π)ω d b(π) dπ

a_d_πb ₌_F2_δ

cdωc(π)ωd(π) = 2F2Tr[ω⊥(π)ω⊥(π)], (8)

where ω⊥(π) is the component of the MC form in the subspace of broken generators. This can be extracted using the fact that the Pauli matrices anticommute with each other, namely as

ω⊥= 1₂(ω−σ3ωσ3). We thus obtain

gab(π) dπadπb =F2Tr

ω(π)ω(π)−ω(π)σ3ω(π)σ3

=F2TrdU(π) dU−1(π)−ω(π)σ3ω(π)σ3

.

(9) With Eq. (7), the trace can be cast as 2 TrdN(π) dN(π) = d~n(π)·d~n(π). The spatial part of the kinetic term in absence of external gauge fields thus takes the form in accord with Eq. (1),

L(2,0) ₌₋1

2F

2_∂

r~n(π)·∂r~n(π). (10)

If needed, background gauge fields can be reintroduced by replacing ordinary derivatives with covariant ones. This form of the kinetic term could of course have been guessed from the linear transformation rule for~n(π). Using instead the general solution (3) shows that it is unique.

The temporal kinetic termL(0,2) _{takes the same form. However, it is not needed at the leading}

order of the derivative expansion, since in the case of ferromagnets, the termL(0,1) _{is nontrivial.}

What we need to do now is therefore to find the functions ei(π) andca(π). Let us start with

the former. First, the condition (4) implies that e1(0) = e2(0) = 0. Let us denote e3(0) = Σ;

this represents the magnetization in the ground state. From Eq. (3), we then obtain

ei(π) = Σ Tr

T3U(π)−1TiU(π)

= Σ TrTiU(π)T3U(π)−1

= Σ TrTiN(π) = Σni(π). (11)

This establishes the expected result that the background magnetic field couples to the total spin of the system though a simple scalar product. The remaining unknown coupling, containing the time derivative of the NG field, also follows from Eq. (3), ca(π) dπa =−ei(0)ωi(π) =−Σω3(π).

From the definition of the MC form, this becomes after some manipulations

ca(π) ˙πa = iΣ Tr

T3U(π)−1∂0U(π)

= iΣ

Z 1

0

dλTrT3U(λπ)−1(i ˙πaTa)U(λπ)

=−Σ

Z 1

0

dλTrπ˙aTaU(λπ)T3U(λπ)−1

'Σ

Z 1

0

dλ ~π·~n˙(λπ).

(12)

(4)

dependence on the choice of the orientation of ~n0 has now dropped. Since we know how ~n

depends on πa _{and thus on} _λ_{, we can now in principle integrate over} _λ _{directly. However, to}

keep the index-free notation, I use a different approach. Since ~n is normalized to unity, ˙~n is perpendicular to it, so we can write~π·~n˙ = (~n×~π)·(~n×~n˙). Next, we express using Eq. (7)

Ti[~n(λπ)×~π]i = [iπaTa, N(λπ)] =∂λN(λπ), (13)

whence ~n(λπ)×~π = ∂λ~n(λπ), which allows us to eliminate explicit π-dependence from the

action. Altogether, the L(0,1) _{part of the Lagrangian thus acquires the form}

L(0,1) _{= Σ}

Z 1

0

dλ ∂λ~n(λπ)·

~

n(λπ)×~n˙(λπ)+µΣB~ ·~n(π). (14)

Together with Eq. (10), this defines the low-energy effective Lagrangian of ferromagnets in external magnetic fields. Its structure is determined by symmetry up to two unknown low-energy coupling constants, Σ and F. The former is given directly by the magnetization of the ground state, while the latter can be extracted for instance from the dispersion relation of spin waves, discussed below.

Topological nature of the

c

a

(

π

) ˙

π

a

term

The Lagrangian L(0,1) _{was derived before by Leutwyler by a straightforward integration of the}

differential equation for ca(π), that is, hbi(∂bca−∂acb) = ∂aei. The auxiliary variable λ there

appears as an additional spacetime dimension, similarly to the construction of the Wess–Zumino term in chiral Lagrangians. Although the derivation based on Eq. (3) is more straightforward as well as completely general, it is illuminating to use this analogy to gain a deeper insight in the structure of this term. Let us rewrite Eq. (14), for simplicity in a vanishing magnetic field, formally as

L(0,1)

B=0 = Σ

Z 1

0

dλ ∂λm~(λ, π)·

~

m(λ, π)×m~˙ (λ, π). (15)

Herem~(λ, π) with 0 ≤λ≤1 defines a smooth path on the unit sphere such thatm~(0, π) =~n0

and m~(1, π) = ~n(π) for all values of πa_{. Eq. (14) corresponds to} _m_~₍_{λ, π}_{) =} _~_n₍_λπ_{). However,}

I will now show that the action of the theory is independent of the choice of the path. In other words, by choosing any other path, the Lagrangian will at most change by a total time derivative.

Let us calculate the variation of the Lagrangian under an infinitesimal shiftδ ~m(λ, π). From the boundary conditions at λ= 0,1, this variation must satisfy δ ~m(0, π) = 0 andδ ~m(1, π) = δ~n(π) for all πa. We thus obtain

δL(0_B₌₀,1) = Σ

Z 1

0

dλ∂λδ ~m·(m~ ×m~˙) +∂λm~ ·(δ ~m×m~˙) +∂λm~ ·(m~ ×δm~˙)

. (16)

The middle term vanishes since due to the constraint|m~|= 1, all vectors∂λm, δ ~~ m,m~˙ lie in the

two-dimensional plane perpendicular to m~. In the last term, we “integrate by parts in time” and in the first term with respect to λ,

δL(0_B₌₀,1) = Σ[δ ~m·(m~ ×m~˙)]1₀+ Σ

Z 1

0

dλ

((((

(((

hhh_hhh

h

−δ ~m·(m~ ×∂λm~˙)−∂λm~˙ ·(m~ ×δ ~m) +∂0[∂λm~ ·(m~ ×δ ~m)]

=δ~n·(Σ~n×~n˙) + Σ∂0

Z 1

0

(5)

When we merely vary the pathm~(λ, π) but keep the end points fixed,δ~n =~0 and the action does not change. Since the unit sphere is simply connected, any such path can be smoothly deformed into any other, which implies the above claim that the action is completely independent of the choice of the path. The integration over λ can be carried out explicitly by choosing that path simply as m~(λ, π) = λn1₍_π₎_{, λn}2₍_π₎_,p

1−λ2_[(_n1₍_π₎₎2_{+ (}_n2₍_π₎₎2_]

. Writing the integrand in Eq. (15) as ˙m~ ·(∂λm~ ×m~), we obtain straightforwardly

L(0,1)

B=0= Σ(n

2_n_˙1₋_n1_n_˙2₎

Z 1

0

dλ_p λ

1−λ2₊_λ2₍_n3₎2. (18)

Upon integration over λ the full L(0,1) _becomes

L(0,1) ₌ Σ

1 +n3₍_π₎

n2(π) ˙n1(π)−n1(π) ˙n2(π)+µΣB~ ·~n(π). (19)

This representation of the Lagrangian still does not depend explicitly on the coordinates πa. On the other hand, as emphasized above, it is somewhat ambiguous; it can be shifted by a total time derivative upon a different choice of the path m~(λ, π).

Equation of motion

In order to derive the equation of motion for the NG mode in the ferromagnet, the magnon, directly using the variable~n(π), we have to further add to the Lagrangian the term 1₂α(1−~n2), where α is the Lagrange multiplier for the constraint |~n|= 1. The variation of δL(0,1) _{can be}

read off Eq. (17) as the coefficient at δ~n. The equation of motion then acquires a neat form

Σ~n×~n˙ +µΣB~ +F2∆~n=α~n, (20)

known as the Landau–Lifshitz equation. Upon expanding in small fluctuations around the ground state~n0, this gives a dispersion relation for the magnon as E(k) =µB+F

2

Σk 2_.

The relation between the two forms for the effective Lagrangian, Eqs. (14) and (19), is nontrivial. As the last point of this text, I therefore wish to demonstrate explicitly that despite different appearance, they lead to the same equation of motion. I do so by showing that the variation of the Lagrangian (19) reproduces Eq. (17) for n1 _and _n3_{; the case of} _n2 _{is obtained from} _n1

by a simple exchange of the labels 1 ↔ 2. Since we use a different parameterization of the Lagrangian, we have to modify the Lagrange multiplier as well to, say, β. The variation of

L(0,1) _{in Eq. (19) with respect to} _n3 _{then reads}

δ3L(0,1) =δn3

− Σ (1 +n3₎2(n

2

˙

n1−n1n˙2) +µΣB3−βn3

. (21)

Note that if we redefine the Lagrange multiplier as

β =α+ Σ 2 +n

3

(1 +n3₎2(n

2_n_˙1₋_n1_n_˙2₎_, ₍₂₂₎

the variation of the Lagrangian takes the formδ3L(0,1) =δn3[−Σ(n2n˙1−n1n˙2) +µΣB3−αn3],

consistent with Eq. (17). The equation of motion for n1_{, on the other hand, follows from the}

variation (up to a total time derivative)

δ1L(0,1) =δn1

−2Σ n˙

2

1 +n3 + Σ

n2_n_˙3

(1 +n3₎2 +µΣB

1₋_βn1

(6)

Finally, we have to use the same redefinition of the Lagrange multiplier as in Eq. (22). After some manipulations, we obtain δ1L(0,1) =δn1(Ξ +µΣB1−αn1), where

Ξ = −Σn

1_{(2 +}_n3₎

(1 +n3₎2 (n 2

˙

n1−n1n˙2)−2Σ n˙

2

1 +n3 + Σ

n2n˙3 (1 +n3₎2

= Σ

(1 +n3₎2

n

(2 +n3)−n2 n1n˙1

| {z }

−n2_n_˙2₋_n3_n_˙3

+(n1)2n˙2−2(1 +n3) ˙n2+n2n˙3o

= Σ

(1 +n3₎2

n

˙

n2(2 +n3)(n1)2+ (2 +n3)(n2)2−2(1 +n3)

| {z }

=(2+n3_)[1₋₍_n3₎2_]₋₂₍₁₊_n3₎₌₋_n3₍₁₊_n3₎2

+ ˙n3(2 +n3)n2n3+n2

| {z }

=n2₍₁₊_n3₎2

o

= Σ(n2n˙3−n3n˙2),

(24)