Approximating relational observables by absolute quantities : a quantum accuracy-size trade-off

(1)

quantum accuracy-size trade-off.

White Rose Research Online URL for this paper: http://eprints.whiterose.ac.uk/99491/

Version: Accepted Version

Article:

Busch, Paul orcid.org/0000-0002-2559-9721, Loveridge, Leon Deryck and Miyadera, Takayuki (2016) Approximating relational observables by absolute quantities : a quantum accuracy-size trade-off. Journal of Physics A: Mathematical and Theoretical. 185301. ISSN 1751-8113

https://doi.org/10.1088/1751-8113/49/18/185301

[email protected] https://eprints.whiterose.ac.uk/ Reuse

Items deposited in White Rose Research Online are protected by copyright, with all rights reserved unless indicated otherwise. They may be downloaded and/or printed for private study, or other acts as permitted by national copyright laws. The publisher or other rights holders may allow further reproduction and re-use of the full text version. This is indicated by the licence information on the White Rose Research Online record for the item.

Takedown

If you consider content in White Rose Research Online to be in breach of UK law, please notify us by

(2)

arXiv:1510.02063v3 [quant-ph] 24 Mar 2016

quantities: a quantum accuracy-size trade-off

Takayuki Miyadera

Department of Nuclear Engineering, Kyoto University, Kyoto daigaku-katsura, Nishikyo-ku, Kyoto, Japan 615-8540

E-mail: [email protected]

Leon Loveridge

Department of Computer Science, University of Oxford, Wolfson Building, Parks Rd, Oxford, UK OX1 3QD

Paul Busch

Department of Mathematics, University of York, Heslington, York, UK. YO10 5DD

Abstract. _{The notion that any physical quantity is defined and measured relative to}

a reference frame is traditionally not explicitly reflected in the theoretical description of physical experiments where, instead, the relevant observables are typically represented as “absolute” quantities. However, the emergence of the resource theory of quantum reference frames as a new branch of quantum information science in recent years has highlighted the need to identify the physical conditions under which a quantum system can serve as a good reference. Here we investigate the conditions under which, in quantum theory, an account in terms of absolute quantities can provide a good approximation of relative quantities. We find that this requires the reference system to be large in a suitable sense.

Published in: J. Phys. A: Math. Theor. 49_{185301 (2016)}

(3)

1. Introduction

Symmetry plays a fundamental role in constructing and understanding physical theories [1, 2]. It also constrains the relationship between theoretical terms in a given formalism and the world those terms are used to describe. In a theory with (say, a gauge) symmetry, the quantities which may be measured (the observables) must be invariant with respect to the relevant symmetry transformations [3, 4]. Yet, it appears that in certain circumstances, measurements of symmetric systems can be well described in terms of non-invariant quantities. Typically, the invariant quantities are relative observables of a system plus reference, whereas the non-invariant quantities provide a simplified, “absolute” description of the situation in terms of the variables of the system alone.

Problems of this kind appear naturally in investigations of the universality of quantum mechanics and the interface between quantum and classical systems. For instance, in [5–8] it is argued that the notion of time appears from stationary observables in a composite system of the system and the clock. Another example, closer to the type of situation we will consider, arises in the theory of superconductivity: though the “absolute” phase in a superconductor does not represent an observable quantity, the Josephson effect shows that the relative phase between two systems can be measured, since the relative phase operator is gauge (phase-shift) invariant. If one of these systems is large in a suitable sense, the fluctuation of the phase can be neglected and the phase can be treated effectively as a classical quantity. In this case the (statistics of a) relative phase measurement can be well approximated by the statistics of a non-invariant phase observable of a subsystem. The large system is then viewed as a (classical) reference system.

The possibility of such an approximation is considered in various forms in the literature and is accompanied by various interpretations. In [3], a review on quantum reference frames and their use as resources for information processing, it is shown that any self-adjoint operator A can be “relativised” to give an (invariant) observable U₍_A₎

acting in the Hilbert space of a larger system composed of the original system and a reference system (henceforth we use the term system-plus-reference). Then A may approximate U₍_A_{) well if the reference system has a localisable phase-like quantity [4].}

Ais viewed as an “approximator” for the invariant observableU₍_A_{) in the high reference}

phase localisation limit. The large reference requirement also appears in the Wigner-Araki-Yanase theorem [9–15] as a large spread in the apparatus part of a conserved quantity, and as an asymmetric resource state in [16], where the measurement of a non-invariant operator is viewed as being simulated by the measurement of the non-invariant observable of system-plus-reference. The symmetry constraint and reference-system size also arise in work concerning superselection rules [3, 4, 17, 18].

(4)

The purpose, therefore, of this paper, is to prove that the operational “distance” between an arbitrary effect (operator) of some quantum system and the restriction of a symmetry-invariant effect of system-plus-reference depends explicitly on the spread of the symmetry generator in the state of the reference system. For good approximation, the reference frame system must be large. Our approach is quantitative and has a clear operational meaning, applies to unsharp as well as sharp observables, and holds for general continuous symmetry groups. We find, moreover, that the trade-off relations established are rooted in the uncertainty relations for certain incompatible quantities, and that therefore the size-versus-accuracy trade-offs are genuinely quantum constraints. After reviewing standard background material in Section 2, we briefly introduce in Section 3 the notion of “restriction” of operators on the system-plus-reference to operators on the system, which arises by fixing a state of the reference. Section 4 contains the relativisation map introduced in [4], accompanied by a discussion of the role of reference system localisation. Section 5 brings our main result: a trade-off between precision and size. This is given as a quantitative relation for the distance between arbitrary and restricted invariant effects; we show that the distance may be made small only when the reference state is delocalised with respect to the symmetry generator or, where applicable, localised with respect to its conjugate quantity. The origin of the trade-off as arising from quantum incompatibility is discussed, and we conclude in Section 6 with a summary and remarks on future work.

2. Mathematical preliminaries

We briefly present some mathematical concepts and notation used throughout the paper. Any Hilbert space _H we consider will be finite-dimensional and we write dim_H for the dimension of _H. The standard inner product is denoted _{h · | · i}:_{H × H →} C _{(taken to} be linear in the second argument). The algebra of bounded linear operatorsA:_{H → H} will be denoted by _L(_H), with the positive operators (A_≥0) given as those A_{∈ L}(_H) for which _hϕ_|A_|ϕ_{i ≥}0 for allϕ _{∈ H}(ifA₋B _≥0 we write A_≥B orB _≤A). States in H, (equivalently, on _L(_H)) are given by the positive operators ρin _L(_H) for which the trace tr [ρ] = 1 (equivalently, positive linear functionalsω : _L(_H)_→C _with _ω₍1) = 1),

the convex set of which is written _S(_H). The extreme elements of _S(_H)—the pure states—satisfy ρ2 ₌_ρ_{, and are given by rank one projections, or simply as unit vectors} in _H.

An observable E is a normalised, positive operator valued measure (pom) E :F →

L(_H) on some measurable space (Ω,_F) (so that E(X) _≥ 0, E(Ω) = 1, and E satisfies

the additivity property of measures). Any operator E(X) in the range of E is an effect, that is, a positive operator A for which 0_≤A_≤ 1; we write E(H) for the (convex) set

(5)

quantified by Dω(A, B) := _|ω(A)₋ω(B)_|. A (state-independent) distance function D is then obtained by taking the supremum over all states:

D(A, B) := sup ω

Dω(A, B) = sup ω |

ω(A)₋ω(B)_| =_kA₋B_k.

Since ω(A) andω(B) are probabilities, D has a clear operational meaning [20].

Let G be a group. A system is said to possess G as a symmetry (group) if there is a projective (anti-)unitary representation _{U(g)_}g∈G of G on H. That is, the U(g) are (anti-)unitary operators satisfyingU(g)U(g′_{) =}_µ₍_{g, g}′₎_U₍_gg′_{) with coefficients}

|µ(g, g′₎_| _{= 1 The projective representation induces a} _∗_{-automorphism} _αg _: _L₍_H₎ _→

L(_H) byαg(A) =U(g)AU(g)∗_{, which satisfies}_αg_◦_αg′ =αgg′. In this paper, we consider a finite-dimensional connected Lie group G, in which case each U(g) is automatically unitary. We denote by g the Lie algebra of G. There exists a neighbourhood O of e (the unit of G) such that every element g _∈O is uniquely expressed as g =exp(n) with n _∈ g. Once we fix n ∈ g, we can define a line g(s) = exp(sn) with a variable real parameter s. This satisfies g(s)g(t) = g(s+t) for small enough _|s_| and _|t_|. U(g) can be set so as to satisfy U(g(s))U(g(t)) =U(g(s+t)). Then Stone’s theorem on unitary representations of one-parameter Abelian groups applies, to obtain U(esn_{) =}_eisN _with some self-adjoint operator N (a generator) on the Hilbert space. We say an effect A is G-invariant if U(g)AU(g)∗ ₌_A _{for all} _g _∈ _G_{. This naturally implies [}_{N, A}_{] = 0 for} each generator N. Similarly, a stateρ is (G-)invariant ifU(g)∗_ρU₍_g_{) =}_ρ _{for all} _g _∈_G_, equivalently if [N, ρ] = 0, or if ω =ωρ : _L(_H) _→ C _satisfies _ωρ₍_A_{) =} _ωρ₍_U₍_g₎_AU₍_g₎∗₎ for all A_{∈ L}(_H) and g _∈G.

Given Hilbert spaces _H and _K (as “output” and “input space”, respectively), we consider channels in the Heisenberg picture. A linear map Λ :_L(_H)_{→ L}(_K) is called a

channelif (i) Λ(1) =1and (ii) Λ⊗id:L(H)⊗C

d_{→ L}₍_K₎_⊗_Cd_{is positive for all}_d_(that

is, Λ is completely positive). In the Schr¨odinger picture a state ρ_{∈ S}(_K) is mapped to Λ_∗(ρ)_{∈ S}(_H), which is defined by tr

ρΛ(A)

= tr

Λ_∗(ρ)A

for allA_{∈ L}(_H). If a group Ghas (projective) representationsU1, U2 on_Hand_K, respectively, a channel Λ is called (G-)covariant if it satisfies Λ U1(g)AU1(g)∗

=U2(g)Λ(A)U2(g)∗ _{for all} _A _{∈ L}₍_H_{) and}

g _∈G. G-covariant channels map invariant effects to invariant effects.

3. Restriction map

Let us consider a system _S and a reference _R with Hilbert spaces _H_S and _H_R, respectively. The Hilbert space of the total system (system-plus-reference) is _H := HS ⊗ HR. As a preliminary observation we note that given a self-adjoint A∈ L(H)≡

(6)

through which one can “view” effects of_H_S. More generally, fixing a stateω_R _{∈ S}(_H_R) we define the restriction map ΓωR :L(H)→ L(HS) via

ΓωR(A⊗B) =AωR(B),

extended by linearity to the whole of _L(_H). For any ω_R, ΓωR is unital and completely positive. Of course, ΓωR(A⊗1) = A. Conversely, any channel Γ : L(H) → L(H

S) satisfying Γ(A_⊗1) =Ahas a unique state ω

R onHR satisfying Γ(A⊗B) =AωR(B).‡ Physically, ΓωR may be understood as providing a reduced description in terms of S alone, contingent upon the state of the reference being ω_R. The Schr¨odinger picture is helpful to see the physical meaning of the restriction map. It maps a state ω on the system to ω _◦Γ = ω _⊗ω_R. Thus Γ describes the usual “tracing out” operation for product states. Note that the restriction map is not G-covariant in general for non-invariant states ω_R.

We now state our problem. Let G (a connected Lie group) be a symmetry of the system and the reference; that is, there exist projective unitary representations U_S and U_Ron_H_S and_H_Rrespectively, with the tensor product representation writtenU_S_⊗U_R (i.e., (U_S_⊗U_R)(g) =U_S(g)_⊗U_R(g) for eachg _∈G). For each (possibly non-invariant) A _{∈ E}(_H_S), how close can Γ(E) be to A by choosing a globally invariant E _{∈ E}(_H) (that is, an E for which (U_S(g)_⊗U_R(g))E(U_S(g)_⊗U_R(g))∗ ₌ _E _{for all} _g _∈ _G_{) and} restriction map Γ?

We begin by considering the special case of invariant E obtained by relativisation, before addressing the general case.

4. Relativisation map

In [4] it is shown that poms, and therefore self-adjoint operators and effects, can be

“relativised” to give associated invariant (with respect to some chosen groups) poms,

self-adjoint operators and effects in a new Hilbert space. For instance, position relativises to relative position, angle to relative angle, and phase (as apom) to relative phase. We

therefore view the relativisation procedure as giving rise to relative observables and effects even when defined on arbitrary poms (or effects). Here we study relativisation

for the case (of representations of) G = U(1) (otherwise called the phase group) to give general quantitative tradeoff relations between (in)accuracy and reference system localisation.

Specifically, let N_S and N_R denote number operators acting in_H_S and _H_R. Thus, for example, N_S is a self-adjoint operator on_H_S with orthonormal basis of eigenvectors {|n_i}n∈I ⊂ HS, I = {0, ..., N0 −1}, so that NS = Pnn|nihn| ≡

P

nnPn. We define

‡ To see this, we introduce a sesquilinear map L(H)× L(H) → L(HS) by hX, Yi := Γ(X∗Y)− Γ(X∗

)Γ(Y). It satisfies a Cauchy-Schwarz-type inequalilty: hX, Y_ihY, X_{i ≤ kh}Y, Y_ikhX, X_ias shown in Section 5 (Lemma 3). Since hA ⊗1, A⊗1i = 0, we have hA⊗1,1⊗Bi = 0. It implies

Γ(A⊗B) =AΓ(1⊗B) and Γ(A⊗B) = Γ(1⊗B)Afor all AandB. Thus Γ(1⊗B) is proportional

(7)

N := N_S + N_R on _H = _H_S _{⊗ H}_R, and denote the associated unitary groups by U_S(θ) =eiNSθ, U

R(θ) =eiNRθ and U(θ) =eiN θ =US(θ)⊗UR(θ);θ ∈[−π, π).

Stipulating that only invariant, or relative effects are observable [4], we may then only measure those effects E _{∈ E}(_H) for which U(θ)EU(θ)∗ ₌ _E _{or, equivalently,} [E, N] = 0. We may construct relative effects in _E(_H) =_E(_H_S _{⊗ H}_R) out of arbitrary effects in_E(_H_S); first we introduce the linear mapping (an adapted and generalised form of that given in [3]) U_:_L₍_H_S₎_{→ L}₍_H_{) (see below for the full definition):}

U₍_A_{) =}

Z π

−π

U_S(θ)AU_S(θ)∗_⊗F(dθ).

Here F is a covariant phase pom on the reference (i.e., acting in H_R), defined as any pom on (the Borel algebraB [−π, π) of) [−π, π) for which

U_R(θ)F(X)U_R(θ)∗ =F(X∔_θ₎_,

where ∔ _{denotes addition modulo 2}_π_.

Here we give the definition of U₍_A_{) and investigate its properties. For a covariant}

phase pomF there exists [22] a uniquely determined positive operator T ≥0 satisfying

for all measurable subsets X,

F(X) = 1 2π

Z

X

dθU_R(θ)T U_R(θ)∗,

with 1 2π

Rπ

−πdθ UR(θ)T UR(θ)∗ = 1. That is, for each state ωR, there exists a positive

smooth function fωR(θ) :=

1

2πωR(UR(θ)T UR(θ)∗) satisfying ωR(F(X)) =

R

Xdθ fωR(θ). We denote the absolutely continuous measureω_R_◦FbyµF

ωR, so thatµ F

ωR(dθ) = fωR(θ)dθ.

U₍_A_{) is defined by}

U₍_A_{) =} 1

2π

Z π

−π

dθU_S(θ)AU_S(θ)∗_⊗U_R(θ)T U_R(θ)∗.

It is easy to see that U₍_A_{) is} _G_{-invariant and becomes an effect for any effect} _A_.

Again, the Schr¨odinger picture is helpful to see the physical meaning of U₍_A_{). In}

the Schr¨odinger picture, the state must be invariant while arbitrary effects are allowed. In this picture the simplest (thus product) relative effect has the formA_⊗T. Switching to the Heisenberg picture, we obtain U₍_A_).

ΓωR(U(A)) is given by

ΓωR(U(A)) =

Z

µF

ωR(dθ)US(θ)AUS(θ) ∗_.

Quantifying the distance between an effect A _{∈ E}(_H_S) and a restriction of a relativised version, viz. ΓωR(U(A)), amounts to estimating the following quantity:

D A,ΓωR U(A)

=

Z π

−π µF

ωR(dθ) e

iθNS_Ae−iθNS

−A

.

For 0 _≤ ǫ < 1, the overall width at confidence level 1₋ǫ of the measure µF

ωR is defined by

Wǫ(µF

ωR) := inf

w∃θ : µF_ω_R I(θ, w)

(8)

where _I(θ, w) denotes the closed interval of width w _≤ 2π centred at θ in [₋π, π), understood as a circle. It can be shown (see, e.g., [23, Chapter 12]) that the above infimum is actually a minimum. Hence, due to the absolute continuity of F, there is a θ0 such that µF

ωR I(θ0, Wǫ(µ F

ωR)

= 1₋ǫ. We call _I(θ0, Wǫ(µF

ωR)) an ǫ-support of the measure µF

ωR (noting that it need not be unique). In addition, we define another quantity W0

ǫ(µFωR)—the overall width around 0—by

W_ǫ0(µF

ωR) := inf

w µF_ω

R I(0, w)

≥1₋ǫ . Of course, W0

ǫ(µFωR)≥Wǫ(µ F

ωR) holds. Note that this inf can also be replaced by min. That is, we have

µF

ωR I(0, W

0

ǫ(µFωR))

= 1₋ǫ. (2)

Proposition 1. Let U _{be a relativisation map and} _Γ_ω_R _{a restriction map. For an}

arbitrary effect A and 0_≤ǫ <1, it holds that

D A,ΓωR(U(A))

≤

[N_S, A]

1

2W 0

ǫ(µFωR)(1−ǫ) +πǫ

.

Proof. Letθ _∈[₋π, π). d

dθe iNSθ

Ae−iNSθ

=eiNSθ

i[N_S, A]e−iNSθ

is used to obtain

eiNSθ_Ae−iNSθ

−A=

Z θ

0

dθ′eiNSθ′_i_[_N

S, A]e−iNSθ ′

.

Taking the norm yields

eiNSθAe−iNSθ−A ≤ |θ|

[N_S, A] .

Thus,

D A,ΓωR(U(A))

≤

[N_S, A]

Z π

−π µF

ωR(dθ)|θ|

=[N_S, A]

Z Wǫ0(µ

F

ω_R)/2

−W0

ǫ(µFω_R)/2

µF

ωR(dθ)|θ|

+

Z π

W0

ǫ(µFω_R)/2

µF_ωR(dθ)|θ|+

Z −Wǫ0(µFω_R)/2

−π

µF_ωR(dθ)|θ|

!

≤[N_S, A]

1₂W_ǫ0(µF_ω_R)(1−ǫ) +πǫ

,

where we used Eq. (2).

Therefore, we may conclude that for ω_R with strong localisation aroundθ = 0, the discrepancy between A and ΓωR(U(A)) is small.

Anω_Rwhich has small overall width but large overall width around 0 does not give good approximation of A by U₍_A_{). However, we can construct a deformed quantity} U_θ₀₍_A_{) for}_θ₀ _∈_[₋_{π, π}_{) as}

U_θ

0(A) = (1⊗U

R(θ0))U(A)(1⊗U

(9)

which is G-invariant. Recall that for 0 _≤ ǫ < 1, each ω_R has θ0 such that µF

ωR(I(θ0, Wǫ(µ F

ωR))) = 1−ǫ. For such a θ0, we have

D A,ΓωR(Uθ0(A))

≤

[N_S, A]

1

2Wǫ(µ

F

ωR)(1−ǫ) +πǫ

.

In the following we show that the high localisation ofω_R is necessary to haveAwell approximated by ΓωR(U(A)). In this illustration we assume that NS has eigenstates|0i and _|1_i with respective eigenvalues 0 and 1. We consider a particular effect A defined by A = 1₂ _|1_ih1_|+_|0_ih0_|+_|0_ih1_|+_|1_ih0_|

. The effect A does not commute with N_S. We show that the magnitude of difference between A and ΓωR(U(A)) is related to the localisation property indeed.

Proposition 2. For A= 1

2 |1ih1|+|0ih0|+|0ih1|+|1ih0|

, it holds that

D A,ΓωR(U(A))

≥ ₂ǫ1₋cos 1₂W_ǫ0(µF

ωR)

Proof. As eiNSθ_Ae−iNSθ−_A= 1

2

|0_ih1_|(e−iθ₋_{1) + (}_eiθ ₋₁₎_|₁_ih₀_|

holds, we have

D A,ΓωR(U(A))

= 1 2

|0ih1|c+c|1ih0|

= 1₂|c|,

where c:=R

µF

ωR(dθ)(1−e

iθ_{). This}_|_c_| _{is estimated for}_t_∈_R _as

|c_{| ≥}Re c=

Z

µF

ωR(dθ)(1−cosθ)

=

Z

1−cosθ>t µF

ωR(dθ)(1−cosθ) +

Z

1−cosθ≤t µF

ωR(dθ)(1−cosθ)

≥t_·µF

ωR {θ|1−cosθ > t}

=t

1₋µF

ωR {θ|1−cosθ ≤t}

.

We put t = 1₋cos 1 2W

0 ǫ(µFωR)

to obtain (using again (2))

t

1₋µF

ωR {θ|1−cosθ≤t}

=

1₋cos 1 2W

0 ǫ(µFωR)

ǫ.

Thus we can see that as Wǫ0(µFωR) becomes large so does the discrepancy

D A,ΓωR(U(A))

.

General effects can be treated in essentially the same manner. Suppose A does not commute with N_S = P

nnPn, where each Pn is a projection. A can be decomposed into two parts, as A = P

nPnAPn +

P

n6=mPnAPm, and therefore eiNSθAe−iNSθ−A=P

n>m((ei(n−m)−1)PnAPm+ (e−i(n−m)−1)PmAPn) holds. Putting cnm:=R

µF

ωR(dθ)(1−e

i(n−m)θ_{), we have}

D A,ΓωR(U(A))

= X n>m

cnmPnAPm+cnmPmAPn

.

(10)

Pm_|m_i=_|m_iso that_|φ_i:= _√1

2(|ni+|mi) satisfieshφ| (cnmPnAPm+cnmPmAPn)|φi= 1

2(cnmhn|A|mi+cnmhm|A|ni) =|cnm|| hn|A|mi | 6= 0. Thus we have a bound, D A,ΓωR(U(A))

≥ |cnm_{|| h}n_|A_|m_{i |}. |cnm_| is bounded as

|cnm_{| ≥}Recnm _≥

Z n−πm

− π n−m

µFωR(dθ)(1−cos(n−m)θ)

≥t_·µFωR

{θ_|1₋cos(n₋m)θ > t_{} ∩}[₋ π n₋m,

π n₋m]

.

For sufficiently large ǫ satisfying W0

ǫ(µFωR) ≤

2π

n−m, we put t = 1−cos n−m

2 W 0 ǫ(µFωR)

to obtain

t_·µF_ωR

{θ_|1₋cos(n₋m)θ > t_{} ∩}[₋ π n₋m,

π n₋m]

=

1₋cos

n₋m

2 W

0 ǫ(µFωR)

ǫ.

Thus D A,ΓωR(U(A))

can be bounded by a similar inequality in which W0

ǫ(µFωR) plays a role. In the following we treat only the special effect A = 1

2(|0ih0|+|1ih1|+|0ih1|+ |1_ih0_|) for simplicity.

In [4] it is shown that a large reference frame allows for good phase localisation around 0. Now we show that to attain good localisation in the sense of small overall width, a large Hilbert space dimension for the reference is needed.

Lemma 1. The overall width W0

ǫ(µFωR) of µ F

ωR around 0 satisfies, for any ǫ≥0, 1₋ǫ_≤ dimHR

2π W 0 ǫ(µFωR).

Proof. For anyX _⊂[₋π, π) and for any stateω_R, we haveω_R(F(X))_≤tr[F(X)], where tr denotes the trace in _H_R. Now for any eigenstates _|n_i ofN_R, we have _hn_|F(X)_|n_i= |X_|/2π and so tr[F(X)] = dim_H_R_|X_|/2π. We put X = [₋W0

ǫ(µFωR)/2, W

0

ǫ(µFωR)/2] to obtain the result (using once more the equality (2)).

Note that this estimate is not tight. For instance, for ǫ= 0 we obtain W0

0(µFωR)≥

2π

dimHR. However, one can show that for any stateW

0

0(µFωR) = 2π. That is, no state can be strictly localised [24]. In fact, if we were to assume that a stateω_R could be strictly localised, there would exist an open setX _⊂[₋π, π) for which ω_R(F(X)) = 0. Defining f(θ) := ω_R eiNRθ

F(X)e−iNRθ _{and noting that the Hilbert space is finite dimensional,} this function can be analytically continued to the whole complex plane. This analytic function, by assumption, satisfiesf(θ) = 0 on some open interval ofR_{. This means that} f(θ) = 0 on the whole plane, which contradicts ω_R F([₋π, π))

= 1.

Proposition 3. Let A = 1

2 |1ih1|+|0ih0|+|0ih1|+|1ih0|

. For any ǫ_≥0,

D A,ΓωR(U(A))

≥ 1₂

1₋cos

π 1−ǫ dim_H_R

(11)

Proof. We use Lemma 1 and Proposition 2 to obtain cos W0

ǫ(µFωR)/2

≤ cos (π(1₋ǫ)/dim_H_R).

In particular for ǫ= 1₂, we obtain

D A,ΓωR(U(A))

≥ 1 4

1₋cos

π 2 dim_H_R

,

which is a non-trivial bound for dim_H_R <_∞.

Let us estimate the overall width around 0 in terms of the standard deviation of the number operator ∆ωRNR := (ωR(N

2

R)−ωR(NR)2)1/2. We obtain the following bound for W0

ǫ(µFωR).

Lemma 2. Assume W0

ǫ(µFωR) ≤ π and ∆ωRNR ·W

0

ǫ(µFωR) ≤

π

2. Then Wǫ0(µFωR) is

bounded as

cos ∆ωRNR·W

0 ǫ(µFωR)

≤pǫ(1₋ǫ) +√ǫ.

Proof. Forǫ satisfying pǫ(1₋ǫ) +√ǫ_≥1 the claim is trivial and therefore we assume otherwise. In the following W0

ǫ(µFωR) is denoted by W

0

ǫ for notational simplicity. As stated above, the definition of the overall width around 0 implies ω_R(F(_I(0, W0

ǫ))) = tr[ρRF(_I(0, W0

ǫ))] = 1 − ǫ, where ρR is a density operator representing ωR, i.e.,

ω_R(A) = tr[ρ_RA] for allA_{∈ L}(_H_R). Now we define shifted states (as density operators) of ρ_R by ρ_R,θ := e−iNRθρReiNRθ for −π ≤ θ ≤ π. Let us consider the fidelity [25, 26] between ρ_R and ρ_R,−W0

ǫ.

The fidelity between two states ρ0 and ρ1 is defined by F(ρ0, ρ1) := trh ρ1/2₀ ρ1ρ1/2₀ 1/2i

, which takes values in [0,1]. It quantifies the closeness of two states such that F(ρ0, ρ1) = 1 holds if and only if ρ0 = ρ1. We note that the fidelity has another representation [27],

F(ρ0, ρ1) = inf

E:pom

X

x

tr[ρ0E({x})]1/2tr[ρ1E({x})]1/2,

where inf is taken over all discrete poms.

With _I(0, W0

ǫ)c := [−π, π)\ I(0, Wǫ0), we have tr

ρRF(_I(0, W0 ǫ)c)

= ǫ. Further, noting that the “shifted” (modulo 2π) set _I(0, W0

ǫ)c −Wǫ0 = I(−Wǫ0, Wǫ0)c contains I(0, W0

ǫ) \ {−Wǫ0/2} (since Wǫ0 ≤ π), we also obtain tr

ρ_R,−W0

ǫF(I(0, W

0 ǫ)c)

= tr

ρ_RF(_I(₋W0

ǫ, Wǫ0)c)

≥tr

ρ_RF(_I(0, W0 ǫ))

= 1₋ǫ, and thus tr[ρ_R,−W0

ǫF(I(0, W

0 ǫ))]≤ ǫ. Therefore we have

F(ρ_R, ρ_R,−W0

ǫ)≤tr[ρRF(I(0, W

0

ǫ))]1/2tr[ρR,−W0

ǫF(I(0, W

0 ǫ))]1/2 + tr[ρ_RF(_I(0, W0

ǫ)c)]1/2tr[ρR,−W0

ǫF(I(0, W

0 ǫ)c)]1/2 ≤pǫ(1₋ǫ) +√ǫ.

The Mandelstam-Tamm uncertainty relation [28, 29] claims that for an arbitrary normalized vector _|φ_{i ∈ H}_R and θ with ∆φN_R _{· |}θ_{| ≤} π/2, _|

φ_|e−iNRθ

|φ

| ≥ cos (∆φN_R_·θ), where ∆φN_R = _hφ_|N2

R|φi − hφ|NR|φi2

1/2

(12)

between general states ρ0 and ρ1 is also given as F(ρ0, ρ1) = sup_|hφ0_|φ1_i| where sup is taken over all the possible purifications of ρ0 and ρ1, the Mandelstam-Tamm inequality is applied to purified states to give [30],

F(ρ_R, ρ_R,−θ)≥cos (∆ωRNR·θ).

Thus we obtain

cos ∆ωRNR·W

0 ǫ

≤pǫ(1₋ǫ) +√ǫ.

Theorem 1. Let A be an effect defined by A = 1₂(_|0_ih0_|+_|1_ih1_|+_|0_ih1_|+_|1_ih0_|). For

ω_R satisfying ∆ωRNR <

1 6,

D(A,ΓωR(U(A))) > 1 32.

For ω_R satisfying∆ωRNR ≥

1

6, it holds D(A,ΓωR(U(A))) ≥

1 32

1₋cos

π 12∆ωRNR

.

Proof. We put ǫ = 1

16. There are three possibilities: (i) W 0

1 16

(µF

ωR) ≤ π and ∆ωRNR · W

0

1 16

(µF

ωR) >

π

2, (ii) W 0

1 16

(µF

ωR) ≤ π and ∆ωRNR · W

0

1 16

(µF

ωR) ≤

π 2, (iii) W0

1 16

(µF

ωR)> π. (i) implies ∆ωRNR >

1

2. In this case, Proposition 2 gives

D(A,ΓωR(U(A))) ≥ 1 32

1₋cos

π 4∆ωRNR

. (3)

Under condition (ii) lemma 2 applies. Forǫ= ₁₆1, pǫ(1₋ǫ) +√ǫ_≤2√ǫ= 1₂ holds and thus

∆ωRNR·W

0

1 16(µ

F

ωR)≥

π

6 (4)

follows. It implies ∆ωRNR≥

1 6 and

D(A,ΓωR(U(A))) ≥ 1 32

1₋cos

π 12∆ωRNR

.

Combining (i) and (ii), we observe that ∆ωRNR <

1

6 implies W 0

1 16

(µF

ωR)> π, thus (iii). In this case, Proposition 2 is applied to show that

D(A,ΓωR(U(A))> 1

32. (5)

This completes the proof.

(13)

There are various reasons for going beyond these results. U _{relies on covariant}

phases, and does not yield all invariant observables in _H, immediately pointing to the need to understand the general case beyond the particular construction. We now present a model-independent trade-off between the operational measure of distance D(Γ(E), A) of an arbitrary effectA from the restriction of an invariant effectE, and the size of the reference system.

5. General argument

In this section we provide a general operational trade-off relation. We consider an arbitrary effect A _{∈ E}(_H_S) and a relative (invariant) effect E _{∈ E}(_H_S _{⊗ H}_R) = _E(_H) and try to bound D(A,ΓωR(E)). Once more there is a symmetry (Lie) groupG acting (via projective unitary representations) both in _L(_H_S) and _L(_H_R), and we recall that for each element n in the Lie algebra g of G, there exist corresponding self-adjoint operators N_S and N_R such that for sufficiently small _|s_|, αS

ens(A) = e

iNSsAe−iNSs and

αR

ens(A) =e

iNRsAe−iNRs.

For notational simplicity we write Γ : _L(_H) _{→ L}(_H_S) for a channel of the form Γ = ΓωR for some ωR. The following is our main result.

Theorem 2. Recall that V(A) = _kA ₋A2_k_, _D₍_{A, B}_{) =} _k_A₋_B_k_, _N

S and NR are

number operators on _H_S and _H_R respectively, and ω_R _{∈ S}(_H_R). Then the following inequality holds:

[A, N_S]

≤2D Γ(E), A

kN_S_k+ 2 ω_R(N_R2)₋ω_R(N_R)21/2

2D(Γ(E), A) +V(A)1/2

.

This inequality shows that good proximity between a symmetric, relative observable and a non-symmetric absolute quantity requires a large spread in the reference system’s number operator.

Before we prove the theorem, we recall some relevant results concerning channels Λ : _L(_H) _{→ L}(_K). In the application of the proof, we will specify Λ = Γ : L(_H_S _{⊗ H}_R)_{→ L}(_H_S).

Define a sesquilinear mapping _h·,_·i:_L(_H)_{× L}(_H)_{→ L}(_K) by

hA, B_i:= Λ(A∗_B₎₋_Λ(_A∗_)Λ(_B₎_.

This mapping satisfies _hA, A_{i ≥}0 and_hA, B_i∗ ₌_h_{B, A}_i _{for all} _{A, B} _{∈ L}₍_H_{). Thus this} can be regarded as an “operator-valued inner product”. It satisfies a Cauchy-Schwarz inequality proven by Janssens [33], which we recapitulate here as we expect it to be useful for future steps in this line of investigation.

Lemma 3. Consider the map _h·,_·i defined above. For any A, B _{∈ L}(_H) , it satisfies

(14)

Proof. We consider a (not necessarily minimal) Stinespring representation (_H′_{, V}_{) of} the channel Λ. That is, _H′ _{is a Hilbert space and} _V _:_{K → H ⊗ H}′ _{is an isometry such} that Λ(A) =V∗₍_A_⊗

1

H′)V holds. By introducing the operatorξ :=√1−V V∗, we can

represent the map as

hA, B_i=V∗(A∗ _⊗1

H′)ξ∗ξ(B⊗1

H′)V.

We then have

hA, B_ihB, A_i=V∗(A∗ _⊗1

H′)ξ∗ξ(B⊗1

H′)V V∗(B∗⊗1

H′)ξ∗ξ(A⊗1

H′)V.

As ξ(B _⊗1

H′)V V∗(B∗⊗1

H′)ξ∗ ≥0, it follows that

V∗(A∗_⊗1H′)ξ

∗_ξ₍_B_⊗

1H′)V V

∗₍_B∗_⊗

1H′)ξ

∗_ξ₍_A_⊗

1H′)V

≤ kξ(B_⊗1

H′)V V∗(B∗⊗1

H′)ξ∗kV∗(A∗⊗1

H′)ξ∗ξ(A⊗1

H′)V.

Using the C∗ _{property of the norm, we obtain}

kξ(B_⊗1

H′)V V∗(B∗⊗1

H′)ξ∗k=kV∗(B∗⊗1

H′)ξ∗ξ(B⊗1

H′)Vk=khB, Bik,

which proves the lemma.

The following lemma is an immediate consequence of the previous result.

Lemma 4. For any A, B _{∈ L}(_H),

khA, B_ik2 _{≤ kh}B, B_ikkhA, A_ik.

We now have the following result.

Lemma 5. If A, B _{∈ L}(_H) satisfy [A, B] = 0, then

k[Λ(A),Λ(B)]_{k ≤ k}Λ(A∗A)₋Λ(A)∗Λ(A)_k1/2_kΛ(BB∗)₋Λ(B)Λ(B)∗_k1/2

+_kΛ(AA∗)₋Λ(A)Λ(A)∗_k1/2_kΛ(B∗B)₋Λ(B)∗Λ(B)_k1/2. (6)

Proof. Appealing again to the sesquilinear mapping _hA, B_i, we write

hA∗_{, B}_i_{= Λ(}_AB₎₋_Λ(_A_)Λ(_B₎_, _h_B∗_{, A}_i_{= Λ(}_BA₎₋_Λ(_B_)Λ(_A₎_.

Thus for A, B _{∈ L}(_H) satisfying [A, B] = 0, it holds that

[Λ(A),Λ(B)] =_hB∗, A_{i − h}A∗, B_i.

By Lemma 4, the result is proved.

(15)

Proof. Define ǫ := Γ(E)₋A; we wish to estimate _kǫ_k = D(Γ(E), A). Now we have [A, N_S] = [Γ(E), N_S] + [N_S, ǫ] and thus

k[A, N_S]_{k ≤ k}[Γ(E), N_S]_k+_k[N_S, ǫ]_k. (7) We provide a bound for each term. The second term of the right-hand side is easily bounded as _k[N_S, ǫ]_{k ≤}2_kN_S_kkǫ_k.

By assumption we have [E, N] = 0, and therefore we may apply Lemma 5 in order to bound the first term on the right hand side of (7). Note that Γ(N) =N_S+ω_R(N_R)1.

We thus obtain

[Γ(E), N_S]

=

[Γ(E),Γ(N)]

≤2

Γ(E2)−Γ(E)2

1/2

Γ(N2)−Γ(N)2

1/2

= 2Γ(E2)−Γ(E)2

1/2

ω_R(N_R2)₋ω(N_R)21/2

. (8)

We now bound_kΓ(E2₎₋_Γ(_E₎2_k_{. Using Γ(}_E2₎₋_Γ(_E₎2 _≥_{0 (two-positivity) and}_E2 _≤_E_, we obtain

Γ(E2)₋Γ(E)2 _≤Γ(E)₋Γ(E)2 = (ǫ+A)₋(ǫ+A)2

=ǫ₋ǫ2 ₋ǫA₋Aǫ+A₋A2.

Thus it holds that

kΓ(E2)₋Γ(E)2_{k ≤ k}ǫ₋ǫ2₋ǫA₋Aǫ_k+V(A) =

[A, ǫ] + (1−2A−ǫ)ǫ

+V(A)

≤[A, ǫ]

+

1−(A+ Γ(E))

ǫ+V(A).

0 _≤ A _≤ 1 gives

[A, ǫ]

≤ kǫk. In fact, [A, ǫ]

= sup_k_φ_k₌₁| hφ|i[A, ǫ]|φi |

can be bounded by Robertson’s uncertainty relation as _{| h}φ_|i[A, ǫ]_|φ_{i |} _≤

2

q

hφ_|A2_|_φ_{i − h}_φ_|_A_|_φ_i2_k_ǫ_k_withq_h_φ_|_A2_|_φ_{i − h}_φ_|_A_|_φ_i2 _≤ 1

2, and 0≤A+ Γ(E)≤ 21 gives

1−(A+ Γ(E))

≤1. Thus we obtain

Γ(E2)−Γ(E)2

≤2kǫk+V(A). (9)

Putting (9) in (8), we obtain

k[Γ(E), N_S]_{k ≤}2(2_kǫ_k+V(A))1/2(ω_R(N_R2)₋ω(N_R)2)1/2. (10)

Thus, (7) can be bounded as

[A, N_S]

≤2kN_Skkǫk+ 2 2kǫk+V(A) 1/2

ω_R(N_R2)₋ω(N_R)21/2

.

This completes the proof.

(16)

Corollary 1. It holds that

[A, N_S]

≤2D(Γ(E), A)kN_Sk+ 2kN_Rk 2D(Γ(E), A) +V(A) 1/2

.

Corollary 2. For a sharp effect (projection) A it holds that

[A, N_S]

≤2D(Γ(E), A)kN_Sk+ 2

√

2 ω_R(N_R2)₋ω_R(N_R)21/2

D(Γ(E), A)1/2,

Thus to attain a good measurement a large reference frame is required, since it is only then that ω_R(N2

R)−ωR(NR)2 can be large.

Example 1. (G=U(1)). In section 4 we studied the particular mapU _for _G₌_U₍₁₎

and showed that a large reference frame is needed for U₍_A_{) to well approximate the}

effect A = 1

2(|0ih0|+|1ih1|+|0ih1|+|1ih0|). Below we estimate the lower bound of the distance D(Γ(E), A) for an arbitrary invariantE. Since A acts only on a subspace spanned by _{|0_i,_|1_i}, we introduce a projection operator P = _|0_ih0_|+_|1_ih1_|. We first note that for an arbitrary invariantE, we have

D Γ((P _⊗1)E(P ⊗1)), A

≤D Γ(E), A

.

In fact, since Γ((P _⊗1)E(P ⊗1)) = PΓ(E)P holds,§ D Γ((P ⊗1)E(P ⊗1)), A

=

P(Γ(E)−A)P

≤ D(Γ(E), A) follows. Thus it suffices to consider only E satisfying

(P_⊗1)E(P⊗1) =E. For such an E,ǫ:= Γ(E)−A satisfies [N

S, ǫ] = [P NSP, ǫ]. Thus

[N_S, ǫ]

in (7) can be bounded as [N_S, ǫ]

≤ kǫk (where positivity of N_S is used).

Compared to the direct application of Corollary 2, this gives the tighter bound 1

2 ≤D(ΓωR(E), A) + 2

√

2(ω_R(N_R2)₋ω_R(N_R)2)1/2D(ΓωR(E), A)

1/2_,

which shows a trade-off relation between the accuracy and the size of the reference frame.

Discussion. _{The necessity of the large reference frame, in the sense of large fluctuation}

ω_R(N2

R)−ωR(NR)2, can be interpreted in terms of the uncertainty relation for joint measurability [20]. E commutes with N := N_S +N_R, and one may consider joint measurements of E and N where these observables may be regarded as approximators to A and N_S, respectively. The aim is to obtain outcome distributions of E that are close to those of A. Since A and N_S do not commute, this closeness entails, due to the joint measurement uncertainty relation, that the outcome distribution of N should contain less “information” on N_S. In other words, the (hypothetical) measurement of N must be a highly inaccurate measurement of N_S. This large approximation error can be achieved only by the initial uncertainty of N_R. Put another way, for A to be a good “absolute” representative of E in the context of a joint measurement ofE and N, by the uncertainty relation for A and N_S, N_S cannot well approximate N, and this discrepancy between N_S and N is afforded by a large spread in N_R.

We also note that symmetry constraints of the form we have considered are relevant in the framework of resource theories (e.g., [16], [34]), wherein a reference state ω_R

(17)

is viewed as a resource if it is not invariant under U(1) symmetry transformations. The asymmetric state is seen in [16] as allowing for the simulation of non-invariant statistics of_S under the constraint of symmetric operations of_S+_R, and the degree of asymmetry may be quantified and exploited in the Wigner-Araki-Yanase theorem [34]. The connection between asymmetry and localisation, as we have it, is clear: highly localised reference states, with respect to a phase conjugate to number, are highly asymmetric under phase shifts. The exact equivalence, should there be one, between localisation as allowing non-invariant quantities to represent relative ones on the one hand, and asymmetry as a resource for good measurements in Wigner-Araki-Yanase-type scenarios on the other, remains a task to be investigated.

6. Conclusion

Through model considerations and generic trade-off relations we have shown that the possibility of the traditional tacit reduction of relative quantities to “absolute” quantities is contingent upon the size of the reference system and a judicious choice of reference (system) state. An obstruction to complete specification of subsystem statistics, or rather to subsystem quantities being used to approximate the corresponding invariant quantities, arises due to the incompatibility of subsystem quantities.

The necessity of a large uncertainty in the symmetry generator in a given reference state for good approximation is an essentially quantum feature. In classical mechanics, all observables commute and all (pure) states are well localised with respect to all classical quantities. Provided that the classical reference system admits a faithful action of the symmetry group, this reference system is sufficient for the “absolute” quantities to perfectly represent the relative ones, with no constraint on the values of other quantities at all.

The analysis presented here constitutes a first step towards a comprehensive, operational understanding of the role of (quantum) reference systems in the description of quantum experiments, and suggests a number of further avenues of exploration, for example the physically relevant case of non-Abelian symmetries, approximation of invariant observables by “absolute” phase-space quantities, strength of Bell inequality violation for finite-size reference systems, and many more.

Acknowledgements Many thanks to Tom Bullock for a careful reading of an earlier version of this manuscript. TM acknowledges JSPS KAKENHI (grant no. 15K04998). LL acknowledges support under the grantQuantum Mathematics and Computation (no. EP/K015478/1).

References

[1] E. P. Wigner, 1967 Symmetries and Reflections (Bloomington and London: Indiana University Press)

(18)

[3] S. D. Bartlett, T. Rudolph and R. W. Spekkens, 2007. Reference frames, superselection rules, and quantum information, Rev. Mod. Phys.79_{, 555.}

[4] L. Loveridge, P. Busch and T. Miyadera, 2016. Quantum measurements and symmetry, in preparation.

[5] D. N. Page, W. K. Wootters, 1983. Evolution without evolution: Dynamics described by stationary observables, Phys. Rev. D27_{, 2885.}

[6] G. J. Milburn, D. Poulin, 2006. Relational time for systems of oscillators, Int. J. Quantum Inform.,

04_{, 151.}

[7] R. Brunetti, K. Fredenhagen, M. Hoge, 2010. Time in quantum physics: from an external parameter to an intrinsic observable, Found. Phys.40_{, 1368.}

[8] V. Giovannetti, S. Lloyd, L. Maccone, 2015. Quantum time, Phys. Rev. D92_{, 045033.}

[9] E. Wigner, 1952. Die Messung quantenmechanischer Operatoren, Z. Phys.133_{, 101.}

[10] E. P. Wigner, 1952. Measurement of quantum mechanical operators. Translation of [5] by P. Busch; http://arxiv.org/abs/1012.4372.

[11] H. Araki and M.M. Yanase, 1960. Measurement of quantum mechanical operators, Phys. Rev.

120_{, 622.}

[12] M. M. Yanase, 1961. Optimal measuring apparatus, Phys. Rev.123_{, 666.}

[13] T. Miyadera and H. Imai, 2006. Wigner-Araki-Yanase theorem on distinguishability, Phys. Rev. A74_{, 024101.}

[14] L. Loveridge and P. Busch 2011. “Measurement of quantum mechanical operators” revisited, Eur. Phys. J. D.62_{, 2.}

[15] P. Busch and L. Loveridge 2011. Position measurements obeying momentum conservation, Phys. Rev. Lett.106_{, 110406.}

[16] I. Marvian and R. W. Spekkens, 2012. An information-theoretic account of the Wigner-Araki-Yanase theorem, arXiv:1212.3378.

[17] G. C. Wick, A. S. Wightman, and E. P. Wigner, 1952. Intrinsic parity of elementary particles, Phys. Rev.88_{, 101.}

[18] Y. Aharonov and L. Susskind, 1967. Charge superselection rule, Phys. Rev.155_{, 1428.}

[19] P. Busch, 2009. On the sharpness and bias of quantum effects, Found. Phys.39_{, 712-730.}

[20] T. Miyadera and H. Imai, 2008. Heisenberg’s uncertainty principle for simultaneous measurement of positive-operator-valued measures, Phys. Rev. A78_{, 052119.}

[21] L. Polterovich, 2014. Symplectic geometry of quantum noise, Commun. Math. Phys.327_{, 481-519.}

[22] E. B. Davies, 1976.Quantum Theory of Open Systems, Academic Press.

[23] P. Busch, P. Lahti, J.-P. Pellonpää and K. Ylinen,Quantum Measurement, Springer (forthcoming). [24] P. Busch, P. Lahti, J.-P. Pellonpää and K. Ylinen, 2001. Are number and phase complementary

observables?, J. Phys. A34_{, 5923-5935,}

[25] C. A. Fuchs, C. M. Caves, 1994. Ensemble-dependent bounds for accessible information in quantum mechanics, Phys. Rev. Lett.73_{, 3047.}

[26] M. Nielsen, I. Chuang, 2000. Quantum Computation and Quantum Information, Cambridge University Press.

[27] H. Barnum, C. M. Caves, C. A. Fuchs, R. Jozsa, B. Schumacher, 1996. Noncommuting mixed states cannot be broadcast, Phys. Rev. Lett.76_{, 2818.}

[28] L. I. Mandelshtam, I. E. Tamm, 1945. The uncertainty relation between energy and time in nonrelativistic quantum mechanics, J. Phys. U.S.S.R.9_{, 249.}

[29] P. Busch, 2008. The time-energy uncertainty relation, inTime in Quantum Physics, Springer. [30] T. Miyadera, 2015. Energy-time uncertainty relations in quantum measurements,

arXiv:1505.03707.

[31] P. Busch, M. Grabowki, P. Lahti, 1997.Operational Quantum Physics, Springer, 2nd ed.

[32] T. Miyadera, 2011. Uncertainty relations for joint localizability and joint measurability in finite-dimensional systems, J. Math. Phys.52_{, 072105.}

(19)