This site uses cookies. By continuing to use this site you agree to our use of cookies. To find out more, see our Privacy and Cookies policy. Close this notification
Paper The following article is Open access

Imprecise probability for non-commuting observables

Published 13 August 2015 © 2015 IOP Publishing Ltd and Deutsche Physikalische Gesellschaft
, , Focus on Quantum Thermodynamics Citation Armen E Allahverdyan 2015 New J. Phys. 17 085005 DOI 10.1088/1367-2630/17/8/085005

1367-2630/17/8/085005

Abstract

It is known that non-commuting observables in quantum mechanics do not have joint probability. This statement refers to the precise (additive) probability model. I show that the joint distribution of any non-commuting pair of variables can be quantified via upper and lower probabilities, i.e. the joint probability is described by an interval instead of a number (imprecise probability). I propose transparent axioms from which the upper and lower probability operators follow. The imprecise probability depend on the non-commuting observables, is linear over the state (density matrix) and reverts to the usual expression for commuting observables.

Export citation and abstract BibTeX RIS

Content from this work may be used under the terms of the Creative Commons Attribution 3.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

1. Introduction

Non-commuting observables in quantum mechanics do not have a joint probability [15] (see appendix A.1 for a reminder). This is the departure point of quantum mechanics from classical probabilistic theories [6]; it lies in the core of all quantum oddities. There are various quasi-probabilities (e.g., Wigner function) which have features of joint probability for (loosely defined) semiclassical states [79, 13]. Quasi-probabilities do have two problems: (i) they (must) get negative for a class of quantum states, thereby preventing any probabilistic interpretation for them1 . (ii) Even if the quasi-probability is positive on a certain state, it is not unique, i.e. there can be other (equally legitimate) quasi-probability that is positive (and has other expected features of probability) on this state, e.g. there are Wigner function, P-function, Terletsky–Margenau–Hill function etc [15, 16]. Despite of the drawbacks, quasi-probabilities do have many applications [712], since they still possess certain features of joint probability, e.g. they reproduce the marginals [2, 4, 79, 12]. In particular, there are applications in equilibrium quantum statistical mechanics, where the Wigner quasi-probability and the Terletsky–Margenau–Hill function [15, 16] are routinely employed for studying equilibrium relations in the semi-classical domain2 [17]. Applications in non-equilibrium quantum statistical mechanics are even more known, since whole chapters of open-system dynamics are written in terms of quasi-probabilities; see e.g. [10]. Another application of the Terletsky–Margenau–Hill quasi-probability for quantum non-equilibrium thermodynamics was proposed recently in the context of fluctuations of work and fluctuation theorems [12].

As a possible alternative to quasi-probabilities, one can relax the requirement that the sought joint probability correctly reproduces the marginals3 . This is done when studying joint measurements of non-commuting variables [4, 5, 2126]. Such measurements have to be approximate, since they operate on an arbitrary initial state [4, 5]. They produce positive probabilities for the measurement results, but it is not clear to which extent these probabilities are intrinsic [27], i.e. to which extent they characterize the system itself, and not approximate measurements employed. Alternatively, one can consider two consecutive measurements of the non-commuting observables [28, 29]. These two-time probabilities do not (generally) qualify for the joint probability of the non-commuting observables; see appendix A.2.

It is assumed that the sought joint probability is linear over the state (density matrix). If this condition is skipped, there are positive probabilities that correctly reproduce marginals for non-commuting observables [20, 24], e.g. simply the product of two marginals [13]. However, they do not reduce to the usual form of the joint quantum probability for commuting observables4 ; hence their physical meaning is unclear [13].

The statement on the non-existence of joint probability concern the usual precise and additive probability. This is not the only model of uncertainty. It was recognized since early days of probability theory [49] that the probability need not be precise: instead of being a definite number, it can be a definite interval [5154]; see [55] for an elementary introduction5 .

Instead of a precise probability for an event E, the measure of uncertainty is now an interval $[\underline{p}(E),\bar{p}(E)]$, where $0\leqslant \underline{p}(E)\leqslant \bar{p}(E)$ are called lower and upper probabilities, respectively. Qualitatively, $\underline{p}(E)$ ($1-\bar{p}(E)$) is a measure of a sure evidence in favor (against) of E. The event E is surely more probable than E', if $\underline{p}(E)\geqslant \bar{p}(E^{\prime} )$. The usual probability is recovered for $\underline{p}(E)=\bar{p}(E)$. Two different pairs $[\underline{p}(E),\bar{p}(E)]$ and $[\underline{p}^{\prime} (E),\bar{p}^{\prime} (E)]$ can hold simultaneously (i.e. they are consistent), provided that $\underline{p}^{\prime} (E)\leqslant \underline{p}(E)$ and $\bar{p}^{\prime} (E)\geqslant \bar{p}(E)$ for all E. In particular, every imprecise probability is consistent with6 $\underline{p}^{\prime} (E)=0$, $\bar{p}^{\prime} (E)=1.$

It is not assumed that for all E there is a true (precise, but unknown) probability that lies in $[\underline{p}(E),\bar{p}(E)]$. This assumption is frequently (but not always [34]) made in applications7 [52, 53], and it did motivate the generalized Kolmogorovian axiomatics of imprecise probability [54]; see appendix B.1. Imprecise joint probabilities in quantum mechanics are to be regarded as fundamental entities, not reducible to a lack of knowledge. They do need an independent axiomatic ground.

My purpose here is to propose a transparent set of conditions (axioms) that lead to quantum lower and upper joint probabilities. They depend only on the involved non-commuting observables (and on the quantum state).

The next section discusses previous attempts to introduce imprecise probability in quantum physics. Section 3 recalls standard linear algebra notations employed in this work. Section 4 describes physical conditions that are imposed on the sought imprecise probability. Section 5 outlines the main linear-algebra tool (CS-representation for projection operators) that is employed for finding the imprecise probability operators. Details of this representation are outlined in appendix C. Section 6 presents the main result: formulas for upper and lower probability operators. Their detailed derivation can be followed in appendix D. Several physical features of these operators are discussed in section 7 and also in appendices E and F. Section 8 discusses upper and lower probabilities for coordinate and momentum. I summarize in the last section. There I also mention several open problems related to this research.

2. Previous work

In 1967 Prugovecki tried to describe the joint probability of two non-commuting observables in a way that resembles imprecise probabilities [30]. But his expression was not correct, since it still can be negative [13]; cf footnote 1 and see also [18] in this context.

In 1991 Suppes and Zanotti proposed a local upper probability model for the standard setup of Bell inequalities (two entangled spins) [31]; see also [32, 33]. The formulation was given in the classical event space of hidden variables, and it is not unique even for the particular case considered. It violates classical observability conditions for the imprecise probability [31, 34, 54]. In particular, no lower probability exists in this scheme. Despite of such drawbacks, the pertinent message of [31] is that one should attempt at quantum applications of the upper probabilities that go beyond its classical axioms.

More recently, Galvan attempted to empoy (classical) imprecise probabilities for describing quantum dynamics in configuration space [37]. For a general discussion on quantum versus classical probabilities see [38].

3. Notations

All operators (matrices) live in a finite-dimensional Hilbert space ${\mathbb{H}}$. For two Hermitean operators Y and Z, $Y\geqslant Z$ means that all eigenvalues of $Y-Z$ are non-negative, i.e. $\langle \psi | (Y-Z)\psi \rangle \geqslant 0$ for any $| \psi \rangle \in {\mathbb{H}}$. The direct sum $Y\oplus Z$ of two operators refers to the following block-diagonal matrix:

$\mathrm{ran}(Y)$ is the range of Y (set of vectors $Y| \psi \rangle $, where $| \psi \rangle \in {\mathbb{H}}$). $I$ is the unity operator of ${\mathbb{H}}$. $\mathrm{ker}(Y)$ is the subspace of vectors $| \phi \rangle $ with $Y| \phi \rangle =0$.

${I}_{n}$ and 0n are the n × n unity and zero matrices, respectively.

In the direct sum of two sub-spaces, ${\mathbb{H}}\oplus {\mathbb{G}}$ it is always understood that ${\mathbb{H}}$ and ${\mathbb{G}}$ are orthogonal. The vector sum of (not necessarily orthogonal) sub-spaces ${\mathbb{A}}$ and ${\mathbb{B}}$ will be denoted as ${\mathbb{A}}+{\mathbb{B}}$. This space is formed by all vectors $| \psi \rangle +| \phi \rangle $, where $| \psi \rangle \in {\mathbb{A}}$ and $| \phi \rangle \in {\mathbb{B}}$.

4. Axioms for quantum imprecise probability

Existing axioms for imprecise probability are formulated on a classical event space with usual notions of con- and disjunction and complemention [51, 5154]; see appendix B for a reminder. For quantum probability it is natural to start from a Hilbert space and introduce upper and lower probabilities as operators. The axioms below require only the most basic feature of upper and lower probability and demand its consistency with the quantum joint probability whenever the latter is well-defined.

The usual quantum probability can be defined over (Hermitean) projectors $P={P}^{2}$ [39, 40]. A projector generalizes the classical notion of characteristic function. Each $P$ uniquely relates to its eigenspace $\mathrm{ran}(P)$. $P$ refers to a set of Hermitean operators $\{{\mathcal{P}}\}$:

Equation (1)

$P$ is a projector to an eigenspace of ${\mathcal{P}}$ or to a direct sum of such eigenspaces, i.e. $P$ refers to an eigenvalue of ${\mathcal{P}}$ or to a union of several eigenvalues. The quantum (precise and additive) probability to observe $P=1$ is $\mathrm{tr}[\rho P]$, where the density matrix $0\leqslant \rho \leqslant I$ defines the quantum state [4, 5, 39, 40].

Let $Q$ be another projector which refers to the set $\{{\mathcal{Q}}\}$ of observables. Generally, $[P,Q]\ne 0$. Given the density matrix ρ, we seek upper and lower joint probabilities of $P$ and $Q$ (i.e. of the corresponding eigenvalues of ${\mathcal{P}}$ and ${\mathcal{Q}}$ ):

Equation (2)

where $\underline{\omega }(P,Q)$ and $\bar{\omega }(P,Q)$ are Hermitean operators. Note that the upper and lower probabilities in (2) are assumed to have the usual Born's form, as far as their dependence on ρ is concerned.

We impose the following conditions (axioms):

Equation (3)

Equation (4)

Equation (5)

Equation (6)

Equation (7)

Equation (2) implies that $\underline{p}$ and $\bar{p}$ depend on $\{{\mathcal{P}}\}$ and $\{{\mathcal{Q}}\}$ only through $P$ and $Q$. This non-contextuality feature holds also for the ordinary (one-variable) quantum probability [43, 44]. Provided that the operators $\underline{\omega }$ and $\bar{\omega }$ are found, $\underline{p}$ and $\bar{p}$ can be determined in the usual way of quantum averages.

Conditions (3) stem from $0\leqslant \underline{p}(\rho ;P,Q)\leqslant \bar{p}(\rho ;P,Q)\leqslant 1$ that are demanded for all density matrices ρ. Equation (4) is the symmetry condition necessary for the joint probability. Equation (5) is reversion to the commuting case. In particular, (5) ensures $\underline{\omega }(P,0)=\bar{\omega }(P,0)=0$ and

Equation (8)

Since $Q=I$ means that $Q$ is anywhere, (8) is the reproduction of the marginal probability. The latter cannot be recovered by summation, since the very probability model is not additive.

For $[P,Q]=0$ the joint probability is $\mathrm{tr}(\rho {QP})=\mathrm{tr}(\rho {PQ})$. This expression is well-defined (i.e. positive, symmetric and additive) also for $[\rho ,Q]=0$ or $[\rho ,P]=0$ (but not necessarily $[P,Q]=0$). If $[\rho ,Q]=0$, one obtains $\mathrm{tr}(\rho {QP})$ by measuring $Q$ (ρ is not disturbed) and then $P$. Alternatively, one can obtain it by measuring the average of an Hermitean observable $\displaystyle \frac{1}{2}({PQ}+{QP})$. Thus (5), (6) demands that $\bar{p}(\rho ;P,Q)$ and $\underline{p}(\rho ;P,Q)$ are consistent with the joint probability $\mathrm{tr}(\rho \;{PQ})$, whenever the latter is well-defined.

Finally, equation (7) means that $\omega (P,Q)$ ($\omega =\underline{\omega },\bar{\omega }$) can be measured simultaneously and precisely with $P$ or with $Q$ (on any quantum state), a natural condition for the joint probability (operators)8 .

If there are several candidates satisfying (3)–(7) we shall naturally select the ones providing the largest lower probability and the smallest upper probability.

5. CS-representation

This representation will be our main tool. Given the projectors $P$ and Q, Hilbert space ${\mathbb{H}}$ can be represented as a direct sum [4547] (see appendix C)

Equation (9)

where the sub-space ${{\mathbb{H}}}_{\alpha \beta }$ of dimension ${m}_{\alpha \beta }$ is formed by common eigenvectors of $P$ and $Q$ having eigenvalue α (for $P$) and β (for Q). Depending on $P$ and $Q$ every sub-space can be absent; all of them can be present only for $\mathrm{dim}{\mathbb{H}}\geqslant 6$. Now ${{\mathbb{H}}}_{11}=\mathrm{ran}(P)\cap \mathrm{ran}(Q)$ is the intersection of the ranges of $P$ and $Q$. ${\mathbb{H}}^{\prime} $ has even dimension 2 $m$ [46, 47], this is the only sub-space in (9) that is not formed by common eigenvectors of $P$ and $Q$. There exists a unitary transformation

Equation (10)

so that $\hat{P}$ and $\hat{Q}$ get the following block-diagonal form related to (9) [46]:

Equation (11)

Equation (12)

where C and S are invertible square matrices of the same size holding

Equation (13)

Now $\mathrm{ran}(P^{\prime} )$ and $\mathrm{ran}(Q^{\prime} )$ are sub-spaces of ${\mathbb{H}}^{\prime} $. One has $C=\mathrm{cos}T$ and $S=\mathrm{sin}T$, where T is the operator analogue of the angle between two spaces. ${{\mathbb{H}}}_{{m}_{\alpha \beta }}$ are absent, if P and Q do not have any common eigenvector. This, in particular, happens in $\mathrm{dim}({\mathbb{H}})=2$.

6. The main result

Note that if (3)–(7) holds for $P$ and Q, they hold as well for $\hat{P}$ and $\hat{Q}$, because $\omega (P,Q)={U}^{\dagger }\omega (\hat{P},\hat{Q})U$ for $\omega =\underline{\omega },\bar{\omega }$. Appendix D shows how to get $\bar{\omega }(\hat{P},\hat{Q})$ and $\underline{\omega }(\hat{P},\hat{Q})$ from (3)–(7) and (11), (12):

Equation (14)

Equation (15)

Let $g(P,Q)=g(Q,P)$ be the projector onto intersection $\mathrm{ran}(P)\cap \mathrm{ran}(Q)$ of $\mathrm{ran}(P)$ and $\mathrm{ran}(Q)$. We now return from (10), (14) and (15) to original projectors $P$ and $Q$ (see appendix D) and obtain the main formulas:

Equation (16)

Equation (17)

For $[P,Q]=0$, $g(P,Q)={PQ}$, and we revert to $\underline{\omega }(P,Q)=\bar{\omega }(P,Q)={PQ}$. Note that $[P,{(P-Q)}^{2}]=[Q,{(P-Q)}^{2}]=0$.

7. Physical meaning of upper and lower probability operators

When looking for a joint probability defined over two projectors $P$ and $Q$ one wonders whether it is just not some (operator) mean of $P$ and $Q$. For ordinary numbers $a\geqslant 0$ and $b\geqslant 0$ there are three means: arithmetic $\frac{a+b}{2}$, geometric $\sqrt{{ab}}$ and harmonic $\frac{2{ab}}{a+b}$. Now (16) is precisely the operator harmonic mean of $P$ and $Q$ [57]

Equation (18)

where ${A}^{-}$ is the inverse of A if it exists, otherwise it is the pseudo-inverse; see appendix E for various representations of $\underline{\omega }(P,Q)$ and $\bar{\omega }(P,Q)$. More familiar formula is

Equation (19)

The intersection projector $g(P,Q)$ appears in [3943]. It was stressed that $g(P,Q)$ cannot be a joint probability for non-commutative $P$ and $Q$ [21]. Its meaning is clear by now: it is the lower probability for $P$ and $Q$. Note that

Equation (20)

since two different rays ($P$ and $Q$) cross only at zero. Thus, $g(P,Q)$ is non-zero for $[P,Q]\ne 0$, only if $\mathrm{tr}(P)\geqslant 2$ (or $\mathrm{tr}(Q)\geqslant 2$). I consider this as a natural features of the quantum lower probability, because the classical case—where the lower probability is expected to be non-zero and close to the upper probability—can be generically reached due to the coarse-graining, i.e. due to $\mathrm{tr}(P)$ (or $\mathrm{tr}(Q)$, or both) being sufficiently larger than 1.

Let us now turn to $\bar{\omega }$. The transition probability between two pure states is determined by the squared cosine of the angle between them: $| \langle \psi | \phi \rangle {| }^{2}={\mathrm{cos}}^{2}{\theta }_{\phi \psi }$. Equation (14) shows that $\bar{\omega }(P,Q)$ depends on ${C}^{2}={\mathrm{cos}}^{2}\;T$, where T is the operator angle between $\hat{P}$ and $\hat{Q}$. Note from (11), (12) that the eigenvalues λ of ${PQ}$, which hold $0\lt \lambda \lt 1$ are the eigenvalues of C2, and $| $ as seen from (14)—they are also (doubly-degenerate) eigenvalues of $\bar{\omega }(P,Q)$. Thus we have a physical interpretation not only for $\mathrm{tr}({PQ})$ (transition probability), but also for eigenvalues of ${PQ}$ (${PQ}$ and ${QP}$ have the same eigenvalues).

Equations (10), (14) and (15) imply that the upper and lower probability operators can be measured simultaneously on any state (cf (7)):

Equation (21)

The operator $\bar{\omega }(P,Q)-\underline{\omega }(P,Q)$ quantifies the uncertainty for joint probability, the physical meaning of this characteristics of non-commutativity is new.

Appendix G calculates the upper and lower probabilities for several examples.

Note that the conditional (upper and lower) probabilities are straighforward to define, e.g. (cf (2)): $\bar{p}(\rho ;P| Q)=\bar{p}(\rho ;P,Q)/\mathrm{tr}(\rho Q)$.

The distance between two probability intervals $[\underline{p},\bar{p}]$ and $[\underline{p}^{\prime} ,\bar{p}^{\prime} ]$ can be calculated via the Haussdorff metric [56]

Equation (22)

which nullifies if and only if $\underline{p}=\underline{p}^{\prime} $ and $\bar{p}=\bar{p}^{\prime} $, and which reduces to the ordinary distance $| p-p^{\prime} | $ for usual (precise) probabilities.

Let us see when we can use the notion of 'surely more probable'. Now

Equation (23)

means that the pair of projectors $({P}_{1},{Q}_{1})$ is surely more probable (on ρ) than $(P,Q)$; see appendix G for examples. Note from (16), (17) that if

Equation (24)

holds for $\omega =\underline{\omega }$, then it also hods for $\omega =\bar{\omega }$ (and vice versa). Though in a weaker sense than (23), (24) means that $P$ and $Q$ together is more probable than neither of them together (which is the pair $(I-P,I-Q)$). equations (23), (24) are examples of comparative (modal) probability statements; see [36] in this context.

Further features of $\underline{\omega }$ and $\bar{\omega }$ are uncovered when looking at a monotonic change of their arguments; see appendix F. Appendix G discusses concrete examples that illustrate these features. Yet another example is provided below.

8. Coordinate and momentum

Coordinate x and momentum p operators, $[x,p]={\rm{i}}$ (${\hslash }=1$) is the most known example of non-commutativity in quantum mechanics. Hence I shall illustrate the upper and lower probability for this example. In the (one-dimensional) x-representation, x-operator amounts to multiplication, while $p=-{\rm{i}}\displaystyle \frac{{\rm{d}}}{{\rm{d}}x}$. For intervals $x\in X=({X}_{1},{X}_{2})$ and $p\in Y=({Y}_{1},{Y}_{2})$ the corresponding projectors read in the coordinate representation

Equation (25)

where ${\chi }_{X}(x)=\theta (x-{X}_{1})\theta ({X}_{2}-x)$ is the characteristic function of interval X. Recall that QY and PY are linked via the Fourier transform:

Equation (26)

The first thing to note is that if X and Y are finite intervals, then $g({Q}_{X},{P}_{Y})=0$, i.e. ${P}_{Y}\phi =\phi $ and ${Q}_{X}\phi =\phi $ lead to $\phi =0$. This is a well-known result in the Fourier analysis; see, e.g. [6163]. The simplest way to show it is to note (from (26)) that ${Q}_{Y}{{\mathcal{F}}}^{-}\phi $ has a finite support, hence ${{\mathcal{F}}}^{+}{Q}_{Y}{{\mathcal{F}}}^{-}\phi =\phi $ is analytic. On the other hand, ϕ should have a finite support. Thus $\phi =0$. This argument extends to the case, where (say) Y is semi-infinite, e.g. $Y=({Y}_{1},\infty )$, while X differs from $(-\infty ,\infty )$ by a finite (or semi-infinite) interval [63]. Indeed, now $[{P}_{Y}\phi ](x)=\frac{1}{\sqrt{2\pi }}\int {\rm{d}}\xi \;{{\rm{e}}}^{{\rm{i}}x\xi }[{Q}_{Y}{{\mathcal{F}}}^{-}\phi ](\xi )$ is analytic for $\mathrm{Im}\;x\gt 0$, while from ${Q}_{X}\phi =\phi $ it follows that $\phi (x)$ is zero in a finite interval at least.

Thus, for finite (or at least one semi-infinite) intervals X and Y the lower probability for the joint distribution of the coordinate and momentum is zero (cf (16) )

Equation (27)

However, if both X and Y are finite intervals, $g(I-{Q}_{X},I-{P}_{Y})\ne 0$, e.g. the above analiticity argument does not work. Moreover, $g(I-{Q}_{X},I-{P}_{Y})$ has a discrete spectrum, and its range is infinite-dimensional [61, 62]. We shall avoid this complication by looking at those case, where (at least) one of X and Y is semi-infinite. Then (27) still holds, while the upper probability operator $\bar{\omega }({Q}_{X},{P}_{Y})$ reduces to (cf (17))

Equation (28)

and is straightforward to calculate via (25). Several examples of the upper probability calculated from (28) are presented in figure 1.

Figure 1.

Figure 1. The upper probability $\langle {\phi }_{1}| \bar{\omega }({Q}_{X},{P}_{Y})| {\phi }_{1}\rangle $, where $\bar{\omega }({Q}_{X},{P}_{Y})$ is given by (28) with $X=({X}_{1},\infty )$ and $Y=[-0.5,0.5]$ (red curve), $Y=[-1,1]$ (blue curve), $Y=[-1.5,1.5]$ (green curve), $Y=[-2,2]$ (black curve). Here ${\phi }_{1}(x)={(4/\pi )}^{1/4}x{{\rm{e}}}^{-{x}^{2}/2}$ is the wave-function of the first excited level for the harmonic oscillator with Hamiltonian $H=({p}^{2}+{x}^{2})/2$. This example of ${\phi }_{1}$ is chosen, because the Wigner function for the excited states of the harmonic oscillator is negative [8] and thus cannot serve for probabilistic reasoning. Note that for small values of X1 (left plateau on the figure), $\langle {\phi }_{1}| \bar{\omega }({Q}_{X},{P}_{Y})| {\phi }_{1}\rangle $ tends to $\langle {\phi }_{1}| {P}_{Y}| {\phi }_{1}\rangle $, while for large values of X1 it tends to $1-\langle {\phi }_{1}| {P}_{Y}| {\phi }_{1}\rangle $. For the eigen-functions of $H=({p}^{2}+{x}^{2})/2$, the (marginal) distributions of the coordinate and momentum are equal, e.g. $\langle {\phi }_{1}| {P}_{X}| {\phi }_{1}\rangle =\langle {\phi }_{1}| {Q}_{X}| {\phi }_{1}\rangle $.

Standard image High-resolution image

9. Summary and open problems

The main message of this work is that while joint precise probability for non-commuting observables does not exist, there are well-defined operator expressions for upper and lower imprecise probabilities. They are not additive, but otherwise they do satisfy a number of reasonable conditions: positivity, reproduction of correct marginals, direct observability via quantum averages, consistency with the (effectively) commuting case, where the joint probability is well-defined etc.

Several open questions are suggested by this research. First of all, it is not clear what is the suitable way of defining averages over the imprecise probability. This would be necessary for defining various correlation functions. Recall that the average $\langle X\rangle $ of a random variable X that has a precise probability is defined via two conditions (see e.g. [60]): linearity, $\langle {aX}+{bY}\rangle =a\langle X\rangle +b\langle Y\rangle $, and monotonicity, $X\leqslant Y$ implies $\langle X\rangle \leqslant \langle Y\rangle $. Presumably, these conditions are to be modified for imprecise probability; in addition the sought average should reduce to the usual one when averaging over a single observable. This question should be clarified before the imprecise probability can be efficiently applied in quantum statistical mechanics.

Another open issue relates to the point (see (6)) that whenever the joint probability $\mathrm{tr}(\rho {PQ})$ for non-commuting projectors $P$ and $Q$ is well defined due to e.g. $[\rho ,P]=0$ (see the discussion after (8)), the upper and lower probabilities $\bar{p}(\rho ;P,Q)$ and $\underline{p}(\rho ;P,Q)$ are merely consistent with the exact probability $\mathrm{tr}(\rho {PQ})$, but are not equal to it, which would be a more desired outcome. It is thus not completely clear whether the found imprecise probabilities cannot be made more precise by looking at more general conditions (axioms), e.g. those that involve a nonlinear dependence on the density matrix ρ; cf (2). Such a dependence might however impede the direct observability of imprecise probabilities; the resulting issues need further investigations.

In a more remote perspective, one can ask about the joint imprecise probability of 3 (and more) non-commuting observables. In contrast to the previous two open problems, where the progress looks to be feasible, this is a difficult problem, because no analogue of the CS-representation for 3 (or more) non-commuting projectors seems to exists; see however [64].

Acknowledgments

I thank KV Hovhannisyan for discussions. I was supported by COST network MP1209.

Appendices

All 6 appendices can be read independently from each other.

Appendices A–C recall, respectively, the no-go statements for the joint quantum probability, generalized axiomatics for the imprecise probability and the CS-representation. This material is not new, but is presented in a focused form, adapted from several different sources.

Appendix D contains the derivation of the main result, while appendices E and F demonstrate various feature of quantum imprecise probability.

Appendix G illustrates it with simple physical examples.

Appendix A.: Non-existence of (precise) joint probability for non-commuting observables

A.1. The basic argument

Given two sets of non-commuting Hermitean projectors:

Equation (A1)

Equation (A2)

we are looking for non-negative operators ${\Pi }_{{ik}}\geqslant 0$ such that for an arbitrary density matrix ρ

Equation (A3)

These relations imply

Equation (A4)

Now the second (third) relation in (A4) implies $\mathrm{ran}({\Pi }_{{ik}})\subseteq \mathrm{ran}({Q}_{i})$ ($\mathrm{ran}({\Pi }_{{ik}})\subseteq \mathrm{ran}({P}_{k})$). Hence $\mathrm{ran}({\Pi }_{{ik}})\subseteq \mathrm{ran}({Q}_{i})\cap \mathrm{ran}({P}_{k})$.

Thus, if $\mathrm{ran}({Q}_{i})\cap \mathrm{ran}({P}_{k})=0$ (e.g. when Pk and Qi are one-dimensional (1D)), then ${\Pi }_{{ik}}=0$, which means that the sought joint probability does not exist.

If $\mathrm{ran}({Q}_{i})\cap \mathrm{ran}({P}_{k})\ne 0$, then the largest ${\Pi }_{{ik}}$ that holds the second and third relation in (A4) is the projection $g({P}_{k},{Q}_{i})$ on $\mathrm{ran}({Q}_{i})\cap \mathrm{ran}({P}_{k})=0$. However, the first relation in (A4) is still impossible to satisfy (for $[{P}_{i},{Q}_{k}]\ne 0$), as seen from the superadditivity feature (F1):

Equation (A5)

A.2. Two-time probability (as a candidate for the joint probability)

Given (A1), (A2), we can carry out two successive measurements. First (second) we measure a quantity, whose eigen-projections are $\{{P}_{k}\}$ ($\{{Q}_{i}\}$). This results to the following joint probability for the measurement results (ρ is the density matrix)

Equation (A6)

Likewise, if we first measure $\{{Q}_{i}\}$ and then $\{{P}_{k}\}$, we obtain a quantity that generally differs from (A6):

Equation (A7)

If we attempt to consider (A7) (or (A6)) as a joint additive probability for Pi and Qk, we note that (A7) (and likewise (A6)) reproduces correctly only one marginal:

Equation (A8)

One can attempt to interpret the mean of (A6), (A7)

Equation (A9)

as a non-additive probability. This object is linear over ρ, symmetric (with respect to interchanging Pk and Qi), non-negative, and reduces to the additive joint probability for $[{P}_{k},{Q}_{i}]=0$. The relation $\mu (\rho ;{P}_{k},I)=\mathrm{tr}(\rho {P}_{k})$ can be interpreted as consistency with the correct marginals (once $\mu (\rho ;{P}_{k},{Q}_{i})$ is regarded as a non-additive probability, there is no point in insisting that the marginals are obtained in the additive way).

However, the additive joint probability $\mathrm{tr}(\rho {P}_{k}{Q}_{i})$ is well-defined also for $[\rho ,{P}_{k}]=0$ (or for $[\rho ,{Q}_{i}]=0$). If $[\rho ,{P}_{k}]=0$ holds, $\mu (\rho ;{P}_{k},{Q}_{i})$ is not consistent with $\mathrm{tr}(\rho {P}_{k}{Q}_{i})$, i.e. depending on ρ, Pk and Qi both

Equation (A10)

are possible.

To summarize, the two-time measurement results do not qualify as the additive joint probability, first because they are not unique (two different expressions (A6) and (A7) are possible), and second because they do not reproduce the correct marginals. If we take the mean of two expressions (A6) and (A7) and attempt to interpret it as a non-additive probability, it is not compatible with the joint probability, whenever the latter is well-defined.

Appendix B.: Axioms for classical imprecise probability

B.1. Generalized Kolmogorov's axioms

Given the full set of events Ω, $\bar{p}(.)$ and $\underline{p}(.)$ defined over sub-sets $A,B,\ldots $ of Ω (including the empty set $\{0\}$) satisfy [5254]:

Equation (B1)

Equation (B2)

Equation (B3)

Equation (B4)

Equation (B5)

where $\Omega -A$ includes all elements of Ω that are not in A, and where $A\cap B$ means intersection of two sets; $A\cap B=\{0\}$ holds for elementary events.

Here are some direct implications of (B1)–(B5)

Equation (B6)

Equation (B7)

Equation (B7) follows directly from (B4). Equation (B6) follows from (B4), (B3). Next relation:

Equation (B8)

which, in particular, implies

Equation (B9)

To derive (B8), note that (B4), (B3) imply $\bar{p}(\Omega -A-B)\leqslant \bar{p}(\Omega -A)-\underline{p}(B)$ or $\bar{p}(\Omega -A)\geqslant \underline{p}(B)+\bar{p}(\Omega -A-B)$, which is the first inequality in (B8). The second inequality is derived via (B5), (B3).

The following inequality generalizes the known relation of the additive probability theory

Equation (B10)

To prove (B10), we denote $A^{\prime} =A-A\cap B$, which means $A^{\prime} \cap B=\{0\}$. Now

Equation (B11)

Equation (B12)

where in (B11) (resp. in (B12)) we applied the first (resp. the second) inequality in (B8).

Note that the (non-negative) difference $\Delta p(A)=\bar{p}(A)-\underline{p}(A)$ between the upper and lower probabilities also holds the super-additivity feature (cd (B5))

Equation (B13)

Employing (B8) one can derive [58] for arbitrary A1 and A2:

Equation (B14)

Equation (B15)

Equation (B16)

B.2. Joint probability

The joint probabilities of A and B are now defined as

Equation (B17)

Employing the distributivity feature

Equation (B18)

which holds for any triple ${A}_{1},{A}_{2},{A}_{3}$, we obtain from (B4), (B5) for $B\cap C=\{0\}$

Equation (B19)

Equation (B20)

B.3. Dominated upper and lower probability

The origin of (B1)–(B5) can be related to the simplest scheme of hidden variable(s) [52]. One imagines that there exists a precise probability ${P}_{\theta }(A)$, where the parameter θ is not known. Only the extremal values over the parameter are known:

Equation (B21)

which satisfy (B1)–(B5).

However, it is generally not true that (B1)–(B5) imply the existence of a precise probability ${P}_{\theta }(A)$ that holds (B21) [53].

Appendix C.: Derivation of the CS-representation

C.1. The main theorem

Let ${{\mathbb{Q}}}^{\prime }$ and ${{\mathbb{P}}}^{\prime }$ are two subspaces of Hilbert space ${\mathbb{H}}^{\prime} $ that hold (${{{\mathbb{Q}}}^{\prime }}^{\perp }$ is the orthogonal complement of ${{\mathbb{Q}}}^{\prime }$)

Equation (C1)

The simplest example realizing (C1) is when ${{\mathbb{Q}}}^{\prime }$ and ${{\mathbb{P}}}^{\prime }$ are 1D subspaces of a two-dimensional (2D) ${\mathbb{H}}^{\prime} $.

Let $\hat{Q}^{\prime} $ and $\hat{P}^{\prime} $ be projectors onto ${{\mathbb{Q}}}^{\prime }$ and ${{\mathbb{P}}}^{\prime }$ respectively. Now $I-\hat{Q}^{\prime} $ is the projector of ${{{\mathbb{P}}}^{\prime }}^{\perp }$, and let $g(\hat{P}^{\prime} ,\hat{Q}^{\prime} )$ be the projector ${{\mathbb{Q}}}^{\prime }\cap {{\mathbb{P}}}^{\prime }$. Employing the known formulas (see e.g. [47])

Equation (C2)

we get from (C1)

Equation (C3)

which means that $\mathrm{dim}\;{\mathbb{H}}^{\prime} $ should be even for (C1) to hold9 .

Here is the statement of the CS-representation [46]: after a unitary transformation $\hat{Q}^{\prime} $ and $\hat{P}^{\prime} $ can be presented as

Equation (C4)

where all blocks in (C4) have the same dimension m.

To prove (C4), note that $\hat{Q}^{\prime} $ and $\hat{P}^{\prime} $ can be written as (cf (C3))

Equation (C5)

Next, let us show that

Equation (C6)

Since $\hat{Q}^{\prime} \hat{P}^{\prime} (I-\hat{Q}^{\prime} )=\left(\begin{array}{cc}{0}_{m} & B\\ {0}_{m} & {0}_{m}\end{array}\right)$, we need to show that for any $| \psi \rangle \in {{{\mathbb{Q}}}^{\prime }}^{\perp }$, $\hat{Q}^{\prime} \hat{P}^{\prime} | \psi \rangle =0$ means $| \psi \rangle =0$. Indeed, we have $\hat{P}^{\prime} | \psi \rangle \in {{{\mathbb{Q}}}^{\prime }}^{\perp }$, which together with ${{{\mathbb{Q}}}^{\prime }}^{\perp }\cap {{\mathbb{P}}}^{\prime }=0$ (see (C1)) leads to $| \psi \rangle =0$. Equation (C6) implies that there is the well-defined polar decomposition ($\hat{B}$ is Hermitean, while V is unitary)

Equation (C7)

We transform as

Equation (C8)

where $K={V}^{\dagger }V^{\prime} U$. We shall now employ the fact that the last matrix in (C8) is a projector:

Equation (C9)

The first and second relations in (C9) show that $[K,\hat{B}]=[L,\hat{B}]=0$. Then the third relations produces $\hat{B}(K+L-1)=0$. Since $\hat{B}\gt 0$ (due to $\mathrm{ker}(B)=0$), we conclude that $K+L=1$. The rest is obvious.

C.2. Joint commutant for two projectors

Given (C4), we want to find matrices that commute both with $\hat{P}^{\prime} $ and $\hat{Q}^{\prime} $ [46]. Matrices that commute with $\hat{Q}^{\prime} $ read

Equation (C10)

Employing (C4), we get that (C10) commutes with $\hat{P}^{\prime} $ if

Equation (C11)

Equation (C12)

Equation (C13)

Since C and S are invertible, (C11), (C12) imply that $[X,S]=[X,C]=[Y,S]=[Y,C]=0$. And then (C13) implies that X = Y. Hence

Equation (C14)

C.3. General form of the CS representation

The above derivation of (C4) assumed conditions (C1). More generally, the Hilbert space ${\mathbb{H}}$ can be represented as a direct sum [4547]

Equation (C15)

where the sub-space ${{\mathbb{H}}}_{\alpha \beta }$ of dimension ${m}_{\alpha \beta }$ is formed by common eigenvectors of $P$ and $Q$ having eigenvalue α (for $P$) and β (for Q). Depending on $P$ and $Q$ every sub-space can be absent; all of them can be present only for $\mathrm{dim}{\mathbb{H}}\geqslant 6$. Now ${{\mathbb{H}}}_{11}=\mathrm{ran}(P)\cap \mathrm{ran}(Q)$. ${\mathbb{H}}^{\prime} $ has even dimension $2m$ [46, 47], this is the only sub-space that is not formed by common eigenvectors of $P$ and $Q$.

After a unitary transformation

Equation (C16)

$\hat{P}$ and $\hat{Q}$ get the following block-diagonal form that is related to (C15) [46] and that generalizes (C4):

Equation (C17)

Equation (C18)

where C and S are invertible square matrices of the same size holding

Equation (C19)

${\mathbb{H}}^{\prime} $ refers to P' and Q'. If $\hat{P}$ and $\hat{Q}$ do not have any common eigenvector, $\hat{P}=P^{\prime} $ and $\hat{Q}=Q^{\prime} $.

Appendix D.: Derivation of the main result (equations (16), (17) of the main text)

We start with representation (C17), (C18) and axioms (3)–(7) of the main text. These axioms hold for $\hat{P}$, $\hat{Q}$ and $\hat{\rho }=U\rho {U}^{\dagger }$ (see (C16)) instead of $P$, $Q$ and ρ, because $\omega (\hat{P},\hat{Q})={U}^{\dagger }\omega (P,Q)U$ for $\omega =\underline{\omega },\bar{\omega }$ (recall that $\underline{\omega }(\hat{P},\hat{Q})$ and $\bar{\omega }(\hat{P},\hat{Q})$ are Taylor expandable). Hence we now search for $\underline{\omega }(\hat{P},\hat{Q})$ and $\bar{\omega }(\hat{P},\hat{Q})$.

The block-diagonal form (C17), (C15) remains intact under addition and multiplication of $\hat{P}$ and $\hat{Q}$. Hence $\underline{\omega }(\hat{P},\hat{Q})$ and $\bar{\omega }(\hat{P},\hat{Q})$ have the block-diagonal form similar to (C17), where the diagonal blocks are to be determined. Let now ${\Pi }_{\alpha \beta }$ be the projector on ${{\mathbb{H}}}_{\alpha \beta }$. We get [$\alpha ,\beta =0,1$]

Equation (D1)

Hence condition (5) of the main text implies [for $\omega =\underline{\omega },\bar{\omega }$ and ${\omega }^{\prime }=\underline{\omega }^{\prime} ,\bar{\omega }^{\prime} $]

Equation (D2)

Aiming to apply (6) of the main text, we write down (C17) explicitly as

Equation (D3)

The most general density matrix $\hat{\rho }$ that commutes with $\hat{Q}$ reads (in the same block-diagonal form)

Equation (D4)

Now $\hat{\rho }\hat{Q}=\hat{Q}\hat{\rho }$ is seen from the fact that after permutations of rows and columns, $\hat{Q}$ and $\hat{\rho }$ become ${I}_{m+{m}_{11}+{m}_{10}}\oplus {0}_{m+{m}_{01}+{m}_{00}}$ and $a\oplus b$, respectively. Note that ${a}_{{ii}}\geqslant 0$ and ${b}_{{ii}}\geqslant 0$.

Equations (D2)–(D4) imply

Equation (D5)

Equation (D6)

Equation (D7)

Condition (7) (of the main text) and (C14) imply

Equation (D8)

Recall condition (6) of the main text. It amounts to (D5) $\geqslant $ (D6) that should hold for arbitrary aik and bik. Hence we deduce: ${\underline{\omega }}_{22}^{\prime }=0$ and hence ${\underline{\omega }}_{11}^{\prime }={\underline{\omega }}_{12}^{\prime }=0$. Likewise, (D7) $\geqslant $ (D5) leads to ${\bar{\omega }}_{11}^{\prime }={\bar{\omega }}_{22}^{\prime }={C}^{2}$, ${\bar{\omega }}_{12}^{\prime }=0$; recall that we want the smallest upper probability. Now (4) (of the main text) holds, since

Equation (D9)

Thus (cf (C15))

Equation (D10)

Equation (D11)

Now $g(\hat{P},\hat{Q})=g(\hat{Q},\hat{P})$ is the projector onto $\mathrm{ran}(\hat{P})\cap \mathrm{ran}(\hat{Q})$. To return from (D10), (D11) to original projectors $P$ and Q, we note via (C17), (C18) (recall that ${\Pi }_{\alpha \beta }$ is the projector onto ${{\mathbb{H}}}_{\alpha \beta }$):

Equation (D12)

Equation (D13)

Equation (D14)

Equation (D15)

Equation (D16)

We act back by U, e.g. $g(\hat{P},\hat{Q})={U}^{\dagger }g(P,Q)U$, and get finally

Equation (D17)

Equation (D18)

Appendix E.: Various representations of upper and lower probability operators

E.1. Representations for the upper probability operator

Let us turn into a more detailed investigation of (D18). Note from (C17), (C18) and (D12) that $I-g(I-P,I-Q)$ is the projector to $\mathrm{ran}(P)+\mathrm{ran}(Q)$, where $\mathrm{ran}(P)+\mathrm{ran}(Q)$ is the vector sum of two sub-spaces. Note the following representation [48]:

Equation (E1)

where ${A}^{-}$ is the pseudo-inverse of Hermitean A, i.e. if $A=V(a\oplus 0){V}^{\dagger }$ (where V is unitary: ${{VV}}^{\dagger }=I$), then ${A}^{-}=V({a}^{-1}\oplus 0){V}^{\dagger }$.

The third equality in (E1) is the obvious feature of the pseudo-inverse. The first equality in (E1) follows from the fact that ${(P+Q)}^{-}(P+Q)$ is the projector on $\mathrm{ran}(P+Q)$ and the known relation [48]:

Equation (E2)

Employing ${(P-Q)}^{2}=I-{(I-P-Q)}^{2}$, $\bar{\omega }(P,Q)$ can be presented as a function of $P+Q$ (cf (E1)):

Equation (E3)

Note another representation for the projector to $\mathrm{ran}(P)+\mathrm{ran}(Q)$ [40]

Equation (E4)

where $I-g(I-P,I-Q)$ equals to the minimal Hermitean operator A that holds two conditions after $| $.

E.2. Representations for the lower probability operator

Let us first show that the projector $g(P,Q)$ into $\mathrm{ran}(P)\cap \mathrm{ran}(Q)$ holds [57]

Equation (E5)

where ${A}^{-}$ is the pseudo-inverse of A (cf (E1)).

The last equality in (E5) follows from the fact that $(Q+P){(P+Q)}^{-}$ is the projector to $\mathrm{ran}(P)+\mathrm{ran}(Q)$ (see (E1), E2)), which then leads to $P(Q+P){(P+Q)}^{-}=(Q+P){(P+Q)}^{-}\;P$.

The first equality in (E5) is shown as follows. Let $| \psi \rangle \in \mathrm{ran}(P)\cap \mathrm{ran}(Q)$. Then using (E5):

Thus, $\mathrm{ran}(2P{(P+Q)}^{-}Q)\supseteq (\;\mathrm{ran}(P)\cap \mathrm{ran}(Q))$. On the other hand, $\mathrm{ran}(2P{(P+Q)}^{-}Q)\subseteq \mathrm{ran}(P)$ and $\mathrm{ran}(2P{(P+Q)}^{-}Q)\subseteq \mathrm{ran}(Q)$, where the first relation follows from the implication: if $| \psi \rangle \not\in{\in }\mathrm{ran}(P)$, then $| \psi \rangle \not\in{\in }\mathrm{ran}(2P{(P+Q)}^{-}Q)$.

There are two other (more familiar) representations of $g(P,Q)$ (see e.g. [40, 48]):

Equation (E6)

Equation (E7)

Equation (E6) can be interpreted as a result of (infinitely many) successive measurements of $P$ and $Q$. Equation (E7) should be compared to (E4).

Yet another representation is useful in calculations, since it explicitly involves a 2 × 2 block-diagonal representation [57]:

Equation (E8)

Equation (E9)

where ${P}_{11}$, ${P}_{12}$, ${P}_{21}$ and ${P}_{22}$ are, respectively, ${n}_{1}\times {n}_{1}$, ${n}_{1}\times {n}_{2}$, ${n}_{2}\times {n}_{1}$, ${n}_{2}\times {n}_{2}$ matrices.

E.3. Direct relation between the eigenvalues of $P-Q$ and ${PQ}$

We can now prove directly (i.e. without employing the CS representation) that there is a direct relation between the eigenvalues of $\bar{\omega }(P,Q)$ and ${PQ}$. Let $| x\rangle $ be the eigenvector of Hermitean operator $P-Q$:

Equation (E10)

where $-1\leqslant \lambda \leqslant 1$ is the eigenvalue. Multiplying both sides of (E10) by $P$ (by Q) and using ${P}^{2}=P$ (${Q}^{2}=Q$) we get

Equation (E11)

which then implies

Equation (E12)

Thus $P| x\rangle $ ($Q| x\rangle $) is an eigenvector of ${PQ}$ (${QP}$) with eigenvalue $1-{\lambda }^{2}$.

As seen from (E11), the 2D linear space $\mathrm{Span}(P| x\rangle ,Q| x\rangle )$ formed by all superpositions of $P| x\rangle $ and $Q| x\rangle $ remains invariant under action of both $\hat{P}$ and $\hat{Q}$. Together with $\mathrm{tr}(P-Q)=0$ this means that if (E10) holds, then $P-Q$ has eigenvalue $-\lambda $ with the eigen-vector living in $\mathrm{Span}(P| x\rangle ,Q| x\rangle )$.

Further details on the relation between ${PQ}$ and $P-Q$ can be looked up in [59].

Appendix F.: Additivity and monotonicity

We discuss here the behavior of $\underline{\omega }(P,Q)$ and $\bar{\omega }(P,Q)$ (given by (D17), (D18)) with respect to a monotonic change of their arguments. For two projectors $Q^{\prime} $ and Q, $Q^{\prime} \geqslant Q$ means $Q^{\prime} =Q+K$, where ${K}^{2}=K$ and ${QK}=0$. Now (D17), (D18) and (E7) imply that $\underline{\omega }(P,Q)$ is operator superadditive

Equation (F1)

Likewise, $\bar{\omega }(P,Q)$ is operator subadditive, but under an additional condition:

Equation (F2)

They are the analogues of classical features (B4) and (B5), respectively. Note that (F1) and (B4) are valid under the same conditions, since ${QK}=0$ is the analogue of $A\cap B=\{0\}$. In that sense the correspondence between (F2) and (B5) is more limited, since $Q+K=I$ is more restrictive than ${QK}=0$.

We focus on deriving (F1), since (F2) is derived in the same way. Note from (E7) that $g(P,Q)\leqslant Q$ and $g(P,K)\leqslant K$ imply $g(P,Q)+g(P,K)\leqslant Q+K$. Since ${QK}=0$, $g(P,Q)+g(P,K)\leqslant P$. Using (E7) for $g(P,Q+K)$ we obtain (F1).

Note as well that both $\bar{\omega }(P,Q)$ and $\underline{\omega }(P,Q)$ are monotonous [$\omega =\underline{\omega },\bar{\omega }$]:

Equation (F3)

Equation (F3) for $\omega =\underline{\omega }$ follows from (F1). For $\omega =\bar{\omega }$ it is deduced as follows (cf (E4), (E7)):

Equation (F4)

Let us now discuss whether (F2) can hold under the same condition ${QK}=0$ as (F1). Now

Equation (F5)

amounts to

Equation (F6)

First of all note that for $Q+K=1$ and ${QK}=0$ we get $(I-Q)(I-K)=0$ and (F6) does hold for the same reason as (F1).

For ${QK}=0$, equation (F6) is invalid in 3D space (as well as for larger dimensional Hilbert spaces). Indeed, let us assume that $Q$ and $K$ are 1D:

Equation (F7)

Given $I-P$ as

Equation (F8)

we get (cf (E9))

where ${a}_{33}^{-}$ is the pseudo-inverse of a33.

Likewise

Now $g(I-P,I-Q-K)=0$, since $I-Q-K$ is a 1d projector. We can now establish that generically

Equation (F9)

(let alone (F6)), because the difference has both positive and negative eigenvalues.

The message (F9) is that the function $I-g(I-P,I-Q)$ is not sub-additive.

Now consider (F5), (F6), but under additional condition that ${PK}=0$. Now (F6) amounts to

Equation (F10)

which holds as equality since $\mathrm{ran}(K)\subseteq \mathrm{ran}({P}^{\perp })\cap \mathrm{ran}({Q}^{\perp })$.

Appendix G.: Upper and lower probabilities for simple examples

G.1. 2D Hilbert space

It should be clear from (D10), (D11) that in 2D Hilbert space, any lower probability operator $\underline{\omega }(P,Q)$ is zero (since two rays overlap only at zero), while the upper probability operator $\bar{\omega }(P,Q)=\bar{p}(\rho ;P,Q)$ just reduces to the transition probability (i.e. to a number) $\mathrm{tr}({PQ})$. Thus for the present case both $\bar{p}$ and $\underline{p}$ do not depend on ρ.

G.2. Spin 1

G.2.1. Projectors

The 3 × 3 matrices for the spin components read

Equation (G1)

Now ${P}_{\pm 1,0}^{a}$ for $a=x,y,z$ are the 1D projectors to the eigenspace with eigenvalues ± 1 or 0 of La:

Equation (G2)

Equation (G3)

Equation (G4)

where the zero components are orthogonal to each other:

Equation (G5)

Other overlaps are simple as well $(\alpha \ne \beta )$

Equation (G6)

Given two projectors P and Q, we defined $g(P,Q)$ as the projector on $\mathrm{ran}(P)\cap \mathrm{ran}(Q)$. For calculating $g(P,Q)$ we employ (E5).

G.2.2. Fine-grained joint probabilities for ${P}_{\pm 1,0}^{z}$ and ${P}_{\pm 1,0}^{x}$

Here are upper probability operators for joint values of ${P}_{\pm 1,0}^{z}$ and ${P}_{\pm 1,0}^{x}$:

Equation (G7)

Equation (G8)

Equation (G9)

Equation (G10)

Equation (G11)

Since ${P}_{\pm 1,0}^{z}$ and ${P}_{\pm 1,0}^{x}$ are 1D projectors, all the lower probability operators nullify. Equation (G7) means that the precise probability of Pz0 and Px0 is zero; cf (G5).

We now get from (G7)–(G11)

Equation (G12)

This matrix is larger than $I$, since its eigenvalues are $\frac{8}{3}$, $\frac{5}{3}$ and $\frac{5}{3}$.

Note from (C17), (C18) that for 3 × 3 matrices $\mathrm{dim}{{\mathcal{H}}}^{\prime }=2$, while $\mathrm{dim}{{\mathcal{H}}}_{11}=1$ (if this sub-space is present at all). Hence the eigenvalues of $\bar{\omega }$ relate to transition probabilities (G6). Indeed, the eigenvalues of matrices in (G8), (G9) (resp. in (G10), (G11)) is $(\frac{1}{4},\frac{1}{4},0)$ (resp. $(\frac{1}{2},\frac{1}{2},0)$). Hence the maximal probability interval $[\frac{1}{4},0]$ that can be generated by (G8), (G9) is smaller than the maximal interval $[\frac{1}{2},0]$ generated by (G10), (G11). As an example, let us take the upper probabilities generated on eigenstates of Ly ($\epsilon ,\eta ,\chi =1,0,-1$ ):

Equation (G13)

G.2.3. Coarse-grained joint probabilities for ${P}_{\pm 1,0}^{z}$ and ${P}_{\pm 1,0}^{x}$

Let us now turn to joint probabilities, where the lower probability is non-zero

Equation (G14)

Equation (G15)

Equation (G16)

Equation (G17)

Equation (G18)

Now $\bar{\omega }-\underline{\omega }$ for (G14), (G15) has eigenvalues $(\frac{1}{4},\frac{1}{4},0)$, while for for (G16), (G17) this matrix has eigenvalues $(\frac{1}{2},\frac{1}{2},0)$ (the last case (G18) refers to the commutative situation). Hence the probabilities for (G16), (G17) are more uncertain.

Next, let us establish whether certain combinations can be (surely) more probable than others. Note that

Equation (G19)

Once there is (one) positive eigenvalue, there is a class of states ρ for which

Equation (G20)

i.e. ${P}^{x}=0\;\mathrm{or}\;-1$ and ${P}^{z}=0\;\mathrm{or}\;1$ is more probable than ${P}^{x}=0\;\mathrm{or}\;1$ and ${P}^{z}=0\;\mathrm{or}\;1$. Note that

Equation (G21)

Such examples can be easily continued, e.g.

Equation (G22)

Footnotes

  • Negative probabilities were not found to admit a direct physical meaning [14] (what can be less possible, than the impossible?). In certain cases what seemed to be a negative probability was later on found to be a local value of a physical quantity, i.e. physically meaningful, but not a probability [14]. Mathematical meaning of negative probability is discussed in [18, 19].

  • One should stress here that the usage of quasi-probabilities in statistical mechanics is frequently implicit, but is nevertheless essential. For instance, the routine introduction of symmetrized correlators of non-commuting variables [17] implies an implicit choice of the underlying Terletsky–Margenau–Hill quasi-probability, because the symmetrized correlators are the 'real' correlators with respect to this quasi-probability. This point is seen in the standard quantum fluctuation–dissipation theorem [17].

  • Employing instead the unbiasedness: the averages of the non-commuting quantities are reproduced correctly [22].

  • Given two projectors $P$ and $Q$ and state $\rho $, this product is $\mathrm{tr}(\rho P)\mathrm{tr}(\rho Q)$, while the correct form for ${PQ}={QP}$ is $\mathrm{tr}(\rho {PQ})$.

  • Ellsberg's paradox is an example in psychology, where the ordinary probability theory does not apply, while imprecise probabilities can be used fruitfully for explaining experimental results on human decision making [50].

  • This 'nothing is known' situation cannot be represented by usual probabilities, the simplest example showing that imprecise probabilities can model types of uncertainty that are not captured by the precise model.

  • The assumption is legitimate in statistics, where one bounds the unknown (additive) probability via a finite number of observations [52]. It is not forbidden in subjective theories, where one aims at quantifying an uncertain human opinion via probabilities [53].

  • Without condition (7), I was not able to fix the upper probability operator, i.e. without (7) there are many operator candidates that are not consistent with each other, i.e. not related by operator analogues of larger or smaller.

  • Equation (C3) can be derived by noting that ${{{\mathbb{Q}}}^{\prime }}^{\perp }\cap {{\mathbb{P}}}^{\prime }=0$ implies $\mathrm{ker}(\hat{Q}^{\prime} \hat{P}^{\prime} )=0$. Indeed, if $Q| p\rangle =0$, where $| p\rangle \in {{\mathbb{P}}}^{\prime }$, then ${{{\mathbb{Q}}}^{\prime }}^{\perp }\cap {{\mathbb{P}}}^{\prime }\ne 0$. Hence $| p\rangle =0$. Let us mention for completeness that $\mathrm{ran}(\hat{Q}^{\prime} \hat{P}^{\prime} )\cap {{{\mathbb{Q}}}^{\prime }}^{\perp }=0$. Indeed, let us assume that $| f\rangle \in {{\mathbb{Q}}}^{\prime }$, $| g\rangle \in {{\mathbb{P}}}^{\prime }$ and $\langle f| \hat{Q}^{\prime} g\rangle =0$. Then $\langle \hat{Q}^{\prime} f| g\rangle =\langle f| g\rangle =0$. The last relation means that either f = 0, or ${{{\mathbb{Q}}}^{\prime }}^{\perp }\cap {{\mathbb{P}}}^{\prime }\ne 0$, which contradicts to (C1).

Please wait… references are loading.
10.1088/1367-2630/17/8/085005