Structure of minimum-error quantum state discrimination

Joonwoo Bae

doi:10.1088/1367-2630/15/7/073037

1. Introduction

In the early days, when quantum systems were applied to the communication of distant parties, in particular photonic systems which define physical limits on the sources of optical communication, there were pioneering works that investigated how quantum systems could be exploited to information processing and incorporated in the frameworks of classical information theory [1–3]. One of the most fundamental tasks for such applications is to formalize how classical messages can be retrieved from quantum states. For various purposes, different designs of measurements were then introduced and studied, for instance, measurement settings are optimized for accessible information [4], unambiguous discrimination [5–9], maximizing confidence to guess about certain states [10] or minimizing errors on average when making guesses about given states [1–3], etc. Recently, there have also been approaches that, with a pre-determined rate of inconclusive results, errors in discrimination are minimized [11, 12]. All these are fundamental and practical in quantum information theory and its applications, as well as useful tools to investigate the foundational aspects of quantum theory, see also the reviews in [13–17].

Among the different strategies in quantum state discrimination, here we are interested in minimum-error (ME) discrimination. ME discrimination optimizes measurement so that, once a quantum state is given from a set of known quantum states, one aims to make the correct guess about the state with a minimal error on average. In fact this has a number of applications in quantum information tasks; it is also of a fundamental interest since it shows the ultimate limit in the identification of a given state among a specified set of states. What is needed is, for arbitrarily given sets of quantum states, a general method of finding optimal measurement, that is, measurement that achieves minimal errors on average in the state discrimination. Despite the efforts devoted so far, however, analytic solutions are known only for restricted classes of quantum states, e.g. cases when certain symmetries are contained. The lack of a solution in most cases is partly because ME discrimination has been largely understood as an optimization problem for which it is generally hard to have analytic solutions. Otherwise, little is known as an approach to ME discrimination. Needless to say it is important in its own right; however, the lack of a general method for state discrimination has obviously been and still is a potentially significant obstacle preventing further investigation in both quantum information theory and quantum foundation.

To be precise about known results in ME discrimination, no general method has yet been found as an analytic way of solving ME state discrimination. The known general method is a numerical approach via semidefinite programming, which efficiently returns the solution in a polynomial time [18–21]. For cases in which ME discrimination is known in an analytic form, two-state discrimination is the only case where no symmetry is assumed among given quantum states. The result was shown in 1976 and called the Helstrom bound [3]. Apart from this, if no symmetry exists among given states, no analytic solution is known, for instance, in the next simplest case of arbitrary three states, or the even simpler case of three qubit states. Otherwise, ME discrimination is known for cases when given quantum states have certain symmetries, such as the geometrically uniform structure, see [3, 4, 19, 22–27].

Note also that, although general solutions are lacking, the necessary and sufficient conditions that characterize optimal discrimination were obtained when the problem was introduced [1, 2]. That is, parameters giving optimal discrimination satisfy the conditions and conversely any parameters fulfilling the conditions can construct optimal discrimination. They have been used to run a numerical algorithm [18].

The purpose of this work is to show the general structure existing in ME discrimination, namely, the relations among optimal measurement, the guessing probability and other useful properties to analytically find optimal measurement and the guessing probability. The main idea is to view ME discrimination from various angles and different approaches, such as relations of fundamental principles, convex optimization frameworks or their generalization called the complementarity problem and different formalisms accordingly. We show that all these can be equivalently summarized into the so-called complementarity problem in the context of convex optimization, see e.g. [28]. The approach refers to a direct analysis on the optimality conditions; in fact, in this way, more parameters are included than the originally given problems; however, the advantage is that the generic structure of the given optimization problems is fully exploited. Applying the structure and the method of the complementarity problem, we provide a geometric formulation of solving ME discrimination. Thus, once the geometry is clear from the context of given quantum states, the formulation can be exploited and one can straightforwardly find the solution, such as optimal measurement and the guessing probability. As an example where the state geometry is also clear, we illustrate that ME discrimination is completely solved for an arbitrary set of qubit states given with equal prior probabilities. All these provide alternative methods of solving the ME discrimination in an analytic way, apart from the numerical optimization method.

The rest of the paper is structured as follows. In section 2, we begin with the problem of definition and an introduction to ME discrimination. We also provide an operational interpretation of the guessing probability. We obtain optimality conditions for ME discrimination from different approaches and finally put them equivalently into the framework of a complementarity problem. From the optimality conditions, we show the general structure of optimal parameters in ME discrimination, such as optimal measurement and the guessing probability. In this way, we provide a geometric formulation of ME discrimination. In section 3, based on the general structure, we present general properties as follows. We construct equivalence classes of sets of quantum states in terms of the ME discrimination, such that for the sets in the same class, the ME discrimination is characterized in an equivalent way. We also show, conversely, how a set of states can be generated in an equivalent class. Then, for the generated states, the ME discrimination is already characterized by the equivalence class. We finally present various and equivalent forms of the guessing probability according to different approaches in which the general structure and the optimality conditions are derived and provide their operational meanings. In section 4, we apply all these to qubit states in which quantum state geometry is clearly found with the Bloch sphere. We show how the geometric formulation can be applied and illustrate that for all problems of ME discrimination, with equal prior probabilities, it can be analytically solved by using the state geometry. In section 5, we conclude the results.

2. Minimum-error discrimination and optimality conditions

In this section, let us introduce the problem of ME discrimination of quantum states and fix notations. We then analyze the optimality conditions that completely characterize optimal parameters, such as optimal measurement and the guessing probability. We review optimality conditions derived from different approaches and show that they are equivalent.

2.1. Problem definition and probability-theoretic preliminaries

Suppose that there is a device which generates different quantum states. The device has N buttons and pressing one of them, say x, corresponds to an instance that a state ρ_x is generated. Denoted by q_x as the probability that the button x is pressed, a priori probability that ρ_x is produced from the device is defined. We summarize the state generation by {q_x,ρ_x}^N_x=1. Once generated, a state is then sent to a measurement device, which is prepared to make a best guess about which button has been pressed in the preparation. This corresponds to the task of distinguishing N quantum states via measurement devices.

Let P(x|y) denote the probability that when state ρ_y has been prepared, one concludes from measurement outcomes that state ρ_x is given. Or equivalently, as the conclusion follows from measurement outcomes, it is the probability that an output port x is clicked when a button y is pressed in the preparation. Then, in the ME discrimination, we are interested in minimizing the probability of making errors on average, or equivalently to have maximal probability to make correct guesses about input states, called the guessing probability, ${P}_{\mathrm {guess}} = \max _M \sum _{{x}} q_{{x}} P({x} | {x})$ where the maximization runs over measurement settings M.

Measurement on quantum systems is described by positive-operator-valued-measures (POVMs): a set of positive operators {M_x ⩾ 0}^N_x=1 that fulfills, $\sum _{{x} = 1}^{N} M_{{x}} = I$ . For given states {q_x,ρ_x}^N_x=1, let {M_x}^N_x=1 denote POVMs for quantum state discrimination. When state ρ_y is actually given, the probability that a detection event happens in measurement M_x is given by P(x|y) = tr[ρ_yM_x]. The guessing probability can therefore be written as

$\begin{equation} {P}_{\mathrm{guess}} = \max_M \sum_{{x}=1}^{N} q_{{x}} P({x} | {x}) = \max_{\{ M_{{x}} \}_{{x}=1}^{N}} \sum_{{x} = 1 }^{N} q_{{x}} \mathrm{tr}[M_{{x}}\rho_{{x}}] \end{equation} \tag{ 1 }$

with the maximization over all POVMs. From postulates of quantum theory, non-orthogonal quantum states cannot be perfectly discriminated between, i.e. P_guess < 1.

The ME discrimination described in the above can be rephrased in a probability-theoretical way as follows. The preparation applies random variable X, which corresponds to a button pressed, and the measurement shows the other random variable Y for guessing about X with some probabilities. The distinguishability of different probabilistic systems can be, in general, measured by the variational distance of probabilities. Then, the distance from the uniform distribution is particularly useful to describe and interpret the guessing probability. The distance of a probability distribution of random variable P_X from the uniform distribution, which is 1/N in this case, is denoted by d(P_X) or simply d(X) and defined as

$\begin{equation} d(X) = \frac{1}{2} \sum_{{x}=1}^{N} \left | P_{X}( {x} ) - \frac{1}{N} \right|. \end{equation} \tag{ 2 }$

Let d(X|Y ) denote the distance of random variable X given Y from the uniform.

([29]).

Lemma 2.1 The guessing probability about X given Y can be expressed by

$\begin{equation} P_{\mathrm{guess}} := P_{\mathrm{guess}}(X|Y) = \frac{1}{N} + d(X |Y). \end{equation} \tag{ 3 }$

Proof. Note that the guessing probability is $P_{\mathrm {guess}} = \sum _{{y}=1}^{N} P_{Y}({y}) P_{X|Y} ({y}|{y})$ . It is then straightforward to write the distance, as follows:

$\begin{eqnarray*} d(X|Y) & = & \sum_{{y}=1} P_{Y}({y}) d(X| Y = {y})\\ & = & \frac{1}{2} \sum_{{y}=1}^{N} P_{Y}({y}) \sum_{{x}=1}^{N} \left| P_{X|Y} ({x} | {y}) -\frac{1}{N} \right|. \end{eqnarray*}$

As the probability of making correct guesses is not smaller than the random guess, we can assume that P_X|Y(y|y) ⩾ 1/N for all y = 1,...,N and P_X|Y(x|y) ⩽ 1/N for x ≠ y and this does not lose any generality. Thus it follows that, when Y =y,

$\begin{eqnarray*}&&\fl \sum_{{x}=1}^{N} \left| P_{X|Y} ({x} | {y}) -\frac{1}{N} \right| = P_{X|Y} ({y}|{y}) -\frac{1}{N} - \sum_{{x} \neq{y}} \left(P_{X|Y}({x} | {y}) -\frac{1}{N}\right) \\ &&= 2 P_{X|Y} ({y} | {y}) -\frac{2}{N}. \end{eqnarray*}$

From these two relations, it holds that

$\begin{eqnarray*} &&d(X|Y) = P_{\mathrm{guess}} - \sum_{{y}=1}^{N} P_{Y}({y}) \frac{1}{N} = P_{\mathrm{guess}} - \frac{1}{N} \end{eqnarray*}$

and thus equation (3) is shown. □

This shows that the guessing probability about random variable X given Y corresponds to the distance of the probability P_X|Y from the uniform distribution. As only input and output random variables are taken into account, the expression in equation (3) generally works for any physical system, either quantum or classical, employed from preparation to measurement. Then, the distance d(X|Y ) depends on the physical systems employed between preparation and measurement. Once quantum systems are applied, the distance d(X|Y ) must be expressed in terms of properties or relations of given quantum states. The expression for quantum systems is shown in lemma 3.2.

2.2. Optimality conditions

As is shown in equation (1), optimal discrimination is obtained with POVM elements that maximize the probability of making correct guesses. There have been different approaches to characterize optimal measurement. Here, we review known results and show that they are equivalent to one another.

2.2.1. Optimality conditions from analytic derivation

The necessary and sufficient conditions that POVMs must satisfy to fulfill optimal discrimination have been obtained from the very beginning when the problem was introduced in [1, 2]. For given states {q_x,ρ_x}^N_x=1 to discriminate among, POVM elements {M_x}^N_x=1 achieve the guessing probability if and only if they satisfy

$\begin{equation} M_{{x}} (q_{{x}} \rho_{{x}} - q_{{y}} \rho_{{y}}) M_{{y}} = 0 \quad \forall {x}, \;{y}=1,\ldots, N, \end{equation} \tag{ 4 }$

$\begin{equation} \sum_{{x} =1 }^{N} q_{{x}} \rho_{{x} } M_{{x} } - q_{{y}} \rho_{{y}} \geqslant 0 \quad \forall\; {y}=1,\ldots, N. \end{equation} \tag{ 5 }$

That is, for a state discrimination problem, if one finds a set of POVMs that satisfy two of the conditions above, then optimal discrimination is immediately obtained. Note also that for a given problem of state discrimination, optimal measurement is generally not unique.

2.2.2. Optimality condition from convex optimization

The optimality conditions can also be derived from a different context, the formalism of convex optimization [21]. The optimization problem in equation (1) with the convex and affine constraints on POVMs can be written as

$\begin{eqnarray} \max& & \sum_{{x} =1}^{N} q_{{x}}\, \mathrm{tr}[M_{{x}} \rho_{{x}}] \nonumber \\ \mathrm{s.t.} ~\mathrm{} & & \sum_{{x} =1 }^{N} M_{{x}} = I\quad \mathrm{and}\quad M_{{x}} \geqslant 0\quad \forall\; {x} =1,\ldots, N. \end{eqnarray} \tag{ 6 }$

For convenience, we call equation (6) the primal problem. Note that the primal problem in the above is feasible.

To derive its dual problem, let us first construct the Lagrangian $\mathcal {L}$ as follows:

$\begin{eqnarray*}&&\fl \mathcal{L}(\{M_{{x}} \}_{{x}=1}^{N}, \{\sigma_{{x}} \}_{{x}=1}^{N},K) = \sum_{{x}=1}^{N} q_{{x}}\, \mathrm{tr}[M_{{x}} \rho_{{x}}] + \sum_{{x}=1}^{N} \mathrm{tr}[\sigma_{{x}} M_{{x}}] + \mathrm{tr}\left[K \left(I - \sum_{{x}=1}^{N} M_{x}\right)\right] \\ &&= \sum_{{x}=1}^{N} \mathrm{tr}[M_{{x}} (q_{{x}} \rho_{{x}} +\sigma_{{x}} -K ) ] + \mathrm{tr}\,{K} ,\nonumber \end{eqnarray*}$

where {σ_x}^N_x=1 are positive-semidefinite Hermitian operators and K is a Hermitian operator. Note that {σ_x}^N_x=1 and K are called dual parameters. To derive the dual problem, we first have to maximize the Lagrangian over primal parameter {M_x}^N_x=1. If it holds that q_xρ_x + σ_x − K > 0 for some x, then the primal problem becomes not feasible since the maximization may go to +∞. Thus, we have q_xρ_x + σ_x − K ⩽ 0 for all x, where the equality holds if and only if the Lagrangian is maximized. Combined together with the other constraint that σ_x ⩾ 0 for all x, the constraint can be finally rewritten as, K ⩾ q_xρ_x for all x. This also shows that the operator K must be positive. Note that since the Lagrangian is maximized, we now have

$\begin{eqnarray*} &&\max_{\{ M_{{x}} \}_{{x}=1}^{N}} \sum_{{x} =1}^{N} q_{{x}}\, \mathrm{tr}[M_{{x}} \rho_{{x}}] \leqslant \max_{\{ M_{{x}} \}_{{x}=1}^{N}} \mathcal{L}(\{M_{{x}} \}_{{x}=1}^{N}, \{\sigma_{{x}} \}_{{x}=1}^{N},K) \vert_{q_{{x}} \rho_{{x}} +\sigma_{{x}} -K \leqslant 0}= \mathrm{tr}\, K, \end{eqnarray*}$

which is called the weak duality.

The dual problem to equation (6) is therefore obtained as follows:

$\begin{eqnarray} \min & & \mathrm{tr}[K] \nonumber \\ \mathrm{s.t.} ~\mathrm{} & & K \geqslant q_{{x}} \rho_{{x}}\quad \forall~{x}=1,\ldots, N. \end{eqnarray} \tag{ 7 }$

Putting optimal discrimination problems into this convex optimization framework, the solution, i.e. the guessing probability, is returned efficiently in a polynomial time. In general, solutions from primal and dual problems in the convex optimization are not necessarily the same. It generally holds that either of the solutions is larger than or equal to the other, from the property called weak duality, as is shown in the above. In this problem of state discrimination, it turns out that solutions from both problems coincide each other, which follows from the so-called constraint quantification. This property is called the strong duality. Thus, we have

$\begin{eqnarray*} &&{P}_{\mathrm{guess}} = \max_{ \{ M_{{x}} \}_{{x}=1}^{N} } \sum_{{x} =1}^{N} q_{{x}}\, \mathrm{tr}[M_{{x}} \rho_{{x}}] = \min_{K\geqslant q_{{x}} \rho_{{x}} } \mathrm{tr}[K]. \end{eqnarray*}$

That is, the guessing probability is obtained by solving either the primal or the dual problem.

Apart from solving either the primal or the dual problem, a third approach in the convex optimization, which we are mainly going to consider here, is the so-called complementarity problem that directly analyzes the constraints characterizing the optimal parameters in both primal and dual problems and finds all of the optimal parameters [30]. As both the primal and dual parameters from both convex constraints are taken into account, the approach itself is not considered to be more efficient in numerics than primal or dual problems. Its advantage, however, lies in the fact that the general structure existing in an optimization problem is fully exploited.

Those constraints which characterize optimal solutions are called optimality conditions and can be written as the so-called Karush–Kuhn–Tucker (KKT) conditions. In general, KKT conditions are only necessary since solutions of primal and dual problems can be unequal. As we have mentioned, from the constraint qualification the strong duality holds and consequently the KKT conditions in this case are also sufficient. The list of KKT conditions for the discrimination problem is, then, constraints in equations (6) and (7) and two more conditions in the following.

Lemma 2.2 (Optimality conditions). For a set of states {q_x,ρ_x}^N_x=1, the optimal ME discrimination is characterized by a symmetry operator denoted by K^⋆ and complementary states {r^⋆_x,σ^⋆_x}^N_x=1, where r^⋆_x ⩾ 0, which satisfy the followings, for x = 1,...,N:

$\begin{equation} \mathrm{(symmetry}~ \mathrm{operator)}\quad K^{\star} = q_{{x}} \rho_{{x}} + r_{{x}}^{\star} {\sigma}_{{x}}^{\star}, \end{equation} \tag{ 8 }$

$\begin{equation} \mathrm{(orthogonality)} \quad r_{{x}}^{\star} \,\mathrm{tr}[M_{{x}}^{\star}{\sigma}_{{x}}^{\star}] = 0, \end{equation} \tag{ 9 }$

where POVM elements {M^⋆_x}^N_x=1 in the above are an optimal measurement giving the guessing probability.

Conversely, for given states {q_x,ρ_x}^N_x=1, any set of parameters, K, {r_x,σ_x}^N_x=1 and {M_x}^N_x=1 fulfilling KKT conditions, equations (6)–(9), characterize the optimal state discrimination. If parameters satisfy KKT conditions, they are optimal and we write them by K^⋆, {r^⋆_x,σ^⋆_x}^N_x=1 and {M^⋆_x}^N_x=1, throughout.

Note that, with optimal parameters, the guessing probability is given by P_guess = tr[K^⋆] from the dual problem in equation (7).

Let us explain how one can obtain two conditions in equations (8) and (9). In the context of convex optimization, the condition of symmetry operator in equation (8) follows from the Lagrangian stability: $\nabla _{M_{{x}}}\mathcal {L}=0$ for all x. The orthogonality condition in equation (9) is obtained from the complementary slackness [21]. In state discrimination, therefore, those parameters satisfying KKT conditions, equations (6)–(9), are characterized as optimal measurements and complementary states to give guessing probabilities.

We in particular call K^⋆ as the symmetry operator in the ME discrimination of states {q_x,ρ_x}^N_x=1 due to the following reasons. First, the operator is uniquely determined for the ME discrimination of given states, whereas optimal measurement is not unique e.g. [36].¹ Later, the proof is provided in lemma 2.4. This means that for given quantum states, the symmetry operator, rather than optimal measurement, characterizes ME discrimination. Next, due to the uniqueness, the operator preserves the properties concerning the guessing probability in ME discrimination. Note that the guessing probability is of practical importance when exploiting sets of quantum states in a communication task. Then, as we will explicitly show later in examples, the guessing probability does not generally depend on detailed relations (e.g. angles or distances between states) of given states {q_x,ρ_x}^N_x=1 but a single parameter, the operator K^⋆ constructed by given states.

Let us also be precise about the result that the guessing probability does not depend on detailed relations of given quantum states. We point out that the observation from the two-state discrimination [3], where the trace-distance defining the relation between the two states is the parameter dictating the guessing probability, cannot generalize to more than two states. In addition we show, by examples, that it clearly fails to generalize the observation for ME discrimination of more than two states. As is going to be shown in section 4, one of three states can be modified independently, while the guessing probability remains the same. We emphasize that the operator K^⋆ directly dictates and corresponds to the guessing probability, rather than any other parameters in ME discrimination. We also note that this property is along the conclusion in [31] that distinguishability is a global property that cannot be reduced to the distinguishability of each pair of states.

In fact, two distinct sets of quantum states can have the same symmetry operator. Then, since the symmetry operator gives the complete characterization of the ME discrimination such as the guessing probability and complementarity states, the ME discrimination for the two sets is analyzed in terms of the same symmetry operator. This motivates one to construct equivalence classes of sets of quantum states via the symmetry operator in the ME discrimination and shows a general structure in ME discrimination, see section 3.

In the below, we show that optimality conditions in equations (8) and (9) are equivalent to those in equations (4) and (5).

Remark 2.1. KKT conditions in equations (8) and (9) are equivalent to the optimality conditions shown in equations (4) and (5).

Proof. To prove the equivalence, it suffices to show that KKT conditions imply optimality conditions in equations (4) and (5). In the following, we derive equations (4) and (5) from KKT conditions.

First, since the symmetry operator in equation (8) gives the guessing probability, we have

$\begin{equation} P_{\mathrm{guess}} = \mathrm{tr}[K^{\star}] = \mathrm{tr}\left[\sum_{{x}=1}^{N} q_{{x}} \rho_{{x}} M_{{x}}^{\star}\right]. \end{equation} \tag{ 10 }$

It follows that $K^{\star } = \sum _{{x}=1}^{N} q_{{x}} \rho _{x} M_{{x}}^{\star }$ with optimal POVMs fulfilling equation (10). From equation (8), noting that r^⋆_xσ^⋆_x ⩾ 0, we have K^⋆ − q_xρ_x ⩾ 0 for all x = 1,...,N. This already proves the condition in equation (5).

Next, from the symmetry operator in equation (8), the equality in equation (10) also implies the following:

$\begin{eqnarray*} \mathrm{tr}[K^{\star}]\kern1pt &=&\kern-1pt \mathrm{tr}\left[\kern-1pt\sum_{{x}=1}^{N} q_{{x}} \rho_{{x}} M_{{x}}^{\star}\kern-1pt\right]\kern-2pt =\kern-1pt \mathrm{tr}\left[\sum_{{x}=1}^{N} (K^{\star} - r_{{x}}^{\star} \sigma_{x}^{\star} ) M_{{x}}^{\star}\right]\kern-2pt =\kern-1pt \mathrm{tr}[K^{\star}] - \sum_{{x}=1}^{N} \mathrm{tr} [ r_{{x} }^{\star} \sigma_{{x}}^{\star} M_{{x}}^{\star} ] \\ &\Rightarrow &\kern-2pt \sum_{{x}=1}^{N} \mathrm{tr} [ r_{{x} }^{\star} \sigma_{{x}}^{\star} M_{{x}}^{\star} ] = 0.\nonumber \end{eqnarray*}$

Since tr[r^⋆_xσ^⋆_xM^⋆_x] ⩾ 0 for each x, we conclude that tr[M^⋆_xr^⋆_xσ^⋆_x] = 0 for all x. Moreover, since σ^⋆_x ⩾ 0, M^⋆_x ⩾ 0 and r^⋆_x ⩾ 0, we have r^⋆_xσ^⋆_xM^⋆_x = 0 for all x. We apply this identity to the following, together with the relation q_xρ_x = K^⋆ − r^⋆_xσ^⋆_x in equation (8):

$\begin{eqnarray*} M_{{x}}^{\star} (q_{{x}} \rho_{{x}} - q_{{y}} \rho_{{y}}) M_{{y}}^{\star} & = & M_{{x}}^{\star} ( (K^{\star} - r_{{x}}^{\star} \sigma_{{x}}^{\star} ) - ( K^{\star} - r_{{y}}^{\star} \sigma_{{y}}^{\star} ) ) M_{{y}}^{\star} \\ & = &M_{{x}}^{\star} ( r_{{y}}^{\star} \sigma_{{y}}^{\star} M_{{y}}^{\star} ) - ( M_{{x}}^{\star} r_{{x}}^{\star} \sigma_{{x}}^{\star}) M_{{y}}^{\star} =0. \end{eqnarray*}$

Thus, it is shown that the KKT conditions imply the optimality condition in equation (4). □

2.2.3. Optimality conditions from fundamental principles

In [32], it is shown that the no-signaling principle can generally determine the ME quantum state discrimination. This is shown by proving that if measurement devices are non-signaling, the guessing probability in the state discrimination cannot be larger than that characterized within quantum theory. Here, we briefly sketch the main framework of the derivation and show that the optimality conditions in equations (8) and (9) can be alternatively derived from the fundamental principles.

Two operational tasks in quantum theory are exploited. One is the ensemble steering, which states that there always exists a bipartite quantum state, such that by sharing the state, one party can prepare any ensemble decomposition of the other party's state. This was first asserted by Schrödinger [33] and later formalized as the Gisin–Hughston–Jozsa–Wootters (GHJW) theorem [34, 35]. The other is the no-signaling principle that information cannot be transmitted faster than the speed of light. Then, the result in [32] is that optimal quantum state discrimination is a consequence when two correlations are compatible: (i) from quantum correlations, called ensemble steering, that ensemble decompositions of quantum states can be steered by a party at a distance, and (ii) the no-signaling condition on probability distributions, that probability distributions of input and output random variables at a location cannot be exploited to instantaneous communication. From two fundamental principles, optimality conditions in equations (8) and (9) can be reproduced as follows.

Suppose that two parties share entangled states and one party prepares quantum states (to be discriminated among) to the other far in the distance using ensemble steering. For states {q_x,ρ_x}^N_x=1 for the ME discrimination, the other parties are with the following identical ensemble in different decompositions:

$\begin{equation} \rho = p_{{x}} \rho_{{x}} + (1-p_{{x}}) \sigma_{{x}}\quad \mathrm{for}\;{x} = 1,\ldots,N \end{equation} \tag{ 11 }$

for some states {σ_x}^N_x=1 with $q_{{x}} = p_{{x}} / \sum _{{y} = 1}^{N} p_{{y}}$ . Notice that an identical ensemble ρ has different decompositions according to indices x = 1...,N. Existence of states {σ_x}^N_x=1 that compose the identical ensemble in the above together with a given set {ρ_x}^N_x=1 follows from the GHJW theorem in [34, 35]. Note that {q_x}^N_x=1 are prior probabilities and thus $\sum _{{x}=1}^{N} q_{{x}} =1$ ; that {p_x}^N_x=1 are probabilities to steer quantum states {ρ_x}^N_x=1 in the ensembles and thus we do not necessarily have $\sum _{{x}=1}^{N} p_{{x}} = 1$ .

If a measurement device is set only to discriminate among states {ρ_x}^N_x=1, the capability of optimal discrimination of {q_x,ρ_x}^N_x=1 cannot work with an arbitrarily high probability. This is because, if the ME discrimination works arbitrarily well, it would violate the no-signaling principle. For instance, suppose that the ensemble decomposition is concluded by the ME discrimination of states {ρ_x}^N_x=1: that is, concluding that ρ_x is found in the discrimination, one guesses that the ensemble corresponds to p_xρ_x + (1 − p_x)σ_x. While no information is announced about the preparation of ensemble decompositions among x = 1,...,N, the no-signaling condition is fulfilled and therefore the ME discrimination of {ρ_x}^N_x=1 must not allow one to gain knowledge about the preparation. Given no information about the preparation, it must be a random guess about ensemble decompositions. Thus, state discrimination can be constrained such that the strategy above gives at its best the random guess.

In [32] it is shown that the ME discrimination must satisfy the following condition to fulfill two constraints in the above:

$\begin{equation} \sum_{{x}=1}^{N} p_{{x}} P ({x} | {x}, \{ \rho_{{x}}\}_{{x}=1}^{N} ) \leqslant 1, \end{equation} \tag{ 12 }$

where P(x|x,{ρ_x}^N_x=1) is the probability of giving outcome x when one of the states {ρ_x}^N_x=1 is given i.e. the probability of correctly discriminating among states {ρ_x}^N_x=1. The equality holds if and only if, for an ensemble prepared in equation (11) with some x, a measurement device responds only to quantum states ρ_x but not to states σ_x, that is, P(x|x,{σ_x}^N_x=1) = 0. Note that, with the measurement postulate, this is equivalent to tr[M_xσ_x] = 0 for some POVM M_x, which is indeed the optimality condition in equation (9). Recall that the bound in equation (12) is about the ME discrimination among quantum states {ρ_x}^N_x=1 that are given with prior probabilities $\{q_{{x}} =p_{{x}} / \sum _{{y}=1}^{N} p_{{y}} \}_{{x}=1}^{N}$ . All these are summarized as follows.

Lemma 2.3. When quantum states {q_x,ρ_x}^N_x=1 are prepared by ensemble steering, as shown in equation (11), the guessing probability is upper bounded by the no-signaling condition

$\begin{equation} \fl P_{\mathrm{guess}} \leqslant \frac{1}{ \sum_{{x}=1}^{N} p_{{x}}}~\mathrm{with~ equality~ if ~and ~only~ if} ~P({x} | {x}, \{ \sigma_{{x}}\}_{{x}=1}^{N} ) = 0 ~\mathrm{for}~{x} = 1,\ldots,N, \quad \end{equation} \tag{ 13 }$

where {p_x}^N_x=1 are from equation (11). With the measurement postulate, the equality holds when POVM M_x does not respond to σ_x but only to ρ_x for each x, that is

$\begin{equation} \mathrm{tr}[M_{{x}} \sigma_{{x}}] = 0, \end{equation} \tag{ 14 }$

which reproduces the condition in equation (9). Then, the upper bound is equal to the guessing probability from quantum theory, e.g. equation (7).

Proof. The upper bound can be derived from equation (12). We also recall $q_{{x}} = p_{{x}} / \sum _{{y}=1}^{N}$ . Since $\sum _{{x}=1}^{N} q_{{x}} =1$ , the upper bound in equation (13) is obtained.

Consequently, two constraints in equations (11) and (14) are optimality conditions for probabilities in the ME discrimination. Optimality conditions in equations (11) and (14) are equivalent to those in lemma 2.2: equation (11) is equivalent to the symmetry operator in equation (8). We also remark that this approach of constraining with the no-signaling principle is valid for generalized probability theories, as long as the steering effect is allowed to do, i.e. that different decompositions of an identical ensemble in equation (11) can be prepared at a distance.

2.3. Geometric formulation

From the different approaches to ME discrimination, we have shown different forms of the optimality conditions, which are also shown to be equivalent to the KKT conditions. Compared to previously known forms of the conditions in equations (4) and (5), the usefulness of expressing the optimality conditions in KKT conditions in lemma 2.2 is that conditions about states (in the state space) and optimal measurement are separated.

We here interpret the symmetry operator in terms of the quantum state geometry and put forward a geometric approach to optimal state discrimination. This shows that optimal discrimination of quantum states, which is supposed to be explained on the level of probability distributions from measurement outcomes, can be explained only with quantum states and their geometry.

We first recall the optimality conditions in lemma 2.2: for a problem of ME discrimination, there exists a symmetry operator K^⋆ which characterizes optimal discrimination and directly gives the guessing probability. Once the operator is obtained, the rest is straightforward. First, complementary states {r^⋆_x,σ^⋆_x}^N_x=1 can be found as K^⋆ − q_xρ_x from equation (8). Since K^⋆ is uniquely determined by given states, this also means the uniqueness of complementary states in the ME discrimination.

Lemma 2.4. The symmetry operator and complementary states in a problem of ME discrimination are unique.

Proof. We first show that the symmetry operator K^⋆ is unique for given states {q_x,ρ_x}^N_x=1. Suppose that for given states {q_x,ρ_x}^N_x=1, there exist two symmetry operators K^⋆ and $\bar {K}^{\star }$ such that both give the same guessing probability i.e. $P_{\mathrm {guess}} = \mathrm {tr}[K^{\star }] = \mathrm {tr}[\bar {K}^{\star } ]$ while

$\begin{eqnarray}&&\begin{array}{@{}c@{}} K^{\star} = q_{{x}} \rho_{{x}} + r_{{x}}^{\star} \sigma_{{x}}^{\star}\quad \forall {x}, \\ \bar{K}^{\star} = q_{{x}} \rho_{{x}} + r_{{x}}^{\star} \bar{\sigma}_{{x}}^{\star}\quad \forall {x}\end{array} \end{eqnarray} \tag{ 15 }$

with corresponding complementary states {r^⋆_x,σ^⋆_x}^N_x=1 and $\{ r_{{x}}^{\star }, \bar {\sigma }_{{x}}^{\star } \}_{{x}=1}^{N}$ , respectively. Note that the parameters {r^⋆_x}^N_x=1 remain the same in both cases of complementary states: this follows from equation (8) that the guessing probability is also given as P_guess = q_x + r^⋆_x for each x. In addition, let us assume that {M^⋆_x}^N_x=1 and $\{ \bar {M}_{{x}}^{\star } \}_{{x}=1}^{N}$ are optimal measurements, respectively, such that, $P_{\mathrm {guess}} = \mathrm {tr}[K^{\star } ] = \mathrm {tr}[\sum _{{x}} q_{{x}} \rho _{{x}} M_{{x}}^{\star } ]$ and also $P_{\mathrm {guess}} = \mathrm {tr}[\bar {K}^{\star } ] = \mathrm {tr}[ \sum _{{x}} q_{{x}} \rho _{{x}} \bar {M}_{{x}}^{\star } ]$ , or equivalently from the optimality condition in equation (9), r^⋆_x tr[σ^⋆_xM^⋆_x] = 0 and $r_{{x}}^{\star }\,\mathrm {tr}[\bar {\sigma }_{{x}}^{\star } \bar { M}_{{x}}^{\star } ] = 0$ .

Now, with two equations in equation (15), let us compute $\sum _{{x}} K^{\star } \bar {M}_{{x}}^{\star }$ and $\sum _{{x}} \bar {K}^{\star } M_{{x}}^{\star }$ as follows, since $\sum _{{x}} M_{{x}}^{\star } = \sum _{{x}} \bar {M}_{{x}}^{\star } = I$ ,

$\begin{equation} K^{\star} = \sum_{{x}} K^{\star} \bar{M}_{{x}}^{\star} = \sum_{{x}} q_{{x}} \rho_{{x}} \bar{M}_{{x}}^{\star} + \sum_{{x}} r_{{x}}^{\star} \sigma_{{x}}^{\star} \bar{M}_{{x}}^{\star} = \bar{K}^{\star} + \sum_{{x}} r_{{x}}^{\star} \sigma_{{x}}^{\star} \bar{M}_{{x}}^{\star}, \end{equation} \tag{ 16 }$

$\begin{equation} \bar{K}^{\star} = \sum_{{x}} \bar{K}^{\star} M_{{x}}^{\star} = \sum_{{x}} q_{{x}} \rho_{{x}} M_{{x}}^{\star} + \sum_{{x}} r_{{x}}^{\star} \bar{\sigma}_{{x}}^{\star} M_{{x}}^{\star} = K^{\star} + \sum_{{x}} r_{{x}}^{\star} \bar{\sigma}_{{x}}^{\star} M_{{x}}^{\star}, \end{equation} \tag{ 17 }$

where we have used the fact that $K^{\star } = \sum _{{x}} q_{{x}} \rho _{{x}} M_{{x}}^{\star }$ and $\bar {K}^{\star } = \sum _{{x}} q_{{x}} \rho _{{x}} \bar {M}_{{x}}^{\star }$ . From equations (16) and (17), we have

$\begin{eqnarray} K^{\star} + \bar{K}^{\star} &=& \bar{K}^{\star} + K^{\star} + \sum_{{x}} r_{{x}}^{\star} \sigma_{{x}}^{\star} \bar{M}_{{x}}^{\star} + \sum_{{x}} r_{{x}}^{\star} \bar{\sigma}_{{x}}^{\star} M_{{x}}^{\star} \nonumber \\ &\Rightarrow& \sum_{{x}} r_{{x}}^{\star} \sigma_{{x}}^{\star} \bar{M}_{{x}}^{\star} + \sum_{{x}} r_{{x}}^{\star}\bar{\sigma}_{{x}}^{\star} M_{{x}}^{\star} =0 \nonumber \\ &\Rightarrow& \sum_{{x}} r_{{x}}^{\star} \sigma_{{x}}^{\star} \bar{M}_{{x}}^{\star} =0 ~\mathrm{and} ~ \sum_{{x}} r_{{x}}^{\star} \bar{\sigma}_{{x}}^{\star} M_{{x}}^{\star} =0. \end{eqnarray} \tag{ 18 }$

To conclude equation (18), we have recalled that all operators of measurement and complementary states are positive semidefinite. Plugging equation (18) to equations (16) and (17), we have that $K^{\star } = \bar {K}^{\star }$ . This proves that for a set of quantum states, the symmetry operator is unique.

Then, from equation (8) and the uniqueness of the symmetry operator, it is shown that complementary states are also uniquely determined, ∀x, r^⋆_xσ^⋆_x = K − q_xρ_x. □

For obtained complementary states, it is not difficult to find POVM elements satisfying the orthogonality condition in equation (9). An optimal POVM for state ρ_x can be found in the kernel of the state σ^⋆_x, $\mathcal {K}[\sigma _{{x}}^{\star }] = \mathrm {span} \{ | \psi \rangle : \sigma _{{x}}^{\star } |\psi \rangle = 0\}$ , i.e. $M_{{x}}^{\star } \in \mathcal {K}[\sigma _{{x}}^{\star }]$ . In doing this, POVM elements should be chosen such that span{M^⋆_x}^N_x=1 = span{ρ_x}^N_x=1 so that the completeness condition $\sum _{{x}=1}^{N} M_{{x}}=I$ is fulfilled. Note also that it holds, span{ρ_x}^N_x=1 = span{σ^⋆_x}^N_x=1. If given states {ρ_x}^N_x=1 are linearly independent, optimal POVM elements must be of rank-one [37].

Since complementary states are unique, one can alternatively solve the ME discrimination in terms of complementary states, without referring directly to optimal measurement. This is also because optimal POVMs are generally not unique: for complementary states {σ^⋆_x}^N_x=1, optimal POVMs {M^⋆_x}^N_x=1 specified by the optimality condition tr[σ^⋆_xM^⋆_x] are not unique [36]. From lemma 2.2, optimal discrimination is solved once those parameters satisfy the KKT conditions. If the underlying geometry of given states is clear, complementary states can be found from given states to discriminate among, in the following way.

Let $\mathcal {P}(\{q_{{x}} ,\rho _{{x}} \}_{{x}=1}^{N})$ denote the polytope of given states constructed in the state space, where each vertex of the polytope is specified by q_xρ_x. The condition of symmetry operator in equation (8) can be written as

$\begin{equation} q_{{x}} \rho_{{x}} - q_{{y}} \rho_{{y}} = r_{{y}}^{\star} \sigma_{{y}}^{\star} - r_{{x}}^{\star} \sigma_{{x}}^{\star}\quad \forall\;{x},{y}=1,\ldots,N. \end{equation} \tag{ 19 }$

This means that the unknown polytope of complementary states $\mathcal {P} ( \{ r_{{x}}^{\star },\sigma _{{x}}^{\star }\}_{{x}=1}^{N})$ is congruent to the polytope of given states. Note that, here, we say that two polytopes are congruent if all of the vertices of one polytope are identical to those of the other. This already determines the polytope $\mathcal {P} ( \{ r_{{x}}^{\star },\sigma _{{x}}^{\star }\}_{{x}=1}^{N})$ of the states we search for. Then, the optimal discrimination follows by locating $\mathcal {P} ( \{ r_{{x}},\sigma _{{x}}\}_{{x}=1}^{N})$ in the state space such that, together with given polytope $\mathcal {P}(\{q_{{x}} ,\rho _{{x}} \}_{{x}=1}^{N})$ , the symmetry operator can be constructed. This is equivalent to the condition that corresponding lines are anti-parallel, see equation (19). An approach to construct the symmetry operator can be done by rewriting the symmetry operator in equation (8) as

$\begin{equation} K^{\star} = \frac{1}{N} \sum_{{x}=1}^{N} q_{{x}} \rho_{{x}} + \frac{1}{N} \sum_{{x}=1}^{N} r_{{x}}^{\star} \sigma_{{x}}^{\star}. \end{equation} \tag{ 20 }$

Note an interpretation of K^⋆, that a symmetry operator corresponds to the sum of two centers of two respective polytopes $\mathcal {P}(\{q_{{x}} ,\rho _{{x}} \}_{{x}=1}^{N})$ and $\mathcal {P} ( \{ r_{{x}}^{\star },\sigma _{{x}}^{\star } \}_{{x}=1}^{N})$ . Then, given the two congruent polytopes in the state space, a symmetry operator can be obtained by rotating the not-yet-fixed one $\mathcal {P} ( \{ r_{{x}},\sigma _{{x}}\}_{{x}=1}^{N})$ with respect to the fixed one $\mathcal {P} ( \{ q_{{x}},\rho _{{x}}\}_{{x}=1}^{N})$ , such that operators from two constructions in equations (8) and (20) are identical.

To apply the geometric formulation in the above, one should be able to describe the geometry of quantum states in the state space. The difficulty is clearly the lack of a general picture to quantum state space apart from two-dimensional cases, qubit state space.

3. General structures

In this section, we show general structures of ME discrimination: equivalence classes of sets of quantum states, construction of the ME discrimination and general expressions of the guessing probability. We mainly exploit results shown in the previous section, that for ME discrimination there always exists a symmetry operator which completely characterizes the optimal discrimination. We first define equivalence classes of sets of quantum states in terms of a symmetry operator. As an approach converse to optimal discrimination for given states, we present a systematic way of constructing a set of quantum states for which the optimal discrimination is immediately known from a given symmetry operator. We then show a general and analytic expression of the guessing probability.

From now on, unless specified otherwise, for simplicity let K and {r_x,σ_x} without ^⋆ denote a symmetry operator and complementary states, respectively.

3.1. Equivalence classes

From lemma 2.2, a symmetry operator gives a complete characterization of optimal parameters in ME discrimination. Once a symmetry operator is found, it is straightforward to find the complementary states and optimal measurements. Note that the guessing probability is given by the trace norm of a symmetry operator, see equation (7). Therefore, if two different sets of quantum states share an identical symmetry operator, the ME discrimination is characterized in an equivalent way in terms of the identical symmetry operator.

(Equivalence classes).

Definition 3.1 Two sets of quantum states, say {q_x,ρ_x}^N_x=1 and {q'_x,ρ'_x}^L_x=1, are equivalent in the ME state discrimination if their symmetry operators are identical up to unitary transformations i.e. an identical spectrum. We write two equivalent sets as

$\begin{eqnarray*} &&\{q_{{x}},\rho_{{x}}\}_{{x}=1}^{N} \sim \{{q'}_{{x}},{\rho'}_{{x}} \}_{{x}=1}^{L}, \end{eqnarray*}$

and the equivalence class characterized by a symmetry operator K is denoted by $\mathcal {A}_{K}$ . Then sets of quantum states in the same equivalence class have the same guessing probability.

3.2. Construction of a set of quantum states from a symmetry operator

In this subsection, conversely to solving a problem of ME discrimination (or equivalent to finding the symmetry operator), we here introduce a systematic way of generating a set of quantum states from a given symmetry operator. This means that, for the generated quantum states, optimal discrimination is already characterized by the symmetry operator. That is, elements of an equivalent class identified by a symmetry operator are generated in this way.

The main idea is to exploit the structure of ME discrimination shown in the section 2.2. Suppose that a symmetry operator is given by $K \in \mathcal {B} (\mathcal {H})$ which is simply a (bounded) positive operator over a Hilbert space $\mathcal {H}$ . As the guessing probability is to be given as tr[K] at the end (see equation (7)), we note that the operator is not larger than the identity operator in the space. To construct a set of quantum states {q_x,ρ_x}^N_x=1 having operator K as their symmetry operator, the first thing to do is to normalize the operator to interpret it as a quantum state and then make its purification. Let $\widetilde {K}$ denote the operator after normalization, $\widetilde {K} = K/ \mathrm {tr}[K]$ . We write its purification as $|\psi _{K}\rangle _{AB} \in \mathcal {H} \otimes \mathcal {H}$ such that $\widetilde {K} = \textrm {tr}_{A} | \psi _{K}\rangle _{AB} \langle \psi _{K}|$ ; the purification is unique up to local unitary transformations on the A system.

Then, the next is to construct N two-outcome and complement measurements on the A system, M^x = {M^x₀,M^x₁} with M^x₀ + M^x₁ = I for x = 1,...,N. Since each measurement is complete, the resulting state in the B system on average (i.e. ensemble average) is described by the operator $\widetilde {K}$ . Decompositions of operator $\widetilde {K}$ are determined by the choice of measurement M^x on the A system: let ρ_x (σ_x) denote the state that resulted from the detection event appearing in Alice's measurement M^x₀ (M^x₁); we assume that the detection event happens with probability p_x (1 − p_x). In this way, there are N different decompositions of operator $\widetilde {K}$ ,

$\begin{equation} \widetilde{K} = p_{{x}} \rho_{{x}} +(1-p_{{x}}) \sigma_{{x}} \quad \mathrm{for}\;{x} = 1,\ldots,N, \end{equation} \tag{ 21 }$

where {ρ_x}^N_x=1 are those states that we are interested in discriminating. Given that a measurement device is prepared to discriminate among these states {ρ_x}^N_x=1, the a priori probability that ρ_x is generated is given by $p_{{x}} / \sum _{{x}=1}^{N} p_{{x}}$ , which we write by q_x.

The ensemble decomposition in equation (21) corresponds to the case shown in equation (11), or also equivalently in equation (8), that an identical ensemble is decomposed into N different ways. In this case, the optimality conditions are presented in equation (13) which is also equivalent to the optimality condition in equation (9). That is, it immediately follows that the optimal discrimination of states {q_x,ρ_x}^x_x=1 is characterized: the guessing probability is P_guess = tr[K] from the given operator; complementary states are {r_x,σ_x}^N_x=1 where r_x can be found from the relation in equation (8), r_x = (1 − p_x)/tr[K] and σ_x result from measurement M^x₁ in the A system.

(Construction of equivalence classes).

Proposition 3.1 When a symmetry operator K is provided, an element of the equivalent class $\mathcal {A}_{K}$ can be constructed as a set of states {q_x,ρ_x}^N_x=1 if a set of POVM elements exists {M^x₀}^N_x=1 on the A system of the purification |ψ_K〉_AB, such that those states {ρ_x}^x_x=1 are prepared in the B system with probabilities {p_x}^N_x=1, respectively, where p_x = q_xtr[K] for each x = 1,...,N. Then, the other POVM elements {M^x₁}^N_x=1, which fulfill that M^x₀ + M^x₁ = I for each x, uniquely find complementary states {σ_x}^N_x=1 in the B system.

Before proceeding, we present an example that shows how the method of constructing a set of quantum states can be applied, when a symmetry operator is given. It is simple but also useful to see how all that has been explained works out. Further examples are also presented in section 4.

Example (Equivalence class of a normalized identity operator). Suppose that a symmetry operator is given as the identity operator in a d-dimensional Hilbert space, K = I/d. Take one spectral decomposition of the identity, we write it using an orthonormal basis {|k〉}^d_k=1: $I = \sum _{k=1}^{N} |k\rangle \langle k|$ . Since the symmetry operator is normalized, the guessing probability for quantum states to be constructed is to be P_guess = 1. The purification is the maximally entangled state, $|\psi _{K} \rangle = \sum _{k=1}^{d} |k\rangle |k\rangle /\sqrt {d}$ . Applying measurement {M^x₀ = |x〉〈x|,M^x₁ = I − M^x₀}^d_x=1 on the A system, the following ensemble is prepared in the B system:

$\begin{eqnarray*} &&K = \frac{1}{d} |{x} \rangle \langle {x}| + \frac{d-1}{d} \sigma_{{x}},\quad \mathrm{where}\;\sigma_{{x}} = \frac{1}{d-1}\sum_{{y} \neq {x}} |{y}\rangle \langle {y} |\quad \forall\;{{x}=1,\ldots,d}. \end{eqnarray*}$

This defines a problem of state discrimination for orthogonal states {1/d,|x〉〈x|}^N_x=1. Complementary states are shown in the above; it is also straightforward to find optimal measurements. Thus, it is shown that $\{1/d, |{x}\rangle \langle {x}| \}_{{x}=1}^{N} \in \mathcal {A}_{I/d}$ .

3.3. Analytic expression of the guessing probability

In this subsection, we present various expressions of the guessing probability in ME state discrimination. They are equivalent but of different forms depending on how they are derived. The first one based on the optimality conditions corresponds to a quantum analogy of the probabilistic–theoretic measure shown in equation (3). The next one is in accordance with other physical theories, ensemble steering on quantum states and the no-signaling principle on measurement outcomes.

3.3.1. Quantum analogy to the probability-theoretic expression

Let us first present a general form of the guessing probability (for quantum states) in the framework of probability-theoretic measures. We first recall from lemma 2.1 that the guessing probability about random variable X given Y is generally expressed as the distance of probability P_X|Y deviated from the uniform distribution. This holds true in general, no matter what physical systems are applied to mediate between the preparation and the measurement. Then, it remains to determine the form of d(X|Y ) once quantum systems are applied.

We recall the KKT conditions of the ME discrimination for {q_x,ρ_x}^N_x=1, as follows:

$\begin{eqnarray*} &&K = q_{{x}} \rho_{{x}} + r_{{x}} \sigma_{{x}}~\mathrm{for}~{x} = 1,\ldots,N~\mathrm{with~ complementary~ states}~ \{r_{{x}},\sigma_{{x}} \}_{{x}=1}^{N}. \end{eqnarray*}$

Using the simple identity that $K = \sum _{{x}=1}K/N$ , we have

$\begin{equation} P_{\mathrm{guess}} = \frac{1}{N} \mathrm{tr}\left[ \sum_{{x}} q_{{x}} \rho_{{x}} + r_{{x}} \sigma_{{x}}\right] = \frac{1}{N} + R, \quad \mathrm{where}\; R= \frac{1}{N} \sum_{{x}=1}^{N} r_{{x}}. \end{equation} \tag{ 22 }$

This compares to equation (3): the distance d(X|Y ) corresponds to the dual parameter R, which now corresponds to the norm of the ensemble average of complementary states.

The parameter R in the expression in equation (22) in fact has a geometrical meaning in the state space. Recall from section 2.3 that each complementary state is determined as a vertex of the polytope that is congruent to the polytope constructed by given states, $\mathcal {P} (\{q_{{x}},\rho _{{x}} \}_{{x}=1}^{N} )$ . Then, each state r_xσ_x plays the role of the (trace) distance between each vertex q_xρ_x of the polytope and the symmetry operator, i.e.

$\begin{eqnarray*} &&r_{{x}} = \mathrm{tr} [ r_{{x}} \sigma_{{x}} ]= \mathrm{tr} [ K - q_{{x}} \rho_{{x}} ] = \| K - q_{{x}} \rho_{{x}} \|_{1}, \end{eqnarray*}$

where we note that K ⩾ q_xρ_x for all x = 1,...,N, from the dual problem in equation (7).

Proposition 3.2. The guessing probability for quantum states {q_x,ρ_x}^N_x=1 constructed from a symmetry operator K is

$\begin{equation} \fl P_{\mathrm{guess}} = \frac{1}{N} + R (K || \{ q_{{x}} ,\rho_{{x}} \}_{{x}=1}^{N} )\; \mathrm{with}\;R (K || \{ q_{{x}} ,\rho_{{x}} \}_{{x}=1}^{N} ) = \frac{1}{N} \sum_{x=1}^{N} \| K - q_{{x}} \rho_{{x}} \|_{1}, \quad \end{equation} \tag{ 23 }$

where R(K||{q_x,ρ_x}^N_x=1) shows the averaged trace distance of given states {q_x,ρ_x}^N_x=1 deviated from their symmetry operator K, see also equation (3) in lemma 2.1 for comparison.

3.3.2. General expression for probabilistic and physical models

The next form of guessing probability (for discrimination among states {q_x,ρ_x}^N_x=1) is obtained by the compatibility between ensemble steering on quantum states and the no-signaling condition on probability distributions of measurement outcomes. Then, as is shown in lemma 2.3, the guessing probability is given by

$\begin{equation} P_{\mathrm{guess}} = \frac{1}{p_1 + \cdots + p_{N}} \quad \mathrm{with}\; p_{{x}} = q_{{x}} P_{\mathrm{guess}}\quad \mathrm{for}\;{x}=1,\ldots,N, \end{equation} \tag{ 24 }$

since the bound in equation (13) is tight. From the ensemble steering shown in equation (11), the guessing probability can be interpreted as follows. Each parameter p_x of {p_x}^N_x=1 in the above shows the probability that a party can prepare a quantum state ρ_x to the other party at a distance via the ensemble steering. Therefore, the guessing probability corresponds to the maximal average probability to prepare a set of chosen quantum states to a distant party via the ensemble steering.

(Steerability and state discrimination).

Theorem 3.1 While ensemble steering is allowed within quantum theory, the steerability of a quantum state in the ensemble is generally proportional to the guessing probability for those states prepared in the ensemble.

3.3.3. Simplification for equal prior probabilities

Finally, for the ME discrimination for {q_x,ρ_x}^N_x=1, a huge simplification can be made when prior probabilities are equal i.e. q_x = 1/N for all x. The simplification is shown in the expression of R in equation (23), that it holds r_x = r_y for all x,y = 1,...,N if q_x = 1/N. Note also that since the symmetry operator in equation (8) shows that P_guess = tr[K] = q_x + r_x = q_y + r_y ∀x,y; now since q_x = q_y = 1/N, it holds that r_x = r_y for all x,y. From lemma 3.2, this means that the distances of given states from the symmetry operator are all equal. Thus, for convenience, let us write

$\begin{equation} r:= r_{{x}}\;~\forall~{x}=1,\ldots,N\quad \mathrm{and~then}\; r = R(K \| \{ 1/N ,\rho_{{x}} \}_{{x}=1}^{N} ) . \end{equation} \tag{ 25 }$

The problem of ME discrimination becomes even simpler. There is only a single parameter r to find, to solve optimal quantum state discrimination for the uniform prior.

Applying the geometric formulation shown in section 2.3, the geometrical meaning of the parameter r in equation (25) can be found in a simple way. From the KKT condition in equation (8), we have the following relation, see also equation (19):

$\begin{equation} \frac{1}{N} \rho_{{x}} - \frac{1}{N} \rho_{{y}} = r \sigma_{{y}} - r \sigma_{{x}}\quad \mathrm{and~then}\; r = \frac{ \| \frac{1}{N} \rho_{{x}} - \frac{1}{N} \rho_{{y}} \|_{1} }{ \| \sigma_{{x}} - \sigma_{{y}} \|_{1} }. \end{equation} \tag{ 26 }$

Then, the parameter r corresponds to the ratio between two polytopes in the state space, the given one $\mathcal {P}(\{ \frac {1}{N},\rho _{{x}} \}_{{x}=1}^{N} )$ constructed from given states and the other one $\mathcal {P}(\{ \sigma _{{x}} \}_{{x}=1}^{N} )$ only of complementary states.

Proposition 3.3. The guessing probability for quantum states given with the uniform prior is determined by only a single parameter as

$\begin{equation} P_{\mathrm{guess}} = \frac{1}{N} + r, \end{equation} \tag{ 27 }$

where r is the ratio between two polytopes, one from given states and the other from complementary states.

4. Examples: solutions in qubit state discrimination

In this section, we apply general results shown so far to an arbitrary set of qubit states. Since qubits are the unit of quantum information processing, the results presented here are not only of theoretical interest as they characterize the quantum capabilities of state-discrimination-based tasks, but they are also useful for practical applications. Note that, for ME discrimination for qubit states, general solutions are known for two qubit states. For more than two states, optimal discrimination is known for qubit states containing some symmetry, such as a geometrically uniform structure. Recently, there have been analytical approaches to qubit state discrimination, one exploiting the dual problem in equation (7) [38] and the other from the complementarity problem that analyzes the KKT conditions in equations (8) and (9) [30]. In the latter, an analytic method is provided to solve ME discrimination for any set of qubit states with the uniform prior. In the following subsections, we present the optimal discrimination of qubit states when qubit states may not contain any symmetry among them. That is, we present the symmetry operator, complementary states and the optimal measurement.

Let us collect general results to be used for qubit state discrimination. As for qubit states, a useful and clear geometric picture is present with the Bloch sphere, in which the state geometry is also clear with a Hilbert–Schmidt distance. The other useful fact is that for qubit states, the Hilbert–Schmidt distance is proportional to the trace distance. From this, the geometry formulation in the section 2.3 can also be applied.

Lemma 4.1. For qubit states, the Hilbert–Schmidt and the trace distances, denoted by d_HS and d_T respectively, are related by a constant as follows:

$\begin{equation} d_{\rm HS}(\rho,\sigma) = \sqrt{2}\,d_{\rm T} (\rho,\sigma) \end{equation} \tag{ 28 }$

for any qubit states ρ and σ.

The lemma is useful when the trace distance between qubit states is computed via the geometry in the Bloch sphere: once the actual distance measured in the Bloch sphere is computed in the Hilbert–Schmidt norm via the geometry, from lemma in the above the distance can be converted to the trace norm.

The next useful fact is the orthogonality relation. In the two-dimensional Hilbert space, if two non-negative and Hermitian operators O₁ and O₂ fulfill the orthogonality condition tr[O₁O₂] = 0, see equation (9), the only possibility is that the two operators are of rank-one and, once normalized, they are a resolution of the identity operator. That is, if O₁∝|ϕ〉〈ϕ| satisfies the orthogonality, then the other is uniquely determined O₂∝|ϕ^⊥〉〈ϕ^⊥|, since in two-dimensional space, the resolution of the identity is given as I = |ϕ〉〈ϕ| + |ϕ^⊥〉〈ϕ^⊥| for any vector |ϕ〉. This means that as long as the optimal strategy in state discrimination is not the application of the null measurement for some state (i.e. M_x = 0 for some x), optimal POVM elements and complementary states, that would fulfill the optimality condition in equation (9), each must be of rank-one and uniquely determined from the aforementioned relation in the above. All these are summarized in the following, exploiting the optimality condition in equation (9).

Lemma 4.2. The following are the conditions for optimal measurement and complementary states in optimal discrimination of qubit states:

1.
If complementary states are not pure states, then the optimal strategy to have a minimal error is to apply the null-measurement, i.e. no-measurement for some states gives an optimal strategy [30, 39].
2.
For cases where the optimal strategy is not the null measurement, optimal POVM elements are of rank-one [30, 38] and are uniquely determined by complementary states that are also of rank-one [30].

Proof.

1.
Let us decompose a complementary state σ_x in the following way: σ_x = s_x|φ_x〉〈φ_x| + (1 − s_x)|φ^⊥_x〉〈φ^⊥_x| with some |φ_x〉 and |φ^⊥_x〉 in the two-dimensional space. One can always find two orthogonal vectors to decompose a qubit state in this way. Suppose that s_x > 0 so that σ_x is not of rank-one. Then, suppose that there exists a POVM element M_x such that tr[M_xσ_x] = 0. This means that 〈φ_x|M_x|φ_x〉 = 〈φ^⊥_x|M_x|φ^⊥_x〉 = 0. Since I = |φ_x〉〈φ_x| + |φ^⊥_x〉〈φ^⊥_x|, it follows that tr[M_x] = tr[M_xI] = 0. Since M_x is non-negative, we have M_x = 0, i.e. no-measurement.
2.
Now it is clear that, when two positive operators e.g. σ_x and M_x fulfill the orthogonality relation, both of them must be of rank-one. Once a complementary state is given, the only choice for its corresponding POVM element that satisfies the orthogonality is where the POVM is in the kernel of the complementary state $\mathcal {K}[\sigma _{{x}}]$ . Since $\dim \mathcal {H}=2$ and σ_x are of rank-one, it follows that $\dim \mathcal {K}[\sigma _{{x}}]=1$ . Thus, the POVM element is also of rank-one and uniquely determined in the one-dimensional kernel space of a complementary state.

Lemma shown in the above applies directly to the geometric formulation presented in section 2.3 for qubit states. Recall that most complementary states for which the optimal measurement is not the null POVM element are of rank-one, that is, pure states. This means that complementary states lie at the surface of the Bloch sphere; the polytope of them $\mathcal {P}(\{ \sigma _{{x}}\}_{{x}=1}^{N} )$ is therefore the maximal within the Bloch sphere. The shape of the polytope $\mathcal {P}(\{ r_{{x}},\sigma _{{x}}\}_{{x}=1}^{N} )$ is also known to be congruent to the polytope $\mathcal {P}(\{ q_{{x}},\rho _{{x}}\}_{{x}=1}^{N} )$ of given states, see the formulation in section 2.3.

Furthermore, let us recall the following. For the uniform prior probabilities q_x = 1/N, as is shown in equation (27), there is only a single parameter to find in order to solve the optimal discrimination. The parameter is expressed by r in equation (27), corresponding to the ratio between two polytopes $\mathcal {P}(\{ \sigma _{{x}}\}_{{x}=1}^{N} )$ and $\mathcal {P}(\{ q_{{x}}=1/N,\rho _{{x}}\}_{{x}=1}^{N} )$ , see the expression in equation (26). Note also that r = r_x for all x = 1,...,N when finding the complementary states {r_x,σ_x}^N_x=1, as it is shown in equation (25).

Lemma 4.3. The guessing probability for qubit states given with the uniform prior probabilities is in the following form:

$\begin{eqnarray*} &&P_{\mathrm{guess}} = \frac{1}{N} + r \end{eqnarray*}$

with the parameter r, the ratio between two polytopes in the state space, $\mathcal {P}(\{ q_{{x}}=1/N, \rho _{{x}}\}_{{x}=1}^{N} )$ constructed from given states and its similar polytope that is also maximal in the Bloch sphere.

Proof. Two polytopes $\mathcal {P}(\{ q_{{x}}=1/N, \rho _{{x}}\}_{{x}=1}^{N} )$ and $\mathcal {P}(\{ r ,\sigma _{{x}}\}_{{x}=1}^{N} )$ are congruent and therefore two polytopes $\mathcal {P}(\{ q_{{x}}=1/N,\rho _{{x}}\}_{{x}=1}^{N} )$ and $\mathcal {P}(\{ \sigma _{{x}}\}_{{x}=1}^{N} )$ are similar with the ratio r. For qubit state discrimination, from lemma 4.2 it holds that most complementary states are of rank-one lying at the surface of the Bloch sphere and hence $\mathcal {P}(\{ \sigma _{{x}}\}_{{x}=1}^{N} )$ is the maximal in the Bloch sphere. Therefore, the ratio r is given from $\mathcal {P}(\{ q_{{x}}=1/N,\rho _{{x}}\}_{{x}=1}^{N} )$ and its similar and maximal one $\mathcal {P}(\{ \sigma _{{x}}\}_{{x}=1}^{N} )$ . This completes the proof.

In what follows, we apply all these results to qubit state discrimination. We also write qubit states using Bloch vectors as $\rho _{{x}} = \rho (\vec {v}_{{x}}) = \frac {1}{2}(I + \vec {v}_{{x}} \cdot \vec {\sigma })$ , where $\vec {\sigma } = (X, Y, Z)$ are Pauli matrices.

4.1. Two states

When two quantum states are given, the state space spanned by them is effectively two-dimensional. Consequently, discrimination of them is equivalently restricted to two qubit states. This does not lose any generality. Then, for two-state discrimination {q_x,ρ_x}²_x=1, optimal discrimination was shown in [3],

$\begin{equation} P_{\mathrm{guess}} = \textstyle\frac{1}{2}(1+ \| q_1 \rho_{1} - q_{2}\rho_{2} \|_1 ). \end{equation} \tag{ 29 }$

We now reproduce the result using the geometric formalism presented in section 2.3.

The optimal discrimination is obtained if complementary states are found, since complementary states provide the symmetry operator that gives the guessing probability and also defines optimal measurements, see lemma 2.2. Thus, we now show how to find complementary states from given states. We first rewrite equation (19) for the two states

$\begin{equation} q_{1}\rho_1 - q_{2} \rho_2 = r_2\sigma_2 - r_1 \sigma_{1}, \end{equation} \tag{ 30 }$

where {r_x,σ_x}²_x=1 are complementary states to find. Recall lemma 4.2, complementary states are of rank-one and thus lie at the surface of the Bloch sphere. Then, equation (30) in the above means (i) two polytopes (lines in this case) by given states and by complementary states, respectively, are congruent and also parallel. Thus, given the line defined by q₁ρ₁ − q₂ρ₂ in the Bloch sphere, two complementary states can be found by finding a diameter (since they are pure states) such that the diameter is parallel to the given line, see figure 1. Two parameters r₁ and r₂ can be found to satisfy the relation in equation (30). Then, the symmetry operator is found as q_xρ_x + r_xσ_x for x = 1,2, and optimal measurements are also defined in the diameter of complementary states as M₁∝σ₂ and M₂∝σ₁.

**Figure 1.** Discrimination of two states {q_x,ρ_x}²_x=1. In both figures (A) and (B), given states q₁ρ₁ and q₂ρ₂ are depicted by two lines OX₁ and OX₂, respectively. The line OK corresponds to the symmetry operator, which is the same in both figures and therefore the guessing probability for two cases is the same, i.e. two cases (A) and (B) are in the same equivalence class. From the geometric formation in section 2.3, complementary states can be found as a polytope congruent to the given polytope X₁X₂ of given states. From lemma 4.2, complementary states are on the surface. Therefore, R₁R₂ is the polytope congruent to X₁X₂; then OC₁ and OC₂ are complementary states σ₁ and σ₂ respectively, from which optimal POVM elements for states ρ₁ and ρ₂ are OC₁ and OC₂ respectively. The KKT condition in equation (8) holds as OK = OX₁ + OR₁ = OX₂ + OR₂ and the other in equation (9) OC₁⊥OC₂.
Download figure:
Standard image High-resolution image

From the geometric method, the guessing probability can be explicitly written as follows. Note that, from equation (30), (i) tr[q₁ρ₁ − q₂ρ₂] = q₁ − q₂ and (ii) r₁ + r₂ = ∥r₁σ₁ − r₂σ₂∥ since tr[σ₁σ₂] = 0 i.e. two complementary states are found to be orthogonal; consequently we have that

$\begin{eqnarray*} (i) ~ r_1 - r_2 &=& q_2 - q_1, \\ (ii)~ r_1 + r_2 & = & \| r_1 \sigma_{1} - r_2\sigma_2\|_1 = \| q_{1}\rho_1 - q_{2} \rho_2 \|_1. \end{eqnarray*}$

Thus, for x = 1,2

$\begin{equation} r_{{x}} = \textstyle\frac{1}{2} (\| q_1 \rho_1 - q_2 \rho_2\| + (-1)^{{x}}(q_{1} - q_{2}) ). \end{equation} \tag{ 31 }$

Thus, from the general expression shown in equation (22) for N = 2, the guessing probability is

$\begin{eqnarray} &&P_{\mathrm{guess}} = \mathrm{tr}[K] = \textstyle\frac{1}{2} + \textstyle\frac{1}{2} ( r_1 +r_2) = \textstyle\frac{1}{2} (1 + \| q_1 \rho_1 - q_2 \rho_2\|)~~~\mathrm{with} \nonumber \\ && K = \textstyle\frac{1}{2} (q_1 \rho_1 + q_2 \rho_2) + \textstyle\frac{1}{2} (r_1 \sigma_1 + r_2 \sigma_2). \end{eqnarray} \tag{ 32 }$

Thus, it is shown that the Helstrom bound is reproduced.

In particular, when q₁ = q₂ = 1/2, from proposition 3.3 it follows that r = r₁ = r₂ = ∥ρ₁ − ρ₂∥/2. Then, from the expresso in equation (32), the symmetry operator has an even simpler form

$\begin{eqnarray*} &&K = \textstyle\frac{1}{2}\left(\textstyle\frac{1}{2}\rho_{1} + \textstyle\frac{1}{2}\rho_2\right) + \textstyle\frac{1}{2} (rI). \end{eqnarray*}$

since two complementary states correspond to the diameter and their average is simply proportional to the identity.

In this case, let us also show how the formulation in lemma 4.3 can be applied. First, note again that the polytope constructed by given states is the line q₁ρ₁ − q₂ρ₂ in the Bloch sphere. Since it is a line, the largest polytope similar to the line is clearly a diameter that has the length 2 in trace norm. The parameter r in equation (27) that we look for is the ratio between two lines,

$\begin{eqnarray*} &&\left\| \textstyle\frac{1}{2}\rho_1 -\textstyle\frac{1}{2}\rho_2\right \| : 2 = r :1\quad \mathrm{and~thus}\; r= \textstyle\frac{1}{2}\left \| \textstyle\frac{1}{2}\rho_1 -\textstyle\frac{1}{2}\rho_2 \right\|. \end{eqnarray*}$

With the above, the guessing probability is P_guess = 1/2 + r, which reproduces the Helstrom bound in equation (29) when q₁ = q₂ = 1/2.

4.2. Three states

We now move to cases of three qubit states. For three states, no general solution for ME discrimination has been known to date, apart from specific cases where they are symmetric. Here, for three states given with equal prior probabilities, we apply lemma 4.3 and show how to find optimal discrimination.

4.2.1. Three-state example I: isosceles triangles

We first suppose that three pure states lying at a half-plane are given with equal prior probabilities 1/3, i.e. {1/3,ρ_x = |ψ_x〉〈ψ_x|}³_x=1. In particular, we suppose that the polytope constructed by the three states forms an isosceles triangle on a half plane of the Bloch sphere. For convenience, we parameterize the three states as follows, for some θ₀:

$\begin{eqnarray} |\psi_1\rangle & = & \cos \textstyle\frac{1}{2}(\theta_0 + \theta ) |0\rangle + \sin \textstyle\frac{1}{2} (\theta_0 + \theta) |1\rangle, \nonumber\\ |\psi_2\rangle & = & \cos \textstyle\frac{1}{2} \theta_0 |0\rangle + \sin \textstyle\frac{1}{2}\theta_0 |1\rangle, \nonumber\\ |\psi_3\rangle & = & \cos \textstyle\frac{1}{2} (\theta_0 - \theta ) |0\rangle + \sin \textstyle\frac{1}{2} (\theta_0 - \theta) |1\rangle, \end{eqnarray} \tag{ 33 }$

where let us suppose θ∈[0,π], see figure 2.

We now apply lemma 4.2 to discriminate among these three states, referring to the geometry shown in figure 2. The maximal polytope which is similar to the given one X₁X₂X₃ is S₁S₂S₃. There are many ways of putting the maximal polytope in the plane, however, it should be put as it is shown in the figure to fulfill the optimality condition equation (19), or equation (8). Let us first compute the guessing probability. We have to find the ratio r between two polytopes. It corresponds to the ratio, X₁X₃/S₁S₃, which is r = (sin θ)/3. Therefore, the guessing probability is

$\begin{equation} P_{\mathrm{guess}} = \textstyle\frac{1}{3} + \textstyle\frac{1}{3} \sin \theta. \end{equation} \tag{ 34 }$

In figure 2, the guessing probability for the three states is plotted as angle θ varies. One can easily find that for θ ⩾ π/2, the ratio is r = 1/3 and thus the guessing probability is, 2/3.

Let us then find the optimal measurement. Note that OS₁, OS₂ and OS₃ correspond to complementary states, σ₁, σ₂ and σ₃, respectively. It is shown that σ₂ is not a pure state, i.e. not of rank-one. This means that, to fulfill the optimality condition in equation (9), the optimal POVM element is the null measurement, i.e. M₂ = 0, no-measurement on the state. The other complementary states can be written explicitly as follows:

$\begin{eqnarray}&&\begin{array}{@{}c@{}} \sigma_{1} = | \varphi_1 \rangle \langle \varphi_1|,\quad |\varphi_1\rangle = \cos \left(\displaystyle\frac{\theta_0}{2} - \frac{\pi}{4}\right) |0\rangle + \sin \left(\displaystyle\frac{\theta_0}{2} - \displaystyle\frac{\pi}{4}\right) |1\rangle, \\ \sigma_{3} = | \varphi_3 \rangle \langle \varphi_3|,\quad |\varphi_3\rangle = \cos \left(\displaystyle\frac{\theta_0}{2} + \frac{\pi}{4}\right) |0\rangle + \sin \left(\displaystyle\frac{\theta_0}{2} + \frac{\pi}{4}\right)|1\rangle \end{array} \end{eqnarray} \tag{ 35 }$

and the optimal POVM elements are

$\begin{eqnarray*} &&M_{1} = | \varphi_{3} \rangle \langle \varphi_3|, ~M_{2}=0,~M_{3} = | \varphi_{1} \rangle \langle \varphi_1|,\quad \mathrm{so~that}\; \sum_{{x}=1}^{3} M_{{x}} = I. \end{eqnarray*}$

One can see that with these POVMs, the guessing probability is obtained

$\begin{eqnarray*} &&P_{\mathrm{guess}}= \textstyle\frac{1}{3} \mathrm{tr}[ \rho_{1} M_{1}] + \frac{1}{3} \mathrm{tr}[ \rho_{2} M_{2}] + \textstyle\frac{1}{3} \mathrm{tr}[ \rho_{3} M_{3}] = \textstyle\frac{1}{3} + \textstyle\frac{1}{3} \sin\theta \end{eqnarray*}$

as it is shown in equation (34). Optimal POVM elements show that for discrimination among the three state given with probability 1/3, the optimal strategy corresponds to the measurement setting where the device only responds to the two most distant states out of the three.

4.2.2. Three-state example II: geometrically uniform states

The example shown in the previous can be extended to the so-called geometrically uniform states [23, 27], which correspond to the case that θ = 2π/3. It is straightforward to exploit the method shown in the above from lemma 4.2, then the guessing probability is 2/3. In fact, as is shown, the guessing probability is found, P_guess = 2/3 for any three pure (qubit) states given with equal probabilities if the polytope of them in the Bloch sphere contains the origin.

4.2.3. Three-state example III: arbitrary triangles on a half-plane

We now consider a set of three pure states given with equal probabilities 1/3 and suppose that the polytope constructed by the three states forms an arbitrary triangle in a half-plane of the Bloch sphere: {1/3,ρ_x = |ψ_x〉〈ψ_x|}³_x=1. As a contrast to the example isosceles triangles shown in the above, we also suppose that the polytope of given states contains the origin of the Bloch sphere, see figure 3.

To apply the geometric formulation in lemma 4.3, we also refer to figure 3. Note that the three states ρ_x/3 for x = 1,2,3 correspond to OX_x, x = 1,2,3, respectively; thus the polytope of given states is the triangle X₁X₂X₃. Then, the next thing to do is to find a maximal polytope similar to X₁X₂X₃ within the Bloch sphere. The triangle S₁S₂S₃ in figure 3 could be a possibility. The ratio r corresponds to, for instance, X₁X₂/S₁S₂, which is also equal to OX₁/OS₁: therefore r = 1/3, then the guessing probability is

$\begin{eqnarray*} &&P_{\mathrm{guess}} = \textstyle\frac{1}{3} + r = \textstyle\frac{2}{3}. \end{eqnarray*}$

Note that this is the guessing probability for many cases of three pure states. This can be generalized to cases where three states have the same purity, as follows.

Remark 4.1. Suppose that three qubit states having an equal purity denoted by f are given with equal prior probabilities 1/3. Here, the purity corresponds to the norm of a Bloch vector. Then, if the polytope of the three states in the Bloch sphere contains the origin, the guessing probability is

$\begin{eqnarray*} &&P_{\mathrm{guess}} = \textstyle\frac{1}{3} + \textstyle\frac{1}{3}f. \end{eqnarray*}$

This is independent of the other details of given quantum states, e.g. the angles between them.

The triangle S₁S₂S₃ in figure 3 does not show complementary states yet, since the optimality condition in equation (19), or equivalently equation (8), is not fulfilled. Therefore, one has to put or rotate S₁S₂S₃ to C₁C₂C₃ in figure 3, so that equation (19) holds true: X_xX_y∥C_yC_x for all x,y = 1,2,3, where ∥ means that two lines are parallel. Then, all of the optimality conditions are satisfied.

Complementary states are then those states that correspond to OC_x for x = 1,2,3. It also follows that optimal POVM elements are OP_x for x = 1,2,3 as each of them is orthogonal to its corresponding complementary state, see the optimality condition in equation (9). Thus, optimal POVM elements are {M_x∝OP_x}³_x=1. Since the convex hull P₁P₂P₃ contains the origin, it is also straightforward to have the completeness, $\sum _{{x}=1}^{3} M_{{x}} =I$ . Here, it is noteworthy that optimal POVMs can be in vectors which are not parallel to corresponding states: in figure 3, we found that $OX_{x} \nparallel OP_{x}$ for some x.

4.2.4. Three-state example IV: arbitrary triangles

Finally, let us show how the geometric formulation in lemma 4.3 can be generally applied to a set of arbitrary three qubit states when they are given with equal probabilities, {1/3,ρ_x}³_x=1. We explain this, referring to figure 4. Arbitrary three states are depicted by OX_x for x = 1,2,3 respectively.

**Figure 4.** Arbitrary three qubit states are given with equal prior probabilities 1/3. They are denoted by OX_x for x = 1,2,3. The polytope of given states X₁X₂X₃ defines a plane; then the complementary states and optimal POVM elements lie on the half-plane which is parallel to the defined plane. See section 4.2.4 for more details.
Download figure:
Standard image High-resolution image

Once they are given, they define a plane of the triangle X₁X₂X₃ in the Bloch sphere and there exists a half-plane parallel to the plane. To find the guessing probability for the states, one has to first find a maximal triangle that is similar to the triangle X₁X₂X₃ in the Bloch sphere. Then, the maximal one must lie on the half-plane parallel to X₁X₂X₃, to fulfill the optimality condition in equation (19), or equivalently equation (8). Then, the maximal one is rotated such that the condition in equation (19) is fulfilled and finally ends up with C₁C₂C₃. Then, complementary states are immediately found as OC_x for x = 1,2,3, from which optimal measurement also follows as those POVM elements orthogonal to complementary states.

4.3. Four states

We now consider discrimination among four qubit states. We begin with a simple case: pairs of orthogonal states. Then, cases where four states define a quadrilateral are considered. Finally, we also show how the geometric formulation can be applied when four states form a tetrahedron in the Bloch sphere.

4.3.1. Four-state example I: rectangles

Let us first consider pairs of orthogonal states, say {1/4,ρ_x = |ψ_x〉〈ψ_x|}⁴_x=1, where 〈ψ₁|ψ₃〉 = 〈ψ₂|ψ₄〉 = 0. The four states define a plane in the Bloch sphere and then form a rectangle on the plane. To be explicit, they can be written as follows, for some θ₀ and θ:

$\begin{eqnarray} |\psi_1\rangle & = & \cos \frac{\theta_0}{2} |0\rangle + \sin\frac{\theta_0}{2} |1\rangle, \nonumber \\ |\psi_2\rangle & = & \cos \left(\frac{\theta_0}{2} - \theta\right) |0\rangle + \sin \left(\frac{\theta_0}{2} -\theta\right) |1\rangle, \nonumber \\\\ |\psi_3\rangle & = & \cos \left(\frac{\theta_0}{2} - \frac{\pi}{2}\right) |0\rangle + \sin \left( \frac{\theta_0}{2} - \frac{\pi}{2} \right) |1\rangle, \nonumber \\ |\psi_4\rangle & = & \cos \left(\frac{\theta_0}{2} -\theta- \frac{\pi}{2}\right) |0\rangle + \sin \left( \frac{\theta_0}{2} -\theta- \frac{\pi}{2} \right) |1\rangle,\nonumber \end{eqnarray} \tag{ 36 }$

which are shown in figure 5. The half-plane in which the four states lie is then defined in the Bloch sphere.

Referring to figure 5, we apply the geometric formulation in lemma 4.3 and show the optimal parameters, optimal measurement and complementary states. The polytope of given states is the rectangle X₁X₂X₃X₄. It is straightforward to see that the largest one similar to X₁X₂X₃X₄ is the rectangle having its diagonal in length 2 (in terms of trace-norm). Then, we rotate it so that the relation in equation (19), or the optimality condition in equation (8), is satisfied; then we have the rectangle C₁C₂C₃C₄. The complementary states are found as OC_x for x = 1,2,3,4, from which optimal POVM elements are also obtained {M_x∝OM_x}⁴_x=1. Therefore, we have optimal parameters as follows:

$\begin{eqnarray}&&\begin{array}{@{}c@{}} M_{1} = \sigma_{3} = | \psi_1 \rangle \langle \psi_1 |,\quad M_{2} = \sigma_{4} = | \psi_2 \rangle \langle \psi_2 |, \\ M_{3} = \sigma_{1} = | \psi_3 \rangle \langle \psi_3 |, \quad M_{4} = \sigma_{2} = | \psi_4 \rangle \langle \psi_4 |.\end{array} \end{eqnarray} \tag{ 37 }$

From these, the guessing probability can be obtained. Or, applying lemma 4.2, one can see the ratio, for instance,

$\begin{equation} r = \frac{ X_1 X_2 }{ C_1 C_2} = \frac{OX_1 }{ OC_1} = \frac{1}{4}\quad \mathrm{and~thus} \;P_{\mathrm{guess}} = \frac{1}{4} + r= \frac{1}{2}. \end{equation} \tag{ 38 }$

We remark that, as it is shown in the above, the guessing probability is equal in the range of angle 0 ⩽ θ ⩽ π/2.

The analysis shown in the above implies that the guessing probability does not depend on detailed relations among given quantum states but on a property assigned by the set of them. This shows the freedom of choosing four states such that the guessing capability about states does not change. More precisely, this can be seen in terms of the symmetry operator, which in this case is

$\begin{equation} K = \textstyle\frac{1}{4} \rho_{{x}} + r \sigma_{{x}} = \textstyle\frac{1}{2} I \end{equation} \tag{ 39 }$

consistently to equation (38). That is, all sets of four states of pairs of orthogonal states share the same the symmetry operator in equation (39), meaning that all of them are in the same equivalence class, see definition 3.1.

Remark 4.2. Four states of any two pairs of orthogonal qubit states are in the same equivalence class, $\mathcal {A}_{I/2}$ .

4.3.2. Four-state example II: quadrilateral

Next, let us consider arbitrary four qubit states defined on a half-plane, which are given with the equal probabilities 1/4. For generality, we do not assume any internal symmetry among given states {1/4,ρ_x}⁴_x=1 except the assumption that they are on a half-plane in the Bloch sphere. It is then straightforward to generalize this to arbitrary four states defined on any plane in the sphere.

We now refer to figure 6 to show how the geometric formulation in lemma 4.3 can be applied to those four states. The polytope of four states is shown as the quadrilateral X₁X₂X₃X₄ where each vertex corresponds to ρ_x/4 for x = 1,2,3,4. Then, one has to expand the given quadrilateral so that they are maximal in the Bloch sphere. In this way, the ratio between two quadrilaterals can be found.

To find complementary states, one has to rotate the obtained maximal quadrilateral so that the optimality condition in equation (19) is fulfilled. The resulting quadrilateral from which complementary states can be found is then, C₁C₂C₃C₄, where OC_x for x = 1,2,3,4 are complementary states. It can happen that, since a given quadrilateral X₁X₂X₃X₄ is arbitrarily shaped, some vertices may not be on the surface of the Bloch sphere. As is depicted in figure 6, suppose that OC₁ cannot lie at the surface of the sphere. This means that complementary state σ₁ is not pure; thus the POVM element for state ρ₁ corresponds to the null measurement, M₁ = 0. This follows from the optimality condition in lemma 4.2 to fulfill equation (9). The optimal discrimination strategy for these states is thus to prepare measurement such that there are three kinds of outcomes from M_x with x = 2,3,4. Note also that, as the convex hull of the POVM elements M₁M₂M₃ contains the origin, one can also construct a complement measurement.

4.3.3. Four-state example III: tetrahedron

We now consider four qubit states {ρ_x}⁴_x=1 which form a polytope having a volume in the Bloch sphere, see figure 7. Suppose that they are given with probability 1/4 for each and for convenience we assume that the tetrahedron constructed by four states is covered by a sphere. Later, this can be relaxed and generalized to cases where all vertices of the tetrahedron are not touched by a sphere. Here, since the tetrahedron is covered by a sphere, the four states have the same purity, which we denote by f.

Referring to figure 7, let us show how to find the guessing probability, complementary states and optimal measurement. The guessing probability immediately follows by applying lemma 4.3. The ratio r can be computed by finding the ratio of two diameters of two spheres: one is the sphere covering the given tetrahedron and the other the Bloch sphere covering a maximal tetrahedron similar to the given one. As the purity is given by f, the guessing probability is given by

$\begin{eqnarray*} &&P_{\mathrm{guess}} = \textstyle\frac{1}{4} + \textstyle\frac{1}{4}f, \end{eqnarray*}$

which does not depend on detailed relations among given states, such as angles between given states. The above holds true for any four states whose tetrahedron in the Bloch sphere can be covered by a single sphere.

Then, one rotates the maximal tetrahedron within the Bloch sphere such that the resulting one satisfies the optimality condition in equation (19). Since the tetrahedron already fully occupies the Bloch sphere, it is not difficult to see that the resulting tetrahedron has the reflection symmetry to the given tetrahedron, with respect to the origin of the Bloch sphere, see figure 7. Each vertex OC_x for x = 1,2,3,4 corresponds to complementary states; optimal POVM elements are found to be those rank-one operators orthogonal to complementary states

4.4. Etc

So far, we have considered discrimination of two, three and four qubit states. In this way, for any number of qubit states given, one can apply the geometric formulation and then find the guessing probability, complementary states and optimal measurement. The framework can be summarized as follows. The first stage is to construct a polytope of given states; then one has to search for the polytope of complementary states such that the optimality condition in equation (19), or equivalently equation (8), is fulfilled. The two resulting polytopes must be congruent and the corresponding labeled lines from the respective polytopes are anti-parallel. For cases when measurement is applied, its POVM element is of rank-one and also corresponding complementary states must be of rank-one, that is, pure states. If it is found that complementary states are not of rank-one, then their POVM elements are the null operator and thus the optimal strategy for those states is to apply no-measurement. For implementation in the experiment, this means that no output port is needed for those states.

As we have also shown, there is a further simplification in optimal discrimination when the polytope of quantum states given with equal prior probabilities is tightly covered by a sphere, such that all vertices of the polytope touch the sphere. Then, the guessing probability simply follows from the ratio between diameters of two spheres, one from the sphere covering a given polytope and the other the Bloch sphere. In this case, it is explicitly shown that the guessing probability does not depend on detailed relations of given states, such as the angles between any two given states.

Remark 4.3. For N qubit states given with equal prior probabilities 1/N, if their polytope in the Bloch sphere is covered by a smaller sphere of radius r such that each vertex touches the sphere and contains the origin of the Bloch sphere, the guessing probability for those states is given by P_guess = 1/N + r independently to detailed relations among given states.

For high-dimensional quantum systems, a general geometric expression is lacking and therefore the geometric formation shown in section 2.3 cannot be further applied in general. Once given states give an underlying geometry in which the convex polytope picture is clear, one can apply and formalize the method of geometric formulation.

5. Conclusion

In this paper, we have considered the problem of ME state discrimination and have shown the general structure of the problem. The main idea of the development is to view the problem from various approaches. Finally, it turns out that the general structure can be summarized in the formulation of the so-called complementarity problem that generalizes convex optimization. The key element in the structure is a single positive operator, called the symmetry operator, which gives the complete characterization of optimal discrimination. Then, one can exploit the symmetry operator to find the optimal parameters in the ME discrimination, such as the guessing probability, complementary states and optimal measurement. The symmetry operator also allows an interpretation of the guessing probability as the averaged distance of given states being deviated from the symmetry operator in terms of the trace norm. The interpretation is in accordance with cases when classical systems are employed, where the guessing probability is interpreted as the deviation from the uniform distribution.

It is shown that in ME discrimination of quantum states, the symmetry operator is uniquely determined whereas optimal measurement is not. This means, rather than optimal measurement, a symmetry operator can characterize the ME discrimination. Symmetry operators are therefore exploited to define equivalence classes among sets of quantum states, such that for those sets in the same class, ME discrimination is completely characterized by an identical symmetry operator. This provides an alternative approach to ME discrimination: by checking whether two given sets are in the same class, one can find the optimal discrimination. Conversely, given a symmetry operator, we have shown how one can construct a set of quantum states for which the ME discrimination is characterized by the operator.

From general structures found from the optimality conditions, we have provided a geometric formulation of ME state discrimination. In the formulation, the geometry of quantum states is exploited to find the guessing probability, instead of optimization over measurement. More precisely, the polytope of given states in the state space is linked to the guessing probability, without directly referring to measurement operators via the measurement postulate. It is clear that the method can be applied once the underlying geometry of given states is well-defined. Conversely, we have also argued that, from cases where the optimal discrimination is known, the guessing probability is useful to find the underlying geometry of high-dimensional quantum states. We have applied the geometric formulation to qubit states and solved ME discrimination: (i) the complete solution is provided for any set of qubit states when prior probabilities are equal, (ii) this gives an upper bound to cases when prior probabilities are not equal, (iii) solutions are obtained even if given states do not contain any symmetry among them, (iv) it is shown that the guessing probability does not depend on detailed relations among given states but a geometric property assigned by the set itself. The conclusion (iv) is along the conclusion in [31] that distinguishability within an ensemble of quantum states is assigned as a global property that cannot be reduced to properties of pairs of states. We arrive here at the conclusion by quantifying the distinguishability with the guessing probability, while it is with von Neumann entropy in [31].

Discrimination of quantum states poses a simple question connected to the fundamental and profound limitations in various contexts of quantum information theory. The results presented here not only provide a useful method of solving optimal discrimination, but also give a general, unique and fresh understanding to ME quantum state discrimination.

Acknowledgments

The author thanks J Bergou, B-G Englert, T Fritz, D Gross, W-Y Hwang, L-C Kwek, A Monras, M Navascués, H K Ng, P Raynal, S Yun, S Wehner, A Winter and H Zhu for their helpful discussions and comments while preparing the manuscript. This work was supported by the National Research Foundation and the Ministry of Education, Singapore.

Structure of minimum-error quantum state discrimination

Article metrics

Author e-mails

Author affiliations

Dates

Abstract

1. Introduction

2. Minimum-error discrimination and optimality conditions

2.1. Problem definition and probability-theoretic preliminaries

2.2. Optimality conditions

2.2.1. Optimality conditions from analytic derivation

2.2.2. Optimality condition from convex optimization

2.2.3. Optimality conditions from fundamental principles

2.3. Geometric formulation

3. General structures

3.1. Equivalence classes

3.2. Construction of a set of quantum states from a symmetry operator

3.3. Analytic expression of the guessing probability

3.3.1. Quantum analogy to the probability-theoretic expression

3.3.2. General expression for probabilistic and physical models

3.3.3. Simplification for equal prior probabilities

4. Examples: solutions in qubit state discrimination

4.1. Two states

4.2. Three states

4.2.1. Three-state example I: isosceles triangles

4.2.2. Three-state example II: geometrically uniform states

4.2.3. Three-state example III: arbitrary triangles on a half-plane

4.2.4. Three-state example IV: arbitrary triangles

4.3. Four states

4.3.1. Four-state example I: rectangles

4.3.2. Four-state example II: quadrilateral

4.3.3. Four-state example III: tetrahedron

4.4. Etc

5. Conclusion

Acknowledgments

Footnotes

Structure of minimum-error quantum state discrimination

Article metrics

Share this article

Author e-mails

Author affiliations

Dates

Abstract

1. Introduction

2. Minimum-error discrimination and optimality conditions

2.1. Problem definition and probability-theoretic preliminaries

2.2. Optimality conditions

2.2.1. Optimality conditions from analytic derivation

2.2.2. Optimality condition from convex optimization

2.2.3. Optimality conditions from fundamental principles

2.3. Geometric formulation

3. General structures

3.1. Equivalence classes

3.2. Construction of a set of quantum states from a symmetry operator

3.3. Analytic expression of the guessing probability

3.3.1. Quantum analogy to the probability-theoretic expression

3.3.2. General expression for probabilistic and physical models

3.3.3. Simplification for equal prior probabilities

4. Examples: solutions in qubit state discrimination

4.1. Two states

4.2. Three states

4.2.1. Three-state example I: isosceles triangles

4.2.2. Three-state example II: geometrically uniform states

4.2.3. Three-state example III: arbitrary triangles on a half-plane

4.2.4. Three-state example IV: arbitrary triangles

4.3. Four states

4.3.1. Four-state example I: rectangles

4.3.2. Four-state example II: quadrilateral

4.3.3. Four-state example III: tetrahedron

4.4. Etc

5. Conclusion

Acknowledgments

Footnotes