Higher entropic uncertainty relations for anti-commuting observables
Uncertainty relations provide one of the most powerful formulations of the quantum mechanical principle of complementarity. Yet, very little is known about such uncertainty relations for more than two measurements. Here, we show that sufficient unbiasedness for a set of binary observables, in the sense of mutual anti-commutation, is good enough to obtain maximally strong uncertainty relations in terms of the Shannon entropy. We also prove nearly optimal relations for the collision entropy. This is the first systematic and explicit approach to finding an arbitrary number of measurements for which we obtain maximally strong uncertainty relations. Our results have immediate applications to quantum cryptography.
Uncertainty relations lie at the very core of quantum mechanics. For any observable, it only has sharp values (in the sense that the measurement outcome is deterministic) for its own eigenstates. However, for any other state, the distribution of measurement outcomes is more or less smeared out, or more conveniently expressed: its entropy is is strictly positive. Hence, if two or more observables have no eigenstates in common, the sum of these respective entropies is strictly greater than for any state we may measure. We thereby say that a set of observables is more “incompatible” than another, if this sum takes on a larger value. But what makes observables more “incompatible”? Or rather, what characterizes maximally “incompatible” observables? Here, we show how to obtain maximally strong uncertainty relations for a large number of binary observables that exhibit simple geometrical properties.
Uncertainty relations are most well-known in the form proposed by Heisenberg Heisenberg (1927) and generalized by Robertson Robertson (1929). Entropic uncertainty relations are an alternative way to state Heisenberg’s uncertainty principle. They are frequently a more useful characterization, because the “uncertainty” is lower bounded by a quantity that only depends on the eigenstates of the observables, and not on the actual physical quantity to be measured Białynicki-Birula and Mycielski (1975); Deutsch (1983), as in Heisenberg’s formulation with standard deviations – see also the more recent paper (5). Following a conjecture by Kraus Kraus (1987), Maassen and Uffink Maassen and Uffink (1988) proved an entropic uncertainty relation for two observables. In particular, they showed that if we measure any state with using observables with eigenbases and respectively, we have
where and is the Shannon entropy arising from measuring the state in basis . Here, the most “incompatible” measurements arise from choosing and to be mutually unbiased bases (MUB). That is, for any and any we have , giving us a lower bound of . Clearly, this bound is tight: Choosing for gives us exactly , with maximum uncertainty for one of the two observables and none for the other.
But how about more than two observables? Sadly, very little is known about this case so far. Yet, this question not only eludes our current understanding of quantum mechanics, but also has practical consequences for quantum cryptography in the bounded storage model, where proving the security of protocols ultimately reduces to finding such relations Damgard et al. (2007). Proving new entropic uncertainty relations could thus give rise to new protocols. Furthermore, uncertainty relations for more than two measurements could also be useful to understand other quantum effects that are derived from such relations, such as locking classical information in quantum states DiVincenzo et al. (2004). Sanchez-Ruiz Sanchez (1993); Sanchez-Ruiz (1995, 1998) has shown that for a full set of MUBs , we have
and for gave a lower bound of . Indeed, strong uncertainty relations for a smaller number of bases do exist. If we choose a set of bases uniformly at random, then (with high probability) we have that for all states : Hayden et al. (2004). This means that there exist bases for which the sum of entropies is very large, i.e., measurements in such bases are very incompatible. However, no explicit constructions are known. It may be tempting to conjecture that simply choosing our measurements to be mutually unbiased leads to strong uncertainty relations in general. In fact, when choosing bases at random they will be almost mutually unbiased. In this case, we might expect the entropy average to be quite large: if the state to be measured is an eigenstate of one of the bases, the corresponding entropy average will be . This value is thus clearly an upper bound on the minimum entropy average for any set of bases, mutually unbiased or not. Perhaps surprisingly, however, choosing the bases to be mutually unbiased is not the right property: there exists up to mutually unbiased bases for which M.Ballester and Wehner (2007). Note that the right hand side is a lower bound for any set of MUBs, since it is the average of pairs of entropies to which we can apply the uncertainty relation by Maassen and Uffink Maassen and Uffink (1988). Hence we call this the trivial lower bound. When considering entropic uncertainty relations as a measure of “incompatibility”, we must thus look for different properties to obtain strong uncertainty relations. But, what properties lead to strong entropic uncertainty relations for more than two observables?
Here, we show that for binary observables we obtain maximally strong uncertainty relations for the Shannon entropy if they satisfy the property that they anti-commute. We also obtain a nearly optimal uncertainty relation for the collision entropy (Rényi entropy of order ) that is of particular relevance to cryptography. As we will see, we can take the anti-commuting observables to have a particularly simple form that in principle allows us to apply our result to quantum cryptography using present-day technology.
I Clifford algebra
For our result we will make use of the structure of Clifford algebra Lounesto (2001); Doran and Lasenby (2003); Dietz (2006), which has many beautiful geometrical properties of which we shall use a few. For any integer , the free real associative algebra generated by , subject to the anti-commutation relations
(1) |
is called Clifford algebra. We briefly recall its most essential properties that we will use in this text. The Clifford algebra has a unique representation by Hermitian matrices on qubits (up to unitary equivalence) which we fix henceforth. This representation can be obtained via the famous Jordan-Wigner transformation Jordan and Wigner (1928):
for , where we use , and to denote the Pauli matrices.
Let us first consider these operators themselves. Evidently, each operator has exactly two eigenvalues : Let be an eigenvector of with eigenvalue . From we have that . Furthermore, we have . We can therefore express each as
where and are projectors onto the positive and negative eigenspace of respectively. Furthermore, note that we have for
That is, all such operators are orthogonal. Hence, the positive and negative eigenspaces of such operators are similarly mutually unbiased than bases can be: we have that for all
The crucial aspect of the Clifford algebra that makes it so useful in geometry is that we can view the operators as orthogonal vectors forming a basis for . Each vector can then be written as . Note that the inner product of two vectors obeys , where is the Clifford product which here is just equal to the matrix product. Hence, anti-commutation takes a geometric meaning within the algebra: two vectors anti-commute if and only if they are orthogonal. Evidently, if we now transform the generating set of linearly to obtain the new operators
then the set satisfies the anti-commutation relations iff is an orthogonal matrix: these are exactly the operations which preserve the inner product. Because of the uniqueness of representation, there exists a matching unitary of which transforms the operator basis on the Hilbert space level, by conjugation:
Essentially, we can think of the positive and negative eigenspace of such operators as the positive and negative direction of the basis vectors. We can visualize the basis vectors with the help of a -dimensional hypercube. Each basis vector determines two opposing faces of the hypercube, where we can think of the two faces as corresponding to the positive and negative eigenspace of each operator.
Note that the face of an -dimensional hypercube is a dimensional hypercube itself.
It will be particularly useful that the collection of operators
form an orthogonal basis for the complex matrices for , again by the anti-commutation relations. By counting, the above operators form a complete operator basis with respect to the Hilbert-Schmidt inner product. Notice that the products with an odd number of factors are Hermitian, while the ones with an even number of factors are skew-Hermitian, so in the definition of the above operators we introduce a factor of to all with an even number of indices to make the whole set a real basis for the Hermitian operators. Working out the above terms using the representation from above, we can see that this gives us the familiar Pauli basis consisting of elements with .
Hence we can write every state on as
(2) |
This expansion has been used before in quantum information theory, see e.g. Dietz (2006). The (real valued) coefficients in this expansion are called “vector” components, the ones belonging to degree products of ’s are “tensor” or -vector components. -vectors also have very nice geometric interpretation within the algebra: they represent oriented plane and higher volume elements. The – unique – coordinate of degree also plays special role (it corresponds to the volume element in ), and is called the “pseudo-scalar” component. Note that it anti-commutes with all the , which has another important consequence: Substituting for any of the again yields a generating set of the Clifford algebra, hence there exists a unitary on taking the original to the new basis by conjugation.
The vector and pseudo-scalar components of the Clifford algebra span a -dimensional space isomorphic to : indeed, extending the symmetry of , the extended has the symmetry of : for every special-orthogonal matrix , we can write transformed Clifford operators obeying the anti-commutation relations. As before (but now this requires an additional proof that we provide in the appendix using the condition ), there exists a unitary of the underlying Hilbert space such that for all , .
Using the orthogonal group symmetry of the Clifford algebra, we show the following lemma in the appendix.
Lemma 1
The linear map taking as in eq. (2) to
(3) |
is positive. I.e., if is a state, then so is , and in this case . Conversely, if , then
is positive semidefinite, hence a state.
It is interesting to note that the map is positive, but not completely positive, for any , as one can see straightforwardly by looking at it’s Choi-Jamiołkowski operator.
Ii Applications
We now first use the tools from above to prove a “meta”-uncertainty relation, from which we will then derive two new entropic uncertainty relations. Evidently, we have immediately from the above that
Lemma 2
Let with be a quantum state, and consider anti-commuting observables as defined above. Then,
Our result is essentially a generalization of the Bloch sphere picture to higher dimensions (see also Dietz (2006)): For () the state is parametrized by where , and are the familiar Pauli matrices. Lemma 2 tells us that , i.e., the state must lie inside the Bloch sphere. Our result may be of independent interest, since it is often hard to find conditions on the coefficients such that is a state.
Notice that the are directly interpreted as the expectations of the observables . Indeed, is precisely the bias of the -variable :
Hence, we can interpret Lemma 2 as a form of uncertainty relation between the observables : if one or more of the observables have a large bias (i.e., they are more precisely defined), this limits the bias of the other oberservables (i.e., they are closer to uniformly distributed).
Indeed, Lemma 2 has strong consequences for the Rényi and von Neumann entropic averages
where is the Rényi entropy at of the probability distribution arising from measuring the state with observable . The minima of such expressions can be interpreted as giving entropic uncertainty relations, as we shall now do for (the collision entropy) and (the Shannon entropy).
Theorem 3
Let , and consider anti-commuting observables as defined above. Then,
where , and the minimization is taken over all states . The latter holds asymptotically for large .
Proof.
Using the fact that we can first rewrite
where the first inequality follows from Jensen’s inequality and the concavity of the log, and the second from Lemma 2. Clearly, the minimum is attained if all . It follows from Lemma 1 that our inequality is tight. Via the Taylor expansion of we obtain the asymptotic result for large .
For the Shannon entropy () we obtain something even nicer:
Theorem 4
Let , and consider anti-commuting observables as defined above. Then,
where , and the minimization is taken over all states .
Proof.
To see this, note that by rewriting our objective as above, we observe that we need to minimize the expression
subject to and , via the identification . An elementary calculation (included in the appendix for completeness) shows that the function is concave in . Hence, by Jensen’s inequality (read in the opposite direction), the minimum is attained with all the being extremal, i.e. one of the is and the others are , giving just the lower bound of .
It is clear that based on Lemma 1 one can derive similar uncertainty relations for other Rényi entropies () by performing the analogous optimization. We stuck to the two values above as they are the most relevant in view of the existing literature; for example, using the same convexity arguments as for , we obtain for ,
This should be compared to Deutsch’s inequality Deutsch (1983) for the case of two mutually unbiased bases of a qubit, because the latter really is about .
Iii Discussion
We have shown that anti-commuting Clifford observables obey the strongest possible uncertainty relation for the von Neumann entropy. It is interesting that in the process of the proof, however, we have found three uncertainty type inequalities (the sum of squares bound, the bound on , and finally the bound on ), and all three have a different structure of attaining the limit. The sum of squares bound can be achieved in every direction (meaning for every tuple satisfying the bound we get one attaining it by multiplying all components by some appropriate factor), the expression requires all components to be equal, while the expression demands exactly the opposite.
Our result for the collision entropy is slightly suboptimal but strong enough for all cryptographic purposes. Indeed, one could use our entropic uncertainty relation in the bounded quantum storage setting to construct, for instance, -out-of- oblivious transfer protocols analogous to Damgard et al. (2007). Here, instead of encoding a single bit into either the computational or Hadamard basis, which gives us a 1-out-of-2 oblivious transfer, we now encode a single bit into the positive or negative eigenspace of each of these operators. It is clear from the representation of such operators discussed earlier, that such an encoding can be done experimentally as easily as encoding a single bit into three mutually unbiased basis given by the Pauli operators , and . Indeed, our construction can be seen as a direct extension of such an encoding: we obtain the uncertainty relations for these three MUBs used in Damgard et al. (2007), previously proved by Sanchez-Ruiz Sanchez (1993); Sanchez-Ruiz (1995), as a special case of our analysis for ().
Alas, strong uncertainty relations for measurements with more than two outcomes remain inaccessible to us. It has been shown Fehr (2007) that uncertainty relations for more outcomes can be obtained via a coding argument from uncertainty relations as we construct them here. Yet, these seem far from optimal. A natural choice would be to consider the generators of a generalized Clifford algebra Morris (1967, 1968), yet this algebra does not have the nice symmetry properties which enabled us to implement operations on the vector components above. It remains an exciting open question, whether such operators form a good generalization, or whether we must continue our search for new properties.
Acknowledgments. The authors acknowledge support by the EC project “QAP” (IST-2005-015848). SW was additionally supported by the NWO vici project 2004-2009. AW was additionally supported by the U.K. EPSRC via the “IRC QIP” and an Advanced Research Fellowship. SW thanks Andrew Doherty for an explanation of the Jordan-Wigner transform.
Appendix A Appendix
SO(2n+1) structure. While the orthogonal group symmetry of the “vector” component of the Clifford algebra, spanned by the generators , is usually covered in textbook accounts, the symmetry of the extended set , including the pseudo-scalar element, seems much less well-known. It is quite natural to consider this set as all its elements mutually anti-commute, so any family of pairwise distinct elements will generate the full Clifford algebra. Hence there exists a unitary mapping the original generators to the :
The initial observation is that indeed an orthogonal transformation of the generators extends to a special-orthogonal transformation of the extended set, since
A nice and easy geometrical way of seeing this is via the higher-dimensional analogue of the well-known Euler angle parametrisation of orthogonal matrices (see Goldstein (1980)):
Euler Angle Decomposition Hoffman et al. (1972). Let be an orthogonal matrix. Then there exist angles for , such that where is either the identity or the reflection along the first coordinate axis, and is the rotation by angle in the plane spanned by the th and th coordinate axes, i.e. (The product is to be taken in some fixed order of the indices, say lexicographically.)
With this, we only have to understand how transforms under the action of the elementary transformations and . Clearly, under the former,
while for the latter (using the abbreviations and ),
Now, for a general special-orthogonal transformation of the coordinates of the extended set, the Euler angle decomposition gives
Then, the unitary representation clearly has to be the product of terms . For we know already what these are, as the transformation is only one of the generating set (and by the above observation the pseudo-scalar is indeed left alone, as required); for on the other hand, we first map the generating set to by the unitary , then apply the unitary belonging to and then map the generators back via . This clearly implements
and we are done.
Proof of Lemma 1. First, we show that there exists a unitary such that has no pseudo-scalar, and only one nonzero vector component, say at , which we can choose to be . Indeed, there is a special-orthogonal transformation of the coefficient vector to a vector whose zeroeth as well as second till last components are all : since the length is preserved, this is consistent with the first component becoming .
Now, let be the corresponding unitary of the Hilbert space. By the above-mentioned representation of on , we arrive at a new, simpler looking state
for some , etc.
There exist of course orthogonal transformations that take to . Such transformations flip the sign of a chosen Clifford generator. They can be extended to a special orthogonal transformation of by also flipping the sign of : . (Using the geometry of the Clifford algebra it is easy to see that fulfills this task.) Now, consider
for .
Clearly, if were a state, then the new operator would also be a state. We claim that has no terms with an index in its Clifford basis expansion: Note that if we flip the sign of precisely those terms that have an index (i.e., they have a factor in the definition of the operator basis), and then the coefficients cancel with those of .
We now iterate this map through , and we are left with a final state , which hence must be of the form
By applying from above, we now transform to , which is the first part of the lemma.
Looking at once more, we see that this can be positive semidefinite only if , i.e., .
Conversely, if , then the (Hermitian) operator has the property
i.e. , so .
Concavity of . Straightforward calculation shows that
and so
Since we are only interested in the sign of the second derivative, we ignore the (positive) factors in front of the bracket, and are done if we can show that
is non-positive for . Substituting , which is also between and , we rewrite this as
which has derivative
and this is clearly positive for . In other words, increases from its value at (where it is ) to its value at (where it is ), so indeed for all .
Consequently, also for , and we are done.
Constructive proof of Lemma 1. For the interested reader, we now give an explicit construction of the unitaries and , which however requires a more intimate knowledge of the Clifford algebra. First of all, recall that we can write two vectors in terms of the generators of the Clifford algebra as and . The Clifford product of the two vectors is defined as , where is the outer product of the two vectors Lounesto (2001); Doran and Lasenby (2003). When using the matrix representation of the Clifford algebra given above, this product is simply the matrix product. Second, it is well known that within the Clifford algebra we may write the vector resulting from a reflection of the vector on the plane perpendicular to the vector (in 0) as . Rotations can then be expressed as successive reflections Lounesto (2001); Doran and Lasenby (2003).
We first consider . Here, our goal is to find the transformation that rotates the vector to the vector , where we let . Finding such a transformation for only the first generators can easily be achieved. The challenge is thus to include . To this end we perform three individual operations: First, we rotate onto the vector with . Second, we exchange and . And finally we rotate the vector onto the vector .
First, we rotate onto the vector : Consider the vector . We have and thus the vector is of length 1. Let denote the vector lying in the plane spanned by and located exactly half way between and . Let with . It is easy to verify that and hence the vector has length 1. To rotate the vector onto the vector , we now need to first reflect around the plane perpendicular to , and then around the plane perpendicular to . Hence, we now define . Evidently, is unitary since . First of all, note that
Hence,
as desired. Using the geometry of the Clifford algebra, one can see that -vectors remain -vectors when transformed with the rotation Doran and Lasenby (2003). Similarly, it is easy to see that is untouched by the operation
since for all . We can thus conclude that
for some coefficients .
Second, we exchange and : To this end, recall that is also a generating set for the Clifford algebra. Hence, we can now view itself as a vector with respect to the new generators. To exchange and , we now simply rotate onto . Essentially, this corresponds to a rotation about 90 degrees in the plane spanned by vectors and . Consider the vector located exactly in the middle between both vectors. Let be the normalized vector. Let . A small calculation anlogous to the above shows that
We also have that , are untouched by the operation: for and , we have that
since . How does affect the -vectors in terms of the original generators ? Using the anti-commutation relations and the definition of it is easy to convince yourself that all -vectors are mapped to -vectors with (except for itself). Hence, the coefficient of remains untouched. We can thus conclude that
for some coefficients .
Finally, we now rotate the vector onto the vector . Note that . Let be the normalized vector. Our rotation is derived exactly analogous to the first step: Let , and let . Let . A simple calculation analogous to the above shows that
as desired. Again, we have for and . Furthermore, -vectors remain -vectors under the actions of Doran and Lasenby (2003). Summarizing, we obtain
for some coefficients . Thus, we can take .
The argument for finding is analogous. A simple computation using the fact that for all gives us .
References
- W. Heisenberg, Zeitschrift für Physik 43, 172 (1927).
- H. Robertson, Physical Review 34, 163 (1929).
- I. Białynicki-Birula and J. Mycielski, Communications in Mathematical Physics 44, 129 (1975).
- D. Deutsch, Phys. Rev. Lett. 50, 631 (1983).
- P. Gibilisco, D. Imparato and T. Isola, J. Math. Phys 48, 072109 (2007).
- K. Kraus, Physical Review D 35, 3070 (1987).
- H. Maassen and J. Uffink, Phys. Rev. Lett. 60 (1988).
- I. Damgard, S. Fehr, R. Renner, L. Salvail, and C. Schaffner, in Proceedings of CRYPTO 2007 (2007), pp. 360–378.
- D. DiVincenzo, M. Horodecki, D. Leung, J. Smolin, and B. Terhal, Physical Review Letters 92 (2004), quant-ph/0303088.
- J. Sanchez, Physics Letters A 173, 233 (1993).
- J. Sanchez-Ruiz, Physics Letters A 201, 125 (1995).
- J. Sanchez-Ruiz, Physics Letters A 244, 189 (1998).
- P. Hayden, D. Leung, P. Shor, and A. Winter, Communications in Mathematical Physics 250, 371 (2004), quant-ph/0307104.
- M.Ballester and S. Wehner, Physical Review A 75, 022319 (2007).
- P. Lounesto, Clifford Algebras and Spinors (Cambridge University Press, 2001).
- C. Doran and A. Lasenby, Geometric Algebra for Physicists (Cambridge University Press, 2003).
- K. Dietz (2006), quant-ph/0601013.
- P. Jordan and E. Wigner, Zeitschrift für Physik 47, 631 (1928).
- S. Fehr, Personal communication (2007).
- A. O. Morris, Quarterly Journal of Mathematics, Oxford (Ser. 2) 18, 7 (1967).
- A. O. Morris, Quarterly Journal of Mathematics, Oxford (Ser. 2) 19, 289 (1968).
- H. Goldstein, Classical Mechanics (Addison-Wesley, 1980).
- D. K. Hoffman, R. C. Raffenetti, and K. Ruedenberg, Journal of Mathematical Physics 13, 528 (1972).