Entanglement can increase asymptotic rates of zeroerror classical communication over classical channels
Abstract
It is known that the number of different classical messages which can be communicated with a single use of a classical channel with zero probability of decoding error can sometimes be increased by using entanglement shared between sender and receiver. It has been an open question to determine whether entanglement can ever increase the zeroerror communication rates achievable in the limit of many channel uses. In this paper we show, by explicit examples, that entanglement can indeed increase asymptotic zeroerror capacity, even to the extent that it is equal to the normal capacity of the channel. Interestingly, our examples are based on the exceptional simple root systems and .
1 Introduction
A classical channel which is discrete and memoryless is fully described by its conditional probability distribution of producing output for a given input . The channel obtained by allowing one use of a channel and one use of is written as , reflecting the fact that its conditional probability matrix is the tensor (or Kronecker) product of those of the two constituent channels. Similarly, denotes uses of .
Definition 1.
Let denote the maximum number of different messages which can be sent with a single use of with zero probability of a decoding error. The zeroerror capacity of is
(1) 
Two input symbols of a channel are said to be confusable if and for some output symbol . The confusability graph of a channel is a graph , whose vertices correspond to different input symbols of and two vertices are joined by an edge if the corresponding symbols are confusable. The confusability graph of is determined by those of and as follows: , where “” denotes the strong graph product.
Definition 2 (Strong graph product).
In general, the strong product of graphs is a graph , whose vertices are the tuples and distinct vertices and are joined by an edge if they are entrywise confusable, i.e., for each either or . Likewise, we define the strong power of graph by and .
An independent set of a graph is a subset of its vertices with no edges between them. The independence number of a graph is the maximum size of an independent set of . As Shannon observed [1], and depend only on the confusability graph of the channel: and where
(2) 
is known as the Shannon capacity of the graph . Clearly, , since has an independent set of size . However, in general can be larger than . The simplest example is the cycle for which but .
Computing the independence number of a graph is NPhard, but conceptually simple. However, no algorithm is known to determine in general, although there is a celebrated upper bound due to Lovász [2]. He defined an efficiently computable quantity called the Lovász number of which satisfies and . Because of these properties we also have .
Definition 3.
Let denote the number of different messages which can be sent with a single use of with zero probability of a decoding error, when both parties share an arbitrary finitedimensional entangled state on which each can perform arbitrary local measurements. The entanglementassisted zeroerror capacity of is
(3) 
As in the unassisted case, the quantities and also depend only on the confusability graph of the channel [3]. For this reason, we will talk about the assisted and unassisted zeroerror capacities of graphs as well as of channels.
In [3] it was shown that graphs exist with . Shortly afterwards it was shown [4, 5] that the Lovász bound also applies to the entanglementassisted quantities, so and hence .
Whether graphs with exist, that is, whether entanglement can ever offer an advantage in terms of the rates achievable in the large block length limit was left as an open question. Clearly, the Lovász bound cannot be used to prove such a separation. Fortunately, there is another bound due to Haemers which is sometimes better than the Lovász bound [6, 7].
Theorem 4 (Haemers [6, 7]).
For let be a matrix with entries in any field . We say that fits if and whenever there is no edge between and . Then . In particular, .
Proof.
Let be a maximal independent set in . If fits , then for all while the diagonal entries are nonzero. Hence, has full rank on a subspace of dimension and thus . As this is true for any that fits , we get .
Next, note that if fits and fits then fits , and . Hence, , which implies the desired result. ∎
The next section shows how Haemers bound applies to a particular graph to determine its unassisted zeroerror capacity, and then provides an explicit entanglementassisted protocol that achieves a higher rate. This shows that entanglement assistance can indeed increase the asymptotic zeroerror rate, thus giving an affirmative answer to the previously open question.
The entanglementassisted protocol is based on the fact that the graph in question can be constructed from the root system [8] , so in Section 3 we investigate constructions based on other irreducible root systems. Most notably we show that a construction based on provides another example with a larger gap in the capacities. In Section 4 we discuss open problems.
2 The zeroerror capacities of the symplectic graph
Definition 5 (Symplectic space).
A nondegenerate symplectic space is a vector space (over a field ) equipped with a nondegenerate symplectic form, i.e., a bilinear map which is

skewsymmetric: for all , and

nondegenerate: if for all , then .
If has characteristic 2, we also require that for all (for other fields this is implied by the antisymmetry property). On a dimensional space, the canonical symplectic form is
(4) 
where is the identity matrix. Any nondegenerate symplectic space with finite dimensional vector space is isomorphic to the canonical symplectic space .
Definition 6 (Symplectic graph).
Let be a finite field and let be a natural number. The vertices of the symplectic graph are the points of the projective space and there is an edge between if . In the case where , the points of the projective space are simply the nonzero elements of .
Remark 7.
The symplectic graph is isomorphic to the graph whose vertices are all the fold tensor products of Pauli matrices except for the identity, i.e., , and which has edges between commuting matrices.
The next two subsections prove that for channels with the confusability graph , the entanglementassisted zeroerror capacity is larger than the unassisted capacity. More precisely,
Theorem 8.
while .
2.1 Capacity in the unassisted case
The fact that is a special case of a result in [9] which we prove explicitly here.
Theorem 9 (Peeters [9]).
.
Proof.
For the upper bound we construct a matrix over which fits and which has rank and use Haemers’ bound (see Theorem 4). Let
(5) 
be the dimensional subspace of that consists of vectors which have an even number of entries equal to one. The restriction of the standard inner product on to the subspace is a nondegenerate symplectic form, so there is an isomorphism such that
(6) 
Let be the allones vector in (note that for all ). For all let
Since if and only if and are not joined by an edge in , matrix fits . Since it is the Gram matrix of a set of dimensional vectors (i.e. the entry at is the inner product of the vector and vector for some ordering of the set of vectors), its rank is at most . Therefore, by Haemers’ bound, .
For the matching lower bound, let for be the standard basis of , and let . Then so and . Therefore is an independent set of size in , so .
Hence, and the upper bound on the zeroerror capacity is attained by a code of block length one. ∎
2.2 Entanglementassisted capacity
In this section we will establish the entanglementassisted capacity of . Our main tool is Theorem 11 together with some combinatorial results.
Definition 10.
A dimensional orthonormal representation of graph is a function that assigns unit vectors to the vertices of such that for each edge vectors and are orthogonal.
The following theorem appeared in [10] but for completeness we include the proof here.
Theorem 11 ([10]).
If graph has an orthonormal representation in and its vertices can be partitioned into disjoint cliques each of size then .
Proof.
Figure 1 describes a protocol that uses a rank maximally entangled state to send one of messages with zero error by a single use of the channel, proving that . Removing edges from cannot decrease the Lovász number and in this way we can obtain the graph which is the strong product of the empty graph on vertices with the clique of size (i.e. the disjoint union of cliques). This graph has Lovász number so . Since [4, 5], this provides the matching upper bound. ∎
Lemma 12.
The vertices of can be partitioned into cliques of size .
Proof.
Such a partition of the symplectic graph is known as a symplectic spread, and is wellknown to exist (see for example [11]). We give a simple construction from [12] below. Another proof in terms of commuting sets of Pauli operators is given in [13, 14].
Let , and identify the vertices of with the nonzero vectors in . Consider the following symplectic form on :
(7) 
where is the finite field trace defined as . As is a nondegenerate inner product in , the form is also nondegenerate. Hence, the symplectic spaces and are isomorphic. We will describe the partition for the later space.
Denoting the multiplicative group (of order ) in by , the cells of a partition of the nonzero elements of are:
It is easy to check that these cells of size partition . Moreover, if and are in the same cell, then
(8) 
Therefore each cell is a clique. ∎
The entire representation, grouped into 9 complete (unnormalised) orthogonal bases, is given in a table in Appendix A and it suffices to check that this has the desired properties to establish the result. Interestingly, it consists of vectors from the root system and it is possible to give a more insightful description and proof of the representation in relation to this. Such a proof is given in Appendix B.
2.3 The connection to
A deeper coincidence underlies the orthogonal representation of by roots of . The automorphism group of is the symplectic group , which is the group of linear maps on which preserve the symplectic form. This group is isomorphic to quotient , where is the Weyl group of .
3 Relationship to the normal capacity
Given a classical channel , its standard classical capacity cannot be increased by the use of entanglement or even arbitrary nonsignalling correlations between the sender and receiver [17]. The standard capacity and the assisted and unassisted zeroerror capacities are related by
(9) 
Theorem 14.
Given a graph which satisfies the premises of Theorem 11 (partitions into cliques of size and has an orthonormal representation in dimension ) and is also vertextransitive, one can construct a channel whose normal capacity and are both equal to and are both achieved by a blocklength one entanglementassisted zeroerror code.
Proof.
Let be the vertices of and let be the set of all cliques of size in . Since is vertextransitive, each vertex is contained in the same number of cliques from . Counting the number of pairs in the set in two ways we have
(10) 
Let be the channel which on the input produces an output uniformly at random from the set . Using the analysis of Section 16 of [18],
(11) 
also partitions into cliques of size and, since it is a subgraph of , has an orthonormal representation in dimension . Therefore and, furthermore, this rate is achieved by the blocklength one entanglementassisted protocol of Figure 1. ∎
The symplectic graphs are all vertex transitive so, remarkably, the channel constructed in this way for has , even though there is a positive lower bound on the error probability for classical codes with any rate greater than (as well as an upper bound, both decaying exponentially with ) [19].
4 Graphs from and other root systems
We define the orthogonality graph of a root system as follows. The vectors of occur in antipodal pairs ; the vertices of the graph are the rays spanned by these antipodal pairs. Two vertices are adjacent if and only if their rays are orthogonal. The graph of Section 2 is precisely the orthogonality graph of . This raises the question of whether a channel whose confusability graph is the orthogonality graph of another irreducible root system can also exhibit a gap between the classical and entanglementassisted zeroerror capacities. We find that the orthogonality graph of provides a second example of such a gap, and furthermore, the ratio between the assisted and classical capacities is larger in this case.
The irreducible root systems consist of the infinite families , , , where , and the exceptional cases , , , , and (see [20]). We show that for all of the infinite families, and for , there is no gap between the independence number and the Lovász number , so for these graphs. However, the orthogonality graph of provides a second example of a gap between the classical and entanglementassisted capacity. For , we show that while . It is interesting to note that the graph used in [3] is precisely the orthogonality graph of and while we know the entanglementassisted capacity of this graph, we still do not know its unassisted capacity or whether it is smaller than the assisted one. We do not give either capacity for the graph of .
In what follows the name of the root system is also used as the name of the orthogonality graph and denotes the th standard basis vector. We ignore correct normalization of the root vectors for simplicity, since it clearly doesn’t affect the orthogonality graph.
Root system
(12) 
As pointed out in [21], is the graph whose vertices are the nonisotropic points in the ambient projective space of the polar space , with vertices adjacent if they are orthogonal with respect to the associated bilinear form. The ambient projective space of is . Since the bilinear form associated with is symplectic, it follows immediately that is an induced subgraph of the symplectic graph with . By Theorem 9,
(13) 
On the other hand, let and identify the vertices of with the nonzero vectors of . Then we may choose the quadratic form of to be , where is the finite field trace. The polarization of this quadratic form is the symplectic form . With this choice, the vertices of , i.e., the nonisotropic vectors in , are those such that . Now, consider the partition of vertices into cliques given in Lemma 12, restricted to the vertices of :
(14) 
Recall that . For each , there are exactly choices of such that . Therefore, is a partition of the vertices of into 15 cliques of size . By Theorem 11,
(15) 
Root system ()
Root system ()
(18) 
Note that the vertices induce a subgraph isomorphic to . Also note that and are adjacent and have the same neighbourhood (apart from themselves). It follows that is isomorphic to , the strong graph product of a Kneser graph and a complete graph on vertices. By Theorem 7 of [2],
(19) 
Since is an independent set of size , it follows that
(20) 
Root system ()
(21) 
To find the Lovász number we consider even and odd separately. When is odd, partition the vertices into sets , where
(22) 
Each is a clique of size . When is even, partition the vertices into sets , where
Again each is a clique of size . In either case, we have partitioned the graph into cliques of size . By Theorem 11, .
Since is an independent set of size , we have
(23) 
Root system ()
(24) 
has the same orthogonality graph as .
Root system
(25) 
By inspection has an independent set of size and can be partitioned into cliques of size . By Theorem 11,
(26) 
5 Conclusion
We have shown that it is possible for entanglement to increase the asymptotic rate of zeroerror classical communication over some classical channels. This is quite different from the situation for families of codes which only achieve arbitrarily small error rates asymptotically. The best rate that can be achieved by classical codes in this context is the Shannon capacity and entanglement cannot increase this rate. The entanglementassisted capacity has a simple formula which reduces to the formula for the Shannon capacity when the channel is classical [22].
It is interesting to note that in every example of a graph with found to date, the entanglementassisted capacity is attained by a code of block length one. This certainly is not true of the entanglementassisted capacities of all graphs. In [23], an interesting observation of Arikan is reported: The graph consisting of a five cycle and one more isolated vertex has . Since no positive integer power of this quantity is an integer, the capacity is not attained by any finite length block code for this graph. Since the Lovász number of this graph is also (the Lovász number is additive for disjoint unions of graphs, and was calculated for cycles in [2]), the same observation is true for the entanglementassisted capacity, which in this case is equal to the unassisted capacity.
Our result has an interesting interpretation in terms of KochenSpecker (KS) proofs of noncontextuality. Such a proof specifies a set of complete, projective measurements, with some projectors in common, such that there is no way to consistently assign a truth value to each projector. An assignment is consistent if (a) precisely one projector in each measurement is “true” and (b) no two “true” projectors are orthogonal.
Ruuge [24] shows that the root systems and can be used to construct KS proofs using computer search to nullify the possibility of a consistent assignment. This is a corollary of our results, but our proof is analytic due to the novel application of the Haemers bound. In fact, the use of the Haemers bound provides a whole sequence of KS proofs which are increasingly strong in the following quantitative sense: For the set of measurements which are obtained by tensoring together of Alice’s 9 measurements, only can be assigned values in accordance with property (a) before property (b) must be violated.
Three main avenues for further research are apparent to us. First, is it possible to give a general algorithm to compute ? More specific related problems include determining whether can be arbitrarily large, and whether there are graphs where is strictly less than .
Secondly, we have already shown that there are some connections to multiprover games and to noncontextuality, but we feel that a deeper understanding of these connections is possible and desirable. For example, the application of our result to KS proofs mentioned above suggests some stronger notion of noncontextuality in quantum mechanics.
Finally, our work on entanglementassisted zeroerror codes can be placed in the wider context of using entanglement to reduce decoding error in finite block length coding of classical information for classical channels (demonstrating this effect is even experimentally feasible [25]), and characterising this phenomenon presents an even wider set of questions.
Acknowledgements
We would like to thank Andrew Childs, Richard Cleve, David Roberson, Simone Severini, and Andreas Winter for useful discussions.
This work was supported by NSERC, QuantumWorks, CIFAR, CFI, and ORF. Aidan Roy acknowledges support by a UW/Fields Institute Award.
References
 [1] Claude E. Shannon. The zero error capacity of a noisy channel. IRE Trans. Inform. Theory, 2(3):8–19, September 1956. doi:10.1109/TIT.1956.1056798.
 [2] László Lovász. On the Shannon capacity of a graph. IEEE Trans. Inf. Theory, 25(1):1–7, January 1979. doi:10.1109/TIT.1979.1055985.
 [3] Toby S. Cubitt, Debbie Leung, William Matthews, and Andreas Winter. Improving zeroerror classical communication with entanglement. Phys. Rev. Lett., 104(23):230503, June 2010. arXiv:0911.5300, doi:10.1103/PhysRevLett.104.230503.
 [4] Salman Beigi. Entanglementassisted zeroerror capacity is upper bounded by the Lovász theta function, 2010. arXiv:1002.2488.
 [5] Runyao Duan, Simone Severini, and Andreas Winter. Zeroerror communication via quantum channels, noncommutative graphs and a quantum Lovász function, 2010. arXiv:1002.2514.
 [6] Willem H. Haemers. On some problems of Lovász concerning the Shannon capacity of a graph. IEEE Trans. Inf. Theory, 25(2):231–232, March 1979. doi:10.1109/TIT.1979.1056027.
 [7] Willem H. Haemers. An upper bound for the Shannon capacity of a graph. Coll. Math. Soc. János Bolyai, 25:267–272, 1978. Available from: http://econpapers.repec.org/RePEc:ner:tilbur:urn:nbn:nl:ui:12402396.
 [8] Karin Erdmann and Mark J. Wildon. Introduction to Lie algebras. Springer, 2006.
 [9] René Peeters. Orthogonal representations over finite fields and the chromatic number of graphs. Combinatorica, 16:417–431, 1996. doi:10.1007/BF01261326.
 [10] Toby S. Cubitt, Debbie Leung, William Matthews, and Andreas Winter. Zeroerror channel capacity and simulation assisted by nonlocal correlations, 2010. arXiv:1003.3195.
 [11] Roger H. Dye. Partitions and their stabilizers for line complexes and quadrics. Annali di Matematica Pura ed Applicata, 114:173–194, 1977. doi:10.1007/BF02413785.
 [12] Simeon Ball, John Bamberg, Michel Lavrauw, and Tim Penttila. Symplectic spreads. Designs, Codes and Cryptography, 32:9–14, 2004. doi:10.1023/B:DESI.0000029209.24742.89.
 [13] Somshubhro Bandyopadhyay, Oscar P. Boykin, Vwani Roychowdhury, and Farrokh Vatan. A new proof for the existence of mutually unbiased bases. Algorithmica, 34:512–528, 2008. arXiv:quantph/0103162, doi:10.1007/s0045300209807.
 [14] Jay Lawrence, Časlav Brukner, and Anton Zeilinger. Mutually unbiased binary observable sets on qubits. Phys. Rev. A, 65(3):032320, February 2002. arXiv:quantph/0104012, doi:10.1103/PhysRevA.65.032320.
 [15] Bianca L. Cerchiai and Bert van Geemen. From qubits to , 2010. arXiv:1003.4255.
 [16] Kevin Purbhoo. Compression of root systems and the sequence. Electronic Journal of Combinatorics, 15(1):R115, 2008. arXiv:math/0609014.
 [17] Charles H. Bennett, Peter W. Shor, John A. Smolin, and Ashish V. Thapliyal. Entanglementassisted capacity of a quantum channel and the reverse Shannon theorem. Information Theory, IEEE Transactions on, 48(10):2637–2655, October 2002. arXiv:quantph/0106052, doi:10.1109/TIT.2002.802612.
 [18] Claude E. Shannon. A mathematical theory of communication. Bell System Technical Journal, 27:379–423, 623–656, July, October 1948. doi:10.1145/584091.584093.
 [19] Claude E. Shannon, Robert G. Gallager, and Elwyn R. Berlekamp. Lower bounds to error probability for coding on discrete memoryless channels. I. Information and Control, 10(1):65–103, 1967. doi:10.1016/S00199958(67)900526.
 [20] James E. Humphreys. Reflection groups and Coxeter groups. Cambridge University Press, 1992.
 [21] Pratima Panigrahi. The diameters graph of the root system is uniquely geometrisable. Geometriae Dedicata, 78:121–141, 1999. doi:10.1023/A:1005270403541.
 [22] Charles H. Bennett, Peter W. Shor, John A. Smolin, and Ashish V. Thapliyal. Entanglementassisted capacity of a quantum channel and the reverse Shannon theorem. IEEE Trans. Inf. Theory, 48(10):2637–2655, October 2002. arXiv:quantph/0106052, doi:10.1109/TIT.2002.802612.
 [23] János Körner and Alon Orlitsky. Zeroerror information theory. IEEE Trans. Inf. Theory, 44(6):2207–2229, October 1998. doi:10.1109/18.720537.
 [24] Artur E. Ruuge. Exceptional and noncrystallographic root systems and the Kochen–Specker theorem. J. Phys. A: Math. Theor., 40(11):2849–2859, 2007. arXiv:0906.2696, doi:10.1088/17518113/40/11/019.
 [25] Robert Prevedel, Yang Lu, William Matthews, Rainer Kaltenbaek, and Kevin J. Resch. Entanglementenhanced classical communication over a noisy classical channel, 2010. arXiv:1010.2566.
Appendix A The orthogonal representation of in full
Appendix B The proof of Lemma 13
We first give some basic definitions and facts about and its lattice. Let be the vectors given in Figure 2, known as simple roots of . Their inner products are encoded by Dynkin diagram shown in Figure 2 as follows:
(27) 
All integer linear combinations of the simple roots form the lattice
(28) 
For , let denote the lattice coordinates of . The inner product between two lattice vectors is
where is the set of edges in the Dynkin diagram. Note that the inner product is an even integer for all , .
The root system is the set vectors of norm in :^{1}^{1}1One can check that this agrees with the more common definition of as the orbit of under the reflection group , where and is the unit vector in direction .
(29) 
In terms of lattice coordinates, the condition can be expressed as
(30) 
Following [15, Section 4.6] and [16, Section 3.2] we we define by
(31) 
The (defined in the figure above) are chosen so that . This extends to all lattice vectors by linearity of :
(32) 
We can write where is defined by with the are treated as vectors in . It is easily checked that the kernel of is the set where (i.e. ). Therefore,
Lemma 15.
iff for some and .
Lemma 16.
If then is not a root.
Proof.
If then for some , , so . The first two inner products are even integers and , so for some integer , and can’t be a root by definition. ∎
Lemma 17.
and iff .
Proof.
iff for some , . We can rule out the case where because, if it were
since . Then, and cannot both be roots due to equation (30), which must necessarily hold modulo two. Therefore, and . Since is a root, the condition that is also a root
reduces to . By the CauchySchwarz inequality, , or equivalently, , with equality iff is a scalar multiple of . Since inner products between lattice vectors are even integers, either and , or and . ∎
There are 126 roots in 63 antiparallel pairs. Let be a subset of with one root from each pair. We have just shown that both roots in a pair have the same image under , that these images are different for different pairs, and none are equal to 0. Therefore, the restriction of to the domain is a bijection between and whose inverse determines (by normalising the vectors in ) an orthonormal representation of thanks to the relationship (32).