On Ergodic Secrecy Capacity of Multiple Input Wiretap Channel with Statistical CSIT

On Ergodic Secrecy Capacity of Multiple Input Wiretap Channel with Statistical CSIT

\authorblockNShih-Chun Lin and Pin-Hsun Lin S.-C. Lin is with Graduate Institute of Computer and Communication Engineering, National Taipei University of Technology, Taipei, Taiwan 10643, e-mail: sclin@ntut.edu.tw, P.-H. Lin is with the Department of Electrical Engineering and Graduate Institute of Communication Engineering, National Taiwan University, Taipei, Taiwan, 10617, e-mail: pinhsunlin@gmail.comThis work was supported by the National Science Council, Taiwan, R.O.C., under Grants 100-2628-E-007-025-MY3.
Abstract

We consider the secure transmission in ergodic fast-Rayleigh fading multiple-input single-output single-antenna-eavesdropper (MISOSE) wiretap channels. We assume that the statistics of both the legitimate and eavesdropper channels is the only available channel state information at the transmitter (CSIT). By introducing a new secrecy capacity upper bound, we prove that the secrecy capacity is achieved by Gaussian input without prefixing. To attain this, we form another MISOSE channel for upper-bounding, and tighten the bound by finding the worst correlations between the legitimate and eavesdropper channel coefficients. The resulting upper bound is tighter than the others in the literature which are based on modifying the correlation between the noises at the legitimate receiver and eavesdropper. Next, we fully characterize the ergodic secrecy capacity by showing that the optimal channel input covariance matrix is a scaled identity matrix, with the transmit power allocated uniformly among the antennas. The key to solve such a complicated stochastic optimization problem is by exploiting the completely monotone property of the ergodic secrecy capacity to use the stochastic ordering theory. Finally, our simulation results show that for the considered channel setting, the secrecy capacity is bounded in both the high signal-to-noise ratio and large number of transmit antenna regimes.

I Introduction

The secrecy capacity of a wiretap channel is the maximum achievable secrecy rate between the transmitter and a legitimate receiver, and a perfect secrecy constraint is imposed to make no information be attainable by an eavesdropper [1][2]. In the wireless environments, the time-varying characteristic of fading channels can also be exploited to enhance the secrecy capacity[3]. Further enhancements are attainable by employing multiple antennas at each node, e.g., in [4][5]. However, these secrecy capacity results [3, 4, 5] rely on perfect knowledge of the legitimate receiver’s channel state information at the transmitter (CSIT). Because of the limited feedback bandwidth and the delay caused by channel estimation, it may be hard to track the channel coefficients if they vary rapidly. Thus for fast-fading channels, it is more practical to consider the case with only partial CSIT of the legitimate channel. However, in this case, only some lower and upper bounds of the secrecy capacity are known [6][7], and the secrecy capacity is unknown. Although the general secrecy capacity formula is reported in [1], the optimal auxiliary random variable for prefixing in this formula is still unknown.

In this letter, we consider one important scenario of partial CSIT, i.e., the transmitter only knows the statistics of both the legitimate and eavesdropper channels but not the realizations of them. Under this scenario, we derive the secrecy capacity of the fast-fading, multiple-input single-output single-antenna-eavesdropper (MISOSE) wiretap channels, where the transmitter has multiple antennas while the legitimate receiver and eavesdropper each have single antenna. Both the coefficients of the legitimate and eavesdropper channels are Rayleigh faded. We first propose a new secrecy capacity upper bound to show that the transmission scheme in [6] is secrecy-capacity achieving, which is based on [1] with Gaussian input but without prefixing. Then we find the optimal channel input covariance matrix analytically to fully characterize the ergodic secrecy capacity, while such a optimization problem is solved numerically in [6] without guaranteeing the optimality. The key is to exploit the completely monotone property of the ergodic secrecy capacity, then invoking the stochastic ordering theory [8].

To obtain a tighter secrecy capacity upper bound than that reported in [7], we introduce another MISOSE channel with a relaxed secrecy constraint for upper-bounding, while finding the worst correlations between the coefficients of the legitimate and eavesdropper channels to tighten the bound. In [7], the upper bound is obtained by directly applying the concepts from [3][4] where the correlation is only introduced between the noises at the legitimate receiver and eavesdropper and the secrecy constraint is left unchanged. Note that the secrecy capacity lower bound in [7] is indeed not achievable. In order to achieve such a bound, the variable-rate coding in [3] must be invoked, where the full CSIT of the legitimate channel must be used to vary the transmission rate in every channel fading state. This can not be done with only statistical CSIT of the legitimate channel as in our setting. In addition to the CSIT assumptions, the secrecy capacity result of [3] is builded on the ergodic slow fading channel assumption where coding among lots of slow fading channel blocks (each block with lots of coded symbols) is used. This assumption may be unrealistic owing to the long latency. For fast fading channels with full CSIT of legitimate channel and statistical CSIT of the eavesdropper channel, only some achievability results are known [9]. In contrast to our results, in [9], the prefixing in [1] may be useful to increase the secrecy rate. More detailed comparisons between our results and those in [7][6][3] can be found in Remarks 1 and 2.

Ii System Model

In the considered MISOSE wiretap channel, we study the problem of reliably communicating a secret message from the transmitter to the legitimate receiver subject to a constraint on the information attainable by the eavesdropper (in upcoming (4)). The received signals and at the legitimate receiver and eavesdropper (each with single antenna) from the transmitter equipped with multiple-antenna, can be represented respectively as *** In this letter, is the vector norm of vector . The trace and complex conjugate transpose of matrix is denoted by and , respectively. The diagonal matrix is denoted by diag(.). The zero-mean complex Gaussian random vector with covariance matrix is denoted as . For random variables (vectors) and , is the probability distribution function (PDF) of , denotes the mutual information between them while denotes the conditional differential entropy. We use to represent that , and form a Markov chain. All the logarithm operations are of base 2 such that the unit of rates is in bit.

 y =hHx+ny, (1) z =gHx+nz, (2)

where is a complex vector representing the transmitted vector signal, is the number of transmit antennas, while and are independent and identically distributed (i.i.d.) circularly symmetric additive white Gaussian noise with zero mean and unit variance at the legitimate receiver and eavesdropper, respectively. In (1) and (2), and are both complex vectors, and representing the channels from the transmitter to the legitimate receiver and eavesdropper, respectively.

In this work, the channels are assumed to be fast Rayleigh fading. That is, and , respectively, while , , and are independent. The channel coefficients change in every symbol time. We assume that the legitimate receiver knows the instantaneous channel state information of perfectly, while the eavesdropper knows those of and perfectly. As for the CSIT, only the distributions of and are known at the transmitter, while the realizations of and are unknown. Thus the transmitter is subjected to a power constraint as

 Tr(Σx)≤P, (3)

where is the covariance matrix of in (1) and (2).

The perfect secrecy and secrecy capacity are defined as follows. Consider a -code with an encoder that maps the message into a length- codeword; and a decoder at the legitimate receiver that maps the received sequence (the collections of over the code length ) from the legitimate channel (1) to an estimated message . We then have the following definitions.

Definition 1 (Secrecy Capacity [3])

Perfect secrecy is achievable with rate if, for any , there exists a sequence of -codes and an integer such that for any

 H(w|zN,hN,gN)/N>R−ε′,andPr(^w≠w)≤ε′, (4)

where is the secret message, ,, and are the collections of , and over code length , respectively. The secrecy capacity is the supremum of all achievable secrecy rates.

Iii Secrecy capacity of the MISOSE fast Rayleigh fading wiretap channel

In this section, we fully characterize the secrecy capacity of the MISOSE fast Rayleigh fading channel in the upcoming Theorem 2. Before that, we present the following Theorem 1 which shows that the scheme in [6], which uses Gaussian without prefixing in [1], is capacity achieving. By introducing new bounding techniques, we obtain tighter secrecy capacity upper bound than that in [7] to attain the secrecy cpapcity. For such a upper bound, we form a better degraded MISOSE channel of (1)(2) with a less stringent perfect secrecy constraint than (4) (in the upcoming (6)), and tighten the upper bound by carefully introducing correlations to the channels and (in the upcoming (8)).

Theorem 1

For the MISOSE fast Rayleigh fading wiretap channel (1)(2) with the statistical CSIT of and , using Gaussain without prefixing is the optimal transmission strategy. And the non-zero secrecy capacity is obtained only when , which is

 (5)

where is the covariance matrix of the Gaussian channel input and subject to (3), while and .

Proof.

From [6], we know that the right-hand-side (RHS) of (5) is achievable and serves as a secrecy capacity lower-bound. Now we present our new secrecy capacity upper bound which matches the RHS of (5). The key for establishing such an upper-bound is first forming a better MISOSE channel than (1)(2) in terms of higher secrecy capacity, and applying the results in [1]. First, we consider a better scenario where the eavesdropper does not know the realizations of , and (4) becomes

 H(w|zN,gN)/N>R−ε′,andPr(^w≠w)≤ε′. (6)

As in [10], equivalently, we can respectively treat the output of the legitimate channel as while that of the eavesdropper channel as (z,). From [11], the secrecy capacity under constraint (6) is only related to the marginal distributions and . Then for any joint conditional PDF such that , the secrecy capacity under constraint (6) is the same. Now we introduce our same marginal legitimate channel for (1), which is formed by replacing in y with as

 y′=(h′)Hx+ny=((σh/σg)g)Hx+ny. (7)

Since is independent of and due to our CSIT assumption, both and have the same conditional distributions condition on (which equal to ). Then we know that =. From (7), we will focus on the MISOSE channel in the following,

 y′′=gHx+(σg/σh)ny,z=gHx+nz. (8)

We now use the secrecy capacity of the MISOSE channel (8), under constraint (6), to upper-bound that of the original channel (1)(2). Again, we can treat as and as the output of the legitimate channel and that of the eavesdropper channel in (8), respectively. As in [9], we apply this fact into the secrecy capacity formula of [1],

 Cs ≤maxp(U,x)I(U;y′′,g)−I(U;z,g) (9) =maxp(U,x)I(U;y′′|g)−I(U;z|g), (10) ≤maxpxI(x;y′′|z,g). (11)

where in (9) is an auxiliary random variable for prefixing, which forms the Markov chain ; the equality (10) follows from [10] by the independence of and due to our CSIT assumptions; and the inequality (11) is from [5]. When , the equivalent channel (8) is degraded, and given . From [12], apply the Markov chain property to (11)

 Cs≤maxpxI(x;y′′|g)−I(x;z|g). (12)

From [4], we know that Gaussian is optimal for the upper bound in (12), and the upper-bound in (12) matches the RHS of (5) when . Note that when , the RHS of (5) is positive. In contrast, when , the upper bound in (11) is zero since from (8), given . And it concludes the proof. ∎

Remark 1: When the transmitter additionally knows the realizations of , e. g. [3], the legitimate channel (7) is not a same marginal channel of (1). In this case, given , may not be Gaussian but is Gaussian, and from (7) may not equal to from (1), In our case, the CSIT assumption makes independent of and , then the legitimate channel in (7) has the same marginal as that in (1). Then we can get rid of the unrealistic ergodic slow fading assumptions in [3] and find the secrecy capacity in fast fading channel. Note that the upper-bound in [7] is obtained by directly applying the derivations in [3] to fast fading channel, and is looser than the upper-bound (11) which is based on the channel (8) and is tightened by the “worst” correlation between and ().

Now we show that the optimal of (5) is , and fully characterize the secrecy capacity as follows.

Theorem 2

For the MISOSE fast Rayleigh fading wiretap channel (1)(2) with the statistical CSIT of and , under power constraint , the non-zero secrecy capacity is obtained only when , which is

 Cs=\mathdsEh[log(1+P||h||2NT)]−\mathdsEg[log(1+P||g||2NT)], (13)

where , , and is the number of transmit antennas.

Proof.

Subjecting to (3), after substituting and into the optimization problem in (5), it becomes

 maxΣx(\mathdsEg[logσ2g/σ2h+gHΣxgσ2g/σ2h]−\mathdsEg[log(1+gHΣxg)]). (14)

By using the eigenvalue decomposition , where is unitary and is diagonal, finding the optimal of (14) is equivalent to solving

 = maxD(\mathdsEg[log(σ2g/σ2h+gHDg)]−\mathdsEg[log(1+gHDg)]), (15)

where the equality comes from the fact that the distribution of is unchanged by the rotation of unitary , and we can set () without loss of optimality.

In the following, we show that subjecting to , the optimal for (15) is

 D∗=diag{P/NT,P/NT,⋯,P/NT}. (16)

First of all, from [6, Section V], the optimal for (15) satisfies . For any where and , we want to prove that for defined in (16)

 \mathdsEg[log(a+gHDg)]−\mathdsEg[log(1+gHDg)] ≤ \mathdsEg[log(a+gHD∗g)]−\mathdsEg[log(1+gHD∗g)], (17)

where we denote by , which belongs to . Here we introduce some results from the stochastic ordering theory [8] to proceed.

Definition 2

[8, p.234] A function is completely monotone if for all and its derivative exists and .

Definition 3

[8, (5.A.1)] Let and be two nonnegative random variables such that , for all . Then is said to be smaller than in the Laplace transform order, denoted as .

Lemma 1

[8, Th. 5.A.4] Let and be two nonnegative random variables. If then , where the first derivative of a differentiable function on is completely monotone, provided that the expectations exist.

To prove (17), we let , , and to invoke Lemma 1. It can be easily verified that (x), the first derivative of , satisfies Definition 2. More specifically, the th derivative of meets

 ψ(n)(x)=⎧⎪⎨⎪⎩n!(a+x)n+1−n!(1+x)n+1>0,if n is even,−n!(a+x)n+1+n!(1+x)n+1<0,if n is odd, (18)

when , since . Now from Lemma 1 and Definition 3, we know that to prove (17) is equivalent to proving or . From [13, p.40], we know that

 log(\mathdsE[e−sB1]\mathdsE[e−sB2])=NT∑k=1log(1+2d∗ks)−NT∑k=1log(1+2dks). (19)

To show that the above is nonnegative, we resort to the majorization theory. Note that is a Schur-concave function [14] in , , and by the definition of majorization [14]

 (d∗1,⋯,d∗NT)=(P/NT,P/NT,⋯,P/NT)≺(d1,d2,⋯dNT),

where means that is majorized by . Thus from [14], we know that the RHS of (19) is nonnegative, . Then (17) is valid, and is the optimal for (15). Note that is also the optimal of (5) according to the discussion under (15). Substituting in (16) as the optimal into the target function of (5), we have (13). ∎

Remark 2: In [6, Sec. VII], the optimal for (13) is found by an iterative algorithm without guaranteeing the optimality. The contribution of Theorem 2 is analytically finding the optimal , which equals to in (16), by exploiting the completely monotone property of the ergodic secrecy capacity and invoking Lemma 1. Finally, as discussed in Section I, the secrecy rate lower-bound in [7] is not achievable, thus the conclusion in [7] that uniform power allocation among transmit antennas as (16) is not secrecy capacity achieving is wrong.

Iv Simulation Results

In this section we compare the secrecy capacities under different channel conditions. The transmit signal-to-noise ratio (SNR) is defined as in dB scale since and both have unit variances. In Fig. 1 we compare the secrecy capacities with under different . The secrecy capacity increases with decreasing . The capacity converges to when the SNR is high, which meets with large . In Fig. 2, we compare the secrecy capacities with different numbers of transmit antennas . We can also find that the capacity converges when is large enough. This results can be easily seen by letting in (13), and applying the central limit theorem on and , respectively.

V Conclusion

In this paper, we derived the secrecy capacity of the MISOSE ergodic fast Rayliegh fading wiretap channel, where only the statistical CSIT of the legitimate and eavesdropper channels is known. By introducing a new secrecy capacity upper bound, we first showed that Gaussian input without prefixing is secrecy capacity achieving. Then we analytically found the optimal channel input covariance matrix, and fully characterized the secrecy capacity.

References

• [1] I. Csiszár and J. Kȯrner, “Broadcast channels with confidential messages,” IEEE Trans. Inform. Theory, vol. 24, no. 3, pp. 339–348, May 1978.
• [2] A. D. Wyner, “The wiretap channel,” Bell Syst. Tech. J., vol. 54, pp. 1355–1387, 1975.
• [3] P. Gopala, L. Lai, and H. El Gamal, “On the secrecy capacity of fading channels,” IEEE Trans. Inform. Theory, vol. 54, no. 10, pp. 4687–4698, Oct. 2008.
• [4] F. Oggier and B. Hassibi, “The secrecy capacity of the MIMO wiretap channel,” IEEE Trans. Inform. Theory, vol. 57, no. 8, Aug. 2011.
• [5] T. Liu and S. Shamai, “A note on the secrecy capacity of the multi-antenna wiretap channel,” IEEE Trans. Inform. Theory, vol. 55, no. 6, pp. 2547–2553, Jun. 2009.
• [6] J. Li and A. P. Petropulu, “On ergodic secrecy rate for Gaussian MISO wiretap channels,” IEEE Trans. Wireless Commun., vol. 10, no. 4, pp. 1176–1187, April 2011.
• [7] Z. Rezki, F. Gagnon, and V. Bhargava, “The ergodic capacity of the MIMO wire-tap channel,” http://arxiv.org/abs/0902.0189, 2009.
• [8] M. Shaked and J. G. Shanthikumar, Stochastic Orders.   Springer, 2007.
• [9] Z. Li, R. Yates, and W. Trappe, “Achieving secret communication for fast Rayleigh fading channels,” IEEE Trans. Wireless Commun., vol. 9, no. 9, pp. 2792 – 2799, Sep. 2010.
• [10] G. Caire and S. Shamai, “On the capacity of some channels with channel state information,” IEEE Trans. Inform. Theory, vol. 45, no. 6, pp. 2007–2019, Sept. 1999.
• [11] Y. Liang, H. V. Poor, and S. S. Shamai, “Information theoretic security,” Foundations and Trends in Communications and Information Theory, vol. 5, no. 4-5, pp. 355–580, Apr. 2009.
• [12] T. M. Cover and J. A. Thomas, Elements of information theory, 2nd ed.   Wiley-Interscience, 2006.
• [13] A. M. Mathai and S. B. Provost, Quadratic forms in random variables.   Marcel Dekker, New York, 1992.
• [14] A. W. Marshall and I. Olkin, Inequalities: theory of majorization and its application.   Academic Press, 1980.
You are adding the first comment!
How to quickly get a good reply:
• Give credit where it’s due by listing out the positive aspects of a paper before getting into which changes should be made.
• Be specific in your critique, and provide supporting evidence with appropriate references to substantiate general statements.
• Your comment should inspire ideas to flow and help the author improves the paper.

The better we are at sharing our knowledge with each other, the faster we move forward.
The feedback must be of minimum 40 characters and the title a minimum of 5 characters