Impact of Angular Spread in Moderately Large MIMO Systems under Pilot Contamination
Abstract
Pilot contamination is known to be one of the main bottlenecks for massive multiinput multioutput (MIMO) networks. For moderately large antenna arrays (of importance to recent/emerging deployments) and correlated MIMO, pilot contamination may not be the dominant limiting factor in certain scenarios. To the best of our knowledge, a rigorous characterization of the achievable rates and their explicit dependence on the angular spread (AS) is not available in the existing literature for moderately large antenna array regime. In this paper, considering eigenbeamforming (EBF) precoding, we derive an exact analytical expression for achievable rates in multicell MIMO systems under pilot contamination, and characterize the relation between the AS, array size, and respective user rates. Our analytical and simulation results reveal that the achievable rates for both the EBF and the regularized zeroforcing (RZF) precoders follow a nonmonotonic behavior for increasing AS when the antenna array size is moderate. We argue that knowledge of this nonmonotonic behavior can be exploited to develop effective usercell pairing techniques.
I Introduction
“Massive” multiinput multioutput (MIMO) is a recent technology that can significantly improve the spectral/energy efficiency of future wireless networks [1, 2, 3], and hence can help to address the exponentially growing traffic demand due to proliferation of smart devices. Interest in timedivisionduplexing (TDD) massive MIMO systems has recently surged [4, 2, 5, 6, 7, 8, 9, 10], due, in part, to their inherent scalability with the number of base station (BS) antennas where a single UL pilot trains the whole BS array. In particular, in TDD massive MIMO systems, the channel state information at the transmitter (CSIT) can be obtained by leveraging the channel reciprocity [11]. However, when the user density gets larger in a TDD massive MIMO network (e.g., as in urban areas), the scarcity of pilot resources necessitates the pilot resource reuse by the user equipments (UEs) in different cells. This results in pilot contamination, which impairs the orthogonality of the downlink transmissions from different BSs in TDD networks, diminishing the achievable aggregate capacity.
Adverse effects of pilot contamination in TDDMIMO networks have been studied extensively in the recent literature, e.g., see the survey [12] and the references therein. In particular, the pioneering papers [4, 13] define and investigate the pilot contamination problem over an uncorrelated MIMO channel. In [11], analytical rate expressions are derived for correlated MIMO channels under pilot contamination. However, this analysis is done only for the asymptotic regime considering very large antenna array sizes. In [6], pilot contamination is considered over a correlated MIMO channel, for finite and large antenna array regimes. Considering an asymptotic analysis, the adverse impacts of the pilot contamination are discussed to be completely eliminated with large antenna arrays. This is achieved when the UL beams have nonoverlapping angular support, which can only be satisfied with small AS values. In a followup work [14], the powerdomain separation of the desired and the interfering user channels is considered to overcome pilot contamination issue (also studied in [15]).
The pilot contamination effect has a strong connection with the AS of the propagation environment. Interestingly, the existing literature lacks a rigorous analysis for the explicit effect of the AS on the user rates under pilot contamination. In particular, focusing on correlated MIMO channels and moderately large antenna array sizes with antenna elements are important since these are common scenarios in present real world deployments. For instance, the rd generation partnership project (3GPP) group is currently focusing on millimeterwave (mmWave) transmissions [16] which consists of correlated MIMO channels due to limited AS (which is frequency dependent). In addition, mmWave transmission is also receiving high attention for vehicular communication mainly due to the possibilities of generating highly directional beams [17] (without much interference), and providing high bandwidth for connected vehicles [18]. The new radio (NR) techniques for the th generation (G) wireless communication are considering moderate array sizes even at mmWave frequencies [19], i.e., at GHz, 128 antenna elements (single polarized) in uniform planar array (UPA). For long term evolution (LTE) systems operating at sub GHz frequencies [20] the number of antennas considered is even smaller, i.e., maximum 32 antenna elements (single polarized) in UPA [21]. Further, for drone based communication networks [22, 23] and moving networks (MNs) [24], having a large antenna array is not practically feasible due to the availability of limited form factor. As a result, it becomes crucial to operate with moderate size antenna arrays for such vehicular communication networks.
In this study, we investigate the impact of the AS on the achievable user rates for a TDD based transmission over correlated MIMO channels. In particular, the effect of pilot contamination on achievable rates is analyzed with a special focus on the moderate antenna array size regime. The specific contributions of this work, which is a rigorous extension of [25], can be summarized as follows:

An exact analytical expression for the achievable rate is derived considering eigenbeamforming (EBF) precoding explicitly taking in to account the impact of AS. In contrast to the earlier work in the literature [11, 6], this analysis is valid for any antenna array size, which is verified to match perfectly with the simulation data under various settings.

We show analytically that although large AS leads to stronger pilot contamination for the EBF precoding by impairing the interference channel orthogonality, this does not necessarily degrade the ergodic rates when the array size is moderate. Interestingly, fluctuation of the channel power around its longterm mean reduces (similar to the socalled channel hardening effect [26, 27, 28]) with the increasing AS, which in turn improves achievable rates for the EBF precoding.

We show that the achievable rates of the EBF and the regularized zeroforcing (RZF) precoders exhibit a nonmonotonic behavior with respect to the AS for moderate antenna array sizes. The AS that results in a minimum/maximum rate depends on the relative positions of UEs and their serving BSs. Hence, the potential for developing efficient usercell pairing algorithms based on the derived rate expression is also discussed with the purpose of maximizing the network throughput.
Table I places the specific contribution of our work in the context of the existing literature. Note that while [6, 14] present simulation results with specific AS values for correlated MIMO and moderate number of antennas, analytical characterization of the achievable rates explicitly as a function of the AS is not carried out.
Reference  Number of  Channel  Investigation of 
antennas  type  Angular spread  
[4]  Asymptotic  Uncorrelated  No 
[6]  Moderate  Correlated  No 
[11]  Asymptotic  Correlated  No 
[13]  Moderate  Uncorrelated  No 
[14]  Moderate  Correlated  No 
Our work  Moderate  Correlated  Yes 
The rest of the paper is organized as follows: Section II introduces the system model for a multicell, TDDbased correlated MIMO network along with UL channel training under pilot contamination. An exact analytical expression to calculate achievable DL rates with the EBF precoding is derived for a given AS and an arbitrary array size in Section III. The individual power terms constituting the achievable rate expression are further investigated for the EBF precoding in Section IV, in order to develop insights on the explicit behavior of the ergodic rate as a function of the AS. Extensive numerical results are provided in Section V, and finally, Section VI provides some concluding remarks.
Notations: Bold and uppercase letters represent matrices whereas bold and lowercase letters represent vectors. denotes the th row and th column element of matrix A. , , , , , , , and represent the Euclidean norm, absolutevalue norm, transpose, Hermitian transpose, complex conjugation, trace of a matrix, Kronecker product, statistical variance and expectation operators, respectively. denotes the complexvalued multivariate Gaussian distribution with the mean vector m and the covariance matrix C, and denotes the continuous Uniform distribution over the interval . and are the identity matrix and zero matrix respectively, and is the Kronecker delta function taking if , and otherwise. denotes the convergence in probability.
Ii System Model
We consider a multicell scenario with cells where each cell includes a single BS equipped with a uniform linear antenna array (ULA) of size . In each cell, a total of UEs each with a single antenna are being served by their respective BSs under perfect timesynchronization. Since we are dealing with pilot contamination with varying AS, we assume that there is one UE in each cell that employs the same pilot sequence with other UEs in other cells during the UL channel estimation. By this way, all the users in this multicell layout are contributing to the pilot contamination, and the scenario where multiple UEs employ nonorthogonal pilot sequences in each cell remains a straightforward extension. Note that the perfect timesynchronization assumption is arguably the worst condition in terms of the pilot contamination as any synchronization approach will make the pilot sequences more orthogonal, and hence reduce the pilot contamination [6].
In our analysis, we assume a TDD protocol consisting of subsequent UL training and DL transmission phases, where the interaction between two adjacent cells are sketched for the DL transmission in Fig. 1. In the UL training phase, all UEs transmitting the same pilot sequence is received by all the BSs in the network. Based on the received pilot sequence, each BS first estimates the channel to its desired UE and then computes the precoding vector based on this estimated channel. During the DL transmission phase, each BS transmits data to its desired UE employing the DL precoding. Note that, due to the pilot contamination, the precoding vector is not aligning well with its desired user channel. Hence, as shown in Fig. 1 DL propagation direction is not the same as the desired user channel direction.
The UL channel between the th UE and the th BS is
(1) 
where is the number multipath components (MPCs), is the complex path attenuation, is the angle of arrival (AoA) of the th MPC, and is the steering vector given as
(2) 
where is the element spacing in the ULA, and is the wavelength. The complex path attenuation and the AoA are assumed to be uncorrelated over any of their indices, and with each other. In particular, is circularly symmetric complex Gaussian with , and the variance captures the effect of the largescale path loss, where is the distance between th UE and th BS, is the path loss exponent, and is the normalization parameter to achieve a given signaltonoise ratio (SNR) at the BS [6]. We consider uniform distribution for the AoA with , where is the lineofsight (LoS) angle between th UE and the th BS, and is the AS. With the channel model in (1), next, we study how to achieve UL training and channel estimation.
Iia UL Training and Channel Estimation with Correlated MIMO Channels
In the UL training phase, the UEs transmit the common pilot sequence of size denoted by , where each pilot symbol is chosen in an independent and identically distributed (iid) fashion from a discrete alphabet consisting of unity norm entries. The matrix of the received symbols at the th BS is given as
(3) 
where is a noise matrix consisting of circularly symmetric complex Gaussian entries with . In an equivalent vector representation, (3) is given as
(4) 
where vectors and are obtained by stacking all columns of and , respectively, and is the training matrix of size satisfying . Following the convention of [4], the SNR is defined for this particular phase to be .
At the th BS, the channel to the th UE can be estimated using linear minimum mean square error (LMMSE) criterion as follows [6]
(5) 
where is the pilotindependent estimation filter given as
(6) 
The covariance matrix in (6) is defined elementwise as follows
(7)  
where is the probability distribution function (pdf) of the AoA, and is the angular covariance matrix of the steering vector. Employing (6) and (7), the resulting covariance matrix of the channel estimate in (5) is given as
(8) 
where we present the detailed derivation steps for covariance matrices in Appendix A.
Iii Achievable DL Rates for Correlated MIMO Channels with Moderate Antenna Array Sizes
In this section, we study the achievable DL rates for correlated MIMO channels specifically considering the moderate size antenna array regime. In particular, we derive an exact analytical expression to calculate achievable DL ergodic rates with eigenbeamforming (EBF) precoding under pilot contamination. This rate expression is applicable to any antenna array size, unlike the case in [11] where asymptotic antenna array regime is taken in to consideration.
During the DL data transmission, each BS employs the channel estimate obtained in the UL training phase as discussed in Section IIA to compute the precoding vector for its own UE relying on the perfect reciprocity of the UL and the DL channels in the TDD protocol [4]. The received signal at the th UE can therefore be given as
(9) 
where is the precoding vector of the th BS for its own user, normalizes the average transmit power of the th BS to achieve the same SNR in the UL training phase [11], is the unitenergy data symbol transmitted from th BS to its own UE and chosen from a discrete alphabet in an iid fashion, and is the circularly symmetric complex Gaussian noise with . The beamforming strategy is assumed to be either the EBF (also known as conjugate beamforming) or the regularized zeroforcing (RZF) [11], and is given at the th BS as follows
(EBF Precoder)  (10)  
(RZF Precoder)  (11) 
In the following, the impact of AS on the achievable rates is investigated under both of these beamforming strategies, with a rigorous analytical rate derivation for the EBF precoding.
Iiia Achievable DL Rates with Precoding
(12) 
We now study achievable DL rates as a function of the AS over the underlying correlated MIMO channel with the EBF and the RZF precoding. In particular, we provide an exact analytical expression to calculate achievable rates with EBF precoding. By assuming UEs have just the knowledge of longterm statistics of the effective channel and not the instantaneous CSI, the ergodic rate as given in (12) is achievable at the th UE [13]. In that, captures the desired signal power, is interpreted as the selfinterference and arises from the lack of information on the instantaneous channel at the UE, and is the intercell interference since it represents the interference from the other BS signals. Here, the power normalization factor is given by .
The rate approximation in (12) is arguably conservative, as discussed in [8], and can be interpreted as “selfinterference limited” rate since the selfinterference term dominates at high SNR regime for finite ULA sizes. However, since our focus in this study is to evaluate the impact of the AS on the correlated MIMO channels at fixed SNR, the rate approximation in (12) is used confidently. It is worth noting that, the achievable rates can also be evaluated by considering the alternative expression suggested in [8, Eqn. (32)] using the first and the second order moments of the effective channel derive subsequently.
In the following theorem, considering that the EBF precoder in (10) is used in the DL transmission, we derive analytical expressions of the first and the second order moments for the effective channel in order to be able to calculate the achievable rate in (12).
Theorem 1
Assuming that LMMSE channel estimation is used in the UL training, and that EBF precoding as in (10) is used prior to DL data transmission, the first order moment of the effective channel is given as
(13) 
(14) 
(15) 
Proof:
See Appendix B. \qed
Iv Impact of AS on Desired and Interference Signal Power Terms
In this section, we study in detail the explicit impact of AS on the desired signal power (Section IVA), the intercell interference (Section IVB), and the selfinterference (Section IVC) terms in (12) in relation to pilot contamination effect from statistical and geometrical perspectives. We also draw useful insights about their behavior for varying AS considering different array sizes. It is worth remarking that, any variation in the achievable rates with varying AS is due to collective contribution from all these three terms, and taking any of them only individually into account may be misleading when evaluating the overall rate results presented in Section V.
Iva Effect of Covariance Matrix Diagonalization on Desired Signal Power
Let us consider the structure of the angular covariance matrices and in (7) with varying AS and antenna array size. Under the assumptions in Section II, the main diagonal entries of these covariance matrices are all , and the offdiagonal entries representing the angular correlation have nonzero norms smaller than . In Fig. 2, the Euclidean norms of the th offdiagonal entries having the minimum and the maximum absolute separation of and , respectively, are depicted along with the increasing AS for and considering in Fig. 1. We observe that each of these covariance matrices gets more diagonalized (the magnitude of the offdiagonal entries decreases) when the array size or the AS increases. The diagonalization rate increases with since both covariance matrices get diagonalized much faster for larger values. Note that any BS in the multicell network will receive signals from a wide range of AoAs as the AS increases, which is similar to the uncorrelated richscattering environment where the possible AoAs span angle support.
The signal power variation with respect to the AS can be assessed through the diagonalization characteristic of the covariance matrices. Employing and the first order moment given in (13), the signal power can be expressed as , which can be expressed more elaborately as follows
(17) 
Since the estimation matrix in (6) becomes diagonal for larger AS values (similar to and ), and that both the terms at the right hand side of (IVA) are real and positive, the second summation in (IVA) decreases when the AS increases. This leads the covariance matrix to become more diagonal. As a result, the signal power in (12) decreases with increasing AS through the diagonalization of the covariance matrices.
This behavior of the signal power with increasing AS can be intuitively interpreted as follows. As we will discuss in Section IVB, when the AS increases, the orthogonality between the desired user precoder and the interfering user channel gets impaired along with more powerful pilot contamination. Hence, does not exactly align with the th user channel direction , any more. This geometrical misalignment accordingly results in transmit power leakage from th BS to some undesired directions (other than the th user direction) during the DL data transmission, which in turn leads to signal power loss at the th user.
IvB Geometrical Interpretation of Intercell Interference Power
In the DL data transmission, the pilot contamination shows its adverse effect by impairing the orthogonality between the desired and the interfering user channels, which is basically captured by the intercell interference term in (9). From a geometrical perspective, the intercell interference involves the inner product between each interference channel for , and the precoder , which is a function of the estimate of the desired user channel . One way to examine how the pilot contamination impairs the orthogonality, and hence amplify the intercell interference power with varying AS is through a geometric interpretation. This can be done by analyzing the pdf of the random angle between and , where we leave the actual numerical evaluation to Section V. Note that, if the channels were perfectly known and spatially uncorrelated, the desired pdf would be given analytically as [29].
To study the impact of AS on intercell interference, we consider an example scenario with the representative setting of Fig. 1. In that, we assume , quadrature phaseshift keying (QPSK) symbols in the UL training phase with the sequence length , and the pathloss exponent . In Fig. 3, we depict the pdf of the random angle for and . We observe that the desired and the interfering user channels are sufficiently orthogonal for small AS values since takes values close to with high probability for perfect CSI, and the resulting orthogonality can even be stronger than the uncorrelated channel case. This geometrical interpretation agrees with [6] in the sense that the training beams of different UEs do not overlap in the UL transmission when the AS is sufficiently small making the scenario free from any pilot contamination effect. As a result, the random angle between the precoder and the interfering user channel has the same pdf for the perfect CSI () and the channel estimation () scenarios when the beams are separated sufficiently, or equivalently the AS is small enough. When the AS starts to increase, spatial correlation between the desired and the interfering user channels becomes stronger since the overlap between AoA domains associated with desired and interfering user channels becomes larger. As a result, the desired orthogonality inherently gets impaired for larger AS values, even for the perfect CSI case.
When the desired user channel is being estimated, this orthogonality gets hurt even more because of the pilot contamination effect. This can be observed in Fig. 3 from the deviation of the pdf of associated with the channel estimation scenario, to the left side (toward ) with respect to perfect CSI scenario when . The orthogonality gets impaired further when the SNR increases since larger SNR in each cell implies more interference power transferred to other cells. Finally, comparing Fig. 3LABEL:sub@fig:pdf_M_10 and Fig. 3LABEL:sub@fig:pdf_M_50 we observe that, for a given AS the precoder and the interfering channels are getting more orthogonal with increasing antenna array size, which is one of the main goals of massive MIMO in the context of the intercell interference rejection [2].
IvC Effect of Channel Power Fluctuation on SelfInterference
The rate bound given in (12) is discussed to be achievable in [13] assuming that the UEs in the network do not know their instantaneous channels, but rather they only know the respective longterm means. This lack of information on the exact instantaneous channel is captured by the selfinterference term in (12). This term actually represents the power of the deviation between the instantaneous channel and the longterm mean, given equivalently as . Assuming EBF precoding in (10) with perfect CSI, this term becomes equivalent to the variance of the channel power. We therefore note that as the fluctuation of the channel squarenorm around the longterm mean decreases, which is similar to the phenomenon known as the channel hardening [26, 27, 28], the selfinterference term should decrease accordingly. In the following, this fluctuation and hence the selfinterference is shown to decrease monotonically when the AS increases, for the EBF precoding.
Lemma 1
The channel considered in (1) hardens, such that as , if we have as , where is the hardening measure given as
(18) 
with for any .
Proof:
We observe that the desired convergence is satisfied only if the number of paths is sufficiently large. Even when the number of paths or the antenna array size have moderate values in contrast to asymptotic approximations, the behavior of the selfinterference power can still be assessed from (18). In Fig. 4, we depict along with AS for various under the assumption that and with representing the AS. We observe that decreases monotonically for increasing AS for all cases, and gets even smaller values as the array size increases. Note that the smaller implies a better convergence of the channel squarenorm to its longterm mean with high probability, and hence less fluctuation in the channel power around its longterm mean. Since the selfinterference power is closely related to the channel power fluctuation, decaying behavior of with the increasing AS implies reduction in the selfinterference power, as well.
V Numerical Results and Discussion
In this section, we present numerical results to evaluate the impact of AS on the achievable rates in a multicell network under pilot contamination and considering the EBF and the RZF precoders. The theoretical derivations presented in Section IIIA are employed for analytical evaluations, and the corresponding simulation data is generated through extensive Monte Carlo runs. Without any loss of generality, we assume QPSK modulated pilot symbols in the UL training of sequence length , the pathloss exponent , with , , and together with the distances and shown in Fig. 1. Note here that rate results presented in this section are for the UE in th cell.
Va TwoCell Scenario: Fixed Interfering UE Position
This section considers a twocell scenario where the th and th cells in Fig. 1 are designated as the interfering and the desired cells, respectively, and the angular position of the th UE is . Fig. 5 captures the achievable rates with the EBF precoding for array sizes of . We observe that the analytical results follow the characteristic behavior of the simulation data in all cases of interest. Further, we can observe from Fig. 5 that, for rates are not monotonically increasing, and actually there is minimum rate value at . As it will become clear in Section VB, this unfavorable AS value corresponding to the minimum rate depends on the underlying geometry. Therefore, the location of this minimum can be controlled through the deployment geometry, and, in particular through the angle of UE separation captured by ’s in Fig. 1.
Remark 1
The real AS value of the propagation environment is independent of the underlying geometry, and it rather depends on the carrier frequency of the communication setting and some other features [30, 16]. As a result, the nonmonotonic behavior of the achievable rates with respect to the AS (e.g. in Fig. 5) can be utilized to enhance the aggregate throughput. This can be achieved by discouraging the formation of usercell pairs if the unfavorable AS value associated with the minimum rate is close to the real AS value of the environment. To the best of our knowledge, none of the existing usercell pairing approaches proposed in the literature exploit the AS of the propagation environment [31, 32, 33, 34, 35, 36]. Note that the nonmonotonic behavior cannot be revealed through an asymptotic analysis due to the moderately large antenna array size regime that we consider in this paper.
Remark 2
Even though user rates for relatively larger antenna array sizes ( and ) exhibit a sharp increase for small AS region, they tend to saturate eventually at larger AS values due to the severe pilot contamination, as discussed in Section IVB. On the other hand, relatively smaller array sizes ( and ) result in no such saturation, which implies that the pilot contamination is not a dominant effect over achievable rates in this array size regime.
In Fig. 6, the signal, the selfinterference, and the intercell interference powers derived in Section IIIA for the EBF precoder are captured separately for the scenario in Fig. 5. We observe that the analytical results follow the simulation data successfully in all cases of interest. The signal and the intercell interference powers are observed to exhibit relatively flat characteristics over a range of small AS values up to approximately . In this region, the AS values are sufficiently small, and the UL training beams of the UEs are therefore well separated. This is the reason for zero intercell interference in this region, which implies no pilot contamination effect and agrees with the geometrical interpretation of Section IVB.
When the AS increases beyond , the intercell interference also starts increasing due to pilot contamination, and saturates around . Further, the DL transmission does not exactly align with the desired signal direction any more, which appears as the decreasing trend in the signal power as discussed in Section IVA. Note that, as captured in Fig. 6 the selfinterference power has a decaying trend with the increasing AS as discussed in Section IVC and for , selfinterference starts decreasing earlier and much faster compared to . Since the decrease in the signal power dominates initially for over selfinterference, we observe a nonmonotonic rate behavior for with a minimum at . On the other hand, since the selfinterference starts decaying quickly for compared to the signal power, we observe monotonically increasing rate behavior for .
The achievable rates for the RZF precoder in the DL transmission is depicted in Fig. 7 for the scenario of Fig. 5 with the ULA sizes of . We observe that the achievable rates with the RZF precoder is higher than that with the EBF precoder in Fig. 5 for the same array sizes at the expense of a larger computational complexity. We also observe a nonmonotonic behavior in achievable rates for all the array sizes of interest, where there is a minimum at for , and a maximum at for , respectively.
Remark 3
Similar to the EBF precoder case, the nonmonotonic behavior as illustrated in Fig. 7 can be utilized effectively to enhance aggregate throughput by: 1) encouraging the formation of usercell pairs if the favorable AS value associated with the maximum rate is close to the real AS value of the environment; and similarly, 2) discouraging the usercell pairs for which the unfavorable AS value associated with the minimum rate is close to the real AS value of the environment.
VB TwoCell Scenario: Varying Interfering UE Position
In this section, we consider the effect of various angular positions of the th interfering UE on achievable rates. Fig. 8, captures the achievable rates for the EBF and the RZF precoding in a cell scenario with at a set of angular positions for the th interfering UE. We observe that as the interfering UE gets closer to the desired UE, which is indicated by the increasing in Fig. 1, the achievable rate reduces for both the precoders. In addition, we observe either lower maxima or deeper minima located at smaller AS values, when increases.
As captured in Fig. 9, the intercell interference increases for larger since the interfering UE gets closer to the desired UE and this indicates more powerful pilot contamination (see Section IVB). In addition, the signal power gets smaller accordingly when increases since DL transmission does not align properly with the desired user direction any more, as discussed in Section IVA. Furthermore, the selfinterference is not highly affected from (and hence from the pilot contamination) as shown in Fig. 9 along with the discussion in Section IVC. As a result, increasing intercell interference and decreasing signal power, both of which occur with increasing , result in reduced user rates. This is also the reason behind the deeper minima observed for the EBF precoding for larger when . Fig. (a)a shows that for , the monotonically increasing rate behavior with the EBF precoding for disappears for , and instead a minimum value appears at . The intercell interference for larger can be very strong for with RZF such that the maxima at switches into a minimum at , as shown in Fig. (b)b.
VC MultiCell Scenario
Finally, we consider the impact of AS in a multicell setting with the number of cells . To this end, a multicell setting is generated by considering five cells as shown in Fig. 1, where the interfering UEs are located at with respect to the horizontal axis for the th, th, th, and th cells, respectively. This layout provides almost the worst condition in terms of the intercell interference power, and hence the pilot contamination. The effect of this multicell setting on the achievable rates with the EBF and the RZF precoders is presented in Fig. 10 for . We observe that as we consider more cells, the resulting interference degrades achievable rates together with much lower maxima or deeper minima. We even observe the formation of an additional maximum for the EBF at and minimum for the RZF at when . Since adding more cells strengthens the intercell interference very rapidly, the desired signal power reduces proportionally, as shown in Fig. 11. These impairing effects eventually reduce and even saturate achievable rates, as shown in Fig. 10.
Vi Concluding Remarks
We investigated the impact of AS on the achievable rates in a multicell environment under pilot contamination, considering moderately large antenna arrays. An exact analytical expression for achievable rate is derived for the EBF precoding considering arbitrary antenna array size. For correlated MIMO channels, we studied how interference channel orthogonality is affected from increasing AS along with the pilot contamination. Further, the channel power fluctuation around its longterm mean is analytically evaluated for varying ASs considering different antenna array sizes.
When the AS gets larger, we showed through rigorous analyses that 1) the covariance matrices tend to have a more diagonalized structure, 2) the channel power fluctuation diminishes (in a similar way as in the channel hardening), and 3) the orthogonality of the interference channel gets impaired due to pilot contamination effect. The overall achievable rate behavior as a function of the AS depends on which of these factors dominate over the other. Our analysis quantitatively identifies the antenna array size beyond which the pilot contamination starts being a dominant factor.
Lastly, our numerical results reveal a nonmonotonic behavior (with respect to the AS) of the achievable rates for both the EBF and the RZF precoders under certain scenarios. The AS values at which the rate minimum/maximum occurs depend on the relative positions of the UEs and their serving BSs. Such a knowledge, along with the rate expression derived in this paper, can be effectively utilized to maximize the aggregate network throughput via careful design of usercell pairing strategies. Due to space limitations we have left analytical rate evaluations with RZF precoder as a future research work.
Appendix A Covariance Matrix Derivation
The covariance matrix of the channel vector in (1) is given as
(19)  
(20) 
where (19) employs , and (20) follows from the fact that the distribution of AoA is identical for any choice of the path index . Defining the angular covariance matrix of the steering vector as , and employing (2), the elementwise angular correlation is given as follows
(21) 
where is the probability distribution function (pdf) of the AoA distribution. In particular, assuming the onering scatterer model [37, 38] and uniform distribution for AoA with , (A) can be given as
We now derive the covariance matrix of the channel estimate , denoted by . Employing the definition of in (5), and the UL signal model in (4), is given as
(23)  
(24)  
(25) 
where we employ the relations and . Since the noise and the channel vectors are uncorrelated and zeromean, the expectations in (25) cancel, and we have
(26) 
where the second term in (26) vanishes since for . Then becomes
(27) 
where we employ hermitian symmetry of covariance matrices and to obtain (8).
Appendix B First and Second Order Moment Derivation
In this section, we derive the first and second order moments and , respectively, for the EBF precoding given in (10). Before the analysis, we define the following property which is used throughout this section while evaluating the mean of the quadratic and the doublequadratic forms involving random vectors.
Lemma 2
Assume that be a set of zeromean random vectors of arbitrary sizes where each of them may be individually correlated with the arbitrary covariance matrices . For the given coefficient matrices A and B of the appropriate sizes and with arbitrary entries, the quadratic form and the doublequadratic form are zeromean if at least one of these random vectors are uncorrelated with the others.
Proof:
Assuming that is uncorrelated with the others, without any loss of generality, regardless of whether are correlated with each other or not, we have
where denotes the th entry of . \qed
Ba First Order Moment
Employing the UL signal model in (4) and the channel estimate in (5), the first order moment of the desired signal is given as follows
(28) 
where the last line employs . Since and are zeromean and uncorrelated, the second expectation in (28) vanishes as per Lemma 2. The desired expectation becomes
(29)  
(30) 
where the second expectation in (29) is similarly zero as per Lemma 2 since the channels of th and th UEs to the th BS, denoted by and , respectively, are uncorrelated from each other, and zeromean by definition. Finally, representing (30) by using the trace operator, employing the Hermitian symmetry of the covariance matrix, and incorporating the covariance matrix of the channel estimate in (8) yield the desired expression given in (13).
BB Second Order Moment
Employing (4) and (5), as in Appendix BA, the second order moment of the desired signal is given as follows
(31)  
(32)  