Interference Exploitation Precoding for MultiLevel Modulations: ClosedForm Solutions
Abstract
In this paper, we study closedform interferenceexploitation precoding for multilevel modulations in the downlink of multiuser multipleinput singleoutput (MUMISO) systems. We consider two distinct cases: first, for the case where the number of served users is not larger than the number of transmit antennas at the base station (BS), we mathematically derive the optimal precoding structure based on the KarushKuhnTucker (KKT) conditions. By formulating the dual problem, the precoding problem for multilevel modulations is transformed into a prescaling operation using quadratic programming (QP) optimization. We further consider the case where the number of served users is larger than the number of transmit antennas at the BS. By employing the pseudo inverse, we show that the optimal solution of the prescaling vector is equivalent to a linear combination of the right singular vectors corresponding to zero singular values, and derive the equivalent QP formulation. We also present the condition under which multiplexing more streams than the number of transmit antennas is achievable. For both considered scenarios, we propose a modified iterative algorithm to obtain the optimal precoding matrix, as well as a suboptimal closedform precoder. Numerical results validate our derivations on the optimal precoding structures for multilevel modulations, and demonstrate the superiority of interferenceexploitation precoding for both scenarios.
I Introduction
PRECODING has been widely studied in multiantenna wireless communication systems to simultaneously support data transmission to multiple users [1]. When the channel state information (CSI) is known at the transmitter side, dirty paper coding (DPC) that subtracts the interference prior to transmission achieves the channel capacity [2]. Despite its promising performance, DPC is generally difficult to implement in practical wireless systems, due to its impractical assumption of an infinite source alphabet and prohibitive complexity. Therefore, suboptimal approximations of DPC in the form of TomlinsonHarashima precoding (THP) and vector perturbation (VP) precoding have been proposed in [3] and [4], respectively. While offering nearoptimal performance, both THP and VP are still nonlinear precoding methods and include a spheresearch process, which makes their complexity still unfavorable, especially when the number of data streams is large. Accordingly, lowcomplexity linear precoding methods such as zeroforcing (ZF) [5] and regularized ZF (RZF) [6] have become popular. On the other hand, downlink precoding based on optimization has also received increasing research attention [7][13]. Among optimizationbased precoding methods, the two most wellknown designs are referred to as signaltonoiseplusinterference ratio (SINR) balancing [7][9] and power minimization [10][12], where SINR balancing aims to maximize the minimum received SINR subject to a total transmit power constraint [7], [8] or a perantenna power constraint [9], and power minimization targets minimizing the power consumption at the transmitter side while guaranteeing a minimum SINR at each receiver [11].
For both the closedform precoding schemes and the optimizationbased precoding approaches described above, the CSI at the base station (BS) is exploited to design the precoding strategy that eliminates, avoids or limits interference. The above approaches ignore the fact that the information in the transmitted data symbols themselves can also be exploited in the downlink precoding design on a symbolbysymbol basis for further performance improvements. With information about the data symbols and their corresponding constellations, the instantaneous interference can be divided into constructive interference (CI) and destructive interference [14]. More specifically, CI is defined as interference that pushes the received signals away from the detection thresholds [15], [16], which provides further benefits for signal detection. A modified ZF precoding method was proposed in [17] to exploit the constructive part of the interference while eliminating the destructive part. A more advanced twostage interference exploitation precoding was proposed in [18], where the phase of the destructive interference was controlled and further rotated such that the destructive interference becomes constructive. Optimizationbased interferenceexploitation precoding for PSK modulations has also been proposed in [19] in the context of vector perturbation precoding, where CI in the form of symbol scaling is proposed. In [20][22], CI precoding based on the phaserotation metric is studied, where it is shown that a relaxed nonstrict phase rotation metric is more advantageous compared to the strict phase rotation in [17], [18]. For multilevel modulations such as QAM, CI can be exploited for the outer constellation points, although all the interference for the inner constellation points is considered to be destructive, as discussed in [23][25] where a symbolscaling metric is introduced. Due to the above benefits, CI has been extended to the area of lowresolution digitaltoanalog converters (DACs) with PSK signaling in [26], as well as quantized constant envelope precoding with PSK and QAM signaling in [27]. More recently, it has been revealed in [28] that there exists an optimal structure for the CI precoding for PSK modulations. Nevertheless, it is still unclear whether a similar result exists for multilevel modulations such as QAM, since CI precoding for PSK modulations is based on the phaserotation metric, while the symbolscaling metric has to be employed for QAM constellations.
In this paper, we study closedform interference exploitation precoding for multilevel modulations, where QAM modulation is considered as a representative example. Due to the fact that the conventional phaserotation CI formulation is not applicable to QAM constellations, the more general symbolscaling metric is employed. We reveal the geometric connection between the phaserotation and symbolscaling metrics in the CI formulation, based on which we propose the optimization problem that maximizes the CI effect for the outer constellation symbols while constraining the inner constellation symbols for multilevel modulations. We first study the case where the number of users simultaneously served by the BS is not larger than the number of BS transmit antennas. Using the Lagrangian and KKT conditions, we analyze the formulated problem and mathematically derive the structure of the optimal precoding matrix, which leads to an equivalent simplified optimization problem. By further formulating the dual problem of this equivalent optimization, we show that, similar to the case of PSK modulations, interferenceexploitation precoding for multilevel modulations is equivalent to a quadratic programming (QP) optimization, and the optimal precoding matrix can be expressed as a function of the dual variables in closed form.
We further extend our analysis to the case where the number of users simultaneously served by the BS is larger than the number of BS transmit antennas, in which case conventional precoding becomes infeasible and the exact inverse included in the above analysis becomes inapplicable. In this scenario, we show that interferenceexploitation precoding may still be feasible. To this end, the more generic pseudo inverse of the channel matrix is employed instead, and we derive the optimal structure of the precoding matrix. Due to the inclusion of the pseudo inverse, an additional constraint is further introduced in the equivalent optimization. Built upon this, the scaling vector for the constellation symbols is shown to be the nonzero solution of a linear equation set, which is equivalent to a linear combination of the singular vectors corresponding to the zero singular values of the coefficient matrix. Accordingly, the optimization can be transformed into an optimization on the weights for each singular vector, which is further shown to be equivalent to a QP optimization as well. Based on the equivalent QP formulation, we discuss the condition under which multiplexing more streams than the number of transmit antennas is possible with interference exploitation precoding.
For both of the scenarios considered above, we also present a generic iterative algorithm to efficiently obtain the optimal precoding matrix for multilevel modulations, where a closedform update is included in each iteration. Based on the above transformation and algorithm, we further develop a suboptimal closedform noniterative CI precoder. Our analysis for multilevel modulations in this paper complements the study on closedform symbollevel interferenceexploitation precoding in [28], which is not applicable to multilevel modulations. Simulation results validate our mathematical derivations and the optimality of the proposed algorithm. Moreover, the superiority of interferenceexploitation precoding over conventional precoding methods for multilevel modulations is also revealed, especially for the case where the BS simultaneously serves a larger number of users than it has the number of transmit antennas.
We summarize the contributions of this paper below:

We present a geometric connection between symbolscaling and phaserotation metrics for interferenceexploitation precoding, based on which we construct the optimization that maximizes the CI effect of the outer constellation symbols while maintaining the performance of the inner constellation symbols for multilevel modulations.

We perform mathematical analysis on interferenceexploitation precoding for multilevel modulations. We show that CI precoding for multilevel modulations can ultimately be simplified into a QP optimization as well. Compared to CI precoding for PSK modulations where the optimization is over a simplex, it is shown that only part of the dual variables need to be constrained as nonnegative in the QP formulation for multilevel modulations.

We further extend our analysis on CI to the case where the number of served users is larger than the number of transmit antennas at the BS. Our transformations show that the optimization for CI precoding in such scenarios is similar to the conventional case where the number of users is smaller than or equal to the number of antennas at the BS, also resulting in a QP optimization. We also present the condition under which multiplexing more streams than the number of transmit antennas based on CI is achievable.

We propose an iterative algorithm that is able to obtain the optimal solution of a generic QP optimization problem subject to specific constraints within only a few iterations. Based on this algorithm, the optimal precoding matrix can be efficiently obtained, for both scenarios considered in this paper. A suboptimal closedform noniterative precoder is also presented.
The remainder of this paper is organized as follows: Section II introduces the system model and illustrates the connection between the two CI metrics. Section III includes the CIbased optimization problems for multilevel modulations when the number of users is smaller than or equal to the number of BS transmit antennas, and the extension to the scenario when the number of users is larger than the number of BS transmit antennas is studied in Section IV. The modified iterative algorithm and suboptimal closedform precoder are presented in Section V. Numerical results are provided in Section VI, and Section VII concludes the paper.
Notation: , , and denote scalar, column vector and matrix, respectively. , , , , and denote conjugate, transposition, conjugate transposition, inverse, pseudo inverse, and rank of a matrix, respectively. is the transformation of a column vector into a diagonal matrix, and denotes the vectorization operation. denotes the entry in the row and th column of . denotes the absolute value of a real number or the modulus of a complex number, and denotes the norm. and represent the sets of complex and realvalued matrices, respectively. and respectively denote the real and imaginary part of a complex scalar, vector or matrix. denotes the cardinality of a set, and represents the Kronecker product. denotes the imaginary unit, denotes the identity matrix, and represents the th column of the identity matrix.
Ii System Model and Constructive Interference
Iia System Model
We study a downlink MUMISO system, where the BS with transmit antennas is simultaneously communicating with singleantenna users in the same timefrequency resource. We separately consider the scenarios of both and . We focus on the downlink precoding designs, and perfect CSI is assumed throughout the paper. The data symbol vector is assumed to be from a normalized multilevel modulation constellation [20], denoted as , and the received signal at the th user can then be expressed as
(1) 
where denotes the flatfading Rayleigh channel vector from user to the BS with each entry following a standard complex Gaussian distribution, is the precoding matrix, and is the additive Gaussian noise at the receiver with zero mean and variance .
IiB Connection between Two CI Metrics for PSK Modulation
In this section, we illustrate the connection between symbolscaling and phaserotation metrics for CI precoding based on Fig. 1, where we employ QPSK (4QAM) as an example.
Phase Rotation Metric: As discussed in [28], we denote and , where is the objective to be optimized. We further denote as the received signal for user excluding noise, which leads to
(2) 
where is a complex scalar that represents the effect of interference on the data symbol for user . For PSK constellations, the CI constraint is then constructed as [28]
(3) 
where , , and for PSK constellations. Accordingly, the optimization problem that maxmizes the distance of the constructive region to the detection thresholds subject to the total available transmit power based on the phaserotation CI metric can be formulated as [28]
(4)  
where . We have enforced a symbollevel power constraint on the precoder, since the exploitation of CI is dependent on the data symbol , which will also be shown mathematically in the following.
Symbol Scaling Metric: Following the coordinate transformation approach in [26], we first decompose the data symbol along the detection thresholds for each user , expressed as
(5) 
where and are the bases that are parallel to the detection thresholds for each specific constellation symbol, as shown in Fig. 1. We refer the interested readers to [26] for a detailed derivation of the expressions for and for generic PSK constellations. Specifically for QPSK modulation considered in Fig. 1 as well as QAM modulations in the following part of the paper, we can obtain
(6) 
Following a similar approach to (5), we also decompose the noiseless received signal for each user along the same detection thresholds, and further introduce two real scalars and for and , respectively, which leads to
(7) 
It is then observed that the values of these two scalars directly indicate the effect of the CI. Subsequently, the corresponding optimization based on the symbolscaling metric can be constructed as
(8)  
Both of the above optimization problems are convex and can be directly solved with convex optimization tools. Subsequently, based on Fig. 1 and the formulation of the above two optimizations, an important geometrical observation is given, which demonstrates the connection between the symbolscaling and phaserotation metric.
Observation 1: Since the noiseless received signal is located on the boundary of its constructive region, the relationship between the minimum value of in and the optimal value of in is expressed as
(9) 
where without loss of generality we have assumed user has the minimum value of . Eq. (9) is derived by considering the isosceles triangle ‘DOA’, where we can obtain
(10) 
Based on the fact that , , and , (10) leads to the expression for in (9).
It’s worth noting that while the above discussion only focuses on QPSK constellations, (9) is in fact generic to any PSK modulation for the connection between the two CI metrics, and the only difference lies in the expression for . In the following section, the symbolscaling CI metric is employed in the derivation of the optimal precoding matrix for multilevel modulations.
Iii CI Precoding for the Case of
In this section, we focus on the common case where , and we consider 16QAM modulation as an example of multilevel modulations. For other multilevel constellations, the problem formulation and the corresponding analysis for the symbolscaling metric readily follows our derivations in this section in a similar way.
For a generic QAM constellation, we employ the symbolscaling metric for CI precoding since there does not exist a generic expression for the phaserotation CI metric for QAM modulations, as shown in Fig. 2 where a 16QAM constellation is depicted as the example. The symbolscaling metric in (7) can be further expressed in vector form as
(11) 
where we have introduced two column vectors
(12) 
For QAM constellations, and are also given by (6). In this work, we consider the interference on the inner constellation points as only destructive, since the interference is less likely to be beneficial for these points. To be more specific, in Fig.2 CI exists for the real part of the constellation point type ‘B’ and imaginary part of type ‘C’, while both the real and imaginary part of the constellation point type ‘D’ can be exploited. Accordingly, we propose to construct the optimization problem that maximizes the CI effect for the outer constellation points while maintaining the performance for the inner constellation points, given by
(13)  
where the set consists of the real scalars corresponding to the real or imaginary part of the outer constellation points that can be scaled, and consists of the real scalars corresponding to the real or imaginary part of the constellation points that cannot exploit CI. Accordingly, we obtain
(14) 
and
(15) 
is a secondordercone programming (SOCP) problem, which can be solved via convex optimization tools such as CVX. Specifically, the optimization objective is equal to the value of in the above optimization, which can also be viewed as a scaling factor for the constellation. Moreover, if we further constrain instead of in the above optimization, the solution of the above optimization problem will become a ZF precoder.
Before we present the subsequent analysis, we first transform the power constraint included in the above optimization problem, which greatly simplifies the subsequent derivations. To be specific, we decompose the precoded signals into
(16) 
and similar to the case of PSK [28], we observe that the distribution of the power among each does not affect the solution of the above optimization problem, as can be viewed as a single vector for both constraints that include in . Therefore, without loss of generality and to be consistent with our problem formulation for PSK modulation in [28], we assume that the norm of each term is identical, and we obtain
(17)  
which further leads to the equivalent power constraint as
(18) 
We then rewrite the above optimization problem in standard minimization form as
(19)  
and we express the Lagrangian of as [29]
(20)  
where , , , and are the introduced dual variables, and , . Each and can be complex since they correspond to the equality constraints.
Based on the Lagrangian in (20), the KKT conditions for optimality can be expressed as
(21a)  
(21b)  
(21c)  
(21d)  
(21e)  
(21f) 
Based on (21b), it is first observed that , and with the premise that we obtain , which further means that the power constraint is met with equality when optimality is achieved. Then, we can express in (21b) as
(22) 
By introducing an auxiliary variable
(23) 
we can express as
(24) 
The above expression further leads to
(25) 
which is constant for any and consistent with our assumption in (17).
With the obtained expression for each , we further express the precoding matrix as
(26)  
where we have introduced two column vectors
(27) 
We express (11) in matrix form as
(28)  
where and are expressed as
(29)  
and the matrix is constructed as
(30) 
By substituting the expression for in (26) into (28), we obtain
(31) 
With the premise that in this section, is invertible, and accordingly we obtain as
(32) 
which further leads to the expression for the precoding matrix as
(33) 
We then substitute in (33) into the power constraint, and we obtain
(34)  
Since is Hermitian and positive semidefinite, and since each entry in is real, (34) can be further transformed into
(35) 
where is symmetric. With the expression for in (33) and the updated power constraint, we are able to construct an equivalent optimization on , given by
(36)  
The optimal precoding matrix for the original optimization is then obtained by substituting the solution of into (33). In the following, we analyze and derive the closedform optimal precoding matrix as a function of the dual variables of .
The Lagrangian of is formulated as
(37)  
where , . To simplify the subsequent KKT conditions, we propose to reorder the columns and rows of the matrices and vectors included in the Lagrangian expression in (37). Specifically, we reorder the expanded symbol vector into
(38) 
where and are given by
(39)  
such that the entries in correspond to the real or imaginary part of the outer constellation points that can exploit CI, and the entries in correspond to the real and imaginary part of the inner constellation points that cannot be scaled. The corresponding scaling vector is accordingly transformed into
(40) 
where and are given by
(41)  
We further introduce a ‘Locater’ function that returns the index of in the original expanded symbol vector , given by
(42) 
We can then express and as
(43) 
where the transformation matrix that transforms the original and into their reordered forms is given by
(44) 
and we note that is invertible. Similarly, the corresponding reordered matrix can be obtained as
(45) 
where the multiplication of at the left side and at the right side correspond to the row and column reordering, respectively. Using the above expressions for , and , the Lagrangian of in (37) can be further transformed into a simple form, given by
(46) 
where , and is the dual vector corresponding to the reordered , given by
(47) 
Subsequently, the KKT conditions for can be formulated as
(48a)  
(48b)  
(48c)  
(48d)  
(48e) 
Based on (48b), we obtain an expression for as a function of , given by
(49) 
where we note that is symmetric and invertible. By substituting the expression for in (49) into the power constraint, we further obtain as
(50)  
For the convex optimization , it is easy to verify that Slater’s condition is met [29], which means that the dual gap is zero. Accordingly, can also be optimally solved via its dual problem, given by
(51) 
For the above dual problem, the inner minimization is achieved by (48a) and (49), and we can further simplify the dual problem into
(52)  
Based on the fact that is a monotonic function, the above dual problem is equivalent to the following minimization problem:
(53)  
which is a QP optimization and can be more efficiently solved than the SOCP formulation. Moreover, based on the expression for in (49) and in (50), we finally obtain the optimal closedform precoding matrix as a function of the dual vector in the case of as
(54)  
where is to order the obtained into the original , with given in (44).
Compared to the final QP formulation for PSK modulation in [28] that is optimized over a simplex, a key difference for the case of QAM constellations is that the variable vector is no longer on a simplex, and only the dual variables that correspond to the real and imaginary part of the constellation points that can exploit CI are constrained to be nonnegative, as observed in . We note that both QP formulations for PSK and QAM modulations can be solved by convex optimization tools. However, for the reasons given above, the more efficient simplex method that is generally used for solving QP problems over a simplex and the proposed iterative algorithm in [28] are not directly applicable to such multilevel modulations.
Iv CI Precoding for the Case of
In this section, we further extend our study to the case where the BS simultaneously serves a number of users larger than the number of the transmit antennas at the BS, i.e., . Specifically, our derivations in this section and the corresponding numerical results show that, by exploiting the information of the channel as well as the data symbols and by judiciously constructing the precoding matrix, CI precoding is able to spatially multiplex more data streams than the number of transmit antennas. Similar to the case of , the subsequent analysis is generic and can be further extended to other multilevel constellations.
When , the direct inverse included in (32) becomes infeasible, as the product is rankdeficient. In this case, the more general pseudo inverse instead of the direct matrix inverse is employed [30]. Based on (31), we can now express in the case of as
(55) 
and the obtained precoding matrix as
(56) 
By substituting the expression for the obtained precoding matrix into the power constraint, we can similarly obtain
(57) 
Then, one can easily follow a similar approach to that in Section III to obtain a QP optimization and the corresponding solution. However, we note that the solution obtained by following the above procedure is not a valid one for the original problem, since the inclusion of the pseudo inverse does not guarantee the equality of the original constraint. To be more specific, if we consider and substitute the obtained precoding matrix in (33) into (28), we obtain
(58)  
which is always true. This means that the symbolscaling constraint in (28) is already implicitly included in the power constraint in (34) for the case of . However, in the case of where the pseudo inverse is employed, the above equality may not hold and simply following the approach for will lead to an erroneous solution. Therefore, the following additional constraint is further required in the case of to obtain a valid and correct solution:
(59)  
where the matrix satifies the following property.
Observation 2: The rank of the coefficient matrix is with probability 1.
Proof: We first consider the matrix