# Optimal Time Reuse in Cooperative D2D Relaying Networks

###### Abstract

Device-to-device (D2D) communication has become one important part of the 5G cellular networks particularly due to the booming of proximity-based applications, e.g. D2D relays. However, the D2D relays may create strong interference to nearby users. Thus interference management in a cellular network with D2D relays is critical. In this paper, we study the optimal time reuse patterns in a cellular network with cooperative D2D relays and derive the corresponding optimized relaying strategies. Due to the binary association constraints and the fact that the total number of feasible time reuse patterns increase exponentially with the number of servers, we have a large scale integer programming problem which is formidable to solve. However, we show that we just need to activate a few time reuse patterns in the order of the number of users. Accordingly, a low-complexity algorithm is proposed to find the set of active time reuse patterns and solve this problem in an approximated way. Numerical simulations demonstrate that our scheme can efficiently allocate the time resources and determine the relaying strategies. Furthermore, the proposed scheme offers significant gains relative to the existing ones.

Optimal Time Reuse in Cooperative D2D Relaying Networks

Zhaowei Zhu, Shengda Jin, Yu Zeng, Honglin Hu, and Xiliang Luo |

ShanghaiTech University, Shanghai, China, Shanghai Advanced Research Institute, CAS, Shanghai, China |

Email: luoxl@shanghaitech.edu.cn |

Index Terms— Cooperative network, device-to-device relay, time reuse, resource allocation, user association.

## 1 Introduction

With the ever-increasing mobile data traffic demands, the proximity-based communication through the device-to-device (D2D) links is regarded as one promising technology in 5G cellular networks [1]. Different from the base station (BS)-based links, D2D links usually appear in short-range scenarios and can significantly enhance the network throughput performance without interfering distant users [2]. In a cooperative D2D network, when both types of links share the same time or spectrum resources, severe interference can still occur among nearby users without careful resource management [3].

To make full use of the limited resources, a lot of works have been done to optimize D2D-enabled cellular networks [4, 7, 5, 8, 6]. In particular, the spectrum allocation and power control were jointly optimized in [4] to maximize different network utility metrics under QoS constraints, while D2D relays were not modeled in [4]. Recent works in [7, 5, 8, 6] emphasized the importance of D2D relays and recommended some D2D user equipments (DUEs) to serve as D2D relays for other DUEs experiencing poor BS-based links. Orthogonal resource allocations and corresponding relaying strategies were optimized jointly under different constraints in [7, 5, 8, 6]. Specifically, the orthogonal time/frequency resources were allocated between the BS and the D2D relays in [5], among the neighboring nodes in [6], and among the D2D relays in [7, 8]. However, we know non-orthogonal resource reuse among the BSs and the D2D relays will further enhance the network throughput performance. Each reuse strategy was modeled as one reuse pattern and the frequency resource allocations were optimized over all possible reuse patterns for heterogeneous networks (HetNets) in [9, 10, 11]. In particular, the network-wide average packet delay was minimized in [9], the energy efficiency was maximized in [10], and the network proportional fairness (PF) metric was maximized in [11].

In this paper, we endeavor to find the optimal resource reuse scheme in a cooperative network with D2D relays. The major contributions can be summarized as follows. Firstly, we study the optimal set of time reuse patterns and the corresponding relaying strategies such that the PF metric of the network is maximized. Even though the total number of feasible reuse patterns increases exponentially with the number of users, we show that we just need to activate a few time reuse patterns in the order of the number of users to achieve the optimal network performance. Secondly, we put forward an efficient algorithm of low complexity to identify the set of reuse patterns to be activated and solve the problem approximately. Numerical results demonstrate our scheme can offer significant gains relative to the existing solutions.

The remainder of this paper is organized as follows. Section 2 describes the system model. Section 3 formulates the problem and characterizes the optimal time reuse profile. Section 4 proposes one low-complexity algorithm. Numerical results are given in Section 5 and Section 6 concludes the paper.

Notations: Notations , , , , and stand for -norm of vector , the transpose of matrix , the convex hull of the set , the cardinality of the set , and the element-wise XOR operation of two vectors. stands for the indicator function and takes the value of () when the specified condition is met (otherwise). denotes the unit vector with only the -th element being and all other elements being .

## 2 System Model

As shown in Fig. 1, a downlink shared cooperative network [6] is considered here, where a number of nodes coexist including the DUEs and the BSs. The sets of BSs and DUEs are denoted by and , respectively. Meanwhile, we define and . Each DUE is assumed to be able to act as either a relay or a user. For instance, it can act as a user to receive data either from the serving BS or from another DUE (behave as one D2D relay) in one time slot. In another slot, it can act as a relay to help other DUEs. Thus we assume each DUE can choose to be either a user or a relay in different time slots. The set of servers including both all BSs and D2D relays is defined as follows.

(1) |

Let denote the total number of servers in the set . We use a vector to indicate the active status of the servers in one time slot and call it the -th time reuse pattern. The -th element of is , i.e. , when server is active under time reuse pattern . Otherwise, we have . Note when a server is active, we assume it utilizes the whole downlink bandwidth for information transmission. There are totally feasible time reuse patterns to be considered and we do not consider the all pattern, which means nobody transmits.

Let denote the set of indices of all the feasible time reuse patterns. Furthermore, the set of all time reuse pattern is denoted as . We also use to denote the set of active servers under pattern , i.e. . Let denote the fraction of the total time duration of length allocated to pattern . The overall time reuse profile is thus given by the vector: . In order to get the optimal network performance, we need to utilize the whole time duration, i.e. . Furthermore, we assume time-division multiple-access (TDMA) and each server only transmits data to one served user at one time. A server needs to divide the allocated time resource under pattern orthogonally among its served users. We use to denote the fraction of the total duration allocated to user by server under reuse pattern . Clearly, it is bounded as . Additionally, as in the current cellular networks, each user is allowed to be associated to a single server under each time reuse pattern, which is characterized by , i.e. indicates user is served by server under pattern and otherwise. Due to this single-server association constraint, the variables satisfy the constraint . Note the corresponding relaying strategies are also implied by .

Denote the transmission power of server by . The signal-to-interference-plus-noise ratio (SINR) of the link between user and server under pattern is denoted by and can be derived as

(2) |

where is the power of the thermal noise at the user and denotes the average gain of the channel from server to user . Let denote the spectral efficiency of this link. To reflect the constraint that one DUE cannot transmit and receive simultaneously, the spectral efficiency is set to be zero if server is active under reuse pattern . Therefore, the spectral efficiency can be expressed as

(3) |

Note that the actual time resource allocated to user by server under pattern is . The average data rate of user over all reuse patterns can be written as follows.

(4) |

where denotes the system bandwidth. Besides, we assume that the DUEs in the network demand different data. For a D2D relay, a portion of its received data are intended for others. The effective data rate of user is thus given by , where denotes the data rate of server , i.e. user acting as a D2D relay, targeted to the served DUEs. The total rate of server serving other DUEs over all reuse patterns is given by

(5) |

The single-server association rule results in a combinatorial problem and finding an optimal solution need exhaustive search. In the next section, we will relax the single-server constraint and allow one user to be served by multiple servers. This will enable low-complexity algorithms to find the optimal time reuse profile.

## 3 Number of Active Time Reuse Patterns

In this section, we would like to maximize the PF metric of the network through optimizing the time reuse profile. To this end, we formulate the optimization problem (6), where the optimization variables , , are the overall time reuse profile: , the overall user time allocation profile: , and the DUE association profile: respectively. Problem (6) is a large knapsack problem, which is NP-hard to find its optimal solution.

(6a) | ||||

(6b) | ||||

(6c) | ||||

(6d) | ||||

(6e) | ||||

(6f) | ||||

(6g) |

By allowing fractional user association, we can drop the single-server association indicator variable and rely on to imply the fractional user association profile. Particularly, given , we can infer the -th user is served by the -th server under pattern . The relaxed multi-server association problem is shown in (7), whose solution offers an upper bound of the problem in (6). Since the active servers in each pattern will use all the available time resources to optimize the network performance, the inequalities in (6f) indeed become the equalities in (7e).

(7a) | ||||

(7b) | ||||

(7c) | ||||

(7d) | ||||

(7e) | ||||

(7f) |

The objective function in (7) is concave and all the constraints are linear. However, the amount of variables, i.e. and , increases exponentially with the number of servers due to the fact that we have different time reuse patterns. For a large cooperative network, it becomes a formidable task to find the optimal solution to the convex problem in (7). Fortunately, we can show that only a very limited number of reuse patterns need to be activated without sacrificing the network throughput performance.

###### Proposition 1.

###### Proof.

Suppose we are given one particular optimal solution to the problem in (7), i.e. and . Each element of can be re-written as . The rates and in (7b) and (7c) can be expressed as and , where and . Define , , , , , and . We see can be written as

(9) |

By defining a set as , we see the vector lies in , i.e. the convex hull of . Since the dimension of the vector is , from the Carathéodory’s theorem [12], we know the vector must lie in the convex hull of affinely independent vector points in with . Let denote the set of those vector points and we know lies in , which is a -simplex. Furthermore, when the solution is optimal, the vector should reach the Pareto efficiency [9, 13] given that the network utilization is measured by the PF metric in (7a), which is concave with respect to . Hence we can find one optimal solution in the face of the -simplex, which is also one -simplex with . Note that the constraints hold for the optimal solution naturally since the objective function is meaningless for any non-positive , otherwise the problem is infeasible. As a result, we see can be represented by a convex combination of at most affinely independent points in . In summary, there exits one optimal time reuse profile satisfying , . ∎

Proposition 1 indicates that we only need to turn on no more than reuse patterns to achieve the optimal network throughput performance instead of activating all the feasible patterns.

## 4 Efficient Pattern Selection and Resource Allocation Algorithm

As indicated by Proposition 1, the following optimization problem shares the same optimal objective value as the problem in (7).

(10a) | ||||

(10b) | ||||

(10c) |

where denotes the candidate set of reuse
patterns containing at most patterns. The problem in (10)
is still very hard to solve.
We split it into two subproblems and solve them iteratively. In particular,
during the -th iteration, we carry out the following updates.

P1. Pattern Selection:
Determine the pattern set such that
and
;

P2. Resource Allocation:
Update the resource allocation and the corresponding association rules as

(11) |

where , and . The problem in (11) is convex and can be solved efficiently by general convex solvers, e.g. CVX [14]. However, the pattern selection subproblem has possible solutions in theory and it is impractical to apply the exhaustive search method. Next, we propose an approximated solution enjoying low-complexity.

From , we can define a set as

(12) |

where and denotes the Hamming distance between patterns and . Furthermore, we add a particular pattern set to the candidate pattern set in the -th iteration when , i.e. and denotes the pattern set to be included in the -th iteration. Inspired by the pattern selection approach in [11] and the Frank-Wolfe method [12], we put forward an iterative pattern selection method according to the solution of the following problem:

(13) |

Basically, in (13), we identify a set of reuse patterns to activate by finding the direction providing the most predominant improvement in the objective value under the specified constraints. Denoting the -th entry of the gradient vector by , we can have

(14) |

Now we can establish the following result.

###### Proposition 2.

Among the solution to the problem in (13), the set contains the following time reuse pattern:

(15) |

where .

###### Proof.

The solution to (13) during the -th iteration simply tells us to activate the particular time reuse pattern to maximize the improvement in the objective value. Clearly, the associated complexity in finding the candidate pattern set is determined by the size of . We have the lowest complexity in the pattern selection subproblem by setting . To enable faster convergence, we select time reuse patterns in each iteration. Specifically, we modify the ON/OFF state of the -th server in each reuse pattern as and define the set as

(19) |

Then we determine a new candidate reuse pattern to activate according to Proposition 2 as

(20) |

Now a temporary reuse pattern set for the -th itervation, i.e. can be derived as

(21) |

Since the number of time reuse patterns in will become larger than when we add time reuse patterns in each iteration, we perform the following pattern trimming as well. In particular, let be the optimal solution to the problem in (11) with . By defining a positive threshold , we can delete those patterns in with negligible allocated resources, i.e.

(22) |

Note that the solution obtained by solving (10) allows multi-server association. To meet the single-server association requirement in (6), we simply associate the user to the server that gives the largest data rate under the particular time reuse pattern , i.e.

(23) |

After finalizing the user association rule as in (23), the original problem in (6) can be solved with low complexity since is given and the number of active patterns in is limited. Algorithm 1 summarizes all the steps.

# DUEs | 10 | 11 | 12 | 30 |
---|---|---|---|---|

(, ) | (8.63, 8.63) | (8.61, 8.51) | (7.12, 7.11) | (?, 7.34) |

(, ) | (735, 20) | (3004, 40) | (10608, 45) | (?, 160) |

## 5 Numerical Results

In this section, we test our algorithm by simulating a cooperative network with only
one pico-BS (dBm transmission power (TxPwr)) since we focus on the relay behavior
of the DUEs (dBm TxPwr). See also Fig. 1 for one example.
Other parameters are set as follows.

DUEs are uniformly dropped in a square specified by m m and the pico-BS is deployed at the center;

System bandwidth is MHz. Noise PSD is dBm/Hz. The
path-loss is determined as (dB)
[4], where is the distance in meters, and stands for
the number of walls [15].

Table 1 compares the geometric mean (GM) of the DUEs’ throughputs and the CPU running time between the brute-force optimal solution (solve the problem in (7) directly with CVX) and our proposed low-complexity solution. In Table 1, () denote the GM throughput in Mbps with the brute-force solution (our proposed algorithm) and () is the consumed CPU time of the brute-force solution (our proposed algorithm) in seconds. Question mark “?” indicates the brute-force CVX solver can not be solved with our lab computer. Our proposed algorithm performs satisfactorily and only suffers a very small amount of degradation as we have DUEs. However, note that the brute-force method becomes impractical even for a middle-sized network, e.g. a network with DUEs.

For a network with DUEs, Fig. 2 shows the set of activated time reuse patterns and the fraction of total time allocated to each pattern. The total number of active time reuse patterns is , which follows Proposition 1. It also indicates that the optimal time reuse patterns are not as those proposed in [5, 7, 8, 6]. The orthogonal scheme in Fig. 3 considers the time reuse patterns where only the BS and one D2D relay are active [8], i.e. . In Fig. 3, it is also worth noting that the orthogonal scheme could provide higher effective rates to some DUEs than our scheme. In fact, this indicates that our proposed algorithm will ask those DUEs with high data rates to serve as D2D relays to help the DUEs with poor channel conditions. It is clear that our proposed algorithm achieves higher GM data rate than the other two existing schemes.

## 6 Conclusions

In this paper, we have studied the optimal time reuse patterns and the corresponding relaying strategies in a cooperative network with D2D relays. The original optimization problem is of a formidable size. This is due to the fact that the total number of feasible time reuse patterns scales exponentially with the number of nodes in the network. To circumvent this dilemma, we have shown that we just need to turn on a limited number of time reuse patterns without sacrificing the network performance firstly. In particular, we have proved that the number of active reuse patterns can be no more than the number of users. Secondly, we have put forward a low-complexity algorithm to identify the small set of active time reuse patterns and solve the large scale optimization problem approximately. Compared to those existing schemes with orthogonal resource allocation constraints, our proposed scheme offers significant gains.

## References

- [1] M. N. Tehrani, M. Uysal, and H. Yanikomeroglu, “Device-to-device communication in 5G cellular networks: Challenges, solutions, and future directions,” IEEE Commun. Mag., vol. 52, no. 5, pp. 86–92, May 2014.
- [2] X. Xu, Y. Zhang, Z. Sun, Y. Hong, and X. Tao, “Analytical modeling of mode selection for moving D2D-enabled celluar networks,” IEEE Commun. Lett., vol. 20, no. 6, pp. 1203–1206, Jun. 2016.
- [3] Y. Liu, R. Wang, and Z. Han, “Interference-constrained pricing for D2D networks,” IEEE Trans. Wireless Commun., vol. 16, no. 1, pp. 475–486, Jan. 2017.
- [4] X. Ma, J. Liu, and H. Jiang, “Resource allocation for heterogeneous applications with device-to-device communication underlaying cellular networks,” IEEE J. Sel. Areas Commun., vol. 34, no. 1, pp. 15–26, Jan. 2016.
- [5] B. Zhou, H. Hu, S. Q. Huang, and H. H. Chen, “Intracluster device-to-device relay algorithm with optimal resource utilization,” IEEE Trans. Veh. Technol., vol. 62, no. 5, pp. 2315–2326, Jun. 2013.
- [6] M. Li, S. Salinas, P. Li, X. Huang, Y. Fang, and S. Glisic, ”Optimal scheduling for multi-radio multi-channel multi-hop cognitive cellular networks,” IEEE Trans. Mobile Comput., vol. 14, no. 1, pp. 139–154, Jan. 1 2015.
- [7] H. Xu and B. Li, “Resource allocation with flexible channel cooperation in cognitive radio networks,” IEEE Trans. Mobile Comput., vol. 12, no. 5, pp. 957–970, May 2013.
- [8] N. Mastronarde, V. Patel, J. Xu, L. Liu, and M. van der Schaar, “To relay or not to relay: learning device-to-device relaying strategies in cellular networks,” IEEE Trans. Mobile Comput., vol. 15, no. 6, pp. 1569–1585, Jun. 2016.
- [9] B. Zhuang, D. Guo, and M. L. Honig, “Traffic-driven spectrum allocation in heterogeneous cellular networks,” IEEE J. Sel. Areas Commun., vol. 33, no. 10, pp. 2027–2038, Oct. 2015.
- [10] B. Zhuang, D. Guo, and M. L. Honig, “Energy-efficient cell activation, user association, and spectrum allocation in heterogeneous networks,” IEEE J. Sel. Areas Commun., vol. 34, no. 4, pp. 823–831, Apr. 2016.
- [11] Q. Kuang, W. Utschick, and A. Dotzler, “Optimal joint user association and multi-pattern resource allocation in heterogeneous networks,” IEEE Trans. Signal Process., vol. 64, no. 13, pp. 3388–3401, Jul. 2016.
- [12] M. S. Bazaraa, H. D. Sherali, and C. M. Shetty, Nonlinear Programming: Theory and Algorithms, 3rd ed. Hoboken, NJ, USA: Wiley, 2006.
- [13] D. Bertsimas and J. N. Tsitsiklis, Introduction to Linear Optimization. Belmont, MA, USA: Athena Scientific, 1997.
- [14] M. Grant and S. Boyd. (Sep. 2013). CVX: MATLAB Software for Disciplined Convex Programming, Version 2.0 Beta. [Online]. Available: http://cvxr.com/cvx
- [15] J. Meinilä, P. Kyösti, T. Jämsä, and L. Hentilä, “WINNER II channel models,” in Radio Technologies and Concepts for IMT-Advanced. Hoboken, NJ, USA: Wiley, 2009, pp. 39–92.