UAVEnabled Radio Access Network: MultiMode Communication and Trajectory Design
Abstract
In this paper, we consider an unmanned aerial vehicle (UAV)enabled radio access network (RAN) with the UAV acting as an aerial platform to communicate with a set of ground users (GUs) in a variety of modes of practical interest, including data collection in the uplink, data transmission in the downlink, and data relaying between GUs involving both the uplink and downlink. Under this general framework, two UAV operation scenarios are considered: periodic operation, where the UAV serves the GUs in a periodic manner by following a certain trajectory repeatedly, and onetime operation where the UAV serves the GUs with one single fly and then leaves for another mission. In each scenario, we aim to minimize the UAV periodic flight duration or mission completion time, while satisfying the target rate requirement of each GU via a joint UAV trajectory and communication resource allocation design approach. Iterative algorithms are proposed to find efficient locally optimal solutions by utilizing successive convex optimization and block coordinate descent techniques. Moreover, as the quality of the solutions obtained by the proposed algorithms critically depends on the initial UAV trajectory adopted, we propose new methods to design the initial trajectories for both operation scenarios by leveraging the existing results for solving the classic Traveling Salesman Problem (TSP) and PickupandDelivery Problem (PDP). Numerical results show that the proposed trajectory initialization designs lead to significant performance gains compared to the benchmark initialization based on circular trajectory.
I Introduction
To support the fastgrowing traffic demand for the next generation mobile communication systems, extensive research efforts have been devoted to exploring various new wireless technologies [1], such as ultradense network, millimeter wave (mmWave) communication, massive multipleinput multipleoutput (MMIMO), nonorthogonal multiple access (NOMA) [2], and machinetype communication. All these technologies were mainly developed for the terrestrial wireless network with base stations (BSs), relays and access points deployed at fixed locations. Recently, there have been significant interests in using unmanned aerial vehicles (UAVs) as aerial platforms to enable terrestrial communications from the sky [3]. Compared to conventional terrestrial communication, UAVenabled communication is more swift and flexible to deploy for unexpected or temporary events. Besides, thanks to the UAV’s high altitude, the favorable lineofsight (LoS) communication links are more likely to be established between UAV and ground users (GUs) [4], [5]. Thus, UAVenabled communication has many potential use cases, such as for public safety communication, ground BS offloading, emergency response, and Internet of things (IoT) communication.
Significant research efforts have been devoted to addressing the various new challenges for UAVenabled communications, such as the UAVground channel characterization [4, 5, 6], performance analysis [7], [8], and UAV placement optimization [9, 10, 11, 12, 13]. In particular, the controllable high mobility of UAVs offers a new design degree of freedom to enhance communication performance via trajectory optimization, which has received significant interests recently [14, 15, 16, 17, 18, 19, 20, 21, 22].
In [14], the authors proposed a general framework via jointly optimizing the transmit power and UAV trajectory to maximize the endtoend throughput for a UAVenabled mobile relaying system. Specifically, the transmit power at the source/UAV relay and the UAV trajectory were optimized in an alternating manner iteratively via the technique of block coordinate descent. To tackle the nonconvex trajectory optimization in each iteration, the successive convex optimization technique was proposed based on the local lower bound of the rate function. Such techniques have then been applied to various other scenarios in UAVenabled wireless communications [15, 16, 17, 18, 19, 20, 21]. Note that for all these works employing successive convex optimization and block coordinate descent techniques, the converged results critically depend on the initial UAV trajectory adopted. A straight line based initial trajectory and a circular based initial trajectory were proposed in [14] and [18], respectively. Though simple and intuitive, such trajectory initialization schemes do not fully exploit the locations and communication requirements of GUs. This thus gives one of the main motivations of the current work, to devise more sophisticated trajectory initialization schemes for UAVenabled communications to achieve better converged performance.
It is worth noting that path planning or trajectory optimization has been extensively studied in the UAV control and navigation literature [23, 24, 25, 26, 27, 28]. For example, in [23], the UAV trajectory was formulated as a mixed integer linear program (MILP) to ensure collision avoidance. In [26], the recedinghorizon path planning approach was applied to demonstrate the capability for a swarm of UAVs to perform autonomous search and localization. Moreover, the authors in [27] and [28] investigated the path planning for a single vehicle to collect data from all sensors. Note that the aforementioned works for path planning either focused on other design objectives rather than communication performance, or assumed simplified communication models, such as the disk model in [27], [28]. In practice, adaptive communication with dynamic power and bandwidth allocation can be exploited along with the UAV trajectory design to achieve enhanced communication performance, as pursued in more recent works such as [14, 15, 16, 17, 18, 19].
In this paper, we study a general UAVenabled radio access network (RAN) as shown in Fig. 1, where the UAV is employed as an aerial platform supporting multimode communications for its served GUs, including data relaying from one GU to another [14], downlink data transmission to GUs [18], and uplink data collection from GUs [20] as special cases. Such a multimode aerial communication platform is more practically relevant for a reallife RAN with different traffic demands of the GUs.
For the considered general RAN, two UAV application scenarios of practical interest are further considered. The first one is periodic operation, where the UAV serves the GUs in a periodic manner by following a certain trajectory repeatedly. In this case, our objective is to minimize each periodic flight duration of the UAV for the purpose of minimizing the communication delay of the GUs [11], while satisfying the average rate requirement of each GU, via jointly optimizing the UAV trajectory, transmit power and bandwidth allocation. The second scenario corresponds to onetime operation, where the UAV serves the GUs with one single fly and then leaves for another mission. This may correspond to practical use cases such as periodic sensing, where the UAV only needs to be dispatched at a given frequency. In this scenario, we aim to minimize the mission completion time for saving UAV time for other missions while satisfying the aggregated throughput requirement of each GU, via jointly optimizing the UAV trajectory and pertinent communication resource allocation. In this case, for the particular data relaying mode, the UAV can only forward to a destination GU the data that has been received from its associated source GU, along its given oneround trajectory, thus resulting in a stringent informationcausality constraint [14]; whereas this constraint can be relaxed in the former periodic operation scenario thanks to the periodic trajectory of the UAV. The main contributions of this paper are summarized as follows.

First, we propose a multimode UAV communication platform with periodic operation or onetime operation. For both operation scenarios, we formulate the optimization problems to minimize the UAV periodic flight duration and mission completion time, respectively, via jointly optimizing the UAV trajectory, bandwidth and power allocation. Since the formulated problems are difficult to be directly solved, we propose efficient iterative algorithms to find locally optimal solutions based on successive convex optimization and block coordinate descent techniques.

Second, as the converged results of the proposed algorithms critically depend on the initial UAV trajectory assumed, we propose new methods to design the initial trajectory by fully exploiting the location information and communication requirements of the GUs. Specifically, as the UAV typically has better communication link when it is near GUs, the initial UAV trajectory should be designed so as to approach each GU as much as possible. To this end, we propose the trajectory initialization design based on the Traveling Salesman Problem (TSP) solution for the case of periodic operation, and that based on the PickupandDelivery Problem (PDP) solution for the case of onetime operation. Compared to the existing UAV initial trajectory designs such as the straightline or circular trajectories, the main novelty of the proposed trajectory initialization lies in the optimized waypoints design and their order of visiting based on the number and location distribution of the GUs, their communication requirements as well as the UAV’s practical mobility constraints such as its maximum speed.
The rest of this paper is organized as follows. Section II introduces the system model and presents the problem formulations for the periodic operation and the onetime operation scenarios, respectively. Section III and Section IV present the proposed algorithms based on successive convex optimization and block coordinate descent techniques for the two operation scenarios, respectively. In Section V, we propose two efficient trajectory initialization designs for the two scenarios, respectively. Numerical results are presented in Section VI to evaluate the performance of the proposed designs. Finally, we conclude this paper in Section VII.
Notations: In this paper, scalars and vectors are denoted by italic letters and boldface lowercase letters, respectively. denotes the space of dimensional realvalued vectors. For a vector a, its Euclidean norm is represented by . denotes the logarithm with base 2. For a timedependent function , represents the firstorder derivative with respect to time . For sets and , means the union of the two sets.
Ii System Model and Problem Formulation
Iia System Model
As shown in Fig. 1, we consider a general UAVenabled wireless RAN, where a UAV serves as an aerial platform for a set of GUs. In general, the communication modes of the UAVenabled wireless RAN can be classified into three categories as follows:
IiA1 Data Collection in Uplink
The UAV is employed as a flying fusion center to collect data from GUs that are data sources on the ground, such as sensors in IoT [20].
IiA2 Data Transmission in Downlink
In this mode, independent information is transmitted from the UAV to GUs. For example, the UAV may act as a data carrier with precached data to transmit to the intended GUs [29].
IiA3 Data Relaying
The UAV functions as a mobile relay to assist in the communications between multiple pairs of GUs. For each pair, the data is firstly received from the source GU in the uplink and then forwarded to the destination GU in the downlink. By exploiting the LoS links between the UAV and GUs, UAVenabled mobile relaying is a promising solution to overcome the unreliable terrestrial links between widespread GUs. Practical application scenarios include service recovery after natural disasters, emergency response, etc., [7], [30].
Accordingly, the GUs can be generally divided into three groups based on their communication modes. Group 1 corresponds to UAVassisted data collection, which only involves the uplink communication from the GUs in this group to the UAV. Within this group, we assume that in total independent information flows are transmitted from their respective GUs to the UAV. Group 2 corresponds to data transmission from the UAV to the GUs belonging to this group, where only downlink communication is involved and the UAV transmits in total independent information flows to their corresponding GUs. Lastly, Group 3 corresponds to data relaying, which involves both uplink and downlink communications. For this group, in total information flows are firstly transmitted to the UAV from the source GUs in this group and then forwarded by the UAV to their respective destination GUs. For information relaying, we assume that the UAV employs the decodeandforward (DF) strategy with a data buffer of sufficiently large size. Notice that in practice, we have , since each GU may correspond to multiple information flows. For ease of presentation, we assume that each GU is only associated with one information flow such that ; whereas the developed results in this paper can be easily generalized to the cases with . By letting and , we define a source GU set as , with the first elements corresponding to source GUs from Group 3 (for information relaying) and the rest from Group 1 (for uplink data collection). Similarly, define a destination GU set as with the first GUs corresponding to destination GUs in Group 3 (for information relaying) and the rest from Group 2 (for downlink transmission). Without loss of generality, we assume that the source GU and the destination GU , , correspond to the same pair in Group 3.
We consider a threedimensional (3D) Cartesian coordinate system, where the locations of each source GU and destination GU are denoted as , , and , , respectively. We assume that the UAV flies at a given constant altitude . Furthermore, for a given time horizon of duration , denote the UAV trajectory projected on the ground as , . Let be the maximum UAV speed in meter/second (m/s). We then have the following constraint , The timevarying distance between the UAV and the GUs can be written as
(1)  
(2) 
We further assume that both the uplink and downlink channels are dominated by LoS links. Thereby, the channel power gains follow the freespace path loss model given by
(3) 
where denotes the channel power gain at the reference distance of m.
Let the total available bandwidth be denoted as . The UAV is assumed to employ the frequency division multiple access (FDMA) scheme with dynamic bandwidth allocation among all GUs. Specifically, at time instant , denote as the fraction of the total bandwidth that is allocated for the source GU , and as that allocated for the destination GU . We then have the following constraints:
(4)  
(5) 
Note that the above dynamic FDMA scheme includes both conventional time division multiple access (TDMA) with dynamic user time scheduling and FDMA with fixed user bandwidth allocation as special cases. In particular, when all and are set as binary values 1 or 0, we have the dynamic TDMA scheme. On the other hand, when , and , , we have the nondynamic FDMA scheme.
Denote by the transmit power for the source GU if , which is assumed to be constant. The instantaneous normalized achievable rate in bits/second/Hertz (bps/Hz) for this GU can be expressed as
(6) 
where represents the additive white Gaussian noise (AWGN) power spectral density in watts/Hz, and is the timedependent channeltonoise power ratio, and denotes the reference signaltonoise ratio (SNR) at the reference distance of m.
Similarly, let denote the UAV’s transmit power for the destination GU at time . The instantaneous achievable rate in bps/Hz for this GU is thus expressed as
(7) 
where is the channeltonoise power ratio from the UAV to the destination GU . Let denote the maximum total transmit power by the UAV. For the downlink transmission from the UAV to the destination GUs, we then have the following power constraint .
IiB Problem Formulation
Generally speaking, a UAV serving as a multimode aerial platform may have two operation scenarios in practice: periodic operation versus onetime operation, explained as follows.
IiB1 Periodic Operation
With periodic operation, the UAV needs to remain airborne to serve the GUs periodically, where the GUs keep generating service requests to the UAV. We assume that the average rate requirements in bps for uplink and downlink communication corresponding to the different flows are , , and , , respectively. In particular, for the data relaying service in Group 3, the uplink and downlink average rate requirements for each pair should be balanced, i.e., , . Without loss of generality, we assume that the UAV flies above the GUs following a periodic trajectory with period , where is a design variable. Note that in practice, it is desirable to minimize in order to avoid large communication delay of GUs [11].
For notational convenience, define , and . Our objective is to minimize the UAV flight period , via jointly optimizing the UAV’s trajectory , the downlink transmit power , as well as the bandwidth allocation , while satisfying the average rate requirements by the GUs. The problem can be formulated as
(8a)  
(8b)  
(8c)  
(8d)  
(8e)  
(8f)  
(8g)  
(8h)  
(8i) 
where the constraint (8i) ensures that the UAV returns to the initial location at the end of each period.
Different from the prior work [31] which focuses on maximizing the minimum throughput over all GUs in downlink communication with given , we here study the flight period minimization problem in a more general setup, where uplink communication, downlink communication and data relaying modes are all taken into account and is also a design variable.
IiB2 OneTime Operation
In the second scenario, the UAV only needs to serve the GUs once by one single fly mission. This corresponds to many practical scenarios where the service requests by the GUs are intermittent. In this case, the UAV mission is regarded as completed once the throughput in bits (instead of the average rate as in periodic operation) for each information flow meets the target requirement of the GUs. Denote the uplink and downlink throughput requirements corresponding to different information flows as bits, , and bits, , respectively. Similar to the periodic operation scenario, for the particular data relaying service, the uplink and downlink throughput requirements should be balanced for each sourcedestination pair, namely , . Further, denote by the flight duration (or mission completion time) required by the UAV to meet the throughput requirements of all the information flows.
Furthermore, for data relaying in onetime operation scenario, we need to impose the stringent informationcausality constraints, i.e., at any time instant , the UAV can only forward the data that has already been previously received from the source GU in Group 3. Note that such information constraints do not need to be explicitly imposed for the periodic operation scenario since the UAV may forward the information received from the previous period, as long as the total information bits received from the source equal to that forwarded to the corresponding destination at each period to ensure the longterm balance. The informationcausality constraints for data relaying in onetime operation can be expressed as [14]
(9) 
Note that the lefthand side (LHS) of (9) is the aggregated information bits that have been forwarded by the UAV to the destination GU at time , and the righthand side (RHS) represents those which have been received from the source GU at the same time. For onetime operation, we aim to minimize the mission completion time via a joint trajectory, spectrum and power allocation design. In practice, minimizing the completion time is of high practical interest since it helps save more time/energy for the UAV to serve other missions. The problem can be formulated as
(10a)  
(10b)  
Note that in (P2), no constraints on the UAV’s initial and final locations are imposed, i.e., they can be freely designed for performance optimization. The developed results can be easily extended to include such constraints similarly as in [14].
Besides, it should also be noted that in the prior work [14], the special case of UAVenabled relaying with one pair of source and destination GUs has been studied, where the endtoend throughput is maximized with a predetermined time horizon . In (P2), we study the completion time minimization problem in the general setup with multiple GUs and modes, where the results in [14] cannot be directly applied.
Iii Proposed Solution for Periodic Operation
In this section, we consider the flight period minimization problem (P1) for the periodic operation. Problem (P1) is challenging to solve for two reasons. First, the problem requires to optimize continuous functions , and , which essentially involve an infinite number of optimization variables that are closely coupled with each other. Secondly, the integral in the LHS of (8a) and (8b) involve the optimization variable as the upper bound of the integration interval, which lack closedform expressions. To tackle these issues, we first introduce the following optimization problem for any given period :
(11a)  
(11b)  
Problem (P1.1) aims to maximize the minimum ratio between the achievable average rate and the target rate requirement of each GU. For any given flight period , denote the optimal value of (P1.1) as . It is not difficult to see that for any given , the target rate requirements of all GUs are achievable if and only if . Therefore, problem (P1) is equivalent to
(12) 
Lemma 1.
The optimal value of problem (P1.1) is nondecreasing with .
Proof:
Please refer to Appendix A. ∎
By applying Lemma 1, problem (P1.2) can be solved by applying a bisection search over until the equality in (12) holds. Thus, the main task of solving (P1) is to find an efficient algorithm for (P1.1) for any given .
To obtain a more tractable form of (P1.1), we apply a discrete statespace approximation. Specifically, the time horizon is equally divided into time slots, i.e., , , with representing the time step which is sufficiently small such that the distance between the UAV and the GUs can be assumed to be approximately constant within each time slot. Therefore, the UAV’s trajectory over can be specified by , . As a result, the UAV speed constraints (8h) can be represented as
(13) 
where denotes the maximum distance that the UAV can travel within each time slot. The bandwidth and transmit power allocation can be similarly discretized as , , , . Then, the achievable rate between the GUs and the UAV at time slot can be expressed as
(14)  
(15) 
where
(16)  
(17) 
Besides, , and are rewritten as , and , respectively. As a result, problem (P1.1) is reformulated as
(18a)  
(18b)  
(18c)  
(18d)  
(18e)  
(18f)  
(18g)  
(18h)  
(18i) 
where constraints (18a)(18i) represent the discretetime equivalents of (11a), (11b), (8c)(8i), respectively. As constraints (18a) and (18b) are nonconvex with respect to variables , and , problem (P1.3) is difficult to be directly solved in general. In the following, we propose an efficient suboptimal solution to (P1.3) based on successive convex optimization and block coordinate descent techniques, similarly as in [14]. The main idea is to solve the two subproblems of (P1.3) iteratively, namely the power and bandwidth optimization with fixed trajectory, and trajectory optimization with fixed power and bandwidth allocation. Then, the block coordinate descent method is employed to optimize the two sets of variables in an alternating manner until the objective value converges within a prescribed accuracy.
Iiia Power and Bandwidth Optimization with Fixed Trajectory
First, we consider the subproblem to optimize the UAV transmit power and bandwidth allocation , for any given feasible UAV trajectory . In this case, the timevarying variables in (16) and (17) are also determined. This subproblem of (P1.3) is given by
(19a)  
(19b)  
It can be shown that the LHS of (19a) is concave with respect to the bandwidth allocation , and the LHS of (19b) is jointly concave with respect to the bandwidth allocation and the transmit power , and all other constraints are convex. Therefore, (P1.4) is a convex optimization problem, which can be efficiently solved via existing software such as CVX [32] or applying the Lagrange duality [33], for which the details are omitted for brevity.
IiiB Trajectory optimization with Fixed Power and Bandwidth Allocation
In this subsection, we consider the other subproblem to optimize the UAV trajectory by assuming that the transmit power and bandwidth allocation are given. However, even with fixed power and bandwidth allocation, the trajectory optimization in (P1.3) is still a nonconvex problem due to nonconvex constraints (18a) and (18b). To tackle such nonconvexity, the successive convex optimization technique similar to that used in [14] and [15] can be applied, for which a lower bound of the original problem is sequentially maximized by optimizing the trajectory at each iteration. To this end, we need the following result.
Proposition 1.
Proof:
Please refer to Appendix B. ∎
Proposition 1 shows that for any given local trajectory , and are respectively lowerbounded by and , which are both concave functions with respect to . As a result, for any given local trajectory , a lower bound of the optimal value of the original problem (P1.3) with fixed power and bandwidth allocation can be obtained by solving the following problem
(22a)  
(22b)  
Note that due to the lower bound given in Proposition 1, if (22a) and (22b) are satisfied, then the constraints (18a) and (18b) with the same power and bandwidth allocation are guaranteed to be satisfied as well, but the reverse is not true. Therefore, the feasible region of (P1.5) is in general a subset of that of (P1.3), and its optimal solution serves as a lower bound to that for (P1.3) with fixed power and bandwidth allocation. (P1.5) is a convex optimization problem, which can be efficiently solved with the standard convex optimization techniques or existing solvers such as CVX [32].
IiiC Iterative Power, Bandwidth and Trajectory Optimization
Based on the results obtained above, we propose an iterative algorithm for (P1.3) based on the block coordinate descent technique. The details are summarized in Algorithm 1.
Since in each iteration of Algorithm 1, (P1.5) is optimally solved with given local trajectory , whose objective value is nondecreasing over iterations and upperbounded by a finite value, Algorithm 1 is guaranteed to converge to at least a locally optimal solution. Note that for step 4 of Algorithm 1, an alternative way is to successively optimize the trajectory multiple times until convergence. The resulted objective value is also nondecreasing over iterations, thus its convergence is also guaranteed.
Iv Proposed Solution for OneTime Operation
In this section, we study the optimization problem (P2) for the onetime operation. Similar to (P1), in order to solve (P2), we first consider the following problem for any given UAV operation time :
(23a)  
(23b)  
Problem (P2.1) aims to maximize the minimum ratio between the achievable throughout and the target requirement. For any given operation time , let the optimal solution to (P2.1) be denoted as . Then it is not difficult to see that all throughput requirements of (P2) are achievable if and only if . Therefore, (P2) is equivalent to finding the minimum such that . Furthermore, as the time only appears in the upper limit of the integral in (23a) and (23b) (no normalization by as in (P1.1)), it is quite obvious that the LHS of (23a) and (23b) are nondecreasing with . Thus, is also nondecreasing with . Therefore, (P2) can be solved by solving (P2.1) and applying a bisection search over the completion time .
Similar to Section III, for any given , problem (P2.1) can be recast in a discrete equivalent form as
(24a)  
(24b)  
(24c)  
where (24) represents the discretetime equivalent of the informationcausality constraints in (9). As constraints (24a)(24) are nonconvex, problem (P2.2) is difficult to be optimally solved. Similar to Section III, we apply the successive convex optimization and block coordinate descent techniques to (P2.2) by iteratively solving the two subproblems, namely the power and bandwidth optimization with fixed trajectory, and trajectory optimization with fixed power and bandwidth allocation, as detailed in the next.
Iva Power and Bandwidth Optimization with Fixed Trajectory
With the given UAV trajectory , problem (P2.2) reduces to optimizing the UAV transmit power and bandwidth allocation . By introducing slack variables , , problem (P2.2) can be equivalently transformed to
(25a)  
(25b)  
(25c)  
(25d)  
(25e)  
Note that if at the optimal solution to (P2.3), there exists one constraint in (25) that is satisfied with strict inequality, we are always able to decrease the corresponding transmit power and/or the bandwidth allocation to make the constraint active. This implies that there always exists an optimal solution to (P2.3) at which all constraints in (25) are active, and thus (P2.3) is equivalent to (P2.2) for any given trajectory. Furthermore, it can be verified that all constraints of (P2.3) are convex, thus (P2.3) is a convex optimization problem, which can be efficiently solved via standard convex optimization software such as CVX [32].
IvB Trajectory optimization with Fixed Power and Bandwidth Allocation
Next, we consider the other subproblem to optimize the UAV trajectory for any given transmit power and bandwidth allocation . To deal with nonconvex constraints (24a)(24), the successive convex optimization is employed based on the lower bounds given in Proposition 1. Specifically, for any given local trajectory, by introducing slack variables , the resulted problem is given by
(26a)  
(26b)  
(26c)  
(26d)  
(26e)  
where and are the lower bounds of , , , respectively, given in Proposition 1. Problem (P2.4) can be verified to be convex, which can be efficiently solved by CVX [32].
With the above two subproblems solved, (P2.2) can be solved by iteratively optimizing the power and bandwidth allocation and the trajectory with similar steps as in Algorithm 1. Furthermore, the completion time minimization problem in (P2) can be solved via a bisection search over while solving (P2.2) in each iteration. The details are omitted for brevity.
V Initial Trajectory Design
The proposed algorithms for both periodic and onetime operation scenarios require the UAV initial trajectory to be specified, and their converged results via the successive convex optimization and block coordinate descent techniques depend on the UAV trajectory initialization in general. In this section, we propose new trajectory initialization schemes for the periodic and onetime operation scenarios, respectively. Note that due to the additional informationcausality constraints for the onetime operation scenario, these two operation scenarios generally require different trajectory initializations.
Intuitively, the UAV trajectory should be designed so that when the UAV is scheduled to communicate with a particular GU, it should be as close to the GU as possible. One intuitive approach for trajectory initialization is to minimize the UAV traveling time among different GUs, so that when the given value of is sufficiently large, the UAV will be able to reach the top of all GUs to enjoy the best communication channel. Given the maximum speed , the problem of minimizing the traveling time is thus equivalent to minimizing the total traveling distance. The above approach will be used in the following designs, as detailed later.
Va Initial Trajectory Design for Periodic Operation
In this subsection, we design the UAV initial trajectory for Algorithm 1 in the case of periodic operation for any given value of . For notational convenience, let denote the set containing all GUs, i.e., , where GUs correspond to source GUs in while GUs correspond to destination GUs in . The locations of all GUs in , and , are compactly denoted as , where represents the index of the GU in .
For given , we first consider the problem of minimizing the traveling distance/time for the UAV to visit all GUs by determining their optimal visiting orders, which is essentially the classic TSP [34]. Although TSP is NPhard, various algorithms have been proposed to find highquality approximate solutions within a reasonable computational complexity [34]. After solving the TSP, we obtain the minimum traveling time required, denoted as , as well as the permutation order , with representing the index of the th GU to be visited. In the following, for any given flight period , the UAV initial trajectory is designed by distinguishing two cases, depending on whether is no smaller than , as follows.
Case 1: . In this case, is sufficiently large so that the UAV is able to reach the top of each GU within each flight period. The remaining time can be spent by the UAV to hover above the GUs. To obtain an effective method for determining the hovering time allocation among the GUs, let denote the time required for the UAV to satisfy the average rate requirement for GU , by assuming that the UAV only communicates with it when hovering on its top. We thus have
(27) 
Then the total hovering time can be proportionally divided among the GUs as
(28) 
Following the visiting order and the hovering time allocation in (28) for each GU, the initial trajectory in the case of can be constructed accordingly.
Case 2: . In this case, the given time is insufficient for the UAV to reach the top of all GUs. To design a feasible initial trajectory, we first specify a diskshaped region for each GU in , which is centered at the corresponding GU with radius . As illustrated in Fig. 2, the main idea is to minimize the UAV traveling distance by properly designing the UAV trajectory and radius , so that the UAV is able to reach each disk region. The problem can be formulated as
(29a)  
(29b)  
(29c) 
where constraints (29a) ensure that for each GU in , there exists at least one time instant such that the distance between the UAV and the GU is no larger than . This guarantees that all disks are traversed by the UAV.
For any given radius , denote the optimal value of (P3) as . It is not difficult to see that is nonincreasing with . Thus, the optimal solution to (P3) can be obtained by solving the corresponding problem with fixed , and then applying a bisection search to find the optimal radius . In the following, we focus on solving (P3) with any given radius .
Lemma 2.
The optimal trajectory to (P3) should only contain connected line segments.
Proof:
Similar to the proof of Theorem 1 in [35], Lemma 2 can be shown by contradiction. Suppose on the contrary that at the optimal solution , there exists at least one curved portion along the trajectory. Then we can always construct an alternative trajectory composed of line segments only, that achieves less traveling time . To this end, it is first noted that at the optimal solution to (P3), the UAV should always travel with the maximum speed , i.e., the constraint (29b) should be satisfied with equality.
For each node with , denote by as the earliest time instance when the UAV reaches its disk region with trajectory , i.e., . Then the trajectory of the UAV can be partitioned into portions, with the th portion specified by time interval , , where , . For the th time interval, we may replace the original trajectory portion with a line segment directly connecting and . Obviously, this replacement not only ensures the feasibility of (29a), but also reduces the traveling distance for the UAV. Therefore, if the optimal solution trajectory contains a curved portion, we are always able to construct an alternative trajectory by sequentially connecting that achieves . Thus, any trajectory with curved portion cannot be the optimal trajectory to (P3). This completes the proof. ∎
With Lemma 2, for any given , problem (P3) is recast to optimizing a set of waypoints inside the disks, which are the starting and ending points of the line segments, and finding the optimal permutation order to visit these waypoints. Let the waypoint inside the disk associated with GU be denoted as , . The traveling time required can be expressed as
(30) 
As a result, problem (P3) reduces to
(31) 
This is reminiscent of the classic Traveling Salesman Problem with Neighborhoods (TSPN) [36], which is a generalization of TSP and also known to be NPhard [37].
In the following, we propose an efficient approach to find a suboptimal solution to (P3.1). The key idea is to let the UAV visit each disk region based on the order obtained by the TSP algorithm (by ignoring the neighborhoods), i.e., , and then find the optimal waypoints inside all disks by solving a similar convex optimization problem as in [35]. With the visiting order obtained, (P3.1) is recast to a convex optimization problem, which can be efficiently solved via standard convex optimization techniques.
It is worth noting that TSPN has been extensively studied in the literature (e.g., [28], [38] and [39]). Based on the permutation obtained by the TSP algorithm, the authors in [28] adopted three evolutionary algorithms to find the shortest path with disjoint disks only. In [39], the authors proposed a combineskipsubstitute (CSS) scheme based on TSP, which is applicable to both joint or disjoint disks. However, there is no guarantee that the optimal waypoints can be found even with given visiting order. In contrast, by applying convex optimization in this work, the optimal waypoints are guaranteed with the given visiting order.