Scheduling in Wireless Networks under Uncertainties: A Greedy Primal-Dual Approach

Scheduling in Wireless Networks under Uncertainties: A Greedy Primal-Dual Approach

\authorblockNQiao Li \authorblockAqiaoli@cmu.edu
Department of Electrical and Computer Engineering
Carnegie Mellon University
5000 Forbes Ave., Pittsburgh, PA 15213
   \authorblockNRohit Negi \authorblockAnegi@ece.cmu.edu
Department of Electrical and Computer Engineering
Carnegie Mellon University
5000 Forbes Ave., Pittsburgh, PA 15213
Abstract

This paper proposes a dynamic primal-dual type algorithm to solve the optimal scheduling problem in wireless networks subject to uncertain parameters, which are generated by stochastic network processes such as random packet arrivals, channel fading, and node mobilities. The algorithm is a generalization of the well-known max-weight scheduling algorithm proposed by Tassiulas et al., where only queue length information is used for computing the schedules when the arrival rates are uncertain. Using the technique of fluid limits, sample path convergence of the algorithm to an arbitrarily close to optimal solution is proved, under the assumption that the Strong Law of Large Numbers (SLLN) applies to the random processes which generate the uncertain parameters. The performance of the algorithm is further verified by simulation results. The method may potentially be applied to other applications where dynamic algorithms for convex problems with uncertain parameters are needed.

I Introduction

Scheduling in wireless networks involves efficiently allocating network resources among competing network users in the presence of uncertainties. These uncertainties may be either due to unexpected events, such as link failures, or due to intricate cross-layer interactions in wireless networks. For example, the packet arrival rates may be unknown (e.g., [1], [2]), which depend on upper layer dynamics such as routing and congestion control protocols. As another example, the wireless channel statistics may be also unknown (e.g., [2]), since they depend on complex network events such as channel fading, power control and node mobilities.

In the presence of such uncertain parameters, it may no longer be optimal to use the static allocation approach (e.g., [3]), which produces periodic schedules by solving a static underlying convex optimization problem (which usually has exponential size) with estimated uncertain parameters. In particular, if these uncertain parameters are slowly converging, or time varying, the estimated parameters may fail to track the changes in their true value, which often leads to suboptimal schedules. Further, it may be impractical to estimate the uncertain parameters for large wireless networks, as the number of the parameters may grow fast (e.g., exponentially) with the size of the network. For a simple illustration, consider a wireless network with links, such that each link randomly switches on or off after certain random number of time slots. In such case, a complete specification of the network topology probabilities may require as large as parameters, which quickly becomes impossible to estimate as grows.

On the other hand, online algorithms (such as [1], [2], [4], [6], [7]) are more robust to the changes to the uncertain parameters (such as arrival rates), since they use queue length (which can also be interpreted as prices [9], [10]) information for scheduling in each time slot. For example, it has been shown that [4] such algorithms can achieve network stability even if the “instantaneous rates” of the traffic vary arbitrarily inside the network capacity region. Further, compared to the estimation based approach, these online algorithms may be more scalable to the network size, in the sense that the dimension of the queue length vector corresponds to the number of constraints (such as rate constraint for each link), which usually grows slowly, whereas the number of uncertain parameters can grow very fast (e.g., exponentially).

In this paper we solve a general class of optimal wireless network scheduling problems with uncertain parameters, whose underlying static problem is described by the convex optimization problem OPT in Section II. Essentially, we require that the structures of the convex objective functions and convex constraint functions are known, except the values of the uncertain parameters. These parameters will be generated by certain stochastic processes and observed by the network gradually over time slots. We propose a greedy primal-dual dynamic algorithm (Algorithm 1 in Section III) to achieve the optimal scheduling asymptotically. Using the novel technique of fluid limits [12], optimality can be guaranteed under the assumption that all the network processes generating the uncertain parameters satisfy SLLN (see details in Section II). Note that this assumption is quite mild, since we can guarantee optimality as long as these processes converge, no matter how slowly the convergence happen. Thus, intuitively, our algorithm can automatically track the convergence of these processes and correct the mistakes which are made within any finite time history.

Our algorithm is a generalization of the well-known max-weight algorithm [1], which was shown to be throughput optimal for i.i.d arrival processes. Our algorithm is related to, but different from the utility-optimal scheduling algorithm by Neely [2], which achieves the optimal scheduling by cleverly transforming the problem into optimizing the time-average of the utility (and constraint) functions, to which a dual-type algorithm applies. Our algorithm is also different from the primal-dual algorithm by Neely [5] since we use a scaled queue length (by ) in the scheduling, which corresponds to the approximated gradient. Stolyar [6] also proposed a primal-dual type scheduling algorithm, and proved its optimality using a fluid limit obtained from a different scaling. Since the fluid scaling in [6] is taken over different systems, it is hard to relate the optimality in the fluid limits to the one in the original system. Finally, our algorithm can be used as a MAC layer solution for the general framework of cross-layer optimization problem (e.g., [4], [7], [8], [10]) for wireless networks.

The organization of the following sections is as follows: In Section II we describe the queueing network model as well as the optimization problem OPT, and in Section III we describe the scheduling algorithm. Section IV proves the optimality of the algorithm, Section V illustrate the algorithm performance in simulation, and finally Section VI concludes this paper.

Ii Network Model and Problem Formulation

In this section we describe the queueing network and propose the optimization problem OPT. We first introduce the queueing network model.

Ii-a Queueing Network

We consider the scheduling problem at the medium access (MAC) layer of a multi-hop wireless network, where the network is modeled as a set of links. We assume a time-slotted network model, and in each time slot , the network is in one of the following states: . The network state can be used to model network topology, channel fading, and user mobility, etc. We further assume that these network states can be measured by the user nodes111Note that although the network states are global, they often allow (approximate) decompositions (e.g., [2]) according to either the geographic structure or channel orthogonality, in which case one only needs to measure local network states., which are assumed to be equipped with channel monitoring devices. We associate each network state with a finite set of resource allocation modes , where each mode corresponds to a configuration of network resource allocation, such as carrier and frequency selection in OFDM systems, spreading codes choice in CDMA systems and time slots assignment in TDMA systems.

Denote as the uncertain parameters, which are generated by the stochastic process , which is a cumulative vector process whose time average converges to . Specifically, the assumptions on are: 1) it is subject to SLLN, i.e., with probability 1 (w.p.1),

(1)

and 2) it has uniformly bounded increment in each time slot:

(2)

where is a finite constant. For specific examples, consider the cumulative network state process :

(3)

where is the network state at time slot , and is the indicator function, i.e., and . Thus, SLLN implies that w.p.1,

As another example, consider the external packet arrival process , which is a vector representing the cumulative external packet arrivals during the first time slots. Similarly, SLLN implies that w.p.1,

(4)

Further, we require that the maximum packet arrivals in any time slot are uniformly bounded:

(5)

where is a finite constant.

We finally describe the queueing system model. The queuing dynamics of the network is modeled as follows:

(6)

where is the queue length vector at time slot , and is the routing matrix, such that , and only if link serves as the next hop for link at time slot , as specified by certain routing protocols, otherwise . Note that the routing matrix is a function of the network state, and therefore SLLN implies

(7)

is a vector representing the cumulative packet departures during the first time slots, which are determined by the resource allocation modes as specified by the scheduler in each time slot. Specifically, at each time slot with network state , if the scheduler chooses a resource allocation mode , there is an associated departure vector , whose each entry corresponds to the number of packets transmitted successfully by link . Note that the choice of resource allocation mode is subject to the constraint that , so that the queue lengths never become negative. Note that this constraint can be easily satisfied in general systems. For example, if the allocation mode corresponds to independent sets of the interference graph (see, for example, [7]), one can simply transmit the subset of links with nonempty queues, which are still independent. In a compact form, we can express the departure process as

(8)

where is a matrix whose columns are , and is a vector whose each entry corresponds to the number of time slots that resource allocation mode is chosen during the first time slots.

A basic requirement on the scheduler is that it should achieve rate stability [12], i.e.,

(9)

so that the departure rate of each link is equal to the arrival rate, as required by the underlying static optimization problem OPT, which we formulate in the next subsection.

Ii-B Optimization Problem

In this section we introduce the optimization problem OPT, which is implicitly solved by the optimal schedulers. The problem OPT is as follows OPT:

(10)
s.t. (12)

In the above formulation, as a resource allocation vector when the network state is . That is, each entry is the asymptotic time fraction (assuming the limit exists for now) that resource allocation mode is chosen, during the time slots where the network state is . Thus, is subject to the simplex constraint (12). is a big vector representing the total resource allocation vector as specified by the scheduler. is a general convex cost function of variable , and is a vector of general convex constraint functions of variable . The additional parameter represents the uncertain parameters, which is valid under the assumption that the corresponding processes are subject to SLLN. Finally, we assume that both and are continuously differentiable as functions of variables .

The formulation of OPT is quite general, which can be used to model various applications in the literature. For example, if we want to minimize the total transmission power, we can choose , and choose the cost function as follows

(13)

where is the time fraction that the network state is , and is a power vector where each entry corresponds to the power consumption when resource allocation mode is chosen at network state . Thus, the cost function in (13) can be interpreted as the average power consumption by the scheduler. Note that we can also encode the power constraint into by choosing and then choosing

(14)

where is a diagonal matrix where each diagonal entry corresponds to the power consumption when resource allocation mode is chosen when the network state is , and is the power constraint vector. In this case, (14) is equivalent to requiring a constraint of on the average transmission power. In order to encode the network stability constraint, we can choose and then choose

(15)

Thus, (15) requires that the average external and internal arrivals should be less than the average departures, in which case the network is rate stable.

Iii Scheduling Algorithm

In this section we will describe the algorithm to solve OPT. As a standard approach in solving constrained convex optimization problems [11], we transform OPT into another static “penalized” problem, PEN, to which our scheduling algorithm can directly apply. Based on this, we then introduce the scheduling algorithm which solves PEN and, therefore, also solves OPT.

Iii-a Transformed Problem

Assuming that OPT is strictly feasible, we first change the constraints in (12) as follows

(16)

where is a small scalar, and is a sufficiently large constant such that the inequality and equality constraints are equivalent. Denote as the optimal cost when the constraint is changed to (16). Thus, the optimal value of OPT is . We have the following sensitivity lemma stating that is a good approximation of with sufficiently small .

Lemma 1 ([11])

Denote and as two Lagrangian multipliers for and , respectively. We have

(17)

We next define the transformed problem as follows:

s.t.

where is a large constant to control approximation accuracy, and corresponds to the penalty term, which corresponds to various standard penalty functions [11], e.g.,

(18)

for . In particular, the standard Lyapunov drift analysis (e.g., [1], [2], [4], [7]) corresponds to the case .

Denote as a solution of PEN. We have the following result holds:

Lemma 2

.

Proof:

Denote as a solution of OPT with constraint in (12) replaced by (16). We have

where is because solves PEN, and is because satisfy the constraint (16). Thus, the claim holds. \qed

In the following we will focus on solving PEN, since Lemma 1 and Lemma 2 guarantee that PEN achieve an objective function value which is arbitrarily close to the optimal in OPT. We next describe the scheduling algorithm.

Iii-B Algorithm Description

The problems OPT and PEN are static. On the other hand, the network is dynamic, and must be described by time series. Therefore, before describing the algorithm, we need to define dynamic counterparts of the static variables and . We first define empirical resource allocation variable

(19)

i.e., each entry corresponds to the time fraction that resource allocation mode is chosen during the first time slots, when the network state is . Note that we have

(20)

Thus, can be interpreted as the empirical value of which is defined in PEN. Similarly, we denote the empirical value of the uncertain parameter as

(21)

i.e., is formed by directly taking the average of the process . Further, define the empirical value of as

(22)

where the cumulative process is defined by

(23)

and is computed by the scheduler in Algorithm 1.

Finally, we introduce some notations. Denote and as the gradient operator with respect to variables and , respectively. Further, with an abuse of notation, we use the following abbreviated notations:

  Step 1. At each time slot with network state , choose allocation mode , where
(24)
  Step 2. Choose variable such that
(25)
and update variables , and accordingly.
Algorithm 1 Optimal Scheduling

The algorithm is described as in Algorithm 1. Essentially, the algorithm updates the variables and by computing descent directions and in Step 1 and Step 2, respectively, where is an all-zero vector except an one at the -th entry. Further note that constraint can be satisfied implicitly with regular cost and penalty functions, i.e., assuming the cost for transmitting a set of links is always no smaller than that of transmitting any of its subsets.

From the definition of and , these processes are naturally updated as follows

Thus, Algorithm 1 can be viewed as a stochastic gradient algorithm for PEN, where the randomness comes from the time varying functions and , which are subject to the changes in uncertain parameters .

The optimization of (24) requires tracking the variables and , in general. However, in applications the structure of the cost function and penalty function often allows a much simpler computation. For example, in the important case of optimal power scheduling, where the cost function is formulated as (13) and the constraint is as (15) with the typical value , we have

where is the empirical time fraction of network state . Thus, the optimization in (24) essentially only requires the queue length information (note that becomes an irrelevant scaling factor in the optimization). In particular, if we are only interested in the rate stability, i.e., setting the objective function as , the optimization in (24) is equivalent to

(26)

which is the same as the max-weight back-pressure algorithm proposed by [1]. We finally conclude this section by the following lemma, which formally shows, essentially, the descent property of Algorithm 1.

Lemma 3

The following properties hold for Algorithm 1:

  1. solves the following problem

    (27)
    s.t.
  2. solves the following problem

    (28)
    s.t.

Thus, the variables computed by Algorithm 1 can be interpreted as the points in the feasible region of PEN which achieves the minimum inner product with the corresponding (stochastic) gradients.

Proof:

For 1), note that GRAD-X is a Linear Programming (LP) problem over a simplex, and therefore the solution can be obtained at a vertex [11] with the minimum directional derivative. For 2), note that GRAD-Z is an LP over a hypercube, and therefore the solution is obtained at the boundary points. Thus the claim follows by noting that , since is not a function of . \qed

Iv Optimality Proof

In this section we will prove the optimality of Algorithm 1. There are two issues to consider: 1) We need to show that Algorithm 1 achieves the optimality of OPT asymptotically, and 2) We need to show that Algorithm 1 is feasible for OPT, i.e., constraint (12) can not be violated. We first briefly introduce fluid limits, which serves as the key technique for the optimality proof.

Iv-a Fluid Limits

We extend the domain of all processes to continuous time by linear interpolation, and define the fluid scaling of a function as

(29)

where can be functions and . It can be shown that these scaled functions are uniformly Lipschitz-continuous. Thus, according to the Arzela-Ascoli Theorem [13], any sequence of functions which is indexed by , i.e., , contains a subsequence which converges uniformly on compact sets to a set of absolutely continuous functions (and, therefore, differentiable almost everywhere [13]) . Define any such limit as a fluid limit. (Note that fluid limits are denoted by a bar.) We next state some properties of the fluid limits.

Lemma 4

The processes in any fluid limit satisfies the following: For any , we have w.p.1,

(30)

and the following properties hold w.p.1: For all

(31)
(32)
Proof:

(30) follows from Algorithm 1, where each is chosen between and . (31) and (32) follows directly from the (functional) SLLN. \qed

We next define the resource allocation variables and auxiliary variables in fluid limit as follows (one can compare with (19) and (22) for similarities)

(33)
(34)

Similarly, define the following variables as the counter parts of and in Algorithm 1:

(35)
(36)

We have the following lemma holds, which states that both and are feasible points for PEN.

Lemma 5

For any fluid limit and , we have

  1. is feasible for PEN:

    (37)
    (38)
  2. is also feasible for PEN:

    (39)
    (40)
  3. The derivatives of and are

    (41)
    (42)
Proof:

For 1), (37) follows from applying (30) to the definitions of , and (38) follows from applying (32) to the definition of . Similarly we can prove 2), by noting that for any and . 3) follows from direct calculation. \qed

We are now ready to prove the optimality of Algorithm 1.

Iv-B Optimality Proof

For the ease of presentation, we use

(43)

as a short-hand notation (note that they are the functions in fluid limits), with an abuse of notation. We next establish the following key technical lemma, which, essentially, extends the optimality property in Lemma 3 to the fluid limits.

Lemma 6

Let a fluid limit and be given. The following properties hold:

  1. solves the following problem

    (44)
    s.t.
  2. solves the following problem

    (45)
    s.t.

Thus, the optimality in Lemma 3 still holds in fluid limits.

We first outline the proof. For 1), since GRAD-XBAR an LP over a simplex, the optimum must correspond to the vertices with the smallest gradient. Thus, it is sufficient to prove that any resource allocation mode will have if there is a such that

(46)

which follows from the optimality shown in Lemma 3 along a convergent subsequence. For 2), we will prove that for any feasible points of (45), we have

(47)

which also follows from the optimality in Lemma 3 along a convergent subsequence.

Proof:

For the clarity of presentation, the proof is moved to the Appendix. \qed

Based on the above lemma, we are now ready to prove that Algorithm 1 achieves the optimal cost in the fluid limit.

Lemma 7

For any fluid limit, we have for all ,

(48)

where . Thus, the optimality is achieved in the fluid limit.

We first outline the proof. Note that it is always true that

(49)

since are always feasible points of PEN. Thus, the claim holds if we can prove the reverse direction. This can be done by defining a proper “Lyapunov” function

(50)

and show that , by using the properties in Lemma 6 and the convexity of function .

Proof:

Consider the “Lyapunov” function as in (50) in any fluid limit. From Lemma 5 we know that for any , are feasible for PEN, and therefore we have due to the definition of . On the other hand,

where is obtained by substituting the equation in Lemma 5, is from Lemma 6, i.e., and are solutions of GRAD-XBAR and GRAD-ZBAR, respectively. is due to the convexity of function , and is because is the solution of PEN, by definition. Thus, we have

(51)

from which we conclude that . \qed

Having established the optimality in the fluid limit, we are now able to prove optimality in the original system. The following theorem states that Algorithm 1 achieves the optimal cost in the original network.

Theorem 1

(optimal cost) In the original network, the following holds w.p.1:

(52)
Proof:

Suppose that it is not true. Then there is a sequence such that

(53)

From the Arzela-Ascoli Theorem [13], there is a subsequence which converges to a fluid limit. Lemma 7 implies

(54)
(55)
(56)

where follows from the fact that for all ,

(57)
(58)
(59)
(60)

and that is continuous. is because of Lemma 7. Thus, we have a contradiction, and the claim holds. \qed

In the next subsection we will continue to prove the feasibility result, namely, the limit points of produced by Algorithm 1 are indeed feasible for OPT.

Iv-C Feasibility Proof

Note that Algorithm 1 is designed to solve PEN. Thus, in order to prove that the scheduler produce feasible points for OPT, we need the following lemma, which connects the objective function value in PEN to the constraint in OPT.

Lemma 8

The following properties hold for PEN: For large enough , we have

(61)

for any solution .

Proof:

For the ease of presentation, we only consider the penalty function as (18), although the proof can be easily extended to general cases. Note that from Lemma 2 we have

(62)

Thus, (61) holds by choosing sufficiently large . \qed

Finally, we conclude this section by the following theorem, which states that the limit points produced by Algorithm 1 are always feasible for the original problem OPT. This, combined with Theorem 1, proves the optimality of Algorithm 1 for OPT.

Theorem 2

(feasibility) For sufficiently large , we have

(63)

for any constraint function in .

Proof:

Suppose that this is not true. Then there exist a sequence such that

(64)

From Arzela-Ascoli Theorem, there is a subsequence which converges to a fluid limit. Thus, we have

where can be argued similarly as in the proof of Theorem 1, is because for any and we have , due to Algorithm 1, and is because of the assumption in (64). But according to Lemma 7, solves PEN, and therefore Lemma 8 implies that

(65)

Contradiction! Therefore the claim holds. \qed

Thus, Algorithm 1 produces feasible points for OPT, and achieves a cost which is arbitrarily close to , by properly selecting parameters and .

V Simulation

In this section we verify the performance of Algorithm 1 through a simulation in a random wireless network where the network is as shown in Fig .1. There are 7 links in the network, where square nodes denote the transmitters, and round nodes denote the receivers. We simulate a special case of OPT, the following minimum power scheduling problem, which we denote as POW:

(66)
s.t. (68)

where is a power vector whose each element corresponds to the power consumption when the independent set is chosen. In the simulation we choose . Here, corresponds to the arrival rate vector, which is assumed to be the only unknown parameter in the network. Thus, (68) corresponds to the rate stability constraint.

Fig. 1: An example wireless network with 7 links, where square nodes are transmitters, and round nodes are receivers.

Fig. 2 shows the convergence results of the cost function (the bottom sub-figure) with slowly converging sources (the top sub-figure) after a simulation of time slots. In the simulation, we choose and . It can be observed from the top sub-figure that our algorithm achieves the optimal cost. Further, by comparing the convergence results of the cost and the arrival processes, we can conclude that Algorithm 1 can track the uncertain parameter dynamically. In the simulation, it is further observed that the maximum queue length in the network is around , so that the constraint in (68) is clearly satisfied.

Fig. 2: Convergence results of average cost with sources with different convergence behaviors.

Vi Conclusion

In this paper we formulated a general class of scheduling problems in wireless networks with uncertain parameters, subject to the constraint that these parameters can be obtained from the empirical average values of certain stochastic network processes. We proposed a class of primal-dual type scheduling algorithms, and showed its optimality as well as feasibility using fluid limits.

[Proof of Lemma 6]

Proof:

We first prove 1). Let a sequence of functions be given, which converge to a fluid limit . In the fluid limit, suppose that there is time , and resource allocation modes such that

(69)

where is a small constant. Then, since is a continuous function of variable , there is such that for all , we have

(70)

Further, since is continuous as a function of variables (and therefore is absolutely continuous when restricted to a compact local region), there is an such that implies that

for all . Now we define

(71)
(72)
(73)

Then, the definition of fluid limits implies that there exists and such that for all and ,

(74)

Thus, by taking we have

for all . Further, by comparing the above definitions of and to that of , and in (19), (21) and (22), respectively, we conclude that they are essentially the same, except a difference in time scale, i.e., . Thus, the following holds in the original system: for any and all ,

Therefore, according to Lemma 3, is never chosen in any time slot during , and we have that is a constant during , from which we conclude that . Therefore, following the definition that