Wireless Power Charging Control in Multiuser Broadband Networks
Abstract
Recent advances in wireless power transfer (WPT) technology provide a costeffective solution to charge wireless devices remotely without disruption to the use. In this paper, we propose an efficient wireless charging control method for exploiting the frequency diversity in multiuser broadband wireless networks, to reduce energy outage and keep the system operating in an efficient and sustainable state. In particular, we first analyze the impact of charging control method to the operating lifetime of a WPTenabled broadband system. Based on the analysis, we then propose a multicriteria charging control policy that optimizes the transmit power allocation over frequency by jointly considering the channel state information (CSI) and the battery state information (BSI) of wireless devices. For practical implementation, the proposed scheme is realized by a novel limited CSI estimation mechanism embedded with partial BSI, which significantly reduces the energy cost of CSI and BSI feedback. Simulation results show that the proposed method could significantly increase the network lifetime under stringent transmit power constraint. Reciprocally, it also consumes lower transmit power to achieve nearperpetual network operation than other singlecriterion based charging control methods.
I Introduction
The performance of wireless systems is fundamentally constrained by limited device battery life. As frequent battery replacement/recharging by manual operation is often costly and inconvenient, numerous energyconservation methods have been proposed to prolong the operating lifetime of wireless communication networks, via transmit power control, energyaware medium access control (MAC) and routing selection, etc [1, 2, 3]. Alternatively, radio frequency enabled (RFenabled) wireless power transfer (WPT) technologies provide an attractive solution to power wireless devices (WDs) over the air [4]. By leveraging the farfield radiative properties of microwave, WDs could harvest energy remotely from the RF signals radiated by the dedicated energy transmitter [5]. Currently, RF power in milliwatt (mW) level could be effectively transferred to WDs from a distance of more than meters.^{1}^{1}1See Powercast Corp. website at http://www.powercastco.com. The received RF energy is sufficient to power the activities of many lowpower devices, such as sensors and RF identification (RFID) tags, where some commercial RFenabled WPT products have already been developed by Powercast Corp. Besides, the recent development of MIMO technology significantly boosts the energy transfer efficiency [6, 7], which also opens up more potential applications for RFenabled WPT in the future.
The practical advantages of WPT have recently attracted increasing efforts to develop wireless powered communication systems [4, 5, 6, 8, 10, 9, 7]. Among them, WPT for broadband networks is of particular interest, where both wireless information and power transfer could benefit from the broadband channel diversity [10, 9]. However, unlike information transfer, WPT does not introduce detrimental interference to unintended cochannel receivers. This fundamental difference motivates this study to redesign the MAC mechanism in WPTenabled broadband systems for efficient charging coordination among the distributed WDs, which are of general power usage besides communication purpose.
Efficient WPT control in broadband systems requires the energy transmitter to have accurate knowledge of both the channel state information (CSI) and battery state information (BSI) of WDs. While accurate CSI could effectively enhance the energy transfer efficiency, wellinformed BSI helps reduce energy outage rate by transmitting on the subchannels in favor of those closetooutage WDs. Although being assumed by many previous studies for the simplicity of analysis (e.g., [10, 9]), in practice, however, achieving perfect knowledge of CSI/BSI consumes significant amount of energy for the WDs to perform CSI estimation and CSI/BSI feedback. The energy cost to the WDs may even offset the power gain obtained from more refined transmit power allocation.
In this paper, we propose a new costeffective charging control method in WPTbased broadband networks. The proposed method has the following main features and advantages:

A multicriteria charging policy is proposed based on the analysis of expected network lifetime of WPTbased systems. Specifically, different metrics are used to evaluate the energyharvesting performance of the WDs depending on their residual battery levels or BSI.

The charging control policy is implemented by a novel limited CSI estimation mechanism embedded with partial BSI, which incurs very low complexity and energy cost on CSI/BSI feedback. In particular, simulations show that good system performance is achievable even with very limited CSI feedback, e.g., by estimating at most out of subchannels for a WD to feed back.

The network lifetime performance of the proposed multicriteria method is robust against different network deployment, and could be further optimized by tuning the control parameters according to specific applications.
The performance advantage of the proposed multicriteria charging control method is evaluated through extensive simulations and comparisons with other heuristically designed singlecriterion based benchmark schemes.
Ii System Model
We consider a WPTenabled broadband wireless network, where an energy node (EN) is connected to a stable power source and broadcasts RF energy to power distributed WDs. Each WD has an RF energy harvesting circuit and rechargeable battery of capacity to store the harvested energy and power the activities of the WD, e.g., event sensing or data transmission. The bandwidth of the system is Hz, which is equally divided into subchannels each with bandwidth . The wireless channels are assumed to be reciprocal in the uplink and downlink, and independent among the WDs. For the simplicity of exposition, we assume that the channel experiences block fading, where the subchannel gains remain constant in a transmission block of length and vary independently over different blocks. Notice that the block fading assumption does not affect the validity of our analysis or the operation of the proposed protocols.
In the th transmission block, , we use to denote the channel power gain of the th subchannel between the EN and the th WD, to denote the transmit power of the EN on the th subchannel. The total transmit power is bounded by , . Then, the harvested energy by the th WD within the block is [5]
(1) 
where is a fixed parameter denoting the energy harvesting efficiency which is assumed to be known by the EN. Let denote the residual energy of the th WD at the end of th transmission block, and denote the amount of energy consumed within the transmission block. Within each block, we assume that the energy consumption rate is constant so that the energy level increases or decreases monotonically. Then, the residual energy at the end of the th block is
(2) 
In this paper, is assumed to follow a general distribution with an average consumption rate , .
The output voltage of a battery decreases with the residual energy level. We say an energy outage occurs if the remaining energy of a WD is no larger than a certain threshold , such that normal device operation could not be maintained. Without loss of generality, we set throughout this paper. Given the initial battery level , we define network lifetime as the duration until a WD is in energy outage. Whenever an energy outage occurs, the device in energy outage will enter a hibernation mode, which harvests RF energy and returns to normal operation only after the battery level reaches a prescribed threshold (). This is a common practice to avoid frequent energy outage for continuous device operation. Notice that the results obtained from homogeneous battery assumption here could also be extended to heterogeneous case (i.e., WDs have different capacities and threshold parameters) by scaling the respective channel gains and energy consumption rates accordingly.
Due to channel diversities among the WDs over both time and frequency, the network lifetime is determined by the transmit power allocation strategy over time . Besides, the knowledge of CSI and BSI, although costly to obtain, is critical to the performance of transmit power allocation. In the following, we first analyze the impact of charging control method to the network lifetime. Based on the analysis, we then propose an efficient charging control method with limited CSI/BSI feedback mechanism.
Iii Charging Control Policy Analysis
A point to notice is that the network lifetime is affected by many factors besides transmit power allocation, such as EN placement, total transmit power and channel distributions. In this section, we assume that all the other parameters are fixed to focus on the design of transmit power allocation policy. We denote as the expected network lifetime achieved by a charging policy . Specifically, a charging control policy specifies the transmit power allocation in each transmission block, and thus the harvested energy , for and . To void trivial results, we assume that the expected network lifetime is finite regardless of the charging policy used, i.e.,
(3) 
where is the set of all feasible policies that satisfy the transmit sumpower constraint, . In other words, is assumed to be sufficiently small, so that the total energy harvesting rate is always lower than the consumption rate, to be consistent with the condition in (3). In fact, as verified later by simulations, a charging policy that achieves a longer under a given transmit power budget also requires lower transmit power to achieve nearperpetual network operation (i.e., is a very large number). Therefore, finding the optimal charging policy under finite network lifetime assumption also has important implication in practical system designs with the guarantee of sufficiently large .
Iiia Wireless Charging: A Gametheoretic View
The actual battery dynamic in (2) complicates the analysis because of the max/min operators, yet unable to provide extra insight into charging policy design. To make the problem tractable, we adjust the battery model as follows:

The energy level could surpass the battery capacity at the end of a transmission block, but such an overcharged battery cannot harvest any energy in the following transmission block(s) until the energy level drops below the capacity due to power usage;

When an energy outage occurs, the residual energy could be negative at the end of the current transmission block.
The first modification helps remove the min operator in (2), while ensuring that the energy level is approximately upper bounded by the capacity . Specifically, it overestimates the harvested energy when battery is fullycharged in a transmission block, but underestimates the harvested energy in the following transmission block(s). The second modification removes the max operator in (2). Both modifications have limited effect to the modeling accuracy, because a practical transmission block length is sufficiently small, such that the energy harvested/consumed within a block is marginal compared to the battery capacity. With the two minor modifications, we could express the battery dynamics as a random process , where
(4) 
denotes the harvested energy in the th transmission block of the modified battery model, with given in (1). The average energy harvesting rate is determined by the control policy , and denoted by . Accordingly, an energy outage occurs if .
Equivalently, the charging problem could be modeled by a gambling game with the WDs as gamblers. is the balance of gambler , who continuously plays a betting game with as the income and as the loss in the th bet, . The game starts with each gambler holding initial balance, and stops once a gambler’s balance becomes zero or negative. Then, the stopping time of the game is also the network lifetime. Evidently, the game is unfair because the average income and loss of each bet are not equal in general, i.e., . In particular, a gambler receives zero income in the th bet when . In the following, we derive the expected stopping time of the above gambling game by constructing an auxiliary fair game, such that Martingale stopping time theorem [11] could be applied.
IiiB Expected Network Lifetime
We first introduce an auxiliary random process , , where and
could be considered as the cumulative compensation given to gambler up to the th bet, where the gambler is compensated for in the th bet if its balance before the bet, i.e., , is less than . Otherwise, it receives if . The following lemma shows that the random process is a Martingale.
Lemma 1: The random process is a Martingale, or equivalently the bet is a fair game.
Proof: To prove Lemma , we need to show that for all it satisfies 1) and 2) [11]. Condition ) holds from the assumption that (the number of bets) is finite. For condition ), we have for each
where is an indicator function that equals if and otherwise. This completes the proof.
Then, the following Martingale Stopping Theorem [11] could be used to derive the expected network lifetime.
Proposition 1 (Martingale Stopping Theorem): Let be a Martingale and a stopping time that depends only on the value of . If , then .
In our case, corresponds to the number of bets until for some . For an initial state , we have
(5) 
where is the value of , and denotes the initial sumenergy of the WDs. Similarly, at the end of the th bet, we have
(6)  
where denotes the probability that the battery of the th WD is fully or overcharged, and denotes the expected residual sumenergy achieved by policy . From Proposition , we could infer that the righthand sides of (5) and (6) are equal. With simple calculations, the expected stopping time conditioned on is obtained as
(7) 
We notice that is always positive by our assumption that the total energy harvesting rate is smaller than the consumption rate, i.e., , . Besides, the network lifetime expression in [3] for conventional batterypowered wireless sensor network is a special case of ours when , i.e., WPT is not used.
IiiC Charging Policy Analysis
To prolong the expected network lifetime in (7), a charging policy should produce

low total residual energy ;

high total energy harvesting rate ;

and low overcharging probabilities ’s.
For condition ), the ideal scenario is for all the WDs to have zero residual energy simultaneously. In this case, the optimal charging policy is the maxmin approach, i.e., maximizing the minimum residual energy among the WDs after each energy transmission block. For the second condition, the optimal policy is the maxrate approach, i.e., maximizing the total harvested energy by all the WDs in each transmission block. For the third condition, the optimal policy is the minmax approach, i.e., minimizing the maximum residual energy among the WDs after each transmission block. In a practical wireless powered network, however, the above three charging policies may conflict with each other. For instance, the maxrate approach is strongly biased towards nearby WDs with high average channel gains. The power allocation of the maxmin approach, however, is in favor of WDs that are faraway from the EN with smaller channel gains. This simple fairnessefficiency tradeoff indicates that the three objectives could not be satisfied by a singlecriterion charging policy in general.
To balance the three objectives, we therefore consider a multicriteria approach. Our observation is that overcharging only occurs to closetocapacity WDs. Thus, the EN should apply a minmax criterion to the closetocapacity WDs, to discourage allocating power to the subchannels that may lead to overcharge in the coming transmission block(s). Besides, using maxmin approach to charge the currently closetooutage WDs could effectively reduce , because otherwise larger amount of energy will be harvested by the WDs of moderate/high energy levels in the subsequent transmission block(s), leading to higher . For those WDs of moderate energy levels, the maxrate approach is most suitable, because charging them with the highest efficiency does not have immediate effect to either ’s or , but could effectively enhance the sum charging rate .
The above analysis motivates a multicriteria charging control method that jointly considers the CSI and BSI. However, a practical question arises on how to efficiently feed back CSI/BSI to the EN in an energyconstrained broadband system with a large number of WDs and subchannels. To tackle this problem, we propose in the following a simple yet efficient charging control protocol based on limited CSI/BSI feedback.
Iv Proposed Protocol
In this section, we propose a multicriteria charging control protocol using a limited CSI feedback mechanism. The protocol operates in the following steps and is illustrated in Fig. 1. Without causing confusions, we omit the transmission block index in this section for the brevity of notation.

At the beginning of a transmission block, the EN sends pilot signals on the subchannels. Then, each WD estimates its own subchannel power gains, i.e., ’s;

Depending on the residual energy , each WD sends back to the EN orthogonal narrowband pilot signals on its strongest subchannels, with
(8) where are positive integers, and are two predetermined energy thresholds. The subset of subchannels reported by WD is denoted by , .

Here, we assume that the channel estimation by the EN is perfect. Besides the knowledge of the channel gains, i.e., , the EN is also aware of the quantized BSI of each WD by counting the number of pilots sent by the WDs, i.e., ’s. The EN then optimizes its transmit power allocation on all the subchannels using a policy detailed later in (9).

The WDs harvest RF energy in the remaining transmission block. Then, the iteration repeats from Step .
Instead of sending pilots on all the subchannels, each WD only reports on a small subset of subchannels with the largest channel gains, which significantly reduces the energy cost. The intuition behind is that transmit power is in general only allocated to relatively strong subchannels, thus the knowledge of weak subchannels’ CSI has marginal effect to the power allocation solution. Although consuming more energy to send pilots on a larger number of subchannels, from (8), a WD of lower residual energy could in fact benefit from a more favorable power allocation due to the better knowledge of CSI by the EN. On the other hand, the number of pilots sent by a WD also contains partial BSI indicating the energy level, which will be exploited by our proposed power control. In addition, the values of could be tuned in different scenarios, e.g., setting such that a WD sends energy request only when it is close to outage. The impact of these parameters will be shown by simulations.
The knowledge of CSI and BSI enables a multicriteria charging policy as discussed in Section III.C. In particular, from the values of ’s, the EN is aware of the subsets of WDs in low, moderate and high residual energy levels, denoted by and , respectively. We design the EN to allocate transmit power to the th subchannel, , by solving the following linear programming (LP) problem:
(9)  
subject to 
where denotes the estimated harvested energy by the th WD, given by , where
(10) 
The three terms in the objective of (9) denote the weighted energy harvested by WDs in and , respectively. and are two positive weights set for balancing the three terms. Specifically, the first term maximizes the minimum harvested energy of the closetooutage WDs. The second term maximizes the total harvested energy by the WDs with moderate energy level. The last term discourages the EN to charge the closetocapacity WDs. Generally speaking, a larger (smaller) would enhance the energy harvesting efficiency (fairness). A larger would increase the penalty to charge the WDs that are closetocapacity. In general, and are set much smaller than to ensure that priority is given to the closetooutage WDs.
Note that from (10), is calculated using both the explicit CSI reported on , and the implicit CSI for those subchannels not reported by WD . In (10), we use conditional expectation to calculate given that the channel gains of unreported subchannels are lower than the known channel gains in . Then, the calculation of for is determined by the distribution of ’s. Denote . For instance, the estimate is if ’s are independent and uniformly distributed within . If ’s follow independent exponential distribution with mean , the estimate is .
The proposed charging control protocol incurs low implementation cost. All the computations are borne by the EN, while each energyconstrained WD only needs to send out limited number of pilots based on the measurement of channel gains and its own residual energy level.
V Performance Evaluation
In this section, we evaluate the performance of the proposed multicriteria charging (MCC) control algorithm. Unless otherwise stated, the parameters used in all simulations are listed in Table I, which correspond to a typical indoor sensor/RFID network. Without loss of generality, the path loss exponent is , such that the path loss is roughly dB at meters from the EN. In practice, multiple ENs are needed to cover a large area, which is beyond the scope of this paper and will be considered in our future work. The wireless channel power gains follow exponential distributions with mean obtained from the path loss model. Here, we consider an i.i.d. energy consumption model where a WD consumes mW power with probability within a block, and no power with probability . In this case, the average energy consumption rate is mW, where a fullycharged battery will be depleted in about hours without WPT. The initial battery level of all WDs is of the capacity, and is assumed.
EN Tx power  W  Path loss exponent  

Center frequency  MHz  Tx block length  ms 
No. of SCs  Ave. WD power cons.  mW  
SC bandwidth  KHz  Battery voltage  V 
Tx antenna gain  Battery capacity  mAh  
Rx antenna gain  Pilot Tx power per SC  mW 
For the proposed MCC algorithm, we set mAh and . That is, a WD with less than of the battery capacity will report the best subchannels, while those with more than battery will report subchannel. Otherwise, a WD will report on subchannels. Besides, we set and in (9). We have also considered three singlecriterion based benchmark schemes for comparison, including

UNI: uniform power allocation, i.e., , .

MaxRate: maximize the total harvested energy by the WDs, i.e., .

MaxMin: maximize the minimum harvested energy by the WDs of the lowest energy level. That is, when , or when and , otherwise when , where denotes empty set.
The case of MinMax method (minimize the maximum energy harvested) is omitted here due to its poor charging performance when it is used alone. Notice that the UNI scheme is completely oblivious to the CSI and BSI, thus the WDs do not need to send any pilot. The MaxRate utilizes only CSI and the MaxMin approach uses both CSI and BSI. For fair comparisons, these two methods use the same CSI feedback method as the MCC algorithm.
In Fig. 2, we compare the average network lifetime achieved by the methods. We assume that WDs are uniformly placed within a ring region meters with the EN being the center, where a smaller (larger) leads to lower (higher) distance disparity to the EN. Fig. 2 shows the lifetime of different schemes as increases. Each point in the figure is the average performance of random placements, while the lifetime of each placement is the mean of independent simulations. We see that UNI has the worst performance for not being able to exploit the channel diversity. We also see that charging efficiency dominates the lifetime when is small, where MaxRate performs the best when . However, its performance degrades drastically as increases, as its charging control is strongly biased towards the nearby WDs. On the other hand, user fairness dominates when is large, where the MaxMin approach achieves the longest network lifetime for . In between, the proposed MCC method achieves the best performance for . Overall, it achieves over longer lifetime than MaxRate and MaxMin, and over performance gain than the UNI method. More importantly, it is robust in different placement scenarios, unlike the MaxRate and MaxMin methods that perform poorly when is either large or small. In addition, the performance of MCC could be further optimized according to the specific placement by adjusting the parameters and . We observe in experiments that a smaller is preferred when the network size is large and the WDs are more sparsely deployed, i.e., large . Detailed results are omitted here due to the page limit.
In Fig. 3, we plot the average network lifetime as the amount of feedback varies. Here, we fix as m. The xaxis is the value of . For convenience, we set and . We see that when the transmit power per subchannel (SC) is mW, achieves the longest network lifetime. However, as increases, energy consumed on CSI feedback would eventually offset the energy gain from more refined power allocation, resulting in a decreased lifetime. This is more evident as the transmit power increases to mW per SC, where the maximum lifetime is achieved when the minimum value of allowed in this case is attained, i.e., . The results show that for the proposed MCC, the pilot energy consumption plays a critical role in the resulting network lifetime.
We also plot in Fig. 4 the minimum power required by different charging methods to achieve nearperpetual network operation. Due to the randomness of channel fading and energy consumptions, it is not possible to truly sustain perpetual network operation in practice. Here, we say a WPTenabled system is nearperpetual if the network lifetime is longer than hours in all the independent simulations conducted. For fair comparison, we deploy the WDs in a line equally spaced from to . Not surprisingly, the UNI method requires the highest transmission power in all the placements. The transmit power of MaxRate is the lowest when , but increases drastically as increases. In contrast, MCC and MinMax require much lower transmit power than the other two methods. The power increment is slow as the distance disparity among the WDs increases. In particular, the proposed MCC outperforms the MinMax method due to its higher energy transfer efficiency. The results in Figs. 2 and 4 show that a scheme that achieves a longer network lifetime under a given transmission power constraint, is in general also more powerefficient to achieve selfsustainable operation. This also justifies the practical value of our analytical results in Section III under the finite network lifetime assumption.
Vi Conclusions
In this paper, we have proposed a novel multicriteria charging control method to prolong the lifetime of a wireless powered multiuser broadband system. The proposed charging policy is efficiently implemented using a limited CSI feedback mechanism embedded with partial BSI. Simulations results show that the proposed charging control could significantly increase the network lifetime under a fixed transmit power constraint, and is also power efficient to achieve selfsustainable network operations.
References
 [1] E. UysalBiyikoglu, B. Prabhakar, and A. El Gamal, “Energyefficient packet transmission over a wireless link,” IEEE/ACM Trans. Netw., vol. 10, no. 4, pp. 487499, Aug. 2002.
 [2] O. Younis and S. Fahmy, “HEED: a hybrid, energyefficient, distributed clustering approach for ad hoc sensor networks,” IEEE Trans. Mobile Comput., vol. 3, no. 4, pp. 366379, Oct. 2004.
 [3] Y. Chen and Q. Zhao, “On the lifetime of wireless sensor networks,” IEEE Commun. Lett., vol. 9, no. 11, pp. 976978, Nov. 2005.
 [4] S. Bi, C. K. Ho, and R. Zhang, “Wireless powered communication: opportunities and challenges,” to appear in IEEE Commun. Mag., available online at http://arxiv.org/abs/1408.2335
 [5] X. Zhou, R. Zhang, and C. K. Ho, “Wireless information and power transfer: architecture design and rateenergy tradeoff,” IEEE Trans. Commun., vol. 61, no. 11, pp. 47544767, Nov. 2013.
 [6] R. Zhang and C. K. Ho, “MIMO broadcasting for simultaneous wireless information and power transfer,” IEEE Trans. Wireless Commun., vol. 12, no. 5, pp. 19892001, May 2013.
 [7] J. Xu and R. Zhang, “Energy beamforming with onebit feedback,” IEEE Trans. Signal Process., vol. 62, no. 20, pp. 53705381, Oct. 2014.
 [8] H. Ju and R. Zhang, “Throughput maximization in wireless powered communication networks,” IEEE Trans. Wireless Commun., vol. 13, no. 1, Jan. 2014.
 [9] P. Nintanavongsa, M. Y. Naderi, and K. R. Chowdhury, “Medium access control protocol design for sensors powered by wireless energy transfer,” in IEEE Proc. INFOCOM, pp. 150154, Apr. 2013.
 [10] X. Zhou, R. Zhang, and C. K. Ho, “Wireless information and power transfer in multiuser OFDM systems,” IEEE Trans. Wireless Commun., vol. 13, no. 4, pp. 22822294, Apr. 2014.
 [11] G. Grimmett and D. Stirzaker, Probability and random processes, 3rd ed., Oxford University Press, New York, 2001.