Communications with decodeandforward relays in mesh networks
Abstract
We consider mesh networks composed of groups of relaying nodes which operate in decodeandforward mode, where each node from a group relays information to all the nodes in the next group. We study these networks in two setups, one where the nodes have complete channel state information from the nodes that transmit to them, and another when they only have the statistics of the channel. We derive recursive expressions for the probabilities of errors of the nodes and present several implementations of detectors used in these networks. We compare the mesh networks with multihop networks, the latter being formed by a set of parallel sections of multiple relaying nodes. We demonstrate with numerous simulations that there are significant improvements in performance of mesh over multihop networks in various scenarios.
1 Introduction
An old concept in radio communications, known as relaying, has been in the center of interest of various studies in communications (Ephremides, 2002; Goldsmith and Wicker, 2002; Ribeiro et al., 2008). To a good extent, this interest has been driven by commercial applications and by the ability of wireless networks to exploit relaying so that they have reduced energy consumption and thereby increased lifetime. In general, relaying is used to provide for improved error performance and capacity (Liu et al., 2009).
In networks with relaying, nodes cooperate in moving information from source to destination (Liu et al., 2009). In the study of these networks, the important notion of cooperative diversity was introduced in (A. Sendonaris et al., 1998), and then extended in (A. Sendonaris et al., 2003a) and (J. N. Laneman et al., 2004). A class of wireless networks that are formed by a set of parallel sections of multiple relaying nodes was thoroughly studied in the literature (Lin et al., 2006; T. Wang et al., 2007). They are known as multibranch multihop networks, and they can reach a full diversity order (T. Wang et al., 2007). However, unlike mesh networks, these networks do not allow for full connectivity among nodes.
Optimum detection in a network demands knowledge of all its channels. There is some work on this subject in the literature, and especially on particular network topologies. For example, in a recent paper, the problem of detection by a set of sensors and the communication of their information were considered together (B. Chen et al., 2004). Various fusion rules were proposed that correspond to different scenarios in terms of what is known for deriving the rules. The study was performed for multibranch multihop topologies, but not for general mesh networks.
An analysis of the maximum likelihood (ML) detector in cooperative multibranch multihop networks was presented in (D. Chen and J. N. Laneman, 2006). A closed form expression was obtained for the bit error rate of a network with one relay. The difficulties with ML detection in DF networks have led to several suboptimal alternatives. In (A. Sendonaris et al., 2003b), a variant of the maximalratio combiner (MRC), the MRC, was introduced to combine signals from several branches. In order to explicitly obtain a full diversity order, the cooperation MRC (CMRC) was proposed in (T. Wang et al., 2007). For this detector, the weight of the channel of relaydestination was chosen to maximize the equivalent SNR for the channel. In that paper the multibranch multihop topology was considered. It is interesting to mention that a multinode cooperative network can be viewed as a virtual MIMO system (Liu et al., 2009; A. K. Sadek et al., 2007). In successive phases the relays combine the received signals from previous relays and the source using the MRC criterion. The system is not optimal and it can be adapted to a mesh network.
Most of the above solutions assume perfect knowledge of the channel state information (CSI). However, having such knowledge can be expensive. For instance, in a sensor network the battery consumption can be high if the nodes have to inform about their CSI in a variable environment. In other circumstances it could be impossible to have complete knowledge of the CSI. For these cases it is important to develop schemes that operate with less information about the channel: instead of full information one has available only the channel statistics. There is some previous work in the literature on detection using channel statistics. In (A. Bravo et al., 2006), the average SNR is used as a weight for combining signals in a multibranch multihop relay network. In (Lin et al., 2006), optimum ML fusion rules are derived for a joint sensorcommunication problem with a multibranch multihop topology.
Data traffic in a sensor network is usually small. We take this statement as true, although it is not always the case, and we assume that most of the available channels can be used for cooperation and we do not put a limit to this number. This is a usual assumption in the literature, see (T. Wang et al., 2007). A consequence of this assumption is that the spectral efficiency is reduced. In the literature, there are examples of simple networks where the error probability and the spectral efficiency are improved at the same time (A. Sendonaris et al., 2003b). However, it is not straightforward to extend these results to arbitrary networks. In this paper, we look for the objective of minimizing the transmitted power, or, what is related to it, minimizing the error probability, with no restrictions in the number of available channels.
The cooperative detection problems mentioned above were posed for multibranch and/or multihop networks. For them, the optimal ML solution was abandoned in favor of tractable solutions. In this paper we present and analyze the optimal maximum a posteriori (MAP) detector for cooperative mesh networks, and we provide explicit analytical solution for a general topology. We consider optimal detection in two cases related to the channels in the network. One is when the CSI is available and the other when the channel statistics are only known. The first case was studied in part by the authors in (A. Bravo and P. M. Djurić, 2009). The relaying in the network is DF with uncoded and symbolbysymbol demodulators. In summary, the two main contributions in our paper are the following: (a) we derive optimal MAP detectors for the nodes in mesh networks and (b) we propose several implementations of the detectors for both, the case of known CSI and scenarios of known channel statistics. In the paper we do not consider the problems of routing and protocols that may arise, for example, in ad hoc mesh networks. How they relate to the proposed schemes here will be addressed elsewhere.
The paper is organized as follows. In Section 2 we formulate the problem. In Section 3, we provide the general MAP solution that holds for known CSI as well as for known channel statistics. In Sections 4 and 5 we present detectors in the first and higher groups of nodes, respectively, and provide the specific solutions for the above two scenarios. Simulation results that demonstrate the performance of the mesh networks and how they compare with the multibranch and multihop networks are shown in Section 6. In the last Section 7, we have some concluding remarks about our findings.
2 Problem statement
We observe a mesh network where its nodes are grouped into relay groups and where the notation signifies the th node of the th group. We denote the source by and the destination by , the superscript implying that a message is relayed times on its way from the source to the destination. For example, refers to the fourth node from the third group of nodes. We use the symbol to denote the total number of relay nodes in group . In Fig. 1 we show a drawing of a general mesh network of our interest.
In this paper, for comparison purposes, we also work with multibranch multihop networks. Multibranch multihop networks can be obtained from mesh networks when we remove connections between nodes so that each node in a group is connected to only one node from the previous group, as in Fig. 2. Clearly, a wireless network formed by a set of interconnected nodes can operate as a mesh network or as a multibranch multihop network. In mesh networks, a node receives and processes information from more than one node from a previous relay group, whereas in a multihop network, it does only from one node.
We consider binary modulations only (the extension to other modulations is straightforward), and we assume phase coherent reception. The received signal by node and transmitted by node , where , is denoted by
(1) 
where is the transmitted symbol, is the channel fading between and , and is the channel noise. In this formulation, is the realvalued fading envelope of the channel. We denote the transmitted symbol by the source with and not so that we avoid superfluous notation. The noise is modeled as a zero mean Gaussian random variable with variance , i.e., , and it is considered identically distributed in all the channels.
We assume a simple protocol where every node has access to an orthogonal channel. This can be implemented with time division, frequency division or hybrid multiplexing. This assumption could be considered unrealistic, but with a little traffic offered in a network, as is frequently the case in sensor networks, most of the available channels can be used for cooperation with the node source of information and the time/frequency channels can be reused by distant nodes. We also point out that in a real system the number of available channels is not necessarily small. For example, the Zigbee standard for sensor networks (ZigBee Alliance, 2009) envisages a hybrid access for its next release in order to increase the number of channels and, thus, reduce message collisions.
We distinguish two scenarios:

all the channels in the network are completely known (CSI is available to all the nodes), and

only the statistics of the channels are known, where it is assumed that the channels follow the Rayleigh distribution, i.e., for the fading of a channel we write
(2)
The signaltonoise ratio (SNR) in the former case is defined by
(3) 
and in the latter, by
(4) 
Obviously, we can have a combination of the two scenarios, but we will not consider it here.
Given the available information, the objective is to determine the optimal decisions at each node and study the performance of the systems under the two scenarios.
3 General solution
In this section we derive the decision rule of the nodes under the assumptions from the previous section. We denote the probability of correct decision of node in the th group by and the probability of error by . In the sequel, we assume .
Clearly, the nodes , make their decisions according to
(5) 
where , and is the likelihood of , with being or .
The following groups of nodes will receive in general more than one signal and a decision is made by using all of them. Let the signals received by node be denoted by the vector , and let the decisions of the nodes in the previous group, be given by . Then we have
(6) 
The decision rule at node is
(7) 
For and general mesh networks, the decision rule (7) requires that a node must know the states or statistics of the channels and the joint probability mass function (pmf) of correct decision of its relaying nodes, and . The former requirement in most studies is assumed satisfied. The second requirement is much more restrictive, and in the sequel we propose ways of dealing with it.
We note that in (6), we have likelihoods of the form . For them we have the following claim:
Claim 1: For the likelihood , in mesh networks, we can write
(8) 
Proof: The proof follows directly from the conditional independence of the channels and the decision independence of the nodes in the first group.
From Claim 1, for the likelihood ratio in nodes of group 1 and 2, , ; , we can write
(9)  
where
(10) 
is the probability of correct decision, and
(11) 
is the probability of error, with .
The likelihood ratio in (9), in general, does not have the same simple form for because of the dependence among the decisions of the relaying nodes. However, if we approximate
(12) 
we will obtain suboptimal detectors that use (9), as is shown in Section 5. We note that with the assumption (12), we only need to know the marginal probabilities of correct decisions of the relaying nodes.
Claim 2: If the channel likelihoods satisfy and for all , we have
(13) 
Proof: See Appendix 1.
We note that if the conditions for Claim 2 are satisfied, for optimal processing of the received signals all the information about the joint pmf of correct decisions needed by a node in the th group is in . Then, for the likelihood ratio (6) we can formally write
(14) 
where all the probabilities are conditioned on .
4 Detectors in the first group of nodes
In Section 2, we described two scenarios, one where the necessary CSI is available to the nodes, and another where only statistics of the channels are known. Here we describe the detectors of the nodes in the first group and we present their performances.
4.1 Completely known channels
In the case of known channels and based on the assumptions from Section 2, we can express the likelihood as follows:
(15) 
The decision rule of node is based on
(16) 
which simplifies to
(17) 
For the nodes in this group, we can easily find the probability of error, and it is given by
(18) 
with , where , and is the SNR of the channel that links the transmitter and the th node in group one (see (3)).
4.2 Channels with known statistics
When the channels are not known, and instead the nodes only have available the channel statistics, we proceed as follows (Niu et al., 2006). We write for the likelihoods
(19) 
where is given by (2).
For simplicity, we rewrite the last integral without subscripts and superscripts, and we get
It is easy to show that can further be expressed as
(20) 
where
(21) 
We can show that the integral in the above equation can be analytically solved. We get
(22) 
After substituting the solution (22) in (20) and returning all the superscripts and subscripts, we obtain
(23) 
where
In Fig. 3, we plotted the probability density functions (pdfs) of given for SNR 3dB for the two cases of interest, when the channels are completely known and when their statistics are only known. As expected, the latter case leads to deterioration in performance because then the pdfs are shifted towards and are flatter in comparison to the pdfs that correspond to known channels.
For the probability of error at node , we have
(24) 
where
i.e.,
(25) 
Note that is the probability of error of a coherent binary signaling over a Rayleigh fading channel (Proakis, 2001). The above integral can be solved and expressed in a closed form by using integration by parts, and the result is given by
(26) 
5 Detectors in the groups of nodes beyond the first group
The detectors for the nodes in groups are challenging to implement because of the need to know the joint probabilities of correct decisions of the relaying nodes (see (7)). It is interesting to point out that there is one exception to the need for pmfs, and it is the case of mesh networks with two nodes per group, known CSI, and assumptions as in Claim 2. Here, we describe four types of detectors for the nodes , , . We also briefly describe the detectors in multihop networks.
5.1 Estimation of the joint pmf based on Monte Carlo sampling (MCS)
The joint pmf of interest is , and we can show that it can be expressed as
(27) 
where the random variables are not independent.
From (6), we see that the s are expressed via the decisions , which allows us to propose a computational method for obtaining the joint pmf. The procedure is the following:

Each node generates samples where , from the mixture distribution given by
(28) 
The node computes the signs of the obtained , at the drawn , and stores them as a vector in a counter.

The process is repeated times.

From the obtained outcomes, the node obtains the estimates of the joint probabilities.
The generation of samples in the case of a mixture Gaussian is easy. When the channel statistics are known only, the sampling of must come from a distribution given by (23). To that end, one could apply rejection sampling.
Clearly, at the end, all the nodes in the th group will have different estimates of the joint pmf. The next group may combine all these estimates, for example, by taking the average of the estimates. This approach becomes tedious when the number of nodes in a group gets large. For example, when , one has to work with a joint pmf of a 10dimensional vector, which, in general, requires the computation of 1023 estimates. However, there are two ways to decrease the computational load of this approach and they are (1) to have a designated node in a group compute the joint pmf and (2) under an assumption of symmetry, instead of computing the values of elements, a node computes only elements.
5.2 Estimation of the joint pmf based on pilot signals (PS)
Suppose that node , receives messages from relay nodes, . In the rest of the subsection, we suppress the subscript and the superscripts . For example, for the received signals , we write .
Define the variable if and if . Let also
(29) 
where identifies the th binary digit of represented in binary notation. Thus, for example, corresponds to and to . In other words, the joint pmf of can succinctly be represented by the pmf of .
Similarly, we represent the joint pmf of the transmitted symbols of the relaying nodes. Let and be defined by
(30) 
and the joint pmf of is equivalent to the pmf of .
We want to estimate from pilot data (where the source, say, repeatedly transmits ). Then the node first estimates the pmf of and, based on the model for the channels, estimates the pmf of . We can write for
(31) 
where the probabilities are known, i.e., we can show that
(32) 
where
(33) 
where means
and is the probability of correct decision based on the transmitted signal from the th node, and is the corresponding probability of error, that is . This probability of error is obtained from (26).
We can represent the probabilities of by
and the probabilities of by
We can also construct a matrix, with probabilities as the ones defined by (32). Then we can write
(34) 
We note that the matrix is a doubly stochastic matrix, that is, its rows and columns sum up to one.
We reiterate that we estimate the vector from the pilot data, and we denote it by . Then the estimate of is readily obtained from
(35) 
This estimate, however, may produce negative probabilities (although the sum of all the elements of equals one), which is due to errors in estimating . Thus, we change the set of equations (34) to the following minimization problem with linear constraints:
(36) 
where has minimum norm and adds to 0. In Appendix 2, we show that the solution vector to the system (36) has elements given by
(37) 
where is the positive part of , , , and is a parameter related to the Lagrange multipliers. It is worth pointing out that (37) is in fact an upsidedown waterfilling scheme.
In summary, according to this scheme the source node transmits symbols known to the rest of the network. The nodes in groups can estimate from the received signals from their relaying groups the joint pmfs as just shown. When the estimation is completed, the nodes have the necessary information for processing of the signals with unknown transmitted symbols. Obviously, a drawback of the scheme is that some transmitted symbols are used for estimating joint pmfs and some power is used for estimating the pmfs.
5.3 Estimation of the joint pmf based on a predefined joint pmf (PJP)
The biggest values of the joint pmf are for combinations when most of the arguments . This suggests that we approximate the joint pmf by
(38) 
where is probability obtained from the number of combinations of the elements of that satisfy the condition in (38), is the cardinality of the set, and . In other words, the probability of correct decision is constant and different from zero whenever the number of arguments is big enough. This detector is efficient because only a few terms in the numerator of the likelihood ratio (6) are different from 0. Claim 2 is used for selecting the significant terms of the denominator of (6). This simple detector does not require the use of pilot signals.
5.4 Estimation of the joint pmf based on the assumption of independent decisions (ID)
When for we approximate the joint pmf of by (12), for decision making we can use the likelihood ratio (9). If we take the logarithm on both sides of (9), and we define
(39) 
where
(40) 
and
(41) 
representing the likelihood ratio for the transmitted symbol of , we can write for the overall loglikelihood ratio
(42) 
The decision rule then simplifies to
(43) 
For the probability of error at , we can write
We note that under these assumptions the new random variables are independent. Clearly, if the s are identically distributed, so are the loglikelihood ratios .
The implementation of this detector requires information about the probability of correct decision of the nodes in the previous group , . These probabilities can readily be computed as described in subsections A and B. The computations in this case are much simpler because the problem is broken into independent problems, where in each problem we estimate the probability of detection (error) of only one relaying node at a time.
Another possibility is that the nodes compute their own probabilities of errors from (LABEL:Pe) and transmit them to the nodes of the next group. The advantage of this approach is that there is no need for transmitting pilot signals. We note that in the transmission of the probability, some form of quantization must be adopted. In Section 6, we study the influence of quantization via simulations.
We now briefly describe the implementation of the scheme in the cases of completely known channels and known channel statistics, respectively.
Completely known channels
For the conditional distributions of the observations, we can write
(45) 
and for the likelihood ratios,
(46) 
whereas for the loglikelihood terms we have
(47)  
We can also find the distribution of by change of variables. We obtain
(48) 
The probabilities of error of the nodes in group are obtained by the recursive equation (LABEL:Pe). A closed form analytical solution of the recursive equation in the general case, however, cannot be obtained.
Channels with known statistics
5.5 Detectors for multihop networks
In multihop networks, the variables , , are conditionally independent. Therefore, for detection at the relay nodes one uses the likelihood ratio in (9). The last node in the network, , collects the information of all the nodes from the previous group and uses the decision rule (43), where the sum loglikelihood is calculated from (42). For this node, the obtained results under the independence assumption apply.
6 Simulations
We conducted many experiments where we compared the performances of the mesh and multihop networks shown in Figs. 1 and 2, respectively. In the experiments, the channels were Rayleigh distributed and, for better understanding of the results and easier comparisons, the SNRs were the same for all the nodes. Most of the simulations were for the scenario where the channel statistics are known. This case presents two main advantages in real networks: (a) the information about the statistics is provided just once for each node and (b), the probabilities of correct detection are evaluated also once. By contrast, when detecting with complete CSI at the nodes, the channel information must be updated if the CSI changes, and the probability of correct detection has to be evaluated accordingly.
First, we considered a mesh network with a decision rule based on complete knowledge of the probabilities of correct decision of the nodes of the previous groups and based on (7). The results of this setup were a benchmark for comparison. The joint pmfs of correct/incorrect decisions of the groups for were computed using the MCS method.
In Fig. 4, we show in the curve Mesh MCS the probabilities of error of mesh networks with different number of hops when all the joint pmfs were known. In the simulations, the number of nodes per group was 10 and the SNR 3 dB. The error probability decreased as the number of hops increased until a point where it remained constant with the number of hops. The spatial redundancy introduced by the mesh network lowered the error probability until an error floor was reached.
In Fig. 4, we also present the performance of the mesh network using the PS method. As can be seen, the obtained results are close to the ones of mesh networks that have complete information about the pmfs.
To these curves, four other curves are also displayed. One of them shows the performance of the PJP detector. With the curve Mesh ID Q=4 bits, we show the performance of the detector based on the ID detector, and where the probabilities of correct decision are quantized with four bits. Note that a node uses the probabilities of correct decision of previous nodes and quantizing these variables reduces the amount of information to be distributed. Finally, with the Multihop ID Q=4 bits curve we plotted the performance of the multihop network, and with the MRC curve, we displayed the performance of the MRC detector (Proakis, 2001). For the multihop network, the probabilities of correct decision of the previous node were again quantized with four bits. The MRC detector was implemented by combining the signals arriving to a node using the MRC criterion.
The worst performance was clearly achieved by the multihop network. Also, its performance deteriorated steadily as the number of hops increased. The performance of the network whose nodes had to estimate the joint decision pmfs of the nodes in the previous group performed almost as well as the network whose nodes knew the joint pmfs. Surprisingly well performed the mesh network that used the independence assumption and where the nodes quantized their probabilities with four bits. Next in performance of the mesh networks came the one that employed the simple detector and finally, the worst was the network with MRC detectors.
The analytical study of the quantization effect on the ID detectors is difficult. Instead, we performed experiments where the number of bits for representing the quantized probabilities was either one bit (the quantized probabilities were zero and one) or four bits (for 16 probability values, that is, the quantized probabilities were 0, 0.0666, , 1). We also obtained results when the probabilities were not quantized. The results are shown in Fig. 5 . There we see the probability of error as a function of the number of hops. The nodes assumed independence of the decisions of the nodes of the previous group. The SNR was 3 dB. In all the mesh networks with 10 nodes per group, the BER decreased with the number of hops. The results show that four bits were enough for achieving almost as good performance as the one when there was no quantization. Interestingly, using only one bit for quantization also yielded good results. Note that representing the probability of correct decision with one bit is a form of selection combining: the nodes with low probability withdraw themselves from combining in the following group of nodes.
In the case of five nodes per group (curve Mesh ID 5 nods/grp Q=4 bits) the decrease in probability of error probability was slower. The performance of this network was worse than that with 10 nodes per group by an order of magnitude. In Fig. 5, we also plotted the performance of two multihop networks. One of them was the same network whose performance is shown in Fig. 4, and the other corresponds to the multihop network with CMRC detectors (Multihop CMRC 10 nods/grp) (T. Wang et al., 2007). Note that the CMRC detector uses complete information about the channel. Nevertheless, it has a larger error probability than the multihop ID detector (curve Multihop ID 10 nods/grp Q=4 bits), where only channel statistics are used. In both cases the error probability increases very fast with the number of hops.
Finally, in Fig. 6 we present curves of probability of error versus SNR for a mesh network with MCS detectors, a mesh network with ID detectors and four bits for quantization, and a multihop network. The number of nodes per group was 10 and the curves were obtained after 9 hops. From the experiment we can draw the following conclusions: a) the assumption of independence is adequate and gives results that are very close to the case of dependence between the probabilities of correct decision of the nodes in a given group and b) the mesh networks offer great advantages in performance over multihop networks.
7 Conclusions
In this paper we studied the performance of mesh networks whose nodes operate in the decodeandforward mode. In particular, we investigated the probability of error in these networks as a function of SNR, number of nodes per group, and number of hops. We studied two cases, where in the first case the nodes knew the CSI and in the second, the nodes worked only with the statistics of the channels. We compared the mesh networks with multihop networks and showed the gain in performance of the former with respect to the latter. In the presented work we dealt with binary modulations, but the generalization of the proposed detectors to deal with ary, , modulations is not difficult.
Appendix 1
Proof of Claim 2
We prove the claim by induction. Suppose that the channel likelihoods satisfy and , and that the claim is true for , i,e. Then we have the likelihood ratio (6)