Soft Cache Hits and the Impact of Alternative Content Recommendations on Mobile Edge Caching
Abstract
Caching popular content at the edge of future mobile networks has been widely considered in order to alleviate the impact of the data tsunami on both the access and backhaul networks. A number of interesting techniques have been proposed, including femtocaching and "delayed" or opportunistic cache access. Nevertheless, the majority of these approaches suffer from the rather limited storage capacity of the edge caches, compared to the tremendous and rapidly increasing size of the Internet content catalog. We propose to depart from the assumption of hard cache misses, common in most existing works, and consider “soft” cache misses, where if the original content is not available, an alternative content that is locally cached can be recommended. Given that Internet content consumption is increasingly entertainmentoriented, we believe that a related content could often lead to complete or at least partial user satisfaction, without the need to retrieve the original content over expensive links. In this paper, we formulate the problem of optimal edge caching with soft cache hits, in the context of delayed access, and analyze the expected gains. We then show using synthetic and real datasets of related video contents that promising caching gains could be achieved in practice.
C.2.1Network Architecture and DesignStore and forward networks, Wireless communication \categoryC.4Performance of SystemsModelling techniques
1 Introduction
In the context of cellular networks, it is widely believed that aggressive densification, overlaying the standard macrocell network with a large number of small cells (e.g., pico or femtocells), is a promising way of dealing with the ongoing data crunch [1]. As this densification puts a tremendous pressure on the backhaul network, researchers have suggested storing popular content at the “edge”, e.g., at small cells [2], user devices [3, 4, 5, 6], or vehicles acting as mobile relays [7] in order to avoid congesting the capacitylimited backhaul links, and reduce the access latency to such content.
Local content caching has been identified as one of the five most disruptive enablers for 5G networks [8], sparking a tremendous interest of academia and industry alike. While caching had been widely studied in peertopeer systems and content distribution networks (CDNs) [9], the number of storage points required in future dense HetNets are many orders of magnitude more than in traditional CDNs (e.g., 1000s small cells per area covered by one CDN server). Therefore, the storage space per local cache must be significantly smaller to keep costs reasonable. Hence, even though studies assuming a large (CDNtype) cache deep inside the core network [10] give promising hit ratios, only a tiny fraction of the constantly and exponentially increasing content catalog could realistically be stored at each edge, leading to low “local” cache hit ratios [11, 12].
Additional “global” caching gains could be sought by increasing the “effective” cache size visible to each user through: (a) small cell overlaps, where each user is in range of multiple cells and caches (e.g., in the femtocaching case [2]), (b) collocated users overhearing the same broadcast channel and benefiting from cached content in other users’ caches (as in coded caching [13]), and (c) delayed content access, where a user might wait up to a TTL for its request, during which time more than one (fixed [6] or mobile [4, 5, 7]) caches can be seen. These ideas could theoretically increase the cache hit ratio significantly, when the “global” cache size becomes large enough (e.g., when, in the latter example, the aggregate size of all caches a user sees within a TTL becomes comparable to the content catalog). Nevertheless, in most practical cases a local edge cache would realistically fit at most of the catalog (e.g., just the entire Netflix catalogue is about 3PBs). Even if the above methods offered a effective cache increase, they would not suffice to achieve significant cache hit ratios (e.g., in the notation of [13], the key factor would be equal to , leading to a global caching gain of , a mere of extra gain).
Operators, are thus left with a very costly dilemma: bear a huge cost for the backhaul infrastructure (e.g., fiber everywhere) or bear a huge cost for CDNlike storage at each and every small cell. We believe this dilemma stems from the common underlying assumption of almost every caching scheme to try to satisfy every possible user request, either from the local cache or, in the worst case, the content server. This leads to an immense catalogue of potential content. Our main assertion in this paper is that, in an Internet which is becoming increasingly contentcentric and entertainmentoriented, a radically different approach could be beneficial, namely moving away from satisfying a given user request towards satisfying the user. E.g., a user requesting a content X, not available locally (e.g., a fan wanting to follow last weekend’s premier league’s games), might be equally satisfied (in the best case) or not fully dissatisfied (in many cases), if she receives another content Y related to X (e.g., another premier league game from that weekend). Another example is users streaming content in sequence (e.g., browsing YouTube videos backtoback or listening to personalized radio). In that case, the selected content at each step is often recommended related to the previous one, and the user might be almost equally happy with many alternatives. We will use the term soft cache hit to describe such scenarios. Finally, we believe such a system is timely given the recent interest of content providers with sophisticated recommendation engines, such as NetFlix and YouTube (i.e., Google), to act as Mobile Virtual Network Operators (MVNO) in the context of RAN Sharing [14].
To this end, we perform here a preliminary analysis and performance evaluation of such a system, in order to obtain initial insights. We first formulate the problem of edge caching with soft cache hits, and analyze the expected gains. We then show using both synthetic data and a real dataset of related video contents that interesting caching gains could be achieved in practice. Our problem formulation and analysis takes place in the context of delayed content access via static or mobile small cells [7, 6], for two reasons: (a) we believe such delayed access is interesting for lowcost users (e.g., 2 euro plans for operators like Free [15]) or developing regions, and (b) could be easily combined with soft cache hits to achieve multiplicative gains. Nevertheless, the basic tenets of our approach are equally applicable to femtocaching (i.e., the framework of [2]) or even other PHYaware caching systems [16].
To the best of our knowledge, the closest related work to the idea of soft cache hits is Roadcast [17], proposing a queryresponse based P2P VANET system, where users’ query requirements can be relaxed in order to get a matching response sooner. Nevertheless, this work focuses mostly on content similarity metrics and considers heuristics to achieve a square root based allocation policy, known to be optimal in P2P systems. Square root policies are suboptimal in our problem setup, as proven later, with or without soft cache hits [7].
2 Problem Setup
Content Model: We consider a wireless network with randomly distributed users, requesting contents from a catalogue with contents. A user requests content with probability . Without loss of generality (“w.l.o.g.”) we assume all contents have the same size.
Network Model: Our network consists of small cells (SC). These SCs can be either static (as in the femtocaching model [2]) or mobile (e.g. a vehicular cloud as in [7]). We denote the set of all SCs as . We also assume that each SC is equipped with storage capacity of contents. Accessing content directly from the local cache, i.e. a cache hit, is considered “cheap” while a cache miss leads to an “expensive” access (e.g. of the backhaul link in [2] or the macrocell in [7]).
Delayed Access Protocol: If the requested content is not available in a nearby small cell, the user waits until it encounters other small cells (as a result of user or cell mobility), until a TimeToLive . If the content is not found in any SC within , a cache miss occurs and the content is fetched over the expensive link.
Meeting Model: Meetings between each user and each SC are IID, with the residual time until such a meeting occurs being a random variable with CDF .
Lemma 2.1
If there are total SCs storing the requested content, the probability of not encountering any of them within is
(1) 
The above result follows directly from the definition of and the assumption of IID meetings.
For simplicity, in this paper we will focus on , so that . The identical meeting rates assumption can be further relaxed, as explained in Section 5.
Up to this point, the problem setup is the same as in [7, 6]. The main departure from that model is captured in the following.
Content Relation Graph: Each content has a set of related contents. Let denote the utility a given user gets if she originally asks for content but instead receives content , where and . The set of related contents can be formally defined as: . These relations define a content relation matrix (or graph) .
Delayed Access with Soft Cache Hits (SCH): A user again performs delayed access. However, if the requested content is not found within , but a content in is found in one of the encountered caches, a soft cache hit occurs (and thus no expensive access is needed). A cache miss occurs if neither the requested nor any related content is found within , in which case the original content is retrieved over the expensive link. The soft cache hit utility is equal to . We will consider two main cases for .

Soft Cache Hits (Case 1): . Any related content gives a cache hit. As soon as one is found, the user stops looking.

Soft Cache Hits (Case 2): . If a related content is found before , the user now continues looking for until . If it fails, a soft cache hit occurs and the access to the expensive link is still avoided. However, the utility attained is less than 1 (equal to ), which creates an interesting tradeoff. If neither nor any related is found by , then a cache miss occurs, as usual.
3 Caching with Related Content
3.1 Objectives
The goal in the above defined problem is to minimize the number of bytes accessed over the expensive “link” (which is, as explained, a radio access link to a macrocell and/or the backhaul network). When all contents have the same size, this is simplified to minimizing the number of (expensive) accesses, or equivalently, maximizing the cache hit ratio.
Definition 1 (Feasible Placement)
Let denote the number of SC caches storing content . A placement vector is “feasible”, if it satisfies the following constraints:
(2)  
(3) 
are the main optimization variables for our problem. Constraint (2) says that the number of SCs storing content is nonnegative and at most equal to the total number of SCs, and constraint (3) that the total number of content replicas stored at all the edge caches cannot exceed their total capacity.
In the traditional case of delayed access no soft cache hits are allowed. This will serve as our baseline scenario. The problem objective (i.e., the expected hit ratio) in this case is given in the following lemma.
Lemma 3.1 (Cache Hit Ratio  Base)
Assume a feasible placement vector . The cache hit rate, i.e., the expected number of user requests served locally when no soft cache hits are allowed is equal to
(4) 
The objective (Eq.(4)) in the above lemma is straightforward in light of Lemma 2.1 and the model of Section 2.
As explained earlier, when we do allow soft cache hits, if content is requested, a cache hit can occur also if other contents (related to ) can be accessed on time. The modified objective for Cases 1 and 2 of the content relation graph is given in the following two lemmas (the proofs are based on basic probabilistic arguments, and are omitted for brevity).
Lemma 3.2 (Soft Cache Hit Ratio (Case 1))
Assume a feasible placement vector , and a content relation graph , where . The cache hit rate for is equal to
(5) 
Lemma 3.3 (Soft Cache Hit Ratio (Case 2))
Assume a feasible placement vector , and a content relation graph , where , and . The cache hit rate for is equal to
(6) 
The main difference between these two cases is that, in the first case, finding a related content gives utility and is equivalent to a normal cache hit. However, in the second case, a related content allows the operator to avoid accessing the expensive link, but is penalized because the utility for the user is lower, leading to a utility of (we remind the reader that in the second term of Eq.(6) includes all related contents , such that , but does not include content ).
3.2 Performance Improvement Under the
Baseline Placement
Maximizing the objective of Lemma 3.1 within the feasibility region of Definition 1, defines the optimal cache allocation problem for the baseline scenario (no soft cache hits). This is in general an INLP (Integer NonLinear Program) that relates to a “multiple knapsack” problem (with equal capacities and logarithmic rather than linear utilities) and is NPhard to solve. Various polynomial approximation algorithms exist with good performance when the size of the caches are large enough to fit many contents. One such approximation can be achieved by solving a continuous relaxation of the problem (related to the fractional knapsack problem), where the optimization variables are continuous. In that case, it is easy to show that the baseline problem is convex, whose optimal solution can be found analytically using Lagrangian multipliers and solving the KKT conditions (we refer the interested reader to [7] for more details). Specifically, the optimal solution is given by
(7) 
where , , and is an appropriate Lagrange multiplier corresponding to the capacity constraint of Eq.(3).^{1}^{1}1An integer solution could be obtained by rounding [7, 2]. Alternatively, one could interpret a noninteger value as follows: If , of content is allocated to caches, and one more cache stores only of the content. If a user encounters the latter, she retrieves the remaining from the infrastructure.
Replacing in the objective of the baseline problem (Eq.(4)) gives us the optimal cache hit ratio, if we ignored related content. At the same time, replacing in the objective of Eq.(5) gives us the cache hit ratio when we can satisfy a request with related content, but the caching decisions were already taken and are the original ones. (We will show later that we could do even better by considering the related content graph when solving the cache placement problem.) The following theorem provides the expected improvement in terms of load on the expensive link, for a simple scenario where .
Theorem 3.4
Assume that . The expected improvement in the cache hit ratio by recommending alternative contents, when the optimal cache placement algorithm is oblivious to these recommendations, is equal to
(8) 
The cache miss ratio (or “load” on the main infrastructure) in the baseline problem is . Replacing Eq.(7) into Eq.(4) gives
(9) 
Similarly, let’s assume that an original request could be satisfied with a related content as in Lemma 3.2. The cache miss ratio, denoted as , can be calculated as:
Hence, the gain from soft cache hits (case 1) is equal to , which gives the desired Eq.(4).
The case where some contents receive no or maximum () copies, as in Eq.(7), can be easily derived by modifying the summation in the above proofs. As a very simple example, consider the case of uniform content popularity, i.e. . After some simple calculations, we get that the performance benefits by related content are equal to . However, we know that , since it is the cache miss rate of the base policy (see Eq.(9)). Therefore, the above gain , and is increasing in , the number of related contents per content , as one would expect. A similar result can be easily derived for Case 2, as well as when the number of nonzero elements on each row of is different (i.e. not all equal to ).
3.3 Content Graph Aware Optimal Caching
We have so far assumed that the caching policy is unaffected by the ability to recommend alternative contents. While this already leads to performance gains, as shown earlier, it is still suboptimal. For example, assume a user requesting content would be OK to receive instead content (i.e. ) and a user requesting content would be OK to receive content instead (i.e. ). If both contents and are popular, a standard caching policy would give a high number of replicas to both, according to Eq.(7). However, this is clearly suboptimal here, since the caching algorithm could just store only one of the two at each cache, saving valuable capacity that could be used to store other contents. The following two theorems formalize this for the two content relation graph cases, discussed in Section 2. Due to space limitations, we only show the proof for the more generic Case 2.
Theorem 3.5 (Uaware Optimal Caching (Case 1))
Assume a content relation graph , where . The optimal content placement that directly exploits related contents is given by vector which is the solution to the following optimization problem
Furthermore, the above problem is a convex optimization problem.
Theorem 3.6 (Uaware Optimal Caching (Case 2))
It is easy to see that the feasibility region (Definition 1) is convex. The objective function needs to be concave (since this is formulated as a maximization problem). A sufficient condition is if its Hessian matrix H is negative semidefinite, i.e., .
Taking the derivatives of the objective function , we calculate the terms of the Hessian matrix
and for
where is if ; otherwise is .
Then, the product is given by the expression
which is always .
4 Performance Evaluation
4.1 Simulations Setup
Mobility Trace. We use the TVCM mobility model to generate a trace, where nodes move in a square area comprising three subareas of interest (communities). Each node moves inside its community for 60% of the time, and leaves it for a few short periods. The area is entirely covered by macrocell BSs, and also includes 25 non overlapping smallcell base stations (SCs), with a communication range of 100m.
Content Popularity. We create contents and assign to each of them a popularity value drawn from a Zipf distribution, with shape parameter . Powerlaw distributions have been shown to capture well real popularity patterns [18, 19, 20].
Utility Matrix. To investigate the effect of the matrix U, we generate different matrices belonging to two generic classes:
(a) random U: for each content pair , the utility is with probability (otherwise it is 0), such that each content has on average related contents, i.e., .
(b) popularity proportional U: for each content pair , the utility is with probability (otherwise it is 0), where is the popularity of content , and is a normalization parameter that determines .
YouTube datasets. In addition to the synthetic popularity/utility patterns, we use real datasets from YouTube that contain information about video popularity and related video lists [21]. Table 1 contains information about the datasets we use, and some main statistics. We preprocess the data to remove entries with or no popularity value. For each video appearing in the related videos list of a video , we set and . Due to the sparseness of the datasets, we consider only the videos belonging to the largest connected component of the graph with vertices and edges .
Data (date / depth of search [21])  

Instance 1  27 July 2008 / 3  2098  5.3 
Instance 2  27 March 2008 / 1  1086  7.9 
4.2 Effects of Utility Matrix
We first study the effects of the (a) density, , and (b) type (random U / popularity proportional U) of the utility matrix. Specifically, in Fig. 1 we present the softcache hit ratio for the SCH1 and SCH2 (with ) cases, under the base optimal policy , as well as the hit ratio of the scenarios without soft caches (nosoft caches). As expected, the cache hit rate improves as the density of the matrix U (xaxis) increases. Under random U matrices the increase in the cache hit rate is almost linear (Fig 1(a)) on , but quickly plateaus for the popularity proportional U matrices (Fig. 1(b)). This is reasonable as the achieved cache hit ratios for the popularity proportional U case already reach values , for few related contents. The reason is that popular contents that have higher probability to appear in the related list of other contents, are also stored in more caches (under the base optimal policy).
These initial observations show that the performance can be improved by recommending more contents (density) and/or by selecting carefully which contents to recommend (type of matrix U). This is a positive message, since there are more than one degrees of freedom for a system design, allowing thus improvements under various settings (e.g., restriction on the max number of recommended contents, predefined content relations), and enabling crosslayer (application/network) design and optimization approaches.
4.3 Gains of Optimal Caching Policies
In Fig. 2 we compare the performance gains of the base optimal policy and the Uaware optimal policy the SCH1 case (Theorem 3.5). Under random U matrices (Fig. 2(a)), the achieved cache hit rate by is always higher than in the policy, with an increase of (for min) and (for min). Here, we need to stress that the extra performance gain from the Uaware optimal caching policy comes without any cost for the system: the recommendation system (matrix U) and the caching capacity ( and ) remain the same, and only the caching policy changes (i.e., in practice, this corresponds to a simple modification in the content placement algorithm).
In the popularity proportional U case (Fig. 2(b)), the performance improvement of the Uaware optimal policy over the base optimal policy is moderate ( and , for min and min, respectively). This indicates that when a recommendation system is carefully designed for a mobile environment (i.e., in our example, resulting to a popularity proportional matrix U), the Uaware caching policy does not add significant gains. As a result, only the content popularities is needed for the caching placement algorithm. Hence, the network provider does not need to cooperate further with a content provider (which designs also the recommendation system), e.g., YouTube or Netflix, and this facilitates the deployment of a softcache system in practice.
4.4 Gains of the YouTube’s Recommendation System
We conduct simulations on the TVCM mobility trace using the popularity/utility patterns of the YouTube datasets (see Section 4.1). In Table 2 we present the relative gain in the softcache hit ratio, i.e., vs. hit ratio under nosoft caches scenario. The improvement in performance by using softcaches can be up to , and on average is higher in Instance 1 where the content catalogue is larger (cf. Table 1). The gains are similar in other simulated scenarios we tested; with parameters and .
Placing contents with the Uaware optimal policy gives similar gains as in the case in the simulated scenarios. In light of the synthetic results, this perhaps suggests that the content relation graph for these YouTube instances more closely resemble the popularity proportional case, rather than the random.
Instance 1  Instance 2  

11 
5 Discussion
Our initial results suggest that soft cache hits could be a promising way to make edge caching scale, opening up new interesting operatoruser performance tradeoffs. Some limitations and potential extensions of the proposed model are discussed here.
Userdependent recommendations: Throughout this work, we have been assuming that the related contents for a requested content item , and their related utilities depend only on item , and not on the user that requested it. In a sense, this relates to itemitem collaborative filtering, where a new/alternative item is recommended based on its similarity with the requested one. Itemitem recommendations have been claimed to offer some advantages compared to useruser collaborative filtering [22]. Nevertheless, one user might be less happy than another, with the same alternative content. On the modeling side, one could take this into account by making a random variable and using its expected value in the objective functions of Section 3. Finally, on the recommendation side, a recommendation system could actually combine both types of collaborative filtering to make better recommendation. This would lead to different graphs per user (or user clusters), whose integration and impact on our framework is part of future work.
Generalization of graph: For simplicity, in our analysis we assumed that related contents bring the same amount of utility ( in case 1, and in case 2). In general, different related contents might bring different amounts of utility. We could generalize our model by assuming a Case 3 where . As in Case 2, if a user requesting content , accesses (before ) any content , she will be satisfied (less than ). She will keep on requesting till time , but will not accept any other related content^{2}^{2}2An alternative approach would be to keep requesting every cache encountered for potentially better related content. However, we believe this might put a high burden on the battery of the UE and the UESC traffic.. Contrary to Case 2, however, the value of the utility (to be contributed at the objective function) is not known a priori, since we cannot know a priori which content will be accessed. One can still derive a closed form objective function with appropriate conditioning on all possible , but we defer elaborating on this scenario for future work.
Generic mobility: Although it would be quite hard to relax the independent mobility assumption (using traces in simulations, where most such assumptions break, tends to be the de facto way of testing this) the identical contact rate assumption could be relaxed. E.g., in the context of exponential meetings, it has been shown that heterogeneous rates could be approximated with their mean, either asymptotically or as a bound [23].
Soft Cache Hits for Femtocaching: The proposed approach of soft cache hits and alternative content recommendations could apply equally well to more traditional caching frameworks that do not allow any delay, as is the popular femtocaching framework [2]. The relation between users and small cells that each user can access is captured by a bipartite graph, and the control variables define whether a content is stored in a cache . In the case of as in Case 1, if some user requests content , and the small cells in her range are , the hit probability is given by
(10) 
instead of , in the original femtocaching case (see [2] for more details).
6 Conclusions
In this paper, we have proposed the idea of soft cache hits for mobile edge caching systems with delay tolerance, where a user request can sometimes be (partially) satisfied, even if the original content is not available locally, by recommending some related contents. We have formulated and analyzed the performance of such a joint system, and derived the optimal related content aware cache placement. Our theoretical analysis and initial evaluation suggest that significant performance gains can be achieved, even with simple modifications to the baseline system. Furthermore, our results suggest that the structure of the content relation graph plays an important role on the actual achievable performance.
References
 [1] N. Sapountzis, T. Spyropoulos, N. Nikaein, and U. Salim, “Optimal downlink and uplink user association in backhaullimited hetnetsn,” in Proc. IEEE Infocom, 2016.
 [2] N. Golrezaei, K. Shanmugam, A. G. Dimakis, A. F. Molisch, and G. Caire, “Femtocaching: Wireless video content delivery through distributed caching helpers,” in Proc. IEEE INFOCOM, 2012.
 [3] N. Golrezaei, A. G. Dimakis, and A. F. Molisch, “Scaling Behavior for DevicetoDevice Communications With Distributed Caching,” IEEE Trans. Information Theory, 2014.
 [4] B. Han, P. Hui, V. S. A. Kumar, M. V. Marathe, J. Shao, and A. Srinivasan, “Mobile data offloading through opportunistic communications and social participation,” IEEE Trans. on Mobile Computing, vol. 11, no. 5, 2012.
 [5] J. Whitbeck, M. Amorim, Y. Lopez, J. Leguay, and V. Conan, “Relieving the wireless infrastructure: When opportunistic networks meet guaranteed delays,” in Proc. IEEE WoWMoM, 2011.
 [6] P. Sermpezis, , and T. Spyropoulos, “"effects of content popularity in the performance of contentcentric opportunistic networking: An analytical approach and applications,” ACM/IEEE Trans. on Networking, 2016.
 [7] L. Vigneri, T. Spyropoulos, and C. Barakat, “Storage on Wheels: Offloading Popular Contents Through a Vehicular Cloud,” in Proc. IEEE WoWMoM, 2016.
 [8] F. Boccardi, R. Heath, A. Lozano, T. Marzetta, and P. Popovski, “Five Disruptive Technology Directions for 5G,” IEEE Comm. Mag. SI on 5G Prospects and Challenges, 2014.
 [9] S. Borst, V. Gupta, and A. Walid, “Distributed caching algorithms for content distribution networks,” in Proc. IEEE INFOCOM, pp. 1–9, 2010.
 [10] J. Erman, A. Gerber, M. Hajiaghayi, D. Pei, S. Sen, and O. Spatscheck, “To cache or not to cache: The 3g case,” IEEE Internet Computing.
 [11] M. Leconte, G. Paschos, L. Gkatzikis, M. Draief, S. Vassilaras, and S. Chouvardas, “Placing dynamic content in caches with small population,” in Proc. IEEE Infocom, 2016.
 [12] G. S. Paschos, E. Bastug, I. Land, G. Caire, and M. Debbah, “Wireless caching: Technical misconceptions and business barriers,” http://arxiv.org/abs/1602.00173.
 [13] M. A. MaddahAli and U. Niesen, “Fundamental limits of caching,” IEEE Transactions on Information Theory, vol. 60, pp. 2856–2867, May 2014.
 [14] R. Kokku, R. Mahindra, H. Zhang, and S. Rangarajan, “Cellslice: Cellular wireless resource slicing for active ran sharing,” in COMSNETS, pp. 1–6, 2014.
 [15] “Free mobile plans.” http://mobile.free.fr/.
 [16] W. C. Ao and K. Psounis, “Distributed caching and small cell cooperation for fast content delivery,” in Proc. ACM MobiHoc, 2015.
 [17] Y. Zhang, J. Zhao, and G. Cao, “Roadcast: A popularity aware content sharing scheme in vanets,” SIGMOBILE Mob. Comput. Commun. Rev., vol. 13, pp. 1–14, Mar. 2010.
 [18] P. Gill, M. Arlitt, Z. Li, and A. Mahanti, “Youtube traffic characterization: A view from the edge,” in Proc. ACM IMC, 2007.
 [19] J. Erman, A. Gerber, K. K. Ramadrishnan, S. Sen, and O. Spatscheck, “Over the top video: The gorilla in cellular networks,” in Proc. ACM IMC, 2011.
 [20] P. Sermpezis and T. Spyropoulos, “Inferring contentcentric traffic for opportunistic networking from geolocation social networks,” in Proc. IEEE WoWMoM (AOC workshop), 2015.
 [21] http://netsg.cs.sfu.ca/youtubedata/.
 [22] G. Bresler, D. Shah, and L. Voloch, “Collaborative filtering with low regret,” in Proc. ACM SIGMETRICS, 2016.
 [23] P. Sermpezis and T. Spyropoulos, “Delay analysis of epidemic schemes in sparse and dense heterogeneous contact environments,” Tech. Rep. http://www.eurecom.fr/~sermpezi/TechRep_HetEpid.pdf, Eurecom, 2012.