| Measurement-based characterisation of application and user behaviour |
|
|
|
|
This deliverable presents our work on the characterisation of the data collected for the two main applications envisioned for NANODATACENTERS: Video-on-Demand applications and multiplayer games. Video on DemandTwo aspects greatly impact the NANODATACENTERS architecture with respect to Video-on-Demand: (i) the distribution of content popularity, and (ii) its dynamics over time. The extent to which popularity distributions are skewed, or dominated by a few highly popular applications, directly influences the ability to cache content, and hence the memory requirements for efficient video delivery. Moreover, the dynamics of popularity affect the extent to which the system has enough time to adjust to changes in this popularity. Another aspect concerns the profiles of users in the case where users might be categorised according to very different profiles. Identifying and managing each user’s dedicated storage for NANODATACENTERS in accordance with their profile can improve the performance of the NANODATACENTERS approach. So far, three types of content have been identified for the applications envisioned in the NANODATACENTERS gateway: User Generated Content (UGC), Catch-up TV and traditional pay-per-view Video-on-Demand. Our findings, and the implications for NANODATACENTERS of each of the three above stated types of content,
User Generated Content
Our collected YouTube workload has enabled us to run trace-driven simulations and assess the memory requirements of a NANODATACENTERS node for supporting such workloads (these results are further detailed in Deliverable D1.2). As an example, 128GB of hard drive per NANODATACENTERS node suffices to serve a YouTube catalogue under standard assumptions about uplink bandwidth capacity and video encoding. Such dimensioning information is essential in order to assess the economical viability of the NANODATACENTERS model. Given the price of memory, it is certainly feasible to provision each gateway with a 128GB flash drive, and hence to enable NANODATACENTERS to support YouTube-like UGC applications without (almost) any assistance from the traditional datacenter infrastructure.
Catch-up TV
To evaluate the overall architecture, and particularly the specific adaptive content replication schemes which will be developed in order to manage each type of content, be it UGC, Catchup TV, or other, these synthetic workloads will be made available. The last type of video content is pay-per-view Video-on-Demand. The focus on understanding users’ tastes for content was key for offering targeted recommendations. In the case of NANODATACENTERS, minimising the cost of accessing the content is dvantageous in, deciding the content preloading strategies. A study [3] was also performed on the publicly available Netflix data trace. Besides characterising the popularity distribution (highly skewed), an investigation was made regarding to what extent users behave according to distinct profiles. Profiling techniques were developed which showed that the user population is far from homogeneous, but behaves according to distinct profiles. The benefits of using the proposed user profiling for informing content preloading strategies were also demonstrated empirically on the Netflix dataset. Specifically, a comparison was made of user profile-unaware content preloading strategies with strategies which systematically preload the most popular content among users with the same profile. These preliminary results show that such profile-informed strategies improve the overall platform efficiency. Additional evaluations with various workloads need to be performed before determining whether such a profiling module is necessary in the NANODATACENTERS architecture. Multiplayer gamesIn the case of multiplayer games, the mobility patterns of user avatars determine the system requirements of user-to-user communications, via the frequency of user encounters and departures. These mobility patterns also determine how frequently a user’s view of the virtual world needs to be updated, and thus specifies the constraints on the corresponding supporting
More details are available in the report. |



