Performance Evaluation - Enhancing mobile multimedia services

We evaluate our solution using real-world data of Vine videos and the user gestures recorded.

For comparison, we implement another two downloading schemes [135]. Sequential Down-loading (SeqD) downloads all the videos according to their order in the playlist, and disre-gards all the user actions, which takes the playlist as a single long video, and is the most aggressive downloading scheme with the least flexibility. Next-one Downloading (NextD) always attempts to download the next video that enters the viewport, which emulates the caching strategy mostly used in Vine-like services. If the current watching duration is not long enough to download the next video, its playback will be interrupted. Besides the two downloading scheme, we also implement a popularity-based raw Pre-fetching (rPF) method, which caches instant video clips solely based on their popularity.

We consider a set of metrics including playback discontinuity, monetary cost, energy consumption in our evaluation, which directly relate to the three goals of our optimization problem. We randomly introduce 20 video watching events from 9 a.m. to 9 p.m. in the week-long dataset, emulate the user behaviors by using the user gesture traces, and produce the average results. Each video watching event consumes 50 videos from popular sections and 150 videos from recent sections. We assume that, during the watch-time only cellular links are available with 1 MB/s bandwidth, and WiFi is available for pre-fetching once every hour. We adopted this setting to emulate a worst case scenario, since if WiFi is available during watch time, our solution can perform better. We assume each Wi-Fi session is long enough to complete a round of pre-fetching. The application local storage is set as 100 MB.

The monetary cost model is 10 dollar per 100 MB traffic, which is close to the major mobile

Figure 3.6: Popularity distribution

(a) Videos uploaded into popular sections (b) Videos uploaded into recent sections

Figure 3.7: Average video uploading rate

(a) Inter-arrival time of user actions under 3G sce-narios

(b) Inter-arrival time of user actions under WiFi scenarios

Figure 3.8: Density of inter-arrival time of user gestures

(a) Fling speed under 3G scenarios (b) Fling speed under WiFi scenarios

Figure 3.9: The initial scrolling speed of flings

operators’ data add-on prices⁷. The energy model is adopted from [9], which considers practical details such as the tail time in 3G/4G communications [43].

3.5.1 Data Traces of Vine Videos and User Gestures

In our simulations, we use the dataset collected and presented in Section 3.3. Fig. 3.6 plots the video popularity versus its rank for the two types of videos (from popular sections and recent sections, respectively), in which videos from recent sections exhibit a higher skewness.

The average numbers of hourly video uploads during a day for popular and recent sections are shown in Fig. 3.7.

7http://www.telus.com/en/ca/mobility/prepaid/add-ons/

0 1 2 3 4 5

(a) Impact of different p/q values on efficiency and playback discontinuity (r = 0)

(b) Impact of different p/r values on efficiency and playback discontinuity (q = 0)

Figure 3.10: Impacts of p, q and r

To obtain the real-world user gesture traces, we implement an Android app to record user touch events. We recruit 10 volunteers to watch Vine videos using the official client on Android under cellular connections and WiFi connections, respectively. Each experiment is conducted around 5 minutes. We plot two important characteristics of user behavior from the collected traces: Fig. 3.8 shows the probability density distribution of the user gestures’ inter-arrival time, and its curve fitting result; and Fig. 3.9 plots the histogram of the measured initial scrolling speed of the triggered flings. The user gesture traces are used to simulate video watching events by applying the dragging/flinging model.

3.5.2 Impacts of p/q and p/r Ratios

As we have three goals in our objective function, two of which are designed for efficiency, we vary the ratio of p/q with r = 0, and vary the ratio p/r with q = 0, respectively.

The results are shown in Fig. 3.10a and Fig. 3.10b, which demonstrate how the playback discontinuity, the cost efficiency and the energy efficiency change with different parameters.

As p, q and r only affect our optimization goal during the watch-time, we disable pre-fetching in this experiment, and only focus on studying the proper settings of p/q and p/r for watch-time downloading (referred to as WT). Since changing the ratios between p, q and r is essentially the trade-off between efficiency and performance, it is not surprising that the plots in Fig. 3.10a and Fig. 3.10b are very similar. Note that, as we adopt both a linear monetary cost model and a linear energy model, the normalized results for monetary cost and energy consumption is nearly the same when pre-fetching has been disabled. When p/q (p/r) is small (< 0.5), very limited number of videos have been downloaded and the video playback is severely affected, so as to reduce the monetary cost (energy consumption).

On the other hand, if p/q (p/r) increases to a certain degree (> 3), the video watching

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8

(a) Relative monetary cost efficiency and playback discontinuity of WT+PF normalized by WT

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8

(b) Relative energy efficiency of WT+PF normal-ized by WT

Figure 3.11: Impact of α

experience is optimized but results in a much higher monetary cost (energy consumption).

Moreover, there is a small interval near 1.5 in the figure, in which the playback quality is acceptable with a relatively good cost efficiency (energy efficiency). We thus pick p/q = 1.5 and p/r = 1.5 as the default setting for the remaining evaluation.

3.5.3 Impact of Pre-fetching Aggressiveness α

We vary α from 0.1 to 0.9, which represents how aggressively our pre-fetching (referred to as PF) acts, and normalize the performance metrics of WT+PF by WT as the baseline. As shown in Fig. 3.11a, when α grows large, the playback discontinuity decreases gradually, while the monetary cost first decreases and then becomes stable. This is because when the pre-fetching becomes more aggressive, more videos to be watched are downloaded through WiFi, thereby saving the monetary cost; however, as one cannot accurately predict which video (and which portion of it) will be consumed and the application local storage is also limited, there are still some parts of videos to be downloaded. At the same time, as more and more videos are pre-fetched, some of the videos, which originally will not be download-ed during watch-time according to the performance-efficiency trade-off, are pre-fetchdownload-ed and make the playback discontinuity continue to decrease. On the other hand, Fig. 3.11b shows that the normalized energy consumption keeps growing (from 1.2x to 3.8x) until α reaches 0.5, where the application local storage is used up and limits the amount of pre-fetched videos even if α gets larger than 0.5 (and thus the energy consumption). It is worth not-ing that here the comparison is only between WT+PF and WT, not WT+PF and SeqD (WT+PF and NextD), where, as will be discussed in the next subsection, the energy saving can achieve as much as over 90%. Fig. 3.11 suggests us setting α = 0.2 as the default value,

0.5 1 1.5 2 2.5 3

(b)Monetary and energy costs normalized by the base-line results of SeqD

Figure 3.12: Impact of downloading bandwidth when p/q = 1.5, p/r = 1.5

0.5 1 1.5 2 2.5 3

(b)Monetary and energy costs normalized by the base-line results of SeqD

Figure 3.13: Impact of downloading bandwidth when p/q = 3.5, p/r = 3.5

where WT+PF consumes about 2x energy of WT to save around 45% monetary cost and improve the playback discontinuity by over 60%.

3.5.4 Performance Enhancement

We next show the overall performance gain of our solution (WT+PF) compared to Se-quential Downloading (SeqD), Next-one Downloading (NextD), and the two downloading schemes with raw pre-fetching (SeqD+rPF and NextD+rPF) under different operating con-ditions. To conduct a fair comparison, we ensure that PF and rPF consume the same amount of storage. Note that the scheduling results of our pre-fetching (PF) affects directly those of watch-time downloading (WT). Therefore, we consider our solution as a whole piece and compare it with other baseline approaches. To simulate different levels of connection quality, we vary the downloading bandwidth from 150 KB/s to 3 MB/s.

Fig. 3.12 shows the results with p/q = 1.5 and p/r = 1.5, which plots the playback discontinuity in Fig. 3.12a, and the normalized monetary cost and the normalized energy consumption in Fig. 3.12b. As we keep α fixed in this experiment, the monetary cost and the energy consumption exhibit similar patterns due to their linear models, and thus we see overlapping plots for NextD in Fig. 3.12b. As SeqD naively downloads videos and disregards their playbacks, which introduces the highest monetary cost/energy consumption, we use it as the baseline, and normalize the costs of the other two approaches. Our proposed approach (WT+PF) shows a very stable performance with different downloading bandwidths, in terms of both playback discontinuity and cost efficiency. On the contrary, the playback discontinuity of NextD and that of SeqD increase dramatically when the bandwidth is low (< 0.5 MB/s). The reason is that, as the bandwidth becomes lower, it is more and more difficult for these two downloading schemes to finish each downloading before the playback, while our proposed approach can still keep the playback discontinuity at a low level by smartly managing the downloading according to user actions and efficient pre-fetching. On the other hand, the monetary cost/energy consumption of NextD quickly increases after reaching the minimal at 0.5 MB/s. The reason for both NextD and SeqD introducing high monetary cost/energy consumption is that, when the bandwidth is low, both NextD and SeqD keep the downloading link busy almost all the time; whereas both of the downloading schemes attempt to download all the videos if the bandwidth becomes high enough. After introducing the popularity-based raw pre-fetching, both NextD+rPF and SeqD+rPF provide better viewing experience (lower playback discontinuity) than their counterparts. On the cost efficiency, NextD+rPF causes slightly higher costs than NextD, while SeqD+rPF can save around 10% costs than SeqD when the bandwidth is high. This is because unlike NextD, SeqD can always benefit from pre-fetching. Fig. 3.12b shows that our approach can save at least over 40% monetary cost and 30% energy consumption, and the cost/energy saving under high bandwidths can be higher than 90%.

The other four caching schemes may achieve the similar performance to ours in terms of playback discontinuity when the bandwidth is high enough. This is because the setting of p/q and p/r asks for a balance between cost efficiency and playback discontinuity. If a user is more aggressive on the video watching experience, s/he can further increase of the ratio of p/q and p/r, e.g., to p/q = 3.5 and p/r = 3.5. We plot the corresponding results in Fig. 3.13, which shows that our solution can always achieve the best playback discontinuity at diverse bandwidths, and still with huge amounts of (over 90%) cost/energy savings.

In document Enhancing mobile multimedia services (Page 38-44)