Peer Sampling Layer - Gossip-based Building Blocks

5.3 Gossip-based Building Blocks

5.4.1 Peer Sampling Layer

We start by evaluating the two fundamental guarantees of the peer sampling layer: the construction of a random overlay and membership management. The former is measured in terms of clustering degree and in-degree distribution, and the resulting balance in bandwidth utilization. The latter is measured as the time required to remove stale entries from the peers’ view after a failure. Our experiments reproduce some of the results obtained by simulation in [30, 31, 117, 139] but using a real implementation.

We instantiate four protocols in the framework of [30] (see Section 5.3.1). The view size is always c = 20. Blind, with H = 0 and S = 0 essentially performs random selections for view selections at each exchange. Swapper corresponds to Cyclon [31] and uses H = 5 and S = 0. It favors randomness quality and in particular seeks at reducing clustering

0 0.1 0.2 0.3 0.4 0.5

random graphBlind H=0,S=0Swapper H=0,S=5Hybrid H=2,S=3Healer H=5,S=0

clustering ratio clustering distribution 0 10 20 30 40 50 60

random graphBlind H=0,S=0Swapper H=0,S=5Hybrid H=2,S=3Healer H=5,S=0

in-degree in-degree distribution Percentiles: Max 75th 50th 25th Min

Figure 5.4: PSS: clustering and in-degree for the four strategies and reference random graph.

0 50 100 150 200

Blind H=0,S=0Swapper H=0,S=5Hybrid H=2,S=3Healer H=5,S=0

bandwidth (Bytes/s)

upload bandwidth distribution

0 50 100 150 200

Blind H=0,S=0Swapper H=0,S=5Hybrid H=2,S=3Healer H=5,S=0

bandwidth (Bytes/s)

download bandwidth distribution

Percentiles: Max 75th 50th 25th Min

Figure 5.5: PSS: upload and download bandwidth usage distributions.

and load imbalance (directly linked to the in-degree distribution). Healer corresponds to Newscast [117], with H = 0 and S = 5. It favors membership management to randomness. Finally, Hybrid mitigates the two objectives by setting H = 2 and S = 3.

Figure 5.4 presents the quality of randomness of a snapshot of the graph obtained after 5 minutes of execution. The results are presented for each variant as well as for a reference random graph generated offline with the same number of vertices. We present the clustering distribution (left) and in-degree distribution (right). We use a representation based on stacked percentiles throughout this section. The white bar at the bottom represents the minimum value, the pale grey on top the maximal value. Intermediate shades of grey represent the 25th, 50th–the median–, and 75th percentiles. For instance, the median clustering ratio for the Healer is 0.2. This means that 50% of the peers have up 20% or less of all of possible pairs of their neighbors that are neighbors themselves.

We observe that, as expected, Blind performs badly for both aspects: the clustering is high and the in-degree is skewed, with some peers being in more than 50 views (while the distribution should be as narrow as possible around the average in-degree of 20). The Swapper gives the best results in terms of clustering and in-degree, for which it gives similar, respectively, better results than the random graph itself. While the Healer performs correctly in terms of in-degree, it yields high clustering, which will result in slower convergence at the

0 20 40 60 80 100 120

Blind H=0,S=0Swapper H=0,S=5Hybrid H=2,S=3Healer H=5,S=0

time (seconds)

time between failure and last stale entry removed

Percentiles: Max 75th 50th 25th Min

Figure 5.6: PSS: time to remove descriptors of failed peers from the system after half of the network fails.

Topology Construction layer and have a higher chance of leading to partitions. We could not reproduce the skewed in-degree distribution for the Healer presented in [30]. This is due to the smaller size of our deployments compared to what can be obtained in synchronous simulations. The results indicate that the Swapper is the best choice in terms of randomness guarantees, while the Hybrid seems to be a good compromise with approximate characteristics to that of the random graph.

Next, we evaluate the overhead imposed by the peer sampling layer by observing the consumed upload and download bandwidth. Because the peer sampling service is a low level service it should be as frugal as possible in terms of bandwidth consumption. The active task is invoked every 10 seconds. The distribution of bandwidth usage, both for upload and download, are presented in Figure 5.5. All variants exhibit highly balanced distributions, with the exception of Blind. This is clearly a consequence of the high imbalance in in-degree distribution, resulting in some peers being contacted much more or much less than the average.

Finally, we study the quality of the membership management by evaluating the time required for discarding failed peers from the views of all other peers. Until then, the upper layer may try to contact the failed peers for some time, resulting in wasted communication. Note however that the monitoring of peers at the upper layer is performed aggressively through pre-established TCP links, and membership management at the peer sampling layer only has an influence on the efficiency of peer discovery, and on partition resilience (as many stale descriptors in peers’ views may lead to a poorly connected graph overall). We use a worst-case scenario where half of the peers (100 out of 200) are removed from the network after a stabilization period of 5 minutes. We observe the views of all peers, reported after each exchange, and in particular the last time each of the failed peers appears in a view. Results in Figure 5.6 indicate that, as expected, Blind performs the worst in that respect. Indeed, this strategy does not use the age field in the descriptors and thus stale descriptors are discarded only due to random selections after a large number of exchanges. Healer performs the best (in terms of median time), and in particular better than Swapper. This is on par with the conclusions in [30]. This illustrates the compromise made when setting the H and S parameters, which will have an influence on the service given to the upper layers. Again here, Hybrid performs well; as mentioned in [30], even a small value of H allows discarding old

% perfect entries in T-Man view

T-Man active task exchanges Yao-Yao on physical coordinates

Max 75th 0 20 40 60 80 100 0 2 4 6 8 10 12 14 16 18

T-Man active task exchanges vivaldi (70%) + random (30%) Percentiles: 50th 25th Min 0 20 40 60 80 100 0 2 4 6 8 10 12 14 16 18

Figure 5.7: TC: Convergence time.

descriptors quickly enough to grant rapid reaction to membership change. As a result, we select the Hybrid strategy for the remaining of our evaluation.

In document Topology-aware protocols, tools and applications for large-scale distributed systems (Page 101-104)