• No results found

Internet Dial-Up Traffic Modelling

N/A
N/A
Protected

Academic year: 2021

Share "Internet Dial-Up Traffic Modelling"

Copied!
12
0
0

Loading.... (view fulltext now)

Full text

(1)

Internet Dial-Up Traffic Modelling

Villy B. Iversen

Arne J. Glenstrup

Jens Rasmussen

Abstract

This paper deals with analysis and modelling of real Internet dial-up traffic based on exten-sive data, which are described and analysed. When recording the traffic process the time axis is digitalised and analysis and modelling must take this into account. In particular, this is very important for arrival processes. It is also important to distinguish between fast vari-ations within short time intervals and slow varivari-ations during the day. The detailed analysis shows that the arrival process is a Poisson process with slow variations.

Keywords: scanning principle, time-dependent Poisson process, mean squared difference

NTS–15, Fifteenth Nordic Teletraffic Seminar Lund, Sweden, August 22–24, 2000

1

Introduction

Characteristics of dial-up Internet calls are of great interest when dimensioning the local access network [2], [7]. Internet calls have characteristics which are different from ordinary voice calls.

The field data used for the study are Internet dial-up calls from Tele Danmark. During one week, Monday 18 – Sunday 24, January 1999, about 3.8 million calls have been recorded. Every call is recorded with arrival time, holding time, and other useful information such as identity of modem-pool, ISDN, subscription type, etc. Only about 0.5 % of the call attempts are rejected for various reasons. About 79 % of all calls are PSTN, and the remaining 21 % are ISDN calls. ISDN calls are always counted as one call independent of the number of channels occupied. The accuracy of the time recordings is one second.

Several papers deal with the same subject. The daily traffic profile has the maximum in the evening. In Fig. 1 we show a typical profile for the number of calls. The mean value of the holding times is 5–10 times bigger the the mean value of voice holding times, and a typical pattern is shown in Fig. 2. Similar patterns are found by other authors.

The distribution of the holding time has a coefficient of variation which is bigger than for voice calls, and the modelling has been studied using hyper-exponential, log-normal and Pareto distributions.

[email protected], Dept. of Telecommunication, Technical University of Denmark, DK–2800 Denmark.

[email protected], Dept. of Telecommunication, Technical University of Denmark, DK–2800 Denmark. ‡

(2)

0 2000 4000 6000 8000 10000 12000 14000 0 2 4 6 8 10 12 14 16 18 20 22 24 arrivals hour of day

Figure 1: Number of calls per 15 minutes, Tuesday 19.01.1999.

0 200 400 600 800 1000 1200 0 2 4 6 8 10 12 14 16 18 20 22 24

service time (sec)

hour of day

Figure 2: Mean holding time n seconds for calls arriving during 15 minutes, Tuesday, 19.01.1999.

(3)

In this paper we shall focus upon a study of the arrival process. In some papers it is concluded that the arrival process of Internet calls is more bursty than the Poisson process, whereas [7] concludes that within intervals of up to 15 minutes it can be considered as a stationary Poisson process. In larger time intervals we have intensity variations, but still a Poisson process.

This corresponds to voice traffic where we have slow variations due to the daily call profile, but where the fast variations are of random (Poissonian) nature. The concept of slow and fast variations was introduced by Palm [8] and studied in detail for voice traffic in [5].

There are two basic ways to study an arrival process. We may keep a time interval fixed

and study the number of calls within this interval (number representation), or we may keep a

number of calls fixed, and study the time interval to get this number of calls (interval

represen-tation). We shall study both approaches in this paper.

2

Interval representation

The accuracy of the recordings isone second. That is, the continuous real time is discretized to

an integer. This causes additional problems for theinterval representation, whereas thenumber

representationfits with this approach.

When observing a continuous time process at regular time instants the continuous time intervals are transformed into a discrete time distribution. The important case of the Pois-son process is transformed into a discrete time process with correlation between inter-arrival times of duration zero. Thus a model in discrete time based on geometric inter-arrival times

(Bernoulli arrival process) isnotequivalent to a Poisson process in continuous time.

The transformation of traffic processes from continuous time to discrete time has been dealt with in the theory of traffic measurements where we observe the traffic process at regular time intervals. In the Holbæk measurements [5], which were the first computerized measure-ment, some fundamental aspects were investigated. The discrete time increases the observed burstiness of the traffic process, so it looks heavy-tailed, even if it is Poissonian.

2.1

The scanning method

We shall only consider regular (constant) scanning intervals. The scanning method is e.g. applied to traffic measurements, call charging, numerical simulations, and processor control. By the scanning method we observe a discrete time distribution for the holding time which in real time usually is continuous.

In practice, we usually choose a constant distance h between scanning instants, and we are able to derive a relation between the observed time interval and the real time interval (Fig. 3).

We notice that there are overlaps between the continuous time intervals, so that the discrete distribution cannot be obtained by a simple integration of the continuous time interval over a

fixed interval of length h. If the real holding times have a distribution function F(t), then it can

be shown that we will observe the following discrete distribution [5]:

p(0) = 1 h Z h 0 F(t)dt (1) p(k) = 1 h Z h 0 {F(t + kh) − F(t + (k−1)h)}dt, k= 1,2,... (2)

(4)

0 0 1 1 2 2 3 3 4 4 5 5 6

Observed number of scans

Interval for the real time (scan)

Figure 3: By the scanning method a continuous time interval is transformed into a discrete time interval. The transformation is not unique.

Interpretation: The arrival time of the call is assumed to be independent of the scanning pro-cess. Therefore, the density function of the time interval from the call arrival instant to the first

scanning time is uniformly distributed and equal to (1/h). The probability of observing zero

scanning instants during the call holding time is denoted by p(0) and is equal to the probability

that the call terminates before the next scanning time. For a fixed value of the holding time t this probability is equal to (F(t)/h), and to obtain the total probability we integrate over all

possible values t(0 ≤ t < h) and get (1). In a similar way we derive formula (2) for p(k).

For exponential distributed holding time intervals, F(t) = 1 − e−µt, we will observe a

discrete distribution,Westerberg’s distribution[5]:

p(0) = 1 − 1 µ h  1− e−µh  (3) p(k) = 1 µ h  1− e−µh 2 · e−(k−1)µh, k = 1,2,... (4)

The i’ th derivative of the probability generating function

Z

(z) (Z–transformed) for the value

z= 1 becomes:

Z

(i)(1) = i!

i h· 1

(eµ h− 1)i−1 (5)

from which we find the mean value m1and form factor ε(second moment divided by the

squared mean value) [6]:

m1 =

Z

(1)(1) = 1 µ h, (6) σ2 =

Z

(2)(1) +

Z

(1)(1) −

Z

(1)(1)2, (7) ε =  σ m1 2 = µh ·eµ h+ 1 eµ h− 1 ≥ 2. (8)

(5)

Thus for any distribution function F(t) we willalways observe the correct mean value. When using Karlsson charging we will therefore always in the long run charge the “correct amount”.

For a continuous measurement the form factor is 2. The contributionεT−2 is thus due

to the influence from the measuring principle. In [1] the scanning method is generalized to phase-type distributions for both the scanning interval and the inter-arrival time.

The form factor is a measure of accuracy of the measurements. Fig. 4 shows how the form factor of the observed holding time for exponentially distributed holding times depends on the length of the scanning interval (8). By continuous measurements we get an ordinary sample. By the scanning method we get a sample of a sample so that there is uncertainty both because of the measuring method and because of the limited sample size.

0 1 2 3 0 1 2 3 4 5 Formfactor" Scan interval [s ,1] ... .... . ... .... . . .. . ... ... ... ... .... . . .. . ... . . .. . ... .... . . .. .... . . .. . ... .... . ... .... . . . . . ... ... . . .. . .... . . . . . .. .... . . .. . ... .... . ... .... . . .. . ... .... . ... .... . . . . . .. ... . .. . ... .... . ... .... . . .. . ... .... . ... .... . . .. . ... . . .. . .. . .... . . .. ... .... . . .. . ... .... . ... .... . . .. . ... . . .. . ... .... . . .. . .. ... . ... ... .... . . .. .... . . .. . ... .... . . .. .... . . .. . ... .... . ... .... . . .. . ... ... . . ... ... .... . . .. .... . . .. . ... .... . ... .... . . .. . ... .... . ... .... . . .. . ... . . . . . .. . .... . . . . ... .... . . .. . ... .... . ... .... . . .. . ... . . .. . ... .... . . . . ... ... . .. . ... .... . ... .... . . .. . ... . . .. . ... .... . . .. .... . . .. . .. . .... . ... ... .... . . ... ... .... . ... .... . . .. . ... .... . ... .... . . .. . ... .... . ... ... ... . ... . . . . . .. . ... .... . ... .... . . .. . ... .... . ... .... . . .. . ... . . . . . ... .... . . . . ... .... . . .. . ... .... . ... .... . . .. . ... . . .. . ... .... . . .. .... . . .. . .. . .... . .. . ... .... . . .. . ... . . .. . ... .... . . .. .... . . .. . ... .... . ... ... ... . .. . ... . . ... ... .... . . .. .... . . .. . ... .... . . .. .... . . .. . ... .... . . .. . .. .... . . ... ... .... . ... .... . . .. . ... .... . ... .... . . .. . ... . . .. . ... ... ... .... . . . . . .. . ... .... . ... .... . . .. . ... . . .. . ... .... . . .. .... . . . . . ... .... . .. . ... .... . . .. . ... . . .. . ... .... . . .. .... . . .. . ... .... . ... .... . . .. . .. . .... . .. . ... .... . . .. .... . . .. . ... .... . ... .... . . .. . ... .... . ... .... . . .. . ... ... ... ... .... . . .. . ... .... . ... .... . . .. . ... . . .. . ... .... . . .. .... . . . . . ... ... .... . . . . .. . ... .... . . ... ... .... . . ... .... . .. . .... .... . ... ... . .. . ... ... . ... ... . . .. . . ... .... . .. . . ... .... . ... .... . . .. . ... ... . .. . ... . . ... . ... . . .. . .. . .... . . .. . ... . . . . . . .. . ... . . . .. . .. . ... . . . .. . ... . . .. . ... .... . . .. ... . . . . ... .... . . . .. ... .... . . . .. .... . . . . . . .. .... . . .. .... .... . ... ... . .. . ... .... . . ... ... .... . . ... .... . .. . ... . . . . ... .... ... ... ... . ... .... . .. . . ... .... . ... . ... . . .. . ... ... . .. . ... ... . ... ... . . ... . ... . . .. . .. . .... . . .. . ... . . . . . . .. . ... . . . .. . .. .... . . . .. . ... . . . . . .... . . . . . .. ... . . .. ... .... . . . .. .... . . . . . . .. .... . . .. .... .... . . .. ... . . . . ... ... . ... ... .... . . ... .... . .. . . ... .... . ... .... .... . ... ... . ... ... ... . ... .... . .. . . ... . . .. . ... . ... . . .. . ... ... . .. . ... ... . ... ... . . ... . ... . . .. . .... . . . . . .. .... . . . .. . .. .... . . . .. . ... . . . . . . .. . ... . . .. . .... . . . . . .. ... . . .. ... .... . . . .. .... . . . . . . . .... ... ... . . ... ... ... . ... ... . ... ... . . . .... ... . ... ... ... . ... ... ... . . ... . ... . .... ... ... . ... .... ... ... ... . .... . ... ... ... ... ... . . ... . ... . ... . .... . . ... . .... . . ... . .... . .... . .... . .... . .... . ... . . .... . ... . . .... . ... . .... .... ... .... ... . .... .... . ... . ... . . .... . . ... . .... . . . .. ... ... .... . ... . .... . ... . . . .. ... ... .... . .... . . ... ... . . . .. . .... . ... . . .... . . .. ... . .... . ... . ... . . .. ... .... . ... . ... ... ... . ... . . .. . ... .... . .... . . ... . . .. . . ... . .... . .... .... . .... . . ... . . .. ... .... . ... ... ... . .... . . .. .... . ... . ... . . .. . .... . . .. ... . .... . ... . . . .. .... . ... . ... .... . .... . . .. .... . . ... .... . ... . . .... . . .. ... . ... . . .. ... . ... ... .... . ... . . . .. .... . ... ... . ... . . .. . .... . ... ... ... .... . . . . . ... . . ... . . . .. .... . .... .... . ... . . . .. .... . ... ... .... . . .. . .... . ... . . ... . ... . ... ... . ... . . .. ... . ... .... . . ... . . .. ... . .. . .... . .... . . . . ... . . . ... ... . ... .. . . ... ... ... . ... . ... ... ... . ... . ... . ... . ... . . . ... ... . ... . ... . ... . . ... ... ... . ... . . . .... . . ... ... ... ... ... . ... . .... ... . ... . . .... ... . ... . . . .... ... . .... ... ... . ... . . ... . ... ... . . .... . .... ... ... . . .... ... . ... . .... . . .... . .... . ... . . ... . . .... . .... . .... . .... ... ... . . . ... . ... . . . ... . .... . ... .... . .... . .... . ... .... . .... . .... . .... ... . ... . ... . .... . .... . ... . ... ... . . .. ... .... ... . .... .... . . .... . . ... . .... ... ... .... . .... . . ... . . ... . . .... . . .. .... . . .... . .... . .... . . .. . .... . ... .... . .... ... .... . . .... . ... . . . ... . . .. .... . . .... . ... ... .... . . ... . . ... .... . ... . ... . . .. ... .... . .... .... . ... . . . ... . . .. .... . . ... . . .. . ... . ... . ... ... . .... . .... . . .. .... . .... .... . ... . . . ... . . .. ... . ... .... . .... . . .. .... . . ... .... .... . . ... ... ... . .. .... . ... . ... .... . . .. ... ... . ... .... ... . . .. .... . . ... . . .. . ... . . .. . .... . .... . . . . ... ... ... . . .. ... . ... .... . . ... . . .. .... . . ... . . . . . ... ... ... ... . ... . . ... . . ... ... ... ... ... . ... . ... . ... . ... ... ... . . ... ... ... . ... . ... ... ... . .... ... ... . ... ... ... ... . ... . ... . . .... . ... ... ... ... . ... . . ... . ... . .... ... . .... ... ... ... . .... . ... ... ... . . ... . .... . .... . ... . .... .... . . .... ... . .... ... ... . . .. ... ... ... ... .... . .... . .... ... . .... . .... ... ... . ... .... . ... . ... . ... . . .. ... ... . .... . .... . . ... . . .. ... .... . ... . ... .... . . .... . ... . . ... .... . .... . . ... . . .. . ... . . .... . ... ... . ... . . ... . . . .. .... . .... .... . ... . . . .. .... . .... . . ... . . .. . .... . . .. .... . .... .... .... . . ... .... . ... . . . .. .... . .... .... . ... . . . .. .... . ... ... . ... . .. . . .... . .. .... . . ... .... . .... . . .. .... . .... ... . . .. ... . ... .... . . ... . . .. ... . . . .. . ... . . .. . .... . ... .... . . . .. .... . . .. . . ... . . ... ... . ... .... . .... .... . ... ... ... .... . . ... ... . ... . . . .. . ... .... . .... .... . ... ... . ... .... . .... .... . ... ... . . .. .... . . . .. ... . . k=1 2 5 1

Figure 4: Form factor for exponentially distributed holding times which are observed by

Erlang-kdistributed scanning intervals in an unlimited measuring period. The case k=∞

corresponds to regular (constant) scan intervals which transform the exponential distribution into Westerberg’s distribution. The casek= 1corresponds to exponentially distributed scan intervals. The caseh= 0corresponds to a continuous measurement. We notice that by regular scan intervals we loose almost no information if the scan interval is smaller than the mean holding time (chosen as time unit).

In Fig. 5 and Fig. 6 we compare for ISDN calls the observed formfactor with the formfactor calculated from Westerberg’s distribution using the total number of calls during 15 minutes to calculate the mean arrival rate. We notice the two curves are very similar. The ratio between the two results is shown in Fig. 7. So even though the formfactor is greater than two, we may assume it is a Poisson process within 15 minutes.

(6)

0 0.5 1 1.5 2 2.5 3 3.5 0 2 4 6 8 10 12 14 16 18 20 22 24

Palm form factor

hour of day

Figure 5: Observed formfactor of inter-arrival times for ISDN-calls in 15-minutes intervals from observed data, Tuesday 19.01.1999. Cf. Fig. 6.

0 0.5 1 1.5 2 2.5 3 3.5 0 2 4 6 8 10 12 14 16 18 20 22 24

Westerberg form factor

hour of day

Figure 6: Formfactor of inter-arrival times for ISDN-calls calculated from Westerberg’s distri-bution (8) based on the total number of calls observed per 15 minutes, Tuesday 19.01.1999. Cf. Fig. 5.

(7)

0 0.2 0.4 0.6 0.8 1 1.2 0 2 4 6 8 10 12 14 16 18 20 22 24

Palm/Westerberg form factor ratio

hour of day

Figure 7: The ratio between the observed values (Fig. 5) and the theoretical values (Fig. 6).

will be identical, because all inter-arrival times will be zero except for 900 inter-arrivals equal to one second due to 900 scans during 15 minutes. An example is shown in Fig. 8. In this case we cannot make any conclusions about the inter-arrival time distribution for the busy periods. Investigation of intervals with fewer calls, e.g. for individual modem pools or during the night, show that Westerberg’s distribution is a very good model for inter-arrival times for pe-riods up to one hour. If we include the ISDN-calls as two simultaneous arrivals, the observed and calculated values become different for the cases where we don’t have too many calls per scan interval.

3

Number representation

In this case we study the number of calls during intervals of fixed duration. The measuring method is exact. If the arrival process is a stationary Poisson process, then the number of calls during a fixed time interval will be Poisson-distributed. We may observe the number of calls per time interval (e.g. one second) during 15 minutes and perform tests for the Poisson

distribution. In general, aχ2-test will accept a Poisson hypothesis. In the following we shall

look at a second order statistics to study intensity variations.

3.1

Generalized mean differences

Let us consider a discrete distribution function p(i) (i = 0,1,...) with mean value m1, second

(8)

0 2 4 6 8 10 12 14 16 0 2 4 6 8 10 12 14 16 18 20 22 24

Palm form factor

hour of day

Figure 8: Observed formfactor of inter-arrival times for all calls in 15-minutes intervals from measurements. The values calculated from Westerberg’s distribution will be identical for the busy periods. Tuesday 19.01.1999.

The generalized mean difference of order r is defined by: δr=

i

j | i − j |r p(i) p( j). For r= 2 we have δ2 =

i

j (i − j)2p(i)p( j) =

i

j 

i2p(i) p( j) + j2p( j) p(i) − 2i j p(i) p( j)

=

i i2p(i) +

j j2p( j) − 2

i i p(i) !

j j p( j) ! = 2 m2− m21  = 2σ2.

For the discrete distributions applied in the BPP (Binomial–Poisson–Pascal) traffic model we get the normalized generalized mean difference (NGMD) (In the Binomial and Pascal distri-butions p denotes the probability of success):

(9)

Binomial: δ2 m1 = 2(1 − p). Poisson: δ2 m1 = 2. Pascal: δ2 m1 = 2(1 + p).

In the Binomial and Pascal distributions p denotes the probability of success.

A double stochastic Poisson process (a Cox process) is a Poisson process where the intensityλ

is a stochastic variable. If for exampleλisΓ-distributed, then the interarrival times are

Pareto-distributed [6]. Letλhave the mean value be mλand the varianceσ2λ, then we get for the the

number of calls in a fixed time interval:

Palm: δ2

m1 = 2 + 2 ·

σ2 λ mλ.

The above statistics may be used to distinguish between fast and slow variations in a point

process [5]. If we let p(i) and p( j) denote the number of calls in two consecutive intervals of

fixed length, then the slow variations have no influence upon the the result, because the statistic is a linear function of the intensity. On the other hand, fast variations within the duration of the two intervals will increase the value of the statistics.

In Fig. 9 we show the NGMD for periods of 15 minutes based on consecutive intervals of duration 1 second each. The average is about 2.2 indicating that the arrival process is a little more bursty than the Poisson process. In Fig. 10 we show the NGMD for the same period when the consecutive intervals are of length 60 seconds each. The mean value increases to 2.53 indicating that there are small intensity variations during 2 minutes.

In Fig. 11 we show the NGMD for increasing values of the consecutive intervals from 1 to 1200 seconds. The curve indicates significant intensity variations during two consecutive periods of each 20 minutes.

If we consider individual model pools, the NGMD becomes close to 2 for intervals of length 5–10 minutes, which indicates a Poisson process. By Palm’s theorem [6] the total arrival process should be even more Poissonian. In the above examples the average number of call per second is about 7 for all 24 hours of the day. It seems as if the large number of calls per time interval tends to increase the value of NGMD above 2, even for a Poisson process. Further detailed investigations have to be carried out.

4

Conclusion

Above we have presented results for the analysis of the arrival process of dial-up Internet calls. Most results are based on all recorded calls. In [4] investigations have also been carried out for individual modem-pools.

All investigations indicate that the arrival process is a Poisson process with slow variations during the day. Only at particular times, e.g. at 19:30 in the evening when the tariff is reduced to 50 %, there are rather fast variations (Fig. 1).

(10)

0 0.5 1 1.5 2 2.5 3 0 2 4 6 8 10 12 14 16 18 20 22 24

generalised mean difference

hour of day

Figure 9: Normalized generalised mean differences using 1-second intervals for periods of 15 minutes. Average = 2.2079. Tuesday 19.01.1999.

0 1 2 3 4 5 6 7 0 2 4 6 8 10 12 14 16 18 20 22 24

generalised mean difference

hour of day

Figure 10: Normalized generalised mean differences using 60-seconds intervals for periods of 15 minutes. Average = 2.5280. Tuesday 19.01.1999.

(11)

0 10 20 30 40 50 60 0 200 400 600 800 1000 1200

generalised mean difference

interval length (sec)

Figure 11: The mean-squared-deviation for all calls as a function of the period. Tuesday 19.01.1999.

References

[1] Christensen, T. K. & Nielsen, B. F. & Iversen, V. B. (2000): Distribution of channel holding times in cellular communication systems. NTS–15, Fifteenth Nordic Teletraffic Seminar, Lund, August 2000. 9 pp.

[2] F¨arber, J. & Bodamer, S. & Charzinski, J. (1999): Statistical evaluation and modelling of Internet dial-up traffic. IND, Institute of Communication Networks and Computer Engi-neering, University of Stuttgart 1999. Technical Report. 10 pp.

[3] Fredericks, A. A. (1999: Impact of holding time distributions on parcel blocking in multi– class networks with application to Internet traffic on PSTN’s. ITC–16, Sixteenth Interna-tional Teletraffic Congress, Edinburgh, June 1999. Proceedings pp. 877–886. Elsevier Science B. V. 1999.

[4] Hussain, I. (1999): Characterization of dial-up based Internet user traffic. Master’s thesis. Department of Telecommunication, Technical University of Denmark, August 1999. 89 pp.

[5] Iversen, V. B. (1973): Analyses of real teletraffic processes based on computerized

mea-surements.Ericsson Technics, vol. 29 (1973): 1 3–64.

[6] Iversen, V.B. (1999): Introduction to Traffic Engineering. Textbook. Department of

(12)

[7] Naldi, M. (1999): Measurement-based modelling of Internet dial-up access connections.

Computer networks, Vol. 31 (1999) : 22, 2381–2390.

[8] Palm, Conny (1943): Intensit¨atsschwankungen im Fernsprechverkehr. Ericsson Technics No. 44, 1943. 189 pp. English translation by Chr. Jacobæus: Intensity Variations in Tele-phone Traffic, North-Holland Publ. Co. 1987.

References

Related documents

The expansion of aquaculture and the recent development of more intensive land-based marine farms require commercially-valuable halophytic plants for the treatment

To Cancel All Extensions being limited to 3 minutes for Outgoing calls Dial Password Beep * 93 # Beep Hang Up...

Relay to PSTN (or ISDN) Line: Enter phone numbers for regular phone calls (that do not use the Internet) that you always want to dial without the prefix number.. For example,

This is the total incoming trunk traffic for the trunk route (in CCS) between seizure and disconnect (including non-ACD calls, if any) divided by the total number of calls that came

Notes: 1 Over an unspecified period of time; 2 Referent drug for the secondary analysis; 3 excluded suicidality in the 6 months before study entry only;

Dial from a call log Press the Menu button and select Directories &gt; Missed Calls, Received Calls, Placed Calls, Speed Dial, or Directory Services.. To dial from a listing,

For a call centre manager or customer service director, flexibility is key?. They need to be able to scale numerous channels up and down to account for seasonal fluctuations and peak

Business Continuity Options Optus Inbound Service Management Optus IntelleManager Optus Insight Plus Optus On-demand Contact Centre Integration with IT and.. Web