• No results found

An overview on Internet Measurement Methodologies, Techniques and Tools

N/A
N/A
Protected

Academic year: 2021

Share "An overview on Internet Measurement Methodologies, Techniques and Tools"

Copied!
65
0
0

Loading.... (view fulltext now)

Full text

(1)

An overview on Internet

Measurement Methodologies,

Techniques and Tools

AA 2012/2013

[email protected]

(2)

(Agenda) Lezione 24/04/2013

[email protected] © Part 1 ¨  Intro ¤  basic concepts ¤  ISP

¤  Traffic exchange (peering)

¨  Measurement Topics

¤  Active vs passive

n  tools overview (we will see them in the next classes)

¤  One or many obs. points

¤  Net vs App

n  The IPPM framework

¤  Sampling techniques

Part 2

(3)

(Agenda) Lezione 2/05/2012

Part 3

¨  The most common metrics

¨  Availability

¨  Reliability

(4)

Who need to measure Internet

traffic and performance?

¨ 

Corporate administrators who buy network service

for their organization.

¨ 

Smaller service providers who buy network service

from a larger provider and resell it to their own

customers.

¨ 

Anyone buying outsourced Services who must judge

the service provider’s ability to serve their end user

community adequately.

(5)

Who need to measure Internet

traffic and performance?

¨ 

Course Projects:

¤ 

A projects on network measurement and analysis, e.g.

to collect and characterize the network traffic

generated from your internet connection

(6)

Basic Terminology

(7)

Networks and Internets def.

¨ 

A network is a collection of hosts connected together

so that they can exchange information.

¨ 

Hosts in a network communicate using a

mutually-agreed network protocol, i.e. they exchange

packets of information

¨ 

An internet is a collection of networks with links

between them

¨ 

Routers are devices with links to more than one

network

(8)

Adjacency, Connectivity, Internet

and Intranets

¨ 

The Internet is the internet of networks that use the

TCP/IP suite of protocols

¨ 

The Internet can be viewed as a graph in the sense

that it is

¤ a set of nodes (networks containing routers) and

¤ edges between them (links between routers)

(9)

Visualization from the Opte Project of the various routes through a portion of the Internet (http://opte.org/)

(10)

[email protected] ©

http://www.caida.org/research/topology/as_core_network/

An Autonomous System (AS) is a collection of connected Internet Protocol (IP) routing prefixes under the control of one or more network operators that presents a common, clearly defined routing policy to the Internet.

Old definition RFC 1771 New definition RFC 1930 A unique ASN is allocated to each AS for use in BGP routing. AS numbers are important

because the ASN uniquely identifies each network on the Internet.

(11)

RFC: Request for Comment

¨  In computer network engineering, a Request for Comments

(RFC) is a memorandum describing methods, behaviors, research, or innovations applicable to the working of the Internet and Internet-connected systems

¨  RFC are usually published by the RFC Editor on behalf of the

Internet Engineering Task Force (IETF), ¨  http://www.ietf.org/rfc.html

(12)

Adjacency and Reachability

¨ 

We describe two networks as adjacent if there is at

least one link directly between them

¨ 

A network is reachable from another network if

there is a path between them

¤ 

reachability is one-way; X may be reachable from Y

while Y is unreachable from X

n e.g. VPN à Internet; Internet à VPN

¤ 

two networks are connected if there is a path (made

up of one or more routers and/or links) between them

that provides reachability in both directions

(13)

Connectivity

¨ 

A distinguishing feature of the Internet is that it

provides universal connectivity to its hosts—every

host is connected to every other host and able to

communicate with it

¤ 

individual hosts within a network may be prevented

from communicating with Internet hosts by a firewall;

such hosts do not have Internet connectivity

(14)

Network layers

¨  Once connectivity is established, it becomes possible for a

network to provide services

¤  e-mail is one example of such a service.

¨  It is useful to think of network services as being implemented in

layers.

¨  Each layer is considered an entity in itself, providing support

for higher layers

¤  This allows each layer to be developed independently of the other

layers, which is an effective simplification.

(15)

Firewall and Intranet

¨  A firewall is a hardware device or a software program running

on a secure host that sits at the junction point or gateway

between two networks usually a private network and a public network such as the Internet, and has connectivity to both

¨  An intranet is an internet using TCP/IP, but in which all the hosts

belong to a single organization, for example a large company with office networks at various geographic locations.

¤  hosts on an intranet are only accessible to members of the

organization that owns it

¤  an intranet is not part of the Internet

¤  an intranet may have some hosts that are Internet-connected

(16)

Who provide connectivity

Internet Service Providers

(17)

ISP common concept

¨ 

ISPs provide a complete range of services, e.g. they

will

¤ 

Connect networks (or individual hosts) to the Internet

¤ 

Provide services for their customers such as e-mail,

network news and Web access

(18)

Specialized ISP

¨ 

Transport providers

¨ 

Traffic Exchange providers (Internet Exchange

Points)

¨ 

Service providers

¨ 

Content providers

(19)

Transport providers

¨  ISP running their own wide-area network and provide Internet

connectivity for their customers via that network ¤  e.g. telecom, fastweb

¨  ISPs who focus on end-user customers can be described as access

providers

¨  ISPs having high-speed networks covering large geographical

areas and connecting to many other ISP networks are commonly described as backbone providers

¤  A special case of transport provider is a transit provider, i.e. one whose

customers are other transport providers rather than individuals or companies.

(20)

E.g. Verizon transport provider

(21)

http://www.wiley.com/legacy/compbooks/press/ch14.htm

Many ISP cover all the layers, other only a portion

(22)(23)

Relathionship between networks

¨  The relationships between networks are generally described

by one of the following three categories:

¤  Transit (or pay) – The network operator pays money (or settlement)

to another network for Internet access (or transit).

¤  Peer (or swap) – Two networks exchange traffic between each

other's customers freely, and for mutual benefit.

¤  Customer (or sell) – Another network pays a network money to

(24)

Transit providers: Peering

¨  Peering: how providers agree to cooperate

¤  peering is a voluntary interconnection of administratively separate

Internet networks for the purpose of exchanging traffic between the customers of each network.

n  The pure definition of peering is settlement-free or "sender keeps all,"

meaning that neither party pays the other for the exchanged traffic; instead, each derives revenue from its own customers.

¤  Peering requires physical interconnection of the networks, an

exchange of routing information through the Border Gateway

Protocol (BGP) routing protocol and is often accompanied by peering agreements of varying formality

(25)

Peering and network relathions

¨  Bilateral peering: exchange of routing information

¤  each ISP becomes aware of the other's routes to their customers'

networks. Information received/transmitted is used to control the flow of traffic exchanged with the outside

¤  ISPs may or may not agree to carry traffic to some or all of those

customers

n  the details are determined by a contract between the two providers.

¨  Transit: carry traffic on behalf of another ISP

¤  traffic may be carried with or without exchange of routing info

(26)

[email protected] ©

http://www.wiley.com/legacy/compbooks/press/ch14.htm

(27)
(28)

Traffic

measured @

NaMeX (RM)

and MIX-IT

(MI)

[email protected] ©

(29)

Traffic

measured @

NaMeX (RM)

and MIX-IT

(MI)

(30)

[email protected] ©

Traffic

measured @

NaMeX (RM)

and MIX-IT

(MI)

(31)

An issues to be aware of:

Oversubscription

[email protected] ©

3 messages between hosts. Once connectivity is

established, it becomes possible for a network to provide

services; e-mail is one example of such a service.

It is useful to think of network services as being implemented in layers. Each layer is considered an entity in itself, providing support for higher layers. This allows each layer to be developed independently of the other layers, which is an effective simplification.

The commonly used layers are listed in Figure 1. The layer numbers were assigned by the Open Systems Interconnection (OSI) Reference Model [OSI-REF], and are widely used as shorthand for the layer names.

1 Physical Provides hardware interfaces

between machines

2 Link Sends and receives packets on

interfaces

3 Network Carries packets between hosts so

as to form networks

4 Transport Provides well-defined information

transport for applications

5-7 Application Provides application services to

users Figure 1: Network layers

2. Internet Service Providers (ISPs)

Companies that provide connectivity to the Internet are known as Internet Service Providers. Many ISPs provide a complete range of services, e.g. they will

• Connect networks (or individual hosts) to the Internet

• Provide services for their customers such as e-mail, network news and Web access

• Provide support services such as Web hosting

Other ISPs may, however, choose to concentrate on particular services, as discussed below.

2.1. Transport

Transport providers are ISPs who run their own

wide-area network and provide Internet connectivity for their customers via that network. ISPs having high-speed networks covering large geographical areas and connecting to many other ISP networks are commonly described as backbone providers. A special case of transport provider is a transit provider, i.e. one whose customers are other transport providers rather than individuals or companies. The notion of transit is discussed in section 2.4 (below).

ISPs who focus on end-user customers can be described as access providers. Access providers connect customer networks to the ISP's own network, and thus to the Internet. Customer connections may use various technologies, e.g. dial-in modems (low speed) or fixed connections (higher speeds).

One issue that customers need to be aware of is

over-subscription. To help define this term, we introduce two

others: proximal, meaning “very near” and its opposite,

distal, meaning "far away".

An access provider's network (illustrated in Figure 2) connects to other transport provider networks via fixed links with a well-defined maximum capacity, for example, a T3 link, with 45 Mbps capacity. We'll call this the

proximal capacity, because it is nearer to the Internet

backbone. Access Provider Network customer 1 customer N Transport Provider Network proximal capacity distal capacity Internet Access Provider Network customer 1 customer N Transport Provider Network proximal capacity distal capacity Internet

Figure 2: Example of a (small) ISP network oversubscription ratio R

R = (total distal capacity) / (proximal capacity) R>1 means oversubscription

(32)

Service providers

¨ 

Web hosting/housing

¨ 

Application service providers

¤ 

complex applications with web service access

n Software as a Service ¤ 

development platform

n Platform as a Service ¨ 

Infrastructure providers

¤ 

Infrastructure as a Service

[email protected] ©

Large ASPs built their own network:

http://community.comsoc.org/blogs/alanweissberger/googles-largest-internal-network-interconnects-its-data-centers-using-software

(33)

Content Providers

¨ 

Any network host that is the source of

downloadable content

¨ 

Content delivery networks

¤ 

CDNs are used by popular commercial Web sites to

(34)

Lord Kelvin said, “

when you can measure what

you are speaking about, and express it in numbers,

you know something about it; but when you cannot

measure it, when you cannot express it in numbers,

your knowledge is of a meager and unsatisfactory

kind”

Measurement Topics

(35)

Why measurement

¨  Without measurements, you have no objective record or

benchmark of how a network behaves

¨  Measurements show whether changes improve or degrade the

network’s performance, and by how much

¨  If you are buying Internet connectivity from an ISP you need to

understand the kind of service being offered

¤  Only by measuring actual performance can you verify that you're

(36)

Active vs. Passive measurement

¨ 

Passive measurements: by observing normal network

traffic

¤ Not invasive

¤ Challenging when rate increases

¤ tools: e.g. tcpdump, CoralReef, NeTraMet

¨ 

Active measurements: by sending test traffic into the

network

¤ Invasive

¤ tools: e.g. Scamper

(37)

One or Many observation points

¨  Some measurements rely on observations at more than one

point in the network

¤  to measure the time a packet takes to travel from A to B, you must

record the times when the packet leaves A and arrives at B using accurate, synchronized clocks.

¨  For measuring traffic flows through a large network is not a

good idea observing flows at many points

¤  It is difficult to correlate measurements of flows taken simultaneously at

even a few different places

¤  It is much simpler to measure traffic at the ingress/egress links of your

network

¤  Follow individual packets on their various paths through the network

ONLY IF you need to produce a traffic matrix showing overall traffic flows through it

(38)

Network vs. Application Measurements

¨  Application-level measurements are needed for a clear view

of overall application performance, which cannot easily be synthesized from lower level data

¤  They may also offer some insights into the performance of the client and

server hosts, and of the network links between.

¨  Application measurements allow to bypass traffic filtering

¤  some ISPs today block ICMP echo packets or limit the rate at which they

are processed

¤  measurements using such packets (e.g. using ping) are still useful, but the

increasing use of traffic filtering is decreasing that utility

¨  RFC 2330 clarify the difference between application and

network metrics

(39)

ISP performance evaluated using Application Measurement by www.truenet.co.nz/

(40)
(41)

The IPPM Framework

(started in 1998 still active)

¨  To clarify the differences between application and network metrics, the

IETF's IP Performance Metrics (IPPM) WG has developed a Measurement Framework for measuring network performance

¤  RFC 2330: Framework for IP Performance Metrics

¨  The Framework presents

¤  terms for describing networks,

¤  explains the need for metrics to be useful, concrete, well defined, and capable

of being measured repeatedly and reliably

¨  With the Framework defined, the IPPM WG has continued to specify

network metrics such as

¤  A One-way Delay Metric for IPPM -- RFC 2679

¤  IPPM metrics for measuring connectivity -- RFC 2678

(42)

Sampling Techniques and Traps

¨ 

Observing packet

¤ 

packet missing is not admitted

¤ 

if packets were missed the number of lost packets must

be reported

¨ 

What to do when packet rate increase?

¤ 

sample the network traffic

(43)

Sampling Techniques and Traps (2)

¨ 

What sampling algorithm should one use?

¤ to examine every n-th packet

¤ examine packets at fixed time intervals; this is harder to

implement and may be affected by aliasing

(synchronization) effects (RFC 2330 discuss this issues)

¤ to measure effects that happen at frequencies of F Hz or

less, you must sample the signal at least 2F times per second (from the Sampling Theorem)

¨ 

In practice sampling granularity depends on

¤ the phenomena we want catch

(44)

The IP Performance Metrics WG’s view

Common Metrics

(45)

Latency

¨ 

widely used measure of network latency is

round-trip time (RTT)

¤ 

the time for a packet to make the round trip from a

(46)

Network latency component times

a)  The time it takes a packet to travel along the physical links that make up

its path through the Internet (transport time)

b)  The time it takes to pass through routers between those links (queuing and

transmission time)

c)  The time required for the server to process an incoming packet and

generate a response packet (server response time) (a) + (b) = { Forward delay (Client à Server) } and { Reverse delay (Server à Client) }

(c) Server delay

Measuring (a) and (b) requires measurement at both client and server, using one-way delay techniques such as those as described in RFC 2679

(47)

Measuring latency

¨ 

Use a method that is implemented within the server's

IP stack, so that it requires very little server

processing to generate a response

¤ 

Ping the most common used

¨ 

Factors influencing latency

¤ 

Server load

¤ 

Path congestion

(48)

The ISP point of view

¨ 

An ISP can manage latency only for packets travelling

within its own network.

¤ Maximum latency can be higher than the average

¤ You need to ask

n  how often the latency is measured,

n  by how much it may vary, and

n  over what period it is averaged.

¨ 

ISPs will measure latency using a tool such as ping,

which uses the ICMP protocol

¤ This may not accurately represent the performance of

applications that use other protocols

(49)

Technical Note: Some network equipment is known to give ICMP a much lower

priority than other traffic, this could make the results for ISPs that use that

equipment appear to be slower than they really are.  However, we perform a large number of pings, which should reduce the affects of routing

equipment giving ICMP traffic a low priority.

(50)

Packet Loss

¨  Network packet loss is the fraction of packets lost in transit

from client to server and back during a specified time interval, expressed as a percentage of the packets sent to the server during that interval

¨  TCP relies on detecting lost packets to sense congestion and

control the rate at which packets are sent, we expect to see occasional packets lost from TCP streams

(51)

Throughput

¨  The rate at which data is sent through the network, usually

expressed in bits per second (bps), bytes per second (Bps) or packets per second (pps)

¨  Throughput is measured by counting bytes transported during a

specified time interval

¤  a long interval will average out short-term bursts in the data rate

¤  Short intervals imply a higher data collection rate, and may exaggerate

the burstiness of the data.

¤  A good compromise is to use one- to five minute intervals, and to

(52)

SpeedTest.net

(53)

Performance Measurement Tools

Taxonomy

¨ 

http://www.caida.org/tools/taxonomy/

performance.xml

(54)

Availability and Reliability

(55)

Availability

¨ 

ITU-T E.800, namely:

¤ 

… the ability of an item to be in a state to perform a

required function at a given instant of time or at any

instant of time within a given time interval ...

¨ 

From the user point of view, availability over a

specified time interval is the percentage of that

interval during which the system was available for

normal use

(56)

What is supposed to be available?

¨ 

Service availability

¤ 

Web service availability test

¨ 

Host availability

¤ 

Host availability test

¨ 

Network availability

¤ 

Network availability test

(57)

When is the service unavailable?

¨ 

Tests will produce latency and packet loss values for

each case

¨ 

A service is effective if are satisfied

¤ 

maximum latency and

¤ 

minimum packet loss

¨ 

if the measured values fall outside these limits, the

service will be considered unavailable

(58)

Example

¨  From the user point of view, you could decide that network

availability is lost if, when you ping your ISP's access router at one-minute intervals, ping latency is worse than 10 ms or ping packet loss is greater than 1 percent.

¨  You need to be realistic when setting these limits, for example

2% packet loss may be unacceptable for Web or ftp data transfer, but 5% packet loss could be acceptable for email.

(59)

Reporting availability

¨  Availabilities are usually reported as a single monthly figure

giving the percentage of the time that the network was available

¤  One minute is approximately 0.01% of a week, hence if service

availability was 99.99%, the service was unavailable for about four minutes during the month.

¨  When you discuss availability with your ISP, bear the above in

mind as you work together to agree on a definition of availability

¨  ISP do not normally specify availability so carefully. Instead

they use a single value of backbone availability, without specifying exactly what it means.

(60)

Other couse of unavailability

¨ 

MMTR

¨ 

MTTF

¨ 

Scheduled downtime

[email protected] © a=MTTF/(MTTR+MTTF)

(61)

Reliability

¨ 

Is closely related to availability

¨ 

Is a measure of how often you get a response back

that is wrong, or get a part that is defective

¤ 

higher the reliability lower the frequency of getting

back a wrong answer

¨ 

In networking a wrong response (e.g. a corrupted

packet) is often equal to a no response (e.g. a lost

packet)

(62)

“Managers are effected by

Mononumerosis

E.C.

Statistics

(63)

Working with statistics

¨ 

Mean, Median and Percentiles

¤ 

used to define SLA

¨ 

Sampling periods must be the same for comparison

(64)

Performance Measurement Tools

¨ 

http://www.caida.org/tools/taxonomy/index.xml

¤ 

Anonymization   

¤ 

Topology

¤ 

Workload

¤ 

Performance

¤ 

Routing   

¤ 

Multicast   

¤ 

Measurement Infrastructures

[email protected] ©

(65)

Source

¨ 

Fundamental of Internet Measurement: A tutorial

http://edu.eap.gr/pli/pli23/documents/Parallila_Keimena/B_Tomos/pdf/ Fundamentals_of_Internet_Measurement_A_Tutorial.pdf

References

Related documents

Všechna vlákna v rámci jednoho bloku mají p ístup do společné sdílené paměti, což je velice mocný nástroj pro zvýšení paměťové propustnosti

High outdoor PM concentrations, limited ven- tilation in schools during winter, and resuspension of particles are the most probable reasons for the ele- vated indoor PM

consumers which are informed by both firms, the captive segment of firm j is larger than that of firm i so that firm j is the less aggressive price-competitor and its expected

multivariate probit is a 5-equation recursive model, with a structural equation for health insurance coverage and reduced form equations for each of our four lifestyle choices:

Jennings all the necessary documents that must be maintained for resident records including the following: Annual Resident Care Agreement; Annual Resident Assessment Plan;

− Increases risk of severe diarrhea, fp malaria, measles severity; child mortality. „

T ropical countries because of a wide variety of bioactive compounds that have found significant therapeutic applications in folkloric medicine. I t is well-known and used in

• To meet the integrity and authenticity requirements Portugal accepts advanced electronic signatures, “proper EDI” with an interchange agreement based on the European