• No results found

Network Performance Optimisation and Load Balancing. Wulf Thannhaeuser

N/A
N/A
Protected

Academic year: 2021

Share "Network Performance Optimisation and Load Balancing. Wulf Thannhaeuser"

Copied!
22
0
0

Loading.... (view fulltext now)

Full text

(1)

Network Performance Optimisation

and

Load Balancing

(2)
(3)

3

Network Optimisation: Where?

VDET TRACK ECAL HCAL MUON RICH

LHC-B Detector Data rates

Trigger Level 2 & 3 Event Filter SFC SFC CPU CPU CPU CPU Sub-Farm Controllers (SFC) Storage 60 SFCs with ca. 16 “off the shelf” PCs each

RU RU RU Read-out units (RU) Front End Links

Read-out Network (RN) Control & Monitoring 2-4 GB/s 4 GB/s 20 MB/s LAN Timing & Fast Control Front-End Electronics 1 MHz 40 MHz 40 TB/s Level 0 Trigger L0 L1 Level 1 Trigger 1 MHz 40 kHz Fixed latency 4.0 µs Variable latency <1 ms 1 TB/s Front-End Multiplexers (FEM)

(4)

Ethernet Speed: 10 Mb/s

Fast Ethernet Speed: 100 Mb/s

(Fast) Ethernet Gigabit Ethernet

Gigabit Ethernet Speed: 1000 Mb/s

(considering full-duplex: 2000 Mb/s)

(5)

Network Optimisation: Why?

An “average” CPU

might not be able to process such a huge amount of data packets per second:

-TCP/IP Overhead -Context Switching -Packet Checksums

An “average” PCI Bus is 33 MHz, 32-bit wide.

Theory: 1056 Mbit/s

Actually: ca. 850 Mbit/s (PCI overhead, burstsize)

(6)

Network Optimisation: How?

An “average” CPU

might not be able to process such a huge amount of data packets per second:

-TCP/IP Overhead -Context Switching -Packet Checksums

Reduce per packet Overhead:

Replace TCP with UDP

An “average” PCI Bus is 33 MHz, 32-bit wide.

Theory: 1056 Mbit/s

Actually: ca. 850 Mbit/s (PCI overhead, burstsize)

(7)

TCP / UDP Comparison

• TCP (Transfer Control Protocol):

- connection-oriented protocol - full-duplex

- messages received in order, no loss or duplication ⇒ reliable but with overheads

• UDP (User Datagram Protocol): - messages called “datagrams”

- messages may be lost or duplicated - messages may be received out of order ⇒ unreliable but potentially faster

(8)

Network Optimisation: How?

An “average” CPU

might not be able to process such a huge amount of data packets per second:

-TCP/IP Overhead -Context Switching -Packet Checksums

Reduce per packet Overhead:

Replace TCP with UDP

Reduce number of packets:

Jumbo Frames

An “average” PCI Bus is 33 MHz, 32-bit wide.

Theory: 1056 Mbit/s

Actually: ca. 850 Mbit/s (PCI overhead, burstsize)

(9)

Jumbo Frames

Normal Ethernet

Maximum Transmission Unit (MTU): 1500 bytes

(10)

Test set-up

• Netperf is a benchmark for measuring network performance

• The systems tested were 800 and 1800 MHz Pentium PCs using (optical as well as copper) Gbit Ethernet NICs.

• The network set-up was always a simple point-to-point connection with a crossed twisted pair or optical cable. • Results were not always symmetric:

With two PCs of different performance, the benchmark results were usually better if data was sent from the slow PC to the fast PC, i.e. the receiving process is more

(11)

Results with the

optimisations so far

Frame Size tuning: throughput

597.67 704.94 648.53 856.52 647.76 854.52 0 100 200 300 400 500 600 700 800 900 1500 9000 MTU Thr oughput ( M bit /s )

(12)

Network Optimisation: How?

An “average” CPU

might not be able to process such a huge amount of data packets per second:

-TCP/IP Overhead -Context Switching -Packet Checksums

Reduce per packet Overhead:

Replace TCP with UDP

Reduce number of packets:

Jumbo Frames

Reduce context switches:

Interrupt coalescence

An “average” PCI Bus is 33 MHz, 32-bit wide.

Theory: 1056 Mbit/s

Actually: ca. 850 Mbit/s (PCI overhead, burstsize)

(13)

13

Interrupt Coalescence

Packet Processing without Interrupt coalescence:

Memory

Interrupt NIC

Memory

Packet Processing with Interrupt coalescence:

CPU Interrupt Interrupt Interrupt CPU NIC

(14)

Interrupt Coalescence: Results

Interrupt coalescence tuning: throughput (Jumbo Frames deactivated)

526.09 633.72 587.43 556.76 666.43 593.94 0 100 200 300 400 500 600 700 32 1024

us waited for new packets to arrive before interrupting CPU

Thr oughput ( M bit /s )

(15)

Network Optimisation: How?

An “average” CPU

might not be able to process such a huge amount of data packets per second:

-TCP /IP Overhead -Context Switching -Packet Checksums

An “average” PCI Bus is 33 MHz, 32-bit wide.

Theory: 1056 Mbit/s

Actually: ca. 850 Mbit/s (PCI overhead, burstsize)

Reduce per packet Overhead:

Replace TCP with UDP

Reduce number of packets:

Jumbo Frames

Reduce context switches:

Interrupt coalescence

Reduce context switches:

(16)

Checksum Offloading

• A checksum is a number calculated from the

data transmitted and attached to the tail of each

TCP/IP packet.

• Usually the CPU has to recalculate the

checksum for each received TCP/IP packet in

order to compare it with the checksum in the tail

of the packet to detect transmission errors.

• With checksum offloading, the NIC performs this

task. Therefore the CPU does not have to

calculate the checksum and can perform other

operations in the meanwhile.

(17)

Network Optimisation: How?

An “average” CPU

might not be able to process such a huge amount of data packets per second:

-TCP/IP Overhead -Context Switching -Packet Checksums

An “average” PCI Bus is 33 MHz, 32-bit wide.

Theory: 1056 Mbit/s

Actually: ca. 850 Mbit/s (PCI overhead, burstsize)

Reduce per packet Overhead:

Replace TCP with UDP

Reduce number of packets:

Jumbo Frames

Reduce context switches:

Interrupt coalescence

Reduce context switches:

Checksum Offloading

Or buy a faster PC with a better PCI bus… ☺

(18)
(19)

19

Load Balancing: Where?

VDET TRACK ECAL HCAL MUON RICH

LHC-B Detector Data rates

Trigger Level 2 & 3 Event Filter SFC SFC CPU CPU CPU CPU Sub-Farm Controllers (SFC) Storage

RU RU RU Read-out units (RU) Front End Links

Read-out Network (RN) Control & Monitoring 2-4 GB/s 4 GB/s 20 MB/s LAN Timing & Fast Control Front-End Electronics 1 MHz 40 MHz 40 TB/s Level 0 Trigger L0 L1 Level 1 Trigger 1 MHz 40 kHz Fixed latency 4.0 µs Variable latency <1 ms 1 TB/s Front-End Multiplexers (FEM)

Throttle

60 SFCs with ca. 16 “off the shelf” PCs each

(20)

20

Load Balancing with round-robin

Fast Ethernet Gigabit Ethernet … … SFC

2

1

3

4

Event Event Event Event Event Event Event

Event Event Event Event

Event

Problem: The SFC doesn’t know if the node it wants to send the event to is ready to process it yet.

(21)

21

Load Balancing with control-tokens

Fast Ethernet Gigabit Ethernet … … SFC

2

1

3

4

Event Event Event Event Event Event Event Event Token 2 1 3 1 Token Token Token

With control tokens, nodes who are ready send a token, and every event is forwarded to the sender of a token.

(22)

LHC Comp. Grid Testbed Structure

100 cpu servers on GE, 300 on FE, 100 disk servers on GE (~50TB), 10 tape server on GE

3 GB lines 3 GB lines 8 GB lines 64 disk server 64 disk server Backbone Routers Backbone Routers 36 disk server 36 disk server 10 tape server 10 tape server 100 GE cpu server 100 GE cpu server 200 FE cpu server 200 FE cpu server 100 FE cpu server 100 FE cpu server 1 GB lines SFC

References

Related documents

67 move move yer de yer de  ii  tirmek, ta tirmek, ta  ınmak, ta ınmak, ta  ımak, hareket etmek ımak, hareket etmek 68. 68 notice notice fark etmek,

The $19.9 million grant went to a group of 13 colleges including all three of the schools in Kern Community College District: Bakersfield College, Cerro Coso College and Porterville

Another five competencies did not achieve 60% or greater agreement of appropriate level of nursing prac- tice (“uses decision support systems,” “expert systems and other aids

A The resu Inpatien the Bene Regulati viewed a St D 605 W ebruary 14 acey, CFO Hospital of toria Street Mesa, CA 9 GE HOSPIT NAL PROVI PERIOD E e examined Our examin titutions Co

Sau ®ã thùc hiÖn lÖnh hiÓn thÞ néi dung local macros nµy, nhng macros nµy kh«ng tån t¹i ë ®o¹n ch¬ng trinh kh¸c hay ë bé nhí cña Stata. end

This prototype virtual office environment provides support for communication, cooperation, and awareness in both the synchronous and asynchronous modes, smoothly integrated into

• An oscar for innovative payment solutions for its instant issuing solution developed with PASS financial services ‘Monext is partnering BNP Paribas in the creation of

On the other hand the impact of the, budget deficit and current account balance itself are positive while growth rate investment, balance of trade ,inflation, exchange