• No results found

Headline in Arial Bold 30pt. The Need For Speed. Rick Reid Principal Engineer SGI

N/A
N/A
Protected

Academic year: 2021

Share "Headline in Arial Bold 30pt. The Need For Speed. Rick Reid Principal Engineer SGI"

Copied!
13
0
0

Loading.... (view fulltext now)

Full text

(1)

Headline in Arial Bold 30pt

The Need For Speed

Rick Reid

Principal Engineer

SGI

(2)

Commodity Systems

• Linux

• Red Hat

• SUSE

• SE-Linux

• X86-64

• Intel Xeon

• AMD

• Scalable Programming Model

• MPI

• Global Data Access

• NFS

• NFS/RDMA

• pNFS

0

10 20 30 40 50 60 70

2005 2006 2007 2008 2009 2010

x86 EPIC RISC CISC 0 5000 10000 15000 20000 25000 30000 35000 40000 45000 50000

2005 2006 2007 2008 2009

Linux Windows Unix

Linux and Windows Growth

x86 Increasingly Dominant

$ B R e ve n u e , W W S e rv e rs IDC 2006 IDC 2006 (0 0 0 ) S h ip m e n ts , W W S e rv e rs CAGR x86 6.5% CAGR Linux 20% Windows 9%

(3)

Workflow

Characteristics

• Performance Oriented

• Data-Intensive

• Product Timelines

• Interactivity

• Rapid development

cycles

Workflow

Characteristics

• Price-performance key

• Little data sharing

• Predictable Workloads

• Non-interactive

• Mature Apps

Capability

&

Performance

Computing

Capacity

&

Compliant

Computing

Focus on Innovation

Focus on Efficiency

(4)

Commodity Xeon Motherboard – Today

Seaburg chipset

Wolfdale and Harpertown

1600 MHz or 1333 MHz front side bus

16 FB DIMM slots

64 GB ECC DDR2 FBD

8 SATA2 or SAS drives

H/W RAID 0, 1, 5, 6, 10, 50, 60

ATI 32 MB DDR2 Graphics

Dual GbE interfaces

Redundant power supplies

5 external I/O slots

2 x PCIe x8 gen 2

1 x PCIe x8 gen 1

1 or 2 x PCIe x4 gen 1

(5)

Commodity Motherboard - Next Year

32 Cores

320 GF

64 DIMMs – 256 GB

1U

MB

MB

MB

MB

MB

MB

MB

MB

MB

MB

MB

MB

MB

MB

MB

MB

QPI

BoxboroBoxboro

X4 ESI

Boxboro Boxboro

ICH 10

Nehalem-EX Nehalem-EX

Nehalem-EX Nehalem-EX

x8 x8

2x4

x8 x4

x8 x8 x8

x16

x8 x4 x8

x16 2x4

2x4

2x4 X4 PCIe Gen1

Nehalem-EX Nehalem-EX

Nehalem-EX Nehalem-EX

(6)

Fabrication Technology Drives the Need

Research

Production

2007

2007

4 - 8

4 - 8

cores

cores

2005

2005

2 - 4

2 - 4

cores

cores

2003

2003

1 - 2

1 - 2

cores

cores

2009

2009

6 - 12

6 - 12

cores

cores

2011

2011

8 - 16

8 - 16

cores

cores

2013

2013

12 - 18

12 - 18

cores

cores

2015

2015

20 - 28

20 - 28

cores

cores

2017

2017

24 - 32

24 - 32

cores

cores

90 nm

90 nm 65 nm65 nm 45 nm45 nm

Development

16 nm 16 nm 22 nm 22 nm 32 nm

32 nm 11 nm11 nm 8 nm8 nm

12 - 24

12 - 24

GF/s

GF/s

24 - 48

24 - 48

GF/s

GF/s

48 - 96

48 - 96

GF/s

GF/s

72 - 144

72 - 144

GF/s

GF/s

96 - 192

96 - 192

GF/s

GF/s

144 - 216

144 - 216

GF/s

GF/s

240 - 336

240 - 336

GF/s

GF/s

288 - 384

288 - 384

GF/s

GF/s 3 GHz , 4 FLOPS / clock

2 sockets per motherboard / blade

24 - 48

24 - 48

GF/s

GF/s

48 - 96

48 - 96

GF/s

GF/s

96 - 192

96 - 192

GF/s

GF/s

144 - 288

144 - 288

GF/s

GF/s

192 - 384

192 - 384

GF/s

GF/s

288 - 432

288 - 432

GF/s

GF/s

480 - 672

480 - 672

GF/s

GF/s

576 - 768

576 - 768

GF/s

GF/s If operations per word of input = 500 than GB/s of I/O required to prevent stalls (20% O/S overhead)

0.31 - 0.61

0.31 - 0.61

GB/s

GB/s

0.61

0.61 –– 1.23 1.23 GB/s

GB/s

1.23

1.23 –– 2.46 2.46 GB/s

GB/s

1.84

1.84 –– 3.69 3.69 GB/s

GB/s

2.46

2.46 –– 4.92 4.92 GB/s

GB/s

3.69

3.69 –– 5.53 5.53 GB/s

GB/s

6.14

6.14 –– 8.6 8.6 GB/s

GB/s

7.37

7.37 –– 9.83 9.83 GB/s

GB/s

(7)
(8)

NFS/RDMA - A new NFS standard

• NFS/RDMA is provided by transport switch in the RPC layer which

allows data to be transported via TCP/IP, UDP/IP or RDMA over

Infiniband.

• The transport switch allows NFS to use the efficient, high

bandwidth native Infiniband protocol.

• NFS/RDMA can be simultaneously intermixed with NFS over

IP-based protocols on the same server and even the same files

• NFS/RDMA transfers the data from the Severs memory to the

client memory without host CPU intervention

• This allows the NFS server to serve more clients • This improves NFS bandwidth

• This allows the NFS client to dedicate more CPU resources to the application

• Latest OpenFabric drivers provide most of the capabilities required

(9)

What is NFS over RDMA?

NFS

RPC/XDR

UDP

IP

TCP

RDMA

(10)

NFS/RDMA

IB or 10 GbE connectivity between NFS server and NFS

clients

IB, SAS or FC backend to storage

Performance

10 RAIDs, 480 15K SAS drives, 96 GB RAM, 2 IB interfaces

NFS/RDMA IB Read: 3.5 GB/s

NFS/RDMA IB Write: 2.25 GB/s

NFS 10 GbE Read: 800 MB/s

NFS 10 GbE Write: 400 MB/s

Higher aggregate bandwidth

(11)

pNFS - Future Technology

• Parallel NFS: Extension of NFS 4.x Standard

• Expected draft release in NFS 4.1

• Dr. Garth Gibson, professor at CMU, wrote the Internet

Draft:

“pnfs-problem-statement”

www.pdl.cmu.edu/pNFS/archive/gibson-pnfs-problem-statement.html

• File base sharing

• Uses metadata servers

• NFS semantics for metadata access

• Metadata server points client to the storage node

• Supports Block, File System, or Object Storage backend

• Expected to be able to serve 100K+ of clients

(12)

NAS Storage

Clients

WAN Admin

NAS 1 NAS 2

Storage Module 1

Storage Module 2 IB or 10 GbE

Switch

. . .

. . .

IB or 10 GbE Switch Compute Server 1 Compute Server 2 Compute Server 3 Compute Server “n”

NAS “n” NAS “n+1”

Storage Module n

Storage Module n+1

. . .

NFS RDMA pNFS

Clients WAN Admin

NAS 1 NAS 2 IB or 10 GbE

Switch

. . .

. . .

IB or 10 GbE Switch Compute Server 1 Compute Server 2 Compute Server 3 Compute Server “n”

NAS “n” NAS “n+1”

. . .

Storage Module 1 Storage Module 2 Storage Module n Storage Module n+1 MDS 1 MDS 2

(13)

Headline in Arial Bold 30pt

Questions

Thank You

References

Related documents

 Staff reviews mitigation protocols every 30 days, and since the Governor relaxed requirements, changes to current protocols are: we removed some Table Games barriers;

Commercial Cloud Providers Vendor Instance storage Object storage Block storage Semi- structured data storage Relational Database storage Distributed File System Online

Unified storage is a single shared pool of storage that can combine both block and file data, eliminating the need to manage separate block and file systems.. Fewer storage

• OpenStack provides redundant, scalable object storage using clusters of standardized servers capable of storing petabytes of data. • Object Storage is not a traditional file

• Object Storage is not a traditional file system, but rather a distributed storage system for static data such as virtual machine images, photo storage, email storage, backups and

Storage 6/27/15 Cloud 101 25 Instance Storage Volume/Block Storage Object Storage. Speed High Medium/High

Object Storage System Application Network Object Storage System HTTP / REST Objects Metadata Typically 10GbE File Attributes File Storage Data & Attributes

Software Defined Storage stack overview System Center Windows Server SMI-S Storage Service Block storage provisioning File storage provisioning Hyper-V Storage