• No results found

Virtualized Infiniband: Enabling HPC in the Cloud

N/A
N/A
Protected

Academic year: 2021

Share "Virtualized Infiniband: Enabling HPC in the Cloud"

Copied!
37
0
0

Loading.... (view fulltext now)

Full text

(1)

National Aeronautics and Space Administration

www.nasa.gov

Virtualized Infiniband:


Enabling HPC in the Cloud"

Hoot Thompson"

NASA Center for Climate Simulation (NCCS)"

[email protected]"

(2)

National Aeronautics and Space Administration! Virtualized Infiniband: Enabling HPC in the Cloud 2

Focus on the research side of climate study (versus NOAA’s operational position)"

Simulations span multiple time scales"

Days for weather prediction !

Seasons to years for short term climate prediction!

Centuries for climate change projection!

Examples:"

High fidelity 3.5 KM global simulations of cloud and hurricane predictions!

Comprehensive reanalysis of the last thirty years of weather/climate –MERRA !

Multi-millennium analysis for the Intergovernmental Panel on Climate Change!

Integrated set of supercomputing, visualization and data management technologies"

Discover computational cluster !

»

Mix traditional Intel cores, nVidia GPUs and Intel Xeon Phis!

»

DDR/QDR/FDR InfiniBand (IB) backbone!

»

1 GbE and 10 GbE management infrastructure!

»

~17 PBytes RAID based shared parallel file system (GPFS)!

Tape archive of over 30 PBytes!

(3)

National Aeronautics and Space Administration!

Discover Computational Growth"

(4)

National Aeronautics and Space Administration! Virtualized Infiniband: Enabling HPC in the Cloud 4

Objective: HPC Science Cloud"

Adjunct to Discover hosted science processing"

• 

Special/temporary debug queues!

• 

Customized run-time environments!

• 

Code validation against older system images!

Expanded customer base!

• 

Temporal processing campaigns (e.g. IFloodS)!

• 

Mission support (e.g. SMAP)

!

Issue is matching HPC levels of performance"

• 

Node-to-node communication critical – high speed, low latency, scalable!

• 

Shared, high performance file system mandatory!

• 

Management and rapid provisioning of resources – cluster formation!

Potential obstacle – performance loss in virtualized space"

Red Hat

Storage

RHEL

(5)

National Aeronautics and Space Administration! Virtualized Infiniband: Enabling HPC in the Cloud 5

RHEL Virtualized IB Test Bed"

Set-up eight node POC environment – Westmere based"

• 

Ran representative benchmarks!

• 

Contrasted bare Host (KVM hypervisor) with VM (guest)!

!

Investigated multiple techniques for improving performance"

• 

VM tuning – hugepages and NUMA awareness!

(6)

National Aeronautics and Space Administration! Virtualized Infiniband: Enabling HPC in the Cloud 6

Summarized Virtualized IB Results"

VM memory bandwidth actually exceeded bare-metal"

"

VM bandwidth/latency between nodes matched bare-metal"

"

(7)

National Aeronautics and Space Administration!

Scale Out Comparisons"

(8)

National Aeronautics and Space Administration! Virtualized Infiniband: Enabling HPC in the Cloud 8

GlusterFS Operational Prototype "

"

Recently acquired 960TB raw storage"

• 

Four I/O servers!

• 

IB frontend!

• 

SAS backend!

• 

Four 60-bay JBODS populated with 4TB drives!

• 

One per I/O server!

• 

Various Gluster volume configurations"

"

Bare metal Gluster clients"

• 

Connected to I/O servers using IB/RDMA!

"

VM Gluster clients"

• 

Connected to VMs servers using virtualized IB/RDMA!

VM VM VM

Client

VM VM VM

Client

VM VM VM

Client

Infiniband Switch

I/O Server

(9)

National Aeronautics and Space Administration! Virtualized Infiniband: Enabling HPC in the Cloud 9

Red Hat OpenStack (RHOS) Cloud"

Set-up three node evaluation system"

• 

Havana release!

• 

One controller!

• 

Two compute node!

• 

More compute nodes as available!

!

Objective"

• 

Gain hands on familiarity!

• 

Work with Red Hat/Mellanox!

• 

Constructs for declaring virtualized IB connections!

• 

Rapid HPC cluster instantiation!

• 

Define architecture – mix of IB and traditional Ethernet!

• 

Seed the Science Cloud!

Instance

Compute

Instance Instance

Compute

Instance

GlusterFS

(Future)

Controller

(10)

National Aeronautics and Space Administration! Virtualized Infiniband: Enabling HPC in the Cloud 10

Decadal Water Products for ABoVE"

Small lakes and ponds are a prominent

feature of the landscape in the High

Northern Latitudes. These ponds will be

mapped at 30m spatial resolution at 3

epochs (1991, 2001, 2011) prior to the

Arctic Boreal Vulnerability Experiment

(ABoVE) field campaign. This will allow

researchers to identify areas to study that

are either constant or ones that are

changing. The effort will take advantage of

the time series of Landsat data that is

available in this region to provide the max,

min, and average condition of each lake/

pond 1ha or larger for each epoch.!

Landsat image, false color composite, from near

Barrow, AK

Landsat image, false color composite, from near

Barrow, AK

Landsat image, false color composite, from near

Barrow, AK!

(11)

National Aeronautics and Space Administration! Virtualized Infiniband: Enabling HPC in the Cloud

11

Science Cloud Use Case"

"

"

The time series for each epoch will draw from 3

years (1990–1992; 2000–2002; 2010–2012)"

"

To cover the study region this translates to

>25,000 scenes to process"

"

Each scene is categorized into land, water, and

other (cloud, ice, shadow, undetermined)"

"

The results are then stacked and summed to

produce 1 map for each epoch that is the

“average” condition for that period"

"

The resultant maps can be used to identify areas

of change and areas that are stable"

VM VM VM

Client

VM VM VM

Client

VM VM VM

Client

Infiniband Switch

I/O Server

(12)

National Aeronautics and Space Administration! Virtualized Infiniband: Enabling HPC in the Cloud 12

Testing In-Progress / Next Steps"

Virtualization overhead elimination (push for 0%)"

• 

Single node VM matches bare metal!

• 

Different VM configurations"

Newer Intel CPU performance – virtualized IB"

• 

Eight node Sandy Bridge cluster in test!

• 

Different I/O structure stirring interest!

• 

Ivy Bridge system in the works!

• 

80 node test system – scaling out > 8 nodes!

GlufterFS performance tuning!

• 

RDMA write/read rates!

• 

Various volume and file system configurations

!

Red Hat OpenStack"

• 

Grow test cluster into Science Cloud!

• 

Host science cloud use cases directly – self directed resources!

Red Hat

Storage

RHEL

(13)

National Aeronautics and Space Administration!

"

Thanks to …..!

Red Hat – software and tuning support!

Mellanox – hardware loaners and technical support!

OSU – mvapich2 software support!

(14)

National Aeronautics and Space Administration

www.nasa.gov

Questions?


"

Hoot Thompson"

NASA Center for Climate Simulation (NCCS)"

[email protected]"

(15)

National Aeronautics and Space Administration! Virtualized Infiniband: Enabling HPC in the Cloud 15

(16)

National Aeronautics and Space Administration!

Virtualized IB Test Configuration"

(17)

National Aeronautics and Space Administration!

Item  

 Details  

 Processor  Type  

 Westmere  

 Processor  Number  

 X5660  

 Processor  Speed  

 2.8GHz  

 Sockets  per  Node  

2

 Cores  per  Socket  

6

 Cores  per  Node  

12

 Main  Memory  

24GB

 Interconnect  

Mellanox MT26428 QDR IB

 OperaGng  System    

Red Hat 6.4

 Kernel  

2.6.32-358.2.1

Configuration Details"

(18)

National Aeronautics and Space Administration! Virtualized Infiniband: Enabling HPC in the Cloud 18

Single node"

Memory bandwidth!

LINPACK!

!

Multiple node"

Node-to-node bandwidth!

Node-to-node latency!

Eight node LINPACK!

Eight node NPB!

Spread host file!

(19)

National Aeronautics and Space Administration! Virtualized Infiniband: Enabling HPC in the Cloud 19

Two different LINPACK versions"

Openmp – single node!

Hybrid – one or more mpi processes each starting 1 to 12 threads!

!

Different block sizes (NB)"

144, 168, 192 and 216!

!

P Q settings"

P X Q equals number of mpi processes to fill node/cluster!

P always less than Q!

Minimize P – Q (square is best)!

(20)

National Aeronautics and Space Administration! Virtualized Infiniband: Enabling HPC in the Cloud 20

Mimic the computation and data movement in CFD applications"

"

Different class levels (C, D) reflect different problem sizes"

"

Five Kernels"

IS – Integer Sort, random memory access!

EP – Embarrassingly Parallel!

CG – Conjugate Gradient, irregular memory access and communication!

MG – Multi-Grid on a sequence of meshes, long and short distance!

FT – discrete 3d fast Fourier Transform, all-to-all communication!

"

Three pseudo applications"

BT – Block Tri-diagonal solver!

SP – Scalar Penta-diagonal solver!

LU – Lower-Upper Gauss-Seidel solver!

(21)

National Aeronautics and Space Administration!

Memory Bandwidth – Single Node"

B

E

T

T

E

R

(22)

National Aeronautics and Space Administration!

LINPACK – Single Node Host"

B

E

T

T

E

R

(23)

National Aeronautics and Space Administration!

LINPACK – Single Node VM"

B

E

T

T

E

R

(24)

National Aeronautics and Space Administration!

LINPACK – Host versus VM Comparison"

B

E

T

T

E

R

(25)

National Aeronautics and Space Administration!

Bandwidth – Node-to-Node "

B

E

T

T

E

R

(26)

National Aeronautics and Space Administration!

Latency – Node-to-node Latency"

B

E

T

T

E

R

(27)

National Aeronautics and Space Administration!

LINPACK – Eight Hosts"

B

E

T

T

E

R

(28)

National Aeronautics and Space Administration!

LINPACK – Eight VMs"

B

E

T

T

E

R

(29)

National Aeronautics and Space Administration!

LINPACK – Eight Hosts Versus Eight VMs"

B

E

TT

E

R

B

E

T

T

E

R

12% delta

(30)

National Aeronautics and Space Administration!

NPB IS – Eight Hosts Versus Eight VMs"

B

E

T

T

E

R

88% Efficiency

Integer Sort, random memory access

(31)

National Aeronautics and Space Administration!

NPB EP – Eight Hosts Versus Eight VMs"

B

E

T

T

E

R

98% Efficiency

Embarrassingly Parallel

(32)

National Aeronautics and Space Administration!

NPB CG – Eight Hosts Versus Eight VMs"

B

E

T

T

E

R

94% Efficiency

Conjugate Gradient, irregular memory access

and communication

(33)

National Aeronautics and Space Administration!

NPB MG – Eight Hosts Versus Eight VMs"

B

E

T

T

E

R

96% Efficiency

Multi-Grid on a sequence of meshes,

long and short distance

(34)

National Aeronautics and Space Administration!

NPB FT – Eight Hosts Versus Eight VMs"

B

E

T

T

E

R

100% Efficiency

discrete 3d fast Fourier Transform,

all-to-all communication

(35)

National Aeronautics and Space Administration!

NPB BT – Eight Hosts Versus Eight VMs"

B

E

T

T

E

R

90% Efficiency

Block Tri-diagonal solver

(36)

National Aeronautics and Space Administration!

NPB SP – Eight Hosts Versus Eight VMs"

B

E

T

T

E

R

88% Efficiency

Scalar Penta-diagonal solver

(37)

National Aeronautics and Space Administration!

NPB LU – Eight Hosts Versus Eight VMs"

91% Efficiency

B

E

T

T

E

R

Lower-Upper Gauss-Seidel solver

References

Related documents

White balance is adjusted when you press the HOME button on the supplied remote commander or the joystick top button on the remote controller while [ONE PUSH] is selected in

Pupae infection with Beauveria bassiana, Metarhizium anisopliae and Verticillium lecanii at 25 ± 2 °C and 85 ±5 % RH.. After treatment

Moreover, to basic messaging WhatsApp Messenger users can send each other images, video as well as audio media messages.. WhatsApp Android is not compatible

 Allowing for time variation in the premia has the most significant impact on the estimated contribution of graduate skills to productivity growth, especially in countries such

In addition to standard Windows elements such as the task pane (if you choose to display it), menu bar, toolbars, scroll bars, and status bar, Excel has a reference area that shows

When you go to Google, you will see a nice little link that says “Images”. What many people don't realize is that this is a gold mind. These images work the same as the search

This information demonstrates that the application profiles in each registry were heavily weighted toward the Dublin Core element set and the qualifiers explicitly defined by the

Cooperative education and work experience students must have personalized placement learning plans that identify the overall and specific curriculum expectations of the related