• No results found

ECLIPSE Best Practices Performance, Productivity, Efficiency. March 2009

N/A
N/A
Protected

Academic year: 2021

Share "ECLIPSE Best Practices Performance, Productivity, Efficiency. March 2009"

Copied!
17
0
0

Loading.... (view fulltext now)

Full text

(1)

ECLIPSE Best Practices

Performance, Productivity, Efficiency

(2)

ECLIPSE Performance, Productivity, Efficiency

• The following research was performed under the HPC Advisory Council

activities

– HPC Advisory Council Cluster Center

• Special thanks to AMD, Dell, Mellanox, Schlumberger – In particular to

• Joshua Mora (AMD)

• Jacob Liberman (Dell)

• Gilad Shainer and Tong Liu (Mellanox Technologies)

• Owen Brazell (Schlumberger)

• For more info please refer to

(3)

The HPC Advisory Council

• Distinguished HPC alliance (OEMs, IHVs, ISVs, end-users) – More than 60 members worldwide

– 1st tier OEMs (AMD, Dell, HP, Intel, Sun) and systems integrators across the world

– Leading OSVs and ISVs (LSTC, Microsoft, Schlumberger, Wolfram etc.)

– Strategic end-users (Total, Fermi Lab, ORNL, OSU, VPAC etc.)

– Storage vendors (DDN, IBRIX, LSI, Panasas etc)

• The Council mission

– Bridge the gap between high-performance computing (HPC) use and its potential

– Bring the beneficial capabilities of HPC to new users for better research, education, innovation and product manufacturing

– Bring users the expertise needed to operate HPC systems

– Provide application designers with the tools needed to enable parallel computing

(4)

HPC Advisory Council Activities

HPC Technology Demonstration HPC Outreach and Education Cluster Center

End-user applications benchmarking center

Network of Expertise

Best Practices - Oil and Gas

- Automotive - Bioscience - Weather - CFD - Quantum Chemistry - and more….

(5)

ECLIPSE Best Practices Objectives

• Ways to improve performance, productivity, efficiency

Knowledge, expertise, usage models

• The following presentation reviews:

ECLIPSE performance benchmarking

Interconnect performance comparisons

ECLIPSE productivity

Understanding ECLIPSE communication patterns

(6)

Test Cluster Configuration

Dell™ PowerEdge™ SC 1435 24-node cluster

– Dual socket 1-U rack server supports 8x PCIe and 800MHz DDR2 DIMMs

– Energy efficient building block for scalable and productive high performance computing

Quad-Core AMD Opteron™ Model 2382 processors (“Shanghai”)

– Industry leading technology that delivers performance and power efficiency

– Opteron™ processors provide up to 21 GB/s peak bandwidth per processor

Mellanox® InfiniBand ConnectX® DDR HCAs and Mellanox InfiniBand DDR Switch

– High-performance I/O consolidation interconnect – transport offload, HW reliability, QoS

– Supports up to 40Gb/s, 1usec latency, high message rate, low CPU overhead and zero scalable latency

Application: Schlumberger ECLIPSE Simulators 2008.2

– ECLIPSE is the market leading reservoir simulator

– Has become the de-facto standard for reservoir simulation in the oil and gas industry

Benchmark Workload

(7)

ECLIPSE Performance - Interconnect

• InfiniBand enables highest performance and scalability

– Performance accelerates with cluster size

• Performance over GigE and 10GigE is not scaling

– Slowdown occurs beyond 8 nodes

Lower is better Single job per cluster size

Schlumberger ECLIPSE (FOURMILL) 0 1000 2000 3000 4000 5000 6000 4 8 12 16 20 22 24 Number of Nodes E la p s e d Ti m e ( S e c onds )

(8)

ECLIPSE Productivity

By allowing multiple jobs to run simultaneously, ECLIPSE productivity can be increased

– Maximum gain with one job per core, maximum % improve with 1 job per 2 cores

Three cases are presented

– Single job over the entire systems

– Four jobs, each on two cores per CPU per server

– Eight jobs, each on one CPU core per server

Eight jobs per node increases productivity by up to 142%

Schlumberger ECLIPSE (FOURMILL) 0 50 100 150 200 250 300 8 12 16 20 22 24 Number of Nodes N u mb er o f Jo b s

(9)

ECLIPSE Productivity – Interconnect Comparison

• Productivity improvement depends on the cluster interconnect

– As number of job increases, I/O becomes the system bottleneck

• InfiniBand demonstrates scalable productivity compared to Ethernet

– GigE shows performance decrease beyond 8 nodes

– 10GigE demonstrates no scaling beyond 16 nodes

4 Jobs on each node

Higher is better Schlumberger ECLIPSE (FOURMILL) 0 50 100 150 200 250 4 8 12 16 20 22 Number of Nodes N u m b e r of J obs

(10)

ECLIPSE MPI Profiliing MPI_Isend 0 2 4 6 8 10 [0..128B] [128B..1KB] [1..8KB] [8..256KB] [256KB..1M] [1M..Infinity] N u m b e r o f M e s s a g e s ( M illio n s ) Message Size

4nodes 8nodes 12nodes 16nodes 20nodes 24nodes

ECLIPSE Profiling – Data Transferred

• Sensitivity points:

8K-256KB messages

• Relate to data communications

0-128B messages

• Relate to data synchronizations

• MPI message profiling

• Determine Application

sensitivity point

Both Send and Receive

ECLIPSE MPI Profiliing MPI_Recv 0 1 2 3 4 5 6 7 [0..128B] [128B..1KB] [1..8KB] [8..256KB] [256KB..1M] [1M..Infinity] N u m b er o f M essag es (M il li o n s ) Message Size

(11)

Eclipse MPI Profiliing 30% 40% 50% 60% 4 8 12 16 20 24 Number of Nodes P e rcen ta g e o f M e ssag es

MPI_Isend < 128 Bytes MPI_Isend < 256 KB MPI_Recv < 128 Bytes MPI_Recv < 256 KB

ECLIPSE Profiling – Message Distribution

• Majority of MPI messages are large size

Percentage slightly increase with cluster size

(12)

Data Sent 0 200 400 600 800 1000 1200 1400 1 45 89 133 177 221 265 309 353 397 441 485 529 573 617 661 705 749 793 Da ta T ran sf e rr e d ( M B /s ) Data Sent 0 200 400 600 800 1000 1200 1400 1 48 95 142 189 236 283 330 377 424 471 518 565 612 659 706 753 800 847 Da ta T ran sf e rr e d ( M B /s ) Data Sent 0 200 400 600 800 1000 1200 1400 1 86 171 256 341 426 511 596 681 766 851 936 1021 1106 1191 Timing (s) Da ta T ran sf e rr e d ( M B /s ) Data Sent 0 200 400 600 800 1000 1200 1400 1 146 291 436 581 726 871 1016 1161 1306 1451 1596 1741 1886 2031 Timing (s) D a ta T ra n s fe rr e d (MB /s )

Interconnect Usage by ECLIPSE

• Total server throughput increases rapidly with cluster size

4 Nodes 8 Nodes

(13)

Power Consumption 0 200 400 600 800 1000 1200 1400 1600 1800 2000 P o w e r pe r J ob ( W h)

GigE 10GigE InfiniBand

Power Consumption/Productivity Comparison

66% 33%

4 Jobs on each node

By enabling higher productivity on a given system, power/job decreases

By using InfiniBand power/job consumption decreases by

– Up to 66% vs GigE and 33% vs 10GigE

– For productivity case – 4 jobs per node

With a single job approach, InfiniBand reduces power/job consumption by more than 82% compared to 10GigE

(14)

ECLIPSE Profiling Summary - Interconnect

• ECLIPSE was profiled to determine networking dependency • Majority of data transferred between compute nodes

– Done with 8KB-256KB message size

– Data transferred increases with cluster size

• Most used message sizes

– <128B messages – mainly synchronizations

– 8KB-256KB – data transferring

• Message size distribution

– Percentage of smaller messages (<128B) slightly decreases with cluster size

– Percentage of mid-size messages (8KB-256KB) increases with cluster size

• ECLIPSE interconnects sensitivity points

– Interconnect latency and throughput for <256KB message range

(15)

Quad-Core AMD “Shanghai” Performance and Power consumption

• AMD Opteron “Shanghai” versus Opteron “Barcelona”

Overall 20% performance increase

• Both GFLOP/s and GB/s

While sustaining similar power consumption.

• Memory throughput intensive applications (such as ECLIPSE)

Performance is driven by north-bridge and DDR2 memory frequencies

Consume less energy than cache friendly compute intensive applications

• Power management schemes

Delivers the highest application performance per Watt ratio

Critical for low power consumption of the platform while in idle

(16)

Conclusions

• Eclipse is widely used to perform reservoir simulation – Developed by Schulmberger

• ECLIPSE performance and productivity relies on: – Scalable HPC systems and interconnect solutions

– Low-latency and high-throughout interconnect technology

– NUMA aware application for fast access to memory

– Reasonable job distribution can dramatically improve productivity

• Increasing number of jobs per day while maintaining fast run time

• Interconnect comparison shows:

– InfiniBand delivers superior performance and productivity in every cluster size

(17)

Thank You

HPC Advisory Council

All trademarks are property of their respective owners. All information is provided “As-Is” without any kind of warranty. The HPC Advisory Council makes no representation to the accuracy and completeness of the information contained herein. HPC Advisory Council Mellanox undertakes no duty and assumes no obligation to update or correct any information presented herein

References

Related documents

We ultimately find that with a careful application of technical and fundamental data, as well as a thorough understanding of common patterns in financial markets, it is possible

(c) If any capital ratios are less than required by the ORDER, as determined by the date of any Report of Condition and Income or at an examination by the FDIC or

Using a subset of genes with known interactions, we show that the inferred NEMix network has high accuracy and outperforms the classical nested effects model without hidden

* Holding Company: AIC Holdings Company (Pty) Ltd Unity is a product of African Unity Health (Pty) Ltd (“African Unity Health”), a new Financial Services Provider, created to

• ThunderX_SC™: Up to 48 highly efficient cores along with integrated virtSOC, 10/40 GbE connectivity, multiple PCIe Gen3 ports, high memory bandwidth, dual socket coherency, and

To monitor the slow path delay of a switch, we send packet probes to the switch using PacketOut, use a carefully placed rule to trigger a PacketIn, and then drop the probe with-

Our model gauges the relationship between risk and reward in several ways, including: the pricing drawdown as compared to potential profit volatility, i.e.how much one is willing

Cost/Benefit - benefits equal or exceed the costs ƒ Services Providers • Information Technology • Real Estate • Finance • Human Resources • Public Affairs ƒ Service Recipients