• No results found

Crossing the Performance Chasm with OpenPOWER

N/A
N/A
Protected

Academic year: 2021

Share "Crossing the Performance Chasm with OpenPOWER"

Copied!
18
0
0

Loading.... (view fulltext now)

Full text

(1)

Crossing the Performance Chasm with

OpenPOWER

Dr. Srini Chari

Cabot Partners/IBM

[email protected]

Join the conversation at #OpenPOWERSummit 1

#OpenPOWERSummit

(2)

Disclosure

Please read accompanying Cabot Partners Whitepaper for Additional Detail,

References and Notes on information presented in this document.

Copyright® 2015. Cabot Partners Group. Inc. All rights reserved. Other companies’ product names,

trademarks, or service marks are used herein for identification only and belong to their respective owner.

All images and supporting data were obtained from IBM , NVIDIA, Mellanox or from public sources. The information and product recommendations made by the Cabot Partners Group are based upon public information and sources and may also include personal opinions both of the Cabot Partners Group and others, all of which we believe to be accurate and reliable. However, as market conditions change and not within our control, the information and recommendations are made without warranty of any kind.

The Cabot Partners Group, Inc. assumes no responsibility or liability for any damages whatsoever (including incidental, consequential or otherwise), caused by your or your client’s use of, or reliance upon, the information and recommendations presented herein, nor for any inadvertent errors which may appear in this document. This paper was developed with IBM funding. Although the paper may utilize publicly available material from various vendors, including IBM, it does not necessarily reflect the positions of such vendors on the issues addressed in this document.

(3)

Agenda

 Key Technology Trends

• Cloud, HPC, Analytics, Social, Mobile and Internet of Things

• Centered Around Data

 Open Innovation Vital for Value Creation

• Data-Centric HPC Growing Rapidly

 Client Considerations in HPC Systems Evaluations

 System Attributes Impacting Real Life Performance

• Why the LINPACK Benchmark is Inadequate

 IBM Data Centric Approach and Solutions

 Why OpenPOWER

 Examples of Performance Gains

 Key Takeaways

Join the conversation at #OpenPOWERSummit 3

(4)

Key Intertwined Technology Trends

(5)

Key Intertwined Technology Trends

Join the conversation at #OpenPOWERSummit 5

 Cloud, High Performance Computing, Analytics,

Social, Mobile and IoT

• Enterprise cloud growth - $70B (2014) to $250B (2017)

• Annual growth: Smart phones 20% . Mobile data 81%

• IoT at 12B today reaching over 1 Trillion in a decade

• Social media users - 1.79B (2014) to 2.44B (2018)

 DATA

• 2.5 exabytes (10

18

bytes) created daily. Individuals create

70% and enterprises manage 80%

• Annual spending 30% to reach $114B in 2018

(6)

What IT Must Consider to Deal with Data

 Volume

 Variety

 Velocity

 Veracity

 Vulnerability

 Visualize

 Virtualize

Value

V 8

(7)

Extracting Value From Data with HPC

Join the conversation at #OpenPOWERSummit 7

Requires Open Innovation across the stack

(8)

HPC Drives Value Across Many Industries

 Overall HPC servers growing ~ 6.4% annually

 Traditional HPC

 Data-Centric HPC growing ~ 23.5%

Risk Analytics Life Sciences Oil and Gas

(9)

Considerations to Evaluate HPC Systems

 Not Just Point Benchmarks

 But Workflows Across the HPC Data Life Cycle

 Example in Seismic Processing

Join the conversation at #OpenPOWERSummit 9

(10)

Total Value of Ownership Framework

Value Delivered

• Business Value: e.g. customer revenues, new business models, compliance

regulations, better products, increased business insight, faster time to market, and new breakthrough capability

• Operational Value: e.g. faster time to results, more accurate analyses, more users supported, improved user productivity, better capacity planning

• IT Value: e.g. improved system utilization, manageability, administration, and

provisioning, scalability, reduced downtime, access to robust proven technology and expertise.

Costs Incurred

• IT /Data Center Capital e.g. new servers, storage, networks, power distribution units, chillers, etc.

• Data Center Facilities e.g. land, buildings, containers, etc.

• Operational Costs: e.g. labor, energy, maintenance, software license, applications, etc.

• Other Costs: e.g. system management, deployment and training, downtime,

Holistic Cost Benefit Analysis for Entire HPC Workflow

(11)

What Impacts Traditional HPC Performance

Join the conversation at #OpenPOWERSummit 11

0

Flops/Core

Cores

Memory Capacity

Memory Bandwidth I/O Performance

Network Latency Network Bandwidth

Structures Crash Fluids

Computer Aided Engineering

0

Flops/Core

Cores

Memory Capacity

Memory Bandwidth I/O Performance

Network Latency Network Bandwidth

Quantum Chemistry Molecular Modeling Bioinformatics

Life Sciences

0

Flops/Core

Cores

Memory Capacity

Memory Bandwidth I/O Performance

Network Latency Network Bandwidth

Low Latency Trading Monte Carlo Risk Analytics

Financial Services

0

Flops/Core

Cores

Memory Capacity

Memory Bandwidth I/O Performance

Network Latency Network Bandwidth

Reservoir Seismic Weather

Energy and Environmental Sciences

(12)

Why LINPACK is Inadequate

Most HPC Analytics Involve Sparse Matrices

but LINPACK Solves Dense Matrix Problems

0

Flops/Core

Cores

Memory Capacity

Memory Bandwidth I/O Performance

Network Latency Network Bandwidth

LINPACK HPC Analytics

LINPACK vs. HPC Analytics

(13)

IBM’s Data Centric Approach and Solutions

Join the conversation at #OpenPOWERSummit 13

Traditional System DesignData Centric System

(14)

Major HPC Win for OpenPOWER

(15)

Key Benchmarks*: POWER8 2 - 2.5X Better

Join the conversation at #OpenPOWERSummit 15

SPECint_rate2006 (greater is better) 1.8 x Performance

0 10 20 30 40 50 60 70 80

Dell PowerEdge T620 2s/36c/72t Intel Xeon Haswell

POWER S824 2s/24c/192t IBM POWER8

Dell PowerEdge T620 2s/36c/72t Intel Xeon Haswell

POWER S824 2s/24c/192t IBM POWER8 0

10 20 30 40 50 60

SPECfp_rate2006 (greater is better) 2.1 x Performance

0.0 0.5 1.0 1.5 2.0 2.5 3.0

POWER8 Cisco

Relative System Performance

Terasort Big Data Hadoop (greater is better)

2.5x

0 50 100 150 200 250 300 350

Stream Triad (greater is better) 2.9 x Performance

GB/s

Intel Xeon Haswell 2s/24c/48t

IBM POWER8 2s/24c/192t

Performance / Core Performance / Core

(16)

Application Performance with POWER8* 1.4 – 2.6 X better

Molecular Dynamics - NAMD apoa1 (greater is better)

1.4 x Performance

0 0.5 1 1.5 2 2.5 3

Intel E5-2690 V3 2s/24c/2.6GHz Intel Xeon Haswell

POWER S824L 2s/24c/3.6GHz IBM POWER8

Intel E5-2690 V3 2s/24c/2.6GHz Intel Xeon Haswell

POWER S822L 2s/24c/3.358GHz

IBM POWER8 0

0.5 1 1.5 2 2.5 3

Seismic – RTM (greater is better) 2.6 x Performance

PostGreSQL (higher is better)

2.6x

5.00E+06 1.00E+07 1.50E+07 2.00E+07 2.50E+07 3.00E+07

STAC A2 – Options Pricing (greater is better) 2.07 x Performance

Max Paths (10 min)Nanoseconds / day

(17)

Key Takeaways

 HPC is becoming more data-centric

 Traditional system evaluations based on point

benchmarks such as LINPACK are inadequate

 Focus evaluation on cost-benefit analysis of workflow

across HPC data lifecycle

 Many system features impact HPC performance

 Benefits of OpenPOWER HPC Offerings:

• Deliver Choice and Flexibility

• Minimize Costly Data Motion for Entire Workflow

• Accelerate Compute and Data Intensive Tasks with Lower TCO

• Provide Investment Protection

Join the conversation at #OpenPOWERSummit 17

(18)

*Appendix - Additional Benchmark Detail

SPECcpu (int_rate & fp_rate)

SPECcpu2006 results are based on best published results on E5-2699 v3 from the top 5 Intel system vendors (HP, Oracle, Lenovo, Dell, Fujitsu) submitted as of 9/8/2014. For more information go to http://www.specbench.org/cpu2006/results/ . The IBM POWER8 published data is based on Power S824 2s/24c/3.5GHz POWER8. The x86 Xeon published data is based on Dell PowerEdge T620 2s/36c/2.3GHz E5-2699 v3.

Hadoop Tersort

IBM Analytics Stack: IBM Power System S822L; 8 nodes each with 24 cores / 192 threads, POWER8; 3.0GHz, 512 GB memory, RHEL 6.5, InfoSphere BigInsights 3.0

Cisco Stack: 16 high-density Cisco UCS C240 M3 Rack Servers each with 16 cores / 32 threads, Intel Xeon E5-2665; 2.4 GHz, 256 GB of memory, Cisco UCS VIC 1225, and LSI 9266 8i with 24 1-TB SATA 7200-rpm disk running Apache Hadoop open source distribution.

Stream Triad

The Stream Triad results are based on results reported in published papers.

IBM POWER8:

http://www.dcs.warwick.ac.uk/~sdh/pmbs14/PMBS14/Workshop_Schedule_files/2-PerformancePower8.pdf Intel Xeon E5-2600 v3

http://www.google.com/url?sa=t&rct=j&q=&esrc=s&source=web&cd=1&ved=0CB8QFjAA&url=http%3A%2F%2Fdownload.boston.co.uk%2Fd ownloads%2F9%2F3%2Fc%2F93c022fd-0d6d-46a4-9124-28c9e32f2533%2FIntel-

Whitepaper.pdf&ei=mLgBVbysL8KrggT774CICw&usg=AFQjCNFal5q5Vz2-

ly6ZbsaKZ2QPPad1fg&sig2=3LzktTXeKPvS2QW9ndXgfQ&bvm=bv.87920726,d.eXY

STAC and all STAC names are trademarks or registered trademarks of Securities Technology Analysis Center LLC.

https://stacresearch.com/system/files/asset/files/STAC-A2%20Intel%20Composer%20on%204%20x%20IVT%20EX%20-%20INTC140509.pdf

References

Related documents