• No results found

Computing for ereseach

N/A
N/A
Protected

Academic year: 2021

Share "Computing for ereseach"

Copied!
39
0
0

Loading.... (view fulltext now)

Full text

(1)

v

© 2006 IBM Corporation

Computing for eReseach

Jeffrey Dunn

(2)

What is Deep Computing?



As deeper understanding of physics and biology lifts the human spirit,



As better physical and biological models are devised,



As volumes of experimental data are collected,



As the Internet grows to encompass more people and institutions,



As pervasive devices connect to the network,



As more business is done online,

A wealth of data is becoming available in digital form.



Finding the value buried in that data will be an increasingly powerful tool

For business and for society

Extending the scale of simulations to more accurately model phenomenon



Deep computing combines several techniques to solve extremely

complex problems in this sea of digital data:

Advanced mathematics

Domain-specific knowledge

Specialized software

(3)

3 © 2006 IBM Corporation

Why Does IBM Do Deep Computing?



Good for our customers:

Biological and Life Sciences

Pharmaceutical Discovery

Materials and Manufacturing

Automotive and aerospace design

Environment and Energy

Finance

Inventory optimization

Value at risk

Portfolio and trading

optimization

Security

Homeland security

Military simulations



Good for the world:

Stop epidemics

Invent new drugs

Understand nature



Good for IBM:

Good business

Drives innovation

Better design of our chips,

servers, software

(4)

Systems Challenges



Performance:

How do we provide sustained

growth in application level performance in the

face of a technology discontinuity?



Productivity:

How do we, at the same time,

make this massive compute power more

consumable and reduce time-to-insight?

(5)

5 © 2006 IBM Corporation

Deep Computing Innovation

Addressing Challenges Beyond Computation



System Design

Scalability

Packaging & density

Network infrastructure

Power consumption & cooling



Software

System management

Security

Software integration

Programming models &

productivity



Data

Data management

Archival & compliance

Performance & reliability

Simulation & modeling

Data warehousing & mining

Capacity management &

virtualization



Economics

Hybrid financial & delivery

models

(6)

Source: www.top500.org

Semiannual independent

ranking of top 500

supercomputers in the world

“IBM remains the dominant vendor of supercomputers

with almost half of the list carrying its label. “

IBM Supercomputing Leadership

TOP500 Nov 2006

44

4

156 7 7 1 9 2 9 1 10 1 3 1 19 1

239

20 15 18 4 39 0% 20% 40% 60% 80% 100%

TOP500 TOP100 TOP10

Others NEC Dell Sun Cray SGI HP IBM

IBM is clear leader ...



#1 System – DOE/LLNL - BlueGene/L (280.6 TF)



Most entries on TOP500 list with 239 (47.8%)



Most installed aggregate throughput with over

1,763 Teraflops (49.8%)



Most in TOP10 with 4 systems (40%)



Most in TOP20 with 6 systems (30%)



Most in TOP100 systems with 44 (44%)



Most Commodity Clusters with 171 of 359 (47.6%)

(7)

7 © 2006 IBM Corporation

Deep Computing Portfolio



Systems

Servers, blades, clusters, workstations

POWER, Intel, AMD Opteron

Linux, Unix, Windows

High performance interconnects

Special-purpose systems and accelerators

Blue Gene, Cell

Gov’t & research partnerships (e.g., ASC,

Roadrunner)

Storage systems and virtualization



Software & Tools

System & data management (e.g., CSM, GPFS)

Compilers, schedulers, libraries, tools

Grid & on demand middleware



Solutions & Services

Deep Computing Capacity on Demand

Deep Computing Visualization

Industry-specific solutions

Engineering & technology services

IBM Global Financing

High Performance Switch

P5 575

BladeCenter

IntelliStation

Clusters (1350)

Blue Gene

(8)
(9)

9 © 2006 IBM Corporation

ASC Purple: World’s #1 General Purpose Supercomputer



Compute cluster of

8-way p5 575 SMPs

>12000 processors

93.4 Teraflop peak



>6 TByte/sec bisection BW

3-stage dual-rail fat-tree



2 PBytes global storage

Can be single file system

100 GB/s to single file



Power:

4.8 MegaWatts to run

3.0 MegaWatts to cool



Area:

131 compute racks

269 racks (w/ storage)

6720 ft

2

http://www.llnl.gov/asc/platforms/purple/



Culmination of 10-year multi-step program initiated by

DOE/NNSA



Goals

Safeguard the U.S nuclear stockpile

Adhere to the Comprehensive Nuclear Test Ban Treaty

Eliminate the need for nuclear testing via a 100x improvement

on nuclear simulations



Along the way: a significant impact on science and

business

Automotive safety

Seismic processing

Weather forecasting

(10)

IBM System Cluster 1350



IBM Leadership in high end clusters: Top 500



Driving genetic research



Helping to find cures for liver disease, breast cancer



Enhancing manufacturing processes

(11)

11 © 2006 IBM Corporation

Cluster 1350 Delivers Choice of Nodes

System x3550

1U, 2 socket

dual core Xeon™

System x3755

System x3650

14 Blades Per Chassis

HS21 – Intel Processor-based

Blade

JS21– PowerPC

Processor-based Blade

LS21/LS41– AMD

Processor-based Blades

QS20 – Cell processor blade

BladeCenter with HS21, JS21

and LS21 or LS41

OpenPOWER 710

1- or 2-way high

performance 2U node

OpenPOWER 720

Up to 4-way high performance 4U node

IBM System p5 505

IBM System p5 550/550Q

64-bit POWER5+ processor, 4U

node

High density 64-bit POWER5

processor, 1U node

System x3655

System x3455

2U, 2 socket

dual core Xeon™

4U, 4 socket dual core Opteron® 2U, 2 socket dual core Opteron® 1U, 2 socket dual core

(12)

IBM Blue Gene Solution

Chip

(2 procs.)

5.6 GF/s

4 MB

Compute Card

(2 chips, 1x2x1)

11.2 GF/s

1.0 GB DDR

Node Card

(32 chips, 4x4x2)

16 Compute Cards

(up to 2 IO cards)

180 GF/s

16 GB DDR

Rack

32 Node Cards, 8x8x16

in 2 midplanes

5.6 TF/s

1 TB DDR

System

(64 racks, 64x32x32)

360 TF/s

32 TB DDR

Blue Gene at Lawrence Livermore

National Laboratory

(13)

13 © 2006 IBM Corporation

Cell BE: Revolutionizing the digital experience



Five-year collaboration with Sony, Toshiba



Gaming provides a roadmap for visual, interactive, and

collaborative methods



200 Gigaflop peak (@ 3.2 GHz, single precision)



221 mm

2

in 90nm SOI technology



Significant cost performance advantages compared to Intel/AMD

for key applications including digital media, entertainment,

communication, graphics, and HPC



Application areas may include:

Digital Media

Rich Media Mining

Financial Services/Analysis

Seismic

Life Sciences, Drug Simulation

Computational mathematics

Virtual worlds

Real time computing

SSL encoding

SSL decoding

HTTP

Processing

Ray casting

3-D rendering

Softbody physics

Compositional models

Collision detection

On-line Game

Background filtering

Motion tracking

Video Surveillance

Key algorithms

Applications

(14)

Multi-Core Acceleration Capability: Cell Broadband Engine™

Cell Broadband Engine



“Supercomputer & Network on a Chip”



1 PPE + 8 Synergistic Processing Elements

(SPE) cores



Element Interface Bus (EIB) @ 300+GB/s



1 Cell BE Blade = 2 Cell BE Chips = 16 SPEs

x86 Linux

Master Cluster

Cell BE

Accelerator

Cluster

(Blade)

Revolutionary Hybrid Supercomputer at Los Alamos National Laboratory

Will Harness Cell BE Chips and AMD Opteron™ Processor Technology

Cell Broadband Engine is a trademark of Sony Computer Entertainment, Inc. Development collaboration between Sony, Toshiba and IBM.

(15)

15 © 2006 IBM Corporation

Scalable Lower Latency I/O: General Parallel File System (GPFS)



High Performance

Has provided 15GB/s to a single node and 100GB/s against a single file



Scalability

Currently supports 100s of nodes and 200+TB of storage (more by special bid)



Reliability

Parallel operation means no single point of failure

One large research customer reported 100% uptime for GPFS for an entire year



Shared Disk

All system data & metadata is equally accessible from all nodes

Any node can read from or write to any of the disks



Parallel

All data & metadata flows between the disks and nodes in parallel



Cluster

100s of servers can share data over a LAN or other high-speed medium

The entire cluster can be administered from a single node

(16)

Deep Computing Capacity on Demand

In-house HPC

Infrastructure

Virtual Private Network

Scale up/down by tapping into IBM hosted HPC capacity in a security-rich

environment to help satisfy peak or long term demand and avoid data center upgrade.

Variable Capacity

Variable Cost

Fixed Capacity

Fixed Cost

IBM POWER

London, England

Poughkeepsie, NY

Rochester, MN

> 3,200 Servers

> 14,700 CPUs

> 24TB Storage

(17)

17 © 2006 IBM Corporation

Project Checkmate

Project Checkmate seeks to model mutations and

interactions between avian flu and human influenza to

determine strains that could cause an epidemic:

• Before the epidemic occurs.

• In time to produce vaccine to avoid the epidemic.

Project Checkmate is a collaboration with the

Scripps Institute.

Identify anticipated

antigen and antibody

using

(18)

Blue Brain

The first objective of the Blue

Brain is to create a cellular

level, software replica of the

Neocortical Column for

real-time simulations.

Blue Brain will search for:

• New insights into how human beings think and

remember.

• How specific defects in our circuitry may contribute to

autism, schizophrenia and Parkinson's.

With Blue Brain, research that used to require several

years of lab work can be done in a matter of days or

minutes – using

Blue Gene

.

(19)

19 © 2006 IBM Corporation

ASTRON

LOFAR

(LOw Frequency ARray) digitizes

10-240 MHz signals from an array of

simple omni-directional antennas and

processes the data on a central

computer system to emulate a

conventional dish antenna

STELLA

(Supercomputing TEchnology for

Linked Lofar Applications) uses 6144

dual-CPU compute nodes of eServer

Blue Gene® providing 27.45 Tflops/sec



Blue Gene enables LOFAR to provide higher resolution and sensitivity

than any other low-frequency radio telescope.



Digital techniques provide extreme agility in frequency and pointing.



Multi-beaming capability allows simultaneous, full-sensitivity

observations in widely separated directions.



High-bandwidth, fiber-optic network handles terabits/second.

(20)
(21)

21 © 2006 IBM Corporation

Collaborative Research on Hybrid Clusters

Nanyang Polytechnic Launches the First Cell Broadband Engine™

Technology Innovation Centre in ASEAN Region

Acquisition of Cell technology to kick start development of Singapore’s Interactive

and Digital Media industry to nurture homegrown talent

(22)

University Challenge

SANTA CRUZ, Calif. — Aiming at finding innovative applications that will take the IBM Cell Broadband Engine (BE) processor beyond video games, IBM is sponsoring a Cell BE contest for university students from 25 different countries. It includes a Cell BE trivia quiz and a software design competition.

IBM Launches Cell/B.E. University Challenge '07

Cash prizes ranging from $10,000 for the first-place winner to $2,500 for the fourth-place winner.

7

февраля

, 2007

От вдохновения

к инновациям с

денежными призами для победителей

КорпорацияIBMобъявила о проведении первого в своем роде конкурса лучших приложений, созданных в инструментальной средеCell Broadband Engine (Cell BE.), ориентированного на студентов из25 стран, включая Россию, и предусматривающего денежные призы и награды.

(23)

23 © 2006 IBM Corporation

(24)



Petaflop systems with ultra scalability



Hybrid architectures



System level accelerators



Mature software stack



Rich application infrastructure

Deep Computing Conceptual Roadmap

Blue Gene/L

Blue Gene/P

Blue Gene/Q

POWER5

POWER6

POWER7

Linux Clusters / AMD/Intel Processors

(25)

25 © 2006 IBM Corporation

Ultrascalability



Ultrascalability is a reality.

And better than people – including us – thought.



Performance



Cost / Performance



Power efficiency



Ease of use



Ease of porting

(26)
(27)

27 © 2006 IBM Corporation

The premium on SYSTEM design continues



Power



Reliability / Availability



Interconnect



Programmability

Debuggability



Cost:

Efficiency vs cost

Not Efficiency vs Peak

(28)

The Tyranny of Large numbers



Each extra Dollar per GigaFlop/s costs $1,000,000



Each extra Watt per GigaFlop/s costs 1 Megawatts



$1M annual electricity costs



An additional cubic inch per GigaFlop/s costs 580

cubic feet

82 sq ft floor space = ~45TFlops of BlueGene/L



Every penny/Megabyte in DRAM memory costs will

cost $5,000,000-$10,000,000 assuming a balanced

system

(29)

29 © 2006 IBM Corporation

The Tyranny of large numbers – itsy bitsy performance

losses



As you increase the number

of processing elements from

N to N+1, performance should

increase by (N+1)/N



Suppose at each step the

best you could achieve was

.99999 * (N+1)/N

performance improvement



Then you’d have to stop

designing after you had

added the 100,000

Processing elements

0 50 100 150 200 250 300

100,000 Processing elements

(30)

Tyranny of Large numbers … continued



Reliability

If Hardware fails on any node roughly once every 5 years

a node failure would be expected on a 10,000-20,000 node system every 2-5 hours

a node failure would be expected on a 100,000 node system every 27 minutes

If software (or anything else) fails on any node (independently) once in a

month

a node failure would be expected on a 10,000-20,000 node system every 2-5 minutes

a node failure would be expected on a 100,000 node system every 27 seconds



Real solutions are likely to require anywhere from 100,000 to

1,000,000 compute nodes

(31)

31 © 2006 IBM Corporation

“Thick” gate oxide

Semiconductor technology issues: We’re down to atoms



Consider the gate oxide in a CMOS transistor (the smallest

dimensions today)

Assume only 1 atom high “defects” on each surrounding silicon

layer.

For a modern “scaled” oxide, 6 atoms thick, 33% variability is

induced.

Result:

Single atom defects can cause local current leakage 10-100x

higher than average

present

future

1.2 nm oxynitride

Scaled gate oxide

Field effect transistor

Source

Drain

Gate

(32)

Power is THE problem

Single thread focus has resulted in power inefficient design

Steam Iron

5W/cm2

(33)

33 © 2006 IBM Corporation

Accelerators and Special Purpose HW

Special

Purpose

HW

General

Purpose

Systems

Programmability



Generality



Flexibility



MPEG chips

Crypto chips

….

DSPs

FPGAs

Game Processors

UNIX / Linux

SMP Clusters

Commodity

Linux

Clusters

Blue Gene

Accelerator boards

(e.g. Clearspeed)

Cell Blade

clusters

Hybrid

systems

Special Purpose HW advantages

Cost / performance

Performance / Watt

Performance / sq mm

(34)

Commercial Supercomputing



Powerful systems to tackle the most demanding commercial workloads

Self-sustaining from a business perspective



Differs from scientific computing

More complex software stack (hypervisor, OS, Java RTE, middleware, apps)

Different system balances with respect to memory (+), processor (-), I/O (+) and

inter-node communication (-)



Areas of supercomputing with commercial workloads

Discovery (search + analytics)

Content management with discovery

Digital media, media analytics

On-line auctions and sales

Financial applications, including portfolio and trading optimization

Health care, including medical imaging and data mining of health data

Large sensor applications

(35)

35 © 2006 IBM Corporation

In the end, it’s not about the technology. It’s

what you do with it that counts. IBM will:



Continue to innovate across the whole systems stack to

deliver leadership performance and usability.



Help solve problems that are currently intractable or not

cost-effective.

(36)
(37)

37 © 2006 IBM Corporation This document was developed for IBM offerings in the United States as of the date of publication. IBM may not make these offerings available in

other countries, and the information is subject to change without notice. Consult your local IBM business contact for information on the IBM offerings available in your area.

Information in this document concerning non-IBM products was obtained from the suppliers of these products or other public sources. Questions on the capabilities of non-IBM products should be addressed to the suppliers of those products.

IBM may have patents or pending patent applications covering subject matter in this document. The furnishing of this document does not give you any license to these patents. Send license inquires, in writing, to IBM Director of Licensing, IBM Corporation, New Castle Drive, Armonk, NY 10504-1785 USA.

All statements regarding IBM future direction and intent are subject to change or withdrawal without notice, and represent goals and objectives only.

The information contained in this document has not been submitted to any formal IBM test and is provided "AS IS" with no warranties or guarantees either expressed or implied.

All examples cited or described in this document are presented as illustrations of the manner in which some IBM products can be used and the results that may be achieved. Actual environmental costs and performance characteristics will vary depending on individual client configurations and conditions.

IBM Global Financing offerings are provided through IBM Credit Corporation in the United States and other IBM subsidiaries and divisions worldwide to qualified commercial and government clients. Rates are based on a client's credit rating, financing terms, offering type, equipment type and options, and may vary by country. Other restrictions may apply. Rates and offerings are subject to change, extension or withdrawal without notice.

IBM is not responsible for printing errors in this document that result in pricing or information inaccuracies.

All prices shown are IBM's United States suggested list prices and are subject to change without notice; reseller prices may vary. IBM hardware products are manufactured from new parts, or new and serviceable used parts. Regardless, our warranty terms apply.

Many of the features described in this document are operating system dependent and may not be available on Linux. For more information, please check: http://www.ibm.com/systems/p/software/whitepapers/linux_overview.html

Any performance data contained in this document was determined in a controlled environment. Actual results may vary significantly and are dependent on many factors including system hardware configuration and software design and configuration. Some measurements quoted in this document may have been made on development-level systems. There is no guarantee these measurements will be the same on generally-available systems. Some measurements quoted in this document may have been estimated through extrapolation. Users of this document

should verify the applicable data for their specific environment. Revised January 19, 2006

(38)

The following terms are registered trademarks of International Business Machines Corporation in the United States and/or other

countries: AIX, AIX/L, AIX/L(logo), alphaWorks, AS/400, BladeCenter, Blue Gene, Blue Lightning, C Set++, CICS, CICS/6000,

ClusterProven, CT/2, DataHub, DataJoiner, DB2, DEEP BLUE, developerWorks, DirectTalk, Domino, DYNIX, DYNIX/ptx, e

business(logo), e(logo)business, e(logo)server, Enterprise Storage Server, ESCON, FlashCopy, GDDM, i5/OS, IBM, IBM(logo),

ibm.com, IBM Business Partner (logo), Informix, IntelliStation, IQ-Link, LANStreamer, LoadLeveler, Lotus, Lotus Notes,

Lotusphere, Magstar, MediaStreamer, Micro Channel, MQSeries, Net.Data, Netfinity, NetView, Network Station, Notes, NUMA-Q,

Operating System/2, Operating System/400, OS/2, OS/390, OS/400, Parallel Sysplex, PartnerLink, PartnerWorld, Passport

Advantage, POWERparallel, Power PC 603, Power PC 604, PowerPC, PowerPC(logo), Predictive Failure Analysis, pSeries,

PTX, ptx/ADMIN, RETAIN, RISC System/6000, RS/6000, RT Personal Computer, S/390, Scalable POWERparallel Systems,

SecureWay, Sequent, ServerProven, SpaceBall, System/390, The Engines of e-business, THINK, Tivoli, Tivoli(logo), Tivoli

Management Environment, Tivoli Ready(logo), TME, TotalStorage, TURBOWAYS, VisualAge, WebSphere, xSeries, z/OS,

zSeries.

The following terms are trademarks of International Business Machines Corporation in the United States and/or other countries:

Advanced Micro-Partitioning, AIX 5L, AIX PVMe, AS/400e, Chipkill, Chiphopper, Cloudscape, DB2 OLAP Server, DB2 Universal

Database, DFDSM, DFSORT, DS4000, DS6000, DS8000, e-business(logo), e-business on demand, eServer, Express

Middleware, Express Portfolio, Express Servers, Express Servers and Storage, GigaProcessor, HACMP, HACMP/6000, IBM

TotalStorage Proven, IBMLink, IMS, Intelligent Miner, iSeries, Micro-Partitioning, NUMACenter, On Demand Business logo,

OpenPower, POWER, Power Architecture, Power Everywhere, Power Family, Power PC, PowerPC Architecture, PowerPC 603,

PowerPC 603e, PowerPC 604, PowerPC 750, POWER2, POWER2 Architecture, POWER3, POWER4, POWER4+, POWER5,

POWER5+, POWER6, POWER6+, Redbooks, Sequent (logo), SequentLINK, Server Advantage, ServeRAID, Service Director,

SmoothStart, SP, System i, System i5, System p, System p5, System Storage, System z, System z9, S/390 Parallel Enterprise

Server, Tivoli Enterprise, TME 10, TotalStorage Proven, Ultramedia, VideoCharger, Virtualization Engine, Visualization Data

Explorer, X-Architecture, z/Architecture, z/9.

A full list of U.S. trademarks owned by IBM may be found at: http://www.ibm.com/legal/copytrade.shtml.

Revised April 27, 2006

(39)

39 © 2006 IBM Corporation

Cell Broadband Engine is a trademark of Sony Computer Entertainment, Inc.

Rambus is a registered trademark of Rambus, Inc.

XDR is a trademark of Rambus, Inc.

UNIX is a registered trademark in the United States, other countries or both.

Linux is a trademark of Linus Torvalds in the United States, other countries or both.

Microsoft, Windows, Windows NT and the Windows logo are trademarks of Microsoft Corporation in the United

States, other countries or both.

Intel, Intel Xeon, Itanium and Pentium are trademarks or registered trademarks of Intel Corporation in the

United States and/or other countries.

AMD Opteron is a trademark of Advanced Micro Devices, Inc.

Java and all Java-based trademarks and logos are trademarks of Sun Microsystems, Inc. in the United States

and/or other countries.

TPC-C and TPC-H are trademarks of the Transaction Performance Processing Council (TPPC).

SPECint, SPECfp, SPECjbb, SPECweb, SPECjAppServer, SPEC OMP, SPECviewperf, SPECapc, SPEChpc,

SPECjvm, SPECmail, SPECimap and SPECsfs are trademarks of the Standard Performance Evaluation Corp

(SPEC).

NetBench is a registered trademark of Ziff Davis Media in the United States, other countries or both.

AltiVec is a trademark of Freescale Semiconductor, Inc.

Other company, product and service names may be trademarks or service marks of others.

Revised April 27, 2006

References

Related documents

Za razliku od odraslih, u djece su prijelomi proksimalnog dijela femura vrlo rijetki te čine manje od jedan posto svih prijeloma u dječjoj dobi.. (19) No, unatoč tome što su rijetki,

IBM, the IBM logo, ibm.com, ILOG and WebSphere are registered trademarks of International Business Machines Corporation in the United States, other countries, or both.. If these

A logical question often asked is “What type of forage do I feed my endurance horse?” To actually determine which hay to buy, one should consider the fiber, digestible energy

(foregut fermenters benefit more from 2 and 3... some hindgut use coprophagy to run food through a second time to better digest and absorb the work of those microbes, YUCK)..

Versione 2 cilindri e maniglia per Porte dʼIngresso, Porte con serratura 2 cylinders and handle version on Entrance Doors. Barre supplementari a 2 punti di chiusura, azionate

The following terms are trademarks of the International Business Machines Corporation in the United States, other countries, or

SurePOS and IBMLink are trademarks of International Business Machines Corporation in the United States or other countries or both. PS/2 is a registered trademark of

The MHM practices, beliefs and behaviours that exist in the study areas were identified. These together with other findings from the study not shared in this paper provide