• No results found

Scalability in the Cloud HPC Convergence with Big Data in Design, Engineering, Manufacturing

N/A
N/A
Protected

Academic year: 2021

Share "Scalability in the Cloud HPC Convergence with Big Data in Design, Engineering, Manufacturing"

Copied!
29
0
0

Loading.... (view fulltext now)

Full text

(1)

Scalability in the Cloud

HPC Convergence with Big Data in Design, Engineering, Manufacturing

July 7, 2014

David Pellerin, Business Development Principal Amazon Web Services

(2)

Want to run larger HPC workloads, more often

• Need to scale up and scale out for faster results

• Want to run more jobs for higher quality of results

Need to globally collaborate on critical data

• To enhance the productivity of distributed teams

• To operate globally, with greater data governance

Need to manage large, diverse datasets

• Ingest and analyze, query, and take action

• Including connected devices, Internet-of-Things

What Do We Hear From Customers?

(3)

A “Cloud First” Strategy at HGST

“HGST is using the cloud for a higher performance, lower cost, faster deployed solution vs buying a huge on-site cluster.”

- Steve Philpott, CIO

HGST HPC and big data roadmap:

 Molecular dynamics simulations

 Collaboration tools for engineering

 Big data for manufacturing yield analysis

 CAD, CFD, EDA – “cloud first”

Every application presents unique challenges…

some technical, some business

(4)

Computer-Aided Design, Simulation, Analysis, Visualization

Across industries, the trend is Simulation-Driven Design and Discovery

Examples in Design and Manufacturing

Computer-Aided Design

Electronic Design Automation

Computational Fluid Dynamics

Molecular Modeling

Seismic Modeling and Reservoir Simulations

Cloud enables scalable simulations for global manufacturers

Cloud also enables HPC convergence with big data

Cloud Scalability for Simulations

(5)

Courtesy of Cypress Semiconductor

Simulations for Electronic Product Design

No job queue… Using cloud, simulation sweeps are possible in hours instead of weeks

(6)

Job Queues Are Evil!

Conflicting goals

• HPC users seek fastest possible time-to-results

• IT support team seeks highest possible utilization

Result:

• The job queue becomes the capacity buffer

• Users are frustrated and run fewer simulations, or they try to game the system

• Money is being saved, but for the wrong reasons!

?

(7)

Instead, run multiple clusters

at the same time, on-demand

(8)

Match the Architectures to the Jobs

(9)

In a secure Virtual Private Cloud

Automation allows easier management of

clusters, and monitoring of jobs and costs

(10)

Many HPC Deployment Methods

• Traditional HPC schedulers and cluster managers

• MOAB/Torque, Bright

• LSF/OpenLava, Grid Engine, SLURM, Condor

• “Born in the cloud” tools

• MIT StarCluster

• Cycle Computing CycleServer

• AWS-provided tools and APIs

• Cloudformation, Auto Scaling

• cfncluster (github.com/awslabs/cfncluster)

(11)

On-Demand

Pay for compute capacity by the hour with no long-term commitments

For spiky workloads, or to define needs

Multiple Consumption Models

Reserved

Make a low, one-time payment and receive a significant discount on the hourly charge

For committed utilization

Spot

Bid for unused capacity, charged at a Spot Price which fluctuates based on supply and demand

For time-insensitive or transient workloads

Dedicated

Launch instances within Amazon VPC that run on hardware dedicated to a single customer

For highly sensitive or compliance related workloads

(12)

Spot Instances for Scale-Out Computing

18 hours

205,000 materials analyzed 156,314 AWS Spot cores at peak

2.3M core-hours Total spending: $33K

(Under 1.5 cents per core-hour)

Mark E. Thompson

University of Southern California

(13)

The Democratizing of HPC

• Deployed in one AWS region (we have 10)

• In one availability zone in that region

• Using one placement group in that AZ

• Consisting entirely of one new instance type (c3.8xlarge)

• Brought up, benchmarked, and torn down in hours

(14)

AWS

Globa

l

Infras

truct

ure

API

AWS Global Infrastructure

AWS Global Infrastructure

Customer Applications – Built on Higher Level Services

AWS Global Infrastructure

AWS Global Infrastructure

Foundation Services Application Services

Development & Deployment

Applications

Libraries, SDK’s

Networking

VPC Direct

Connect ELB Route53 Databases

RDS Dynamo ElastiCache RedShift

Content Delivery

CloudFront

SES SNS SQS Elastic

Transcoder CloudSearch SWF

IAM Federation Identity & Access

Web Console Interaction Human Interaction

Support

AWS Global Infrastructure

Regions Availability Zones Edge Locations

Analytics

DataPipeline

EMR Kinesis

EC2 Compute

WorkSpaces

AppStream

Storage

S3 EBS Glacier Storage

Gateway Monitoring

CloudWatch

Deployment & Management

BeanStalk Cloud

Formation

OpsWork CloudTrail

Command Line

(15)

Cloud

Collaboration

is

Secure

Collaboration

Agility

(16)

Collaboration is More Secure in the Cloud

Bring the users to the data, don’t send the data to the users

(17)

Collaboration is More Secure in the Cloud

Bring the users to the data, don’t send the data to the users

(18)
(19)

Collaboration with Secure Remote Access

Image courtesy of Calgary Scientific

Data and computation hosted in a secure, customer-managed virtual private cloud, with controlled access via a wide variety of client devices.

Virtual Private Cloud

(20)

NVIDIA GRID K520 in AWS Cloud

Product Name GRID K520

GPUs 2 x GK104 GPUs

CUDA cores 3,072 (1,536 per GPU)

Core Clocks 800 MHz

Memory Size 8GB GDDR5 (4GB per GPU)

HW Video Encoder 2x h.264 (1 per GPU)

Power Consumption 225W

Supported APIs

OpenGL 4.3, DirectX 9, 10, 11, CUDA 5.5, OpenCL 1.1, NVFBC, NVIFR,

NVENC

(21)

Application Streaming Middleware

(22)

• Application Streaming

• Remote

visualization

• Thin client 3D applications

Amazon AppStream

(23)

Cloud Plus Big Data

Equals Awesome

Agility

(24)

Integration of Big Data and HPC via Cloud

Cloud provides a global, distributed, secure, and scalable environment for collaborative design and manufacturing

(25)

AWS Has Always Been About Big Data

(26)

Big Data is Everywhere!

(27)

Big Data in Manufacturing

Managing big data for competitive advantage

For design, engineering, production environments

Quality and Yield Analysis, Statistical Process Control

Structured and unstructured queries

Processing Input

Yield analysis

Manufacturing facility monitoring In-field device monitoring

Logging

Log4J Appender

push to

Kinesis

Elastic MapReduce

Hive

Pig Cascading

MapReduce

pull from

(28)

Sending & Reading Data from Kinesis Streams

HTTP Post

AWS SDK

LOG4J

Flume

Fluentd

Get* APIs

Kinesis Client Library +

Connector Library

Apache Storm

Amazon Elastic MapReduce

Sending Reading

(29)

Cloud for Scalability

Cloud for Global Collaboration

Cloud for Big Data

Summary

References

Related documents

DICAL HOUSE gifts and wine hampers are always well received, and there is a hamper for every taste so step inside the flagship Store located on the outskirts of Mosta, or if more

scope of this essay, but below I explore the reasons why it is plausible to believe that the gap is large enough for private governance initiatives to

Classical  

Evaluation of time-intensity curves in ductal carcinoma in situ (DCIS) and mastopathy obtained using dynamic contrast-enhanced magnetic resonance imaging, Magn Reson Imaging

Although the impetus for biosimilars development has been largely a reduction in the cost of biologics, there are other benefits emerging from the main argument of cost saving,

Finally, from a longer-term perspective, differences in short- and long-term elasticities allow circumventing a trade-off between tax revenue growth and stability of tax revenues:

Population and economic growth within the Durban Metropolitan region in eastern South Africa has increased the demand for water supply. This ever-increasing demand means that

2) To give private sectors with production incentives of fertilizers for them to increase local production at affordable prices 3) To introduce quality