• No results found

Cloud Computing for Research Roger Barga Cloud Computing Futures, Microsoft Research

N/A
N/A
Protected

Academic year: 2021

Share "Cloud Computing for Research Roger Barga Cloud Computing Futures, Microsoft Research"

Copied!
18
0
0

Loading.... (view fulltext now)

Full text

(1)

Cloud Computing for Research

Roger Barga

Cloud Computing Futures, Microsoft Research

(2)

Trends: Data on an Exponential Scale

• Scientific data doubles every year

– Combination of inexpensive sensors + exponentially faster computing – Cost of acquiring data has dropped close to zero

• Changes the nature of research computing

• Cuts across disciplines (eScience)

• Becoming increasingly harder to extract knowledge

• Got data, now what?

• And it is really is about data

, not the FLOPS (going faster)…

– Very extended distribution: data sets on all scales!

– When data collection does grow large, not able to analyze.

– Tools are limited, must dedicate resources to build analysis tools (and this doesn’t help complete the actual research).

(3)

Trend: Lack of Broad Access

1M

14M 14M 1M

Scientists & Engineers

55M Little to no access to high performance data-intensive Computing capacity

70M

53% of technical organizations are forced to scale down their advanced problems ‘to fit’ within their technology limitations, and 57% of companies have problems they can’t solve with existing computers. US Council on Competitiveness, 2010.

(4)

Widespread On Demand Access

1M 14M High Performance Data-intensive Capacity 80% 20% 14M 1M

Scientists & Engineers

55M Little to no access to high performance data-intensive Computing capacity

Cloud Computing for

Compute and Data Intensive Research

(5)

Bridging the Gap with the Cloud

With cloud computing, researchers can:

• Marshal needed storage and compute resources on demand without knowing or caring how it happens

• Access curated reference datasets, in the cloud.

• Run key algorithms as Software as a Service without having to know coding details or how to install software • Collaborate and share data and algorithms.

• Use familiar client tools, leverage the cloud as a resource and intellectual amplifier.

• Offer new (higher value) data services for research – Scalable analytics to explore data sets

– Interactive data visualizations – Result provenance

(6)

Queue

Suggested Application Model

Using queues for reliable messaging

Web Role ASP.NET, WCF, etc. Worker Role main() { … } 1) Receive work 2) Put work in

queue 3) Get work from queue

4) Do work

(7)

Storage Service in Windows Azure

Azure applications can use Azure storage services or SQL Azure

Blobs: large,

unstructured data (audio, video, etc)

Tables: simply structured data, accessed using

ADO.NET Data Services Queues: serially accessed

messages or requests, allowing web-roles and worker-roles to interact

SQL Azure

(8)

Modern Data Center Network

Internet CR CR AR AR … AR AR S S LB LB Data Center Layer 3 Internet S S … S S … … Layer 2 Key: • CR (L3 Border Router) • AR (L3 Access Router) • S (L2 Switch) • LB (Load Balancer) • A (20 Server Rack/TOR)

Source: Albert Greenberg and Cisco

GigE 10 GigE

(9)

Worker Role TCP-Endpoints

0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% Cumul at iv e P er cen tag e MB/sec Bandwidth 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% 0 20 40 60 80 100 Cumul at iv e P er cen tag e RTT ms Latency

(10)

http://azurescope.cloudapp.net/

> Azure service, Silverlight frontend, data archived in SQL Azure

> Microbenchmarks, algorithm benchmarks, best practices

> Includes utility to measure local to cloud throughput (

download

)

> Run weekly on datacenters across the world

> New version(s) in development pipeline

(11)

• Most important app in bioinformatics, exponential data growth

– Research groups are unable to secure sufficient compute resources

• External research group, blasted ~5K proteins (700K sequences)

• Normal BLAST job, between 700 – ~1000 CPU hours, couldn’t run on NCBI

• 3 day run was reduced to 30 min, a 30 minutes reduced to 30 sec

• Total cost $100, publishable result after one hour of cloud computing

“Reduced a weeks work of research down to one afternoon of computing”

• All-against-all non-redundant protein database (10m sequences)

• Theoretically 100 billion sequence comparisons, 6 years on 8 core node

• One of the largest BLAST jobs completed to date;

• 1.85 billion results

AzureBLAST

Bioinformatics in the Cloud

(12)

The $24,000 Slide Deck

4000 cores (500 XL VMs), running for over 7 days

35 Nodes experience blob writing failure at same time

(13)

Application Benchmarks Inform Design

Task size vs. Performance

• Benefit of the warm cache effect

• 100 sequences per partition is best choice

Instance size vs. Performance

• Super-linear speedup with larger size worker

instances

• Primarily due to the memory capability.

Task Size/Instance Size vs. Cost

• Extra-large instance generated the best and

the most economical throughput

(14)

AzureMODIS –

Computing Evapotranspiration (ET) in the Cloud

Project with UC Berkeley, Univ Virginia, Indiana University, and LBNL

Two MODIS satellites

• Terra, launched 12/1999 • Aqua, launched 05/2002 • Near polar orbits

• Global coverage two days • Sensitive in 36 spectral bands

(15)

AzureMODIS – Four stage image processing pipeline

http://research.microsoft.com/en-us/projects/azure/azuremodis.aspx

Pipeline for download, processing, and

reduction of satellite imagery.

• ~35 different data products;

• Atmospheric and land products are in different projections;

• Re-projection, spatial temporal resolve.

• Integrate data from different swaths, days,… • 5 TB data processed (600,000 files)

• 35,000 hours reprojection stage

• 12,000 hours derivation reduction stage • 3,000 hours analysis state

• Total cost for computing one US year ET computation: ~ $2000.

(16)

Supporting Smart Sensors and Data Fusion

in collaboration with Ed Lazowska, Bill Howe, Keith Grochow & Mark Stoermer, U of Washington

NSF Ocean Observing Initiative

– Hundreds of cabled sensors and robots exploring the sea floor

– Data to be collected, curated, mined

(17)

Conclusions

• Clouds are the

largest scale computer centers ever constructed

and have

the potential to be important to both large and small scale research.

• Suitable for “loosely coupled” data parallel applications

, but tightly

coupled low-latency applications perform poorly on clouds

today

.

– Landscape fast changing, research computing market is significant – Provide valuable fault tolerance and scalability abstractions

• Use patterns enormously benefit from

databases

and

data services

– Rapidly extract small subsets of large data sets – Compute aggregates, perform compression, etc. – Fast sequential read performance is criticial.

• Software is becoming a new kind instrument

– Value added federated data sets – Simulations

(18)

Questions?

[email protected]

http://research.microsoft.com/en-us/people/barga/

Microsoft Research eXtreme Computing: research.microsoft.com/en-us/labs/xcg/default.aspx

Microsoft Research Cloud Research Team: www.research.microsoft.com/cloud

The Fourth Paradigm: www.research.microsoft.com/fourthparadigm Microsoft Cloud Services: www.microsoft.com/cloud

References

Related documents

A) A computer system running Microsoft Windows (XP, Vista, or Windows 7) with a separate test volume. B) A suitable tool or utility such as FragmentFreeSpace.exe (see attached

Abstract: This collection comprises records created and collected during a Historic American Buildings Survey (HABS) project completed prior to the demolition of the Irvine

The present research investigates these standards and is framed by critical reflections on CBT literature and existing sustainable tourism standards (STS) practices,

To the authors’ knowledge, this is the first published research where the detailed analysis of the NVH problem using newest findings in the squeal prevention such as the complete

In the event that Interoute fails to achieve the SLO, Customer will be entitled to a Service Credit equal to 10% the Dedicated Equipment Charge (as specified on the Purchase

The 2008 financial collapse catalyzed sweeping changes in the legal profession that resulted in dwindling work for law firms and client demands for deep price discounts. But most

Customer Involvement. It was difficult at the start of the course to convince both the customer and students of both projects that high customer involvement was needed. This

The Automotive Industry Global Purchasing and Supply Chain Management Seminar is a joint offering by Wayne State University’s School of Business Administration and the