• No results found

SC12 Cloud Compu,ng for Science Tutorial: Cloud Challenges

N/A
N/A
Protected

Academic year: 2021

Share "SC12 Cloud Compu,ng for Science Tutorial: Cloud Challenges"

Copied!
18
0
0

Loading.... (view fulltext now)

Full text

(1)

NIMBUS www.nimbusproject.org  

SC12  Cloud  Compu,ng  for  Science  Tutorial:    

Cloud  Challenges  

Kate Keahey, John Breshnahan, Patrick Armstrong, Pierre Riteau Argonne National Laboratory

Computation Institute, University of Chicago

1  

(2)

Cloud  versus  Cloud  

Custom  user  environments!   On-­‐demand  access!  

Elas5c  compu5ng!  

Growth  and  cost  management!  

Capital  expense  -­‐>  opera5onal  expense!  

Do  I  need  to  become  a  sys  admin?  

Elas5city  is  fine,  but  how  do  I  use  it  in  prac5ce  over     many  incompa5ble  clouds?    

How  long  does  it  take  to  deploy  a  virtual  cluster  of  a     few  thousand  nodes  ?      

(3)

NIMBUS www.nimbusproject.org  

Appliance  Management  

•  Users require control

•  The emergence of

community appliance

management and

maintenance

•  More and better tools

•  Users are not skilled in

appliance management

•  Roles take time to

emerge

•  The configuration/

deployment trade-offs

are not obvious

11/13/12   3  

I  want  to  manage  

(4)

Deployment  Performance  

•  Scientific images

have lots of stuff on

them

•  We typically deploy

lots of them at a time

•  LANTorrent

•  Large deployments

often operate on

same image

•  Caching

(5)

NIMBUS www.nimbusproject.org  

Execu,on  Performance  

•  In 2011, ~87% of the

top500 systems were

commodity clusters

•  Amazon Cluster Compute

•  I/O is an issue, but there is

a configuration trade-off

•  Aspects of performance:

–  Hardware

–  Configuration

•  Availability@price

11/13/12   5  

From  Santos  et  al.,  Xen  Summit  2007   How  good  are  clouds  for  HPC?  

“Comparisons:  not  as  Odious  as  Once  Thought”,  

see  www.scienceclouds.org/blog  

Soon  to  be  updated…  

(6)

What’s  Not  To  Like?  

•  Latency!

•  … and bandwidth

•  … and lack of IB

•  … and anything

having to do with I/O

•  … and noise (esp.

with multi-tenancy)

•  … and MTBF

Courtesy  of  Ramakrishnan  et  al.,  PMBS’11  

How  good  are  clouds  for  HPC?   “Mahommad  and  the  Mountain”,  

(7)

NIMBUS www.nimbusproject.org  

What’s  Not  To  Like?  

•  … and overall impact on tightly-coupled applications

11/13/12   7  

Courtesy  of  Ramakrishnan  et  al.,  PMBS’11  

(8)

…but  Wait,  There’s  More!  

•  New hypervisors

–  KHVMM (Blue Gene/P)

–  Palacios

•  Xen-IB (since ~2006)

•  “Symbiotic virtualization”

–  Passthrough I/O

–  Preemption control

–  Optimized paging

•  How do we put that into

practice?

CTH  (shockwave  simulaVon)   within  ~5%  of  naVve  performance  

On  Cray  XT4  (Sandia)  

(9)

NIMBUS www.nimbusproject.org  

HPC  Cloud  BOF  Tomorrow  @  5:30!  

•  Are clouds a viable platform for HPC?

•  Join us for a fun discussion in 355-BC!

(10)

Security  

•  Isolation

– Allows the user high degree of freedom while

mitigating risk to the provider

– Network traffic controlled via virtual routers

•  Virtualization in security solutions

– Short-lived VM deployments

– Honeypots

(11)

NIMBUS www.nimbusproject.org  

What’s  Not  To  Like?  

•  Layers:

–  Hypervisor, virtual network, control domain

–  Cloud infrastructure

–  VM image management

•  More complexity == more things to attack

•  New model == new attacks

•  Incentives: commercial versus community

•  Data Privacy

–  Trusting the provider

•  Trusted computing

•  Homomorphic encryption

•  New legal challenges

(12)

Cloud  Fungibility  

•  Can’t easily move from one provider to

another:

– Cloud infrastructure APIs are incompatible

– Contextualization across providers is

incompatible

– VM image formats are incompatible

•  No easy way to compare performance

across cloud providers

(13)

NIMBUS www.nimbusproject.org  

What’s  to  like?  

•  Emergent API standards:

–  Cloud standards: OCCI (OGF), OVF (DMTF), and many

more…

–  www.cloud-standards.org

•  IaaS API adapters

–  Deltacloud, jcloud, libcloud, and many more…

•  VM image standards….

•  …and adapters

–  rBuilder, BCFG2, CohesiveFT, Puppet, and many more…

•  Emergent efforts to compare

performance

11/13/12   13  

(14)

Cost,  U,liza,on,  and  Price  

•  Most science today is

done in batch

–  Not very responsive…

–  … but very efficient!

•  Challenge:

On-demand

catch-22 for providers

–  You can overprovision

(Expensive!)

–  Or you can reject requests

(Not really on-demand)

•  Utilization -> Cost -> Price

UVlizaVon  on  the  Fusion  cluster  @  ANL     courtesy  of  Ray  Bair,  LCRC  

(15)

NIMBUS www.nimbusproject.org  

Cost,  U,liza,on,  and  Price  

•  Solution 1:

–  Backfill with volunteer VMs

•  Benefits:

–  Up to 100% utilization! –  Significantly lower cost

•  Solution 2:

–  Spot pricing

–  Support for auctions

•  Open Source community

contribution

•  Preparing for running of production

workloads on FG @ U Chicago

•  Extension to Nimbus Workspace

Service RM available since

Nimbus release 2.7

•  CCGrid 2011 paper, Marshal et al.

11/13/12   15   !"#$"%&'""(& )*+,-.&!("-$& /*01&2& /*01&3& 4"%5.6781&/1%9*81& :;;&)"$1.& ;7.01%& 4"%51%.& 4"%51%.& &<=#797*(7,(1>& :;;&?& =.1%& :;& :;;&@& =.1%& :;& =.1%& :;& :;;&A& =.1%& :;& :;;&B& 2785C((& :;& D#*E701&"%&F1%+*#701& 4"%5.6781.& /-,+*0&G",&<B&F7.5.>& 2785C((& :;& H*.6708I&F7.5.& H*.6708I&F7.5& 2785C((& :;& G"*#&'""(& D#*E701&"%& F1%+*#701& 4"%5.6781.& J7-#8I&2785C((& )"$1.& =.1%&3& =.1%&2&

(16)

Par,ng  Thoughts  

•  Infrastructure clouds are a disruptive innovation

–  Opportunity: on-demand availability

–  But also many challenges!

•  “Crossing the chasm”

–  We are living in the age of early adopters

–  Broadening the set:

•  Complexity and scale

•  New models

•  Performance and features

–  The challenges are in crossing the chasm

•  Impact on Applications

–  The ability to absorb resources is critical

–  Fault-tolerance is increasingly a requirement

(17)

NIMBUS www.nimbusproject.org  

Ques,ons?  

All tutorial slides will be available at:

www.nimbusproject.org/docs/sc12

(18)

Acknowledgements  

NSF  OCI    

“FutureGrid”  

DOE  ASCR  

References

Related documents

Pada penelitian ini penulis mensimulasikan menggunakan mobil simulasi (remote control) dihubungkan dengan sensor ultrasonik HY-SRF05 yang digunakan untuk mendeteksi dan

Regarding the high prevalence of depression, anxiety and stress in dialysis patients and the results of this study, which confirmed the effects of positive group psychotherapy

Also, traditional blogs have continued to enjoy better credibility ratings than mainstream media agencies (Johnson, Kaye, Bichard &amp; Wong, 2008; Johnson &amp; Kaye, 2010).

School of Education Department of Educational Leadership SAMPLE of MSA program requirements with CGSC Distance Distance Learning Courses in Management and Human Resources Boston

ASFAC attended the Annual Eurofinas/Leaseurope Conference which took place in Edinburgh, in October 2007. The Association was invited to present its projects

De acuerdo con los valores de precipitación mayores y las áreas actuales sembradas en papa, es aconsejable ampliar es- tos cultivos que tienen consumo de agua de 376,3 mm/anuales

While latent space proposals assist in making meaningful and efficient transitions within a Markov Chain, PL-MCMC ultimately relies on the auxiliary distribution, q, and