PHD Dissertation Proposal Defense
Wes J. Lloyd
August 7, 2012
Colorado State University, Fort Collins, Colorado USA
Outline
Research Problem
Research Problem
Challenges
Approaches & Gaps
Research Goals
Research Questions & Experiments
Research Questions & Experiments
Research Contributions
Preliminary Results
August 7, 2012 Wes J. Lloyd PHD Dissertation Proposal Defense 2
Traditional Application Deployment
Data
Business
Logging
App
Data
Spatial DB
rDBMS
Business
Services
Services
Logging
Tracking
DB
pp
Server
Apache
Tomcat
Research Problem 3DODB /
NOSQL
Services
Services
Object
Store
Single Server
Wes J. Lloyd PHD Dissertation Proposal Defense August 7, 2012
Infrastructure as a Service (IaaS)
Cloud Computing
Server partitioning of multi‐core servers
Server partitioning of multi‐core servers
Hardware virtualization
Service isolation
Resource elasticity
Research Problem 5 Wes J. Lloyd PHD Dissertation Proposal Defense
August 7, 2012
Problem Statement
Autonomic deployment of multi‐tier
Autonomic deployment of multi‐tier
applications to IaaS clouds
Component composition
Collocation and interference of components
Scaling infrastructure to meet demand
Scaling infrastructure to meet demand
August 7, 2012 Wes J. Lloyd PHD Dissertation Proposal Defense Research Problem 6
7 Wes J. Lloyd PHD Dissertation Proposal Defense
August 7, 2012
Application Component Composition
Application
Components
Virtual Machine (VM)
App Server
Component
Deployment
Virtual Machine (VM)
Images
rDBMS r/o
File Server
Log Server
Load Balancer
Image 2
rDBMS write
Image 1
App Server
File Server
Log Server
rDBMS write
rDBMS r/o
Load Balancer
Problems & Challenges 8
Application
“Stack”
PERFORMANCE
. . .
Image n
Load Balancer
Dist. cache
Wes J. Lloyd PHD Dissertation Proposal Defense August 7, 2012
Bell’s Number
n = number
of
application
n k
Model
Component
Deployment
application
components
VM deployments
Database
File Server
Log Server
config 1
M D
F L
config 2
M
F
L
config n
M L
D
1 VM : 1..n components
4
15
5
52
6
203
7
877
8
4,140
D
Problems & Challenges 9
Application
“Stack”
# of Configurations
. . .
k= # of
possible
configs
F
9
21,147
n . . .
Wes J. Lloyd PHD Dissertation Proposal Defense August 7, 2012
Provisioning Variation
Request(s) to
launch VMs
VMs Share PM
CPU / Disk / Network
VM
Physical Host Physical Host
Physical Host
VM VM
VM
Ambiguous
Mapping
VM
VM VM
VM
VM
VM
VM
VM
VM
VM
VM
VM
VM
VM
Problems & Challenges
10
Physical Host Physical Host Physical Host
VMs Reserve
PM Memory Blocks
PERFORMANCE
Wes J. Lloyd PHD Dissertation Proposal Defense August 7, 2012
Service Requests
Infrastructure Management
• Scale Services
• Tune Application
Application ServersLoad Balancer
Load Balancer
distributed cache• Tune Application
Parameters
• Tune Virtualization
Parameters
noSQL data stores
rDBMS
Problems & Challenges 11 Wes J. Lloyd PHD Dissertation Proposal Defense
Virtual Machine (VM) Placement
as “Bin Packing Problem”
Bins= physical machines (PMs)
Bins physical machines (PMs)
Items= virtual machines (VMs)
Dimensions
CPU time
VM RAM, hard disk size, # cores
Disk read/write throughput
Disk read/write throughput
Network read/write throughput
PM capacities vary dynamically
VM resource utilization varies
Approaches & Gaps 13
Wes J. Lloyd PHD Dissertation Proposal Defense August 7, 2012
NP-Hard
Related Work
Multivariate performance models
Multivariate performance models
Regression models
Machine learning
Feedback loop control
Hybrid approaches
Formal approaches
Integer linear programming
Case based reasoning
Approaches & Gaps 14
Wes J. Lloyd PHD Dissertation Proposal Defense August 7, 2012
Gaps in Related Work
Existing approaches do not consider
VM image composition
VM image composition
Complementary component placements
Interference among components
Minimization of resources (# VMs)
Load balancing of physical resources
Performance models ignore
Performance models ignore
Disk I/O
Network I/O
VM and component location
Approaches & Gaps 15
Wes J. Lloyd PHD Dissertation Proposal Defense August 7, 2012
Why Gaps Exist
Public clouds
Public clouds
Research is cost prohibitive
Users concerned with performance not in control
Private clouds: systems still evolving
Performance models (large problem space)
Performance models (large problem space)
Virtualization misunderstood or overlooked
Approaches & Gaps 16
Wes J. Lloyd PHD Dissertation Proposal Defense August 7, 2012
17 Wes J. Lloyd PHD Dissertation Proposal Defense
August 7, 2012
Research Goals
RG1: Support VM component composition
RG2: Support virtual infrastructure management
Determine and execute VM placement
Scale infrastructure for application demand
August 7, 2012 Wes J. Lloyd PHD Dissertation Proposal Defense Research Goals 18
Performance Objectives
Primary: Maximize application throughput
Primary: Maximize application throughput
Secondary: Minimize resource cost (# of VMs)
Minimize modeling time
Support high responsiveness to change in
application demand
application demand
Methodology Evaluation
CSIP: USDA‐NRCS platform for model services
p
Models as multi‐tier application surrogates
RUSLE2 – Soil erosion model
WEPS – Wind Erosion Prediction System
Hydrology models: SWAT, AgES
Other models: STIR SCI
Other models: STIR, SCI…
Eucalyptus IaaS cloud(s)
Amazon EC2 compatible
XEN & KVM hypervisors
Research Questions & Methodology 21 Wes J. Lloyd PHD Dissertation Proposal Defense
August 7, 2012
Component
Composition
Infrastructure
Management
RQ1
RQ3
RQ4
August 7, 2012 Wes J. Lloyd PHD Dissertation Proposal Defense Research Questions & Methodology 22
RQ2
RQ5
RQ1: Which independent variables best help
model application performance (throughput)
to guide autonomic component composition?
Total (all VMs) resource utilization
Total (all VMs) resource utilization
CPU time, disk I/O, network I/O, …
Individual VM and PM resource utilization
Component and VM location
VM Configuration: number of cores RAM
VM Configuration: number of cores, RAM,
hypervisor type (KVM, XEN...)
Research Questions & Methodology 23 Wes J. Lloyd PHD Dissertation Proposal Defense
August 7, 2012
RQ1: Which independent variables best help
model application performance (throughput)
to guide autonomic component composition?
Total (all VMs) resource utilization
Methodology
Exploratory performance modeling
Total (all VMs) resource utilization
CPU time, disk I/O, network I/O, …
Individual VM/PM resource utilization
Component / VM location
VM Configuration: number of cores RAM
Exploratory performance modeling
•Investigate independent variables
•Investigate modeling techniques
• Multiple linear regression (MLR)
• Artificial neural networks (ANNs)
VM Configuration: number of cores, RAM,
hypervisor type (KVM, XEN...)
Research Questions & Methodology 24 Wes J. Lloyd PHD Dissertation Proposal Defense
August 7, 2012
• Artificial neural networks (ANNs)
• Others
RQ2: Can component resource classifications
and behavioral rules predict performance of
component compositions?
Support simplification of the search space
n k
4
15
5
52
6
203
Support simplification of the search space
Support applications with large # of
components
Bell’s number
7
877
8
4,140
9
21,147
n . . .
Research Questions & Methodology 25 Wes J. Lloyd PHD Dissertation Proposal Defense
August 7, 2012
RQ2: Can component resource classifications
and behavioral rules predict performance of
component compositions?
Methodology
Investigate autonomic component composition
Support simplification of the search space
n k
4
15
5
52
6
203
Support simplification of the search space
Support applications with large # of
components
e.g. Bell’s number:
Investigate autonomic component composition
approach(es)
•Performance modeling
•Heuristics to classify
• Component resource utilization
7
877
8
4,140
9
21,147
n . . .
Research Questions & Methodology 26 Wes J. Lloyd PHD Dissertation Proposal Defense
August 7, 2012
• Component resource utilization
• Component dependencies
Evaluation Metrics:
Component Composition
C
i i
f
Composition performance
Average throughput of configurations
Resource packing density
# components/# VMs for compositions
D i ti
d
Derivation speed
Average wall clock time to produce compositions
Research Questions & Methodology 27 Wes J. Lloyd PHD Dissertation Proposal Defense
August 7, 2012
RQ3: Does performance of
component compositions change
when scaled up?
Single provisioned application VMs
Multiply provisioned application VMs
Investigate collocation of new VMs
Intelligent vs. ad‐hoc placement
Load balance physical resources
Research Questions & Experiments 28 Wes J. Lloyd PHD Dissertation Proposal Defense
RQ3: Do performance rankings of
component compositions change
when scaled up?
Methodology
Single provisioned application VMs
Multiply provisioned application VMs
Investigate collocation of new VMs
Intelligent vs. ad‐hoc placement
Methodology
Scale up compositions and benchmark
performance
•Investigate impact of VM placement for
infrastructure scaling
Load balance physical resources
Research Questions & Methodology 29 Wes J. Lloyd PHD Dissertation Proposal Defense
August 7, 2012
infrastructure scaling
RQ4: How rapidly can VMs be launched
in response to application demand?
Determine upper bound of VM launch speed
Determine upper bound of VM launch speed
Devise workarounds to improve performance
VM prelaunch and suspension
Reserve RAM, other resources multiplexed
Enforced caching of VM data on PMs
Reassign duties of existing VMs
Others ?
Research Questions & Methodology 30 Wes J. Lloyd PHD Dissertation Proposal Defense
August 7, 2012
RQ4: How rapidly can VMs be launched
in response to application demand?
Determine upper bound of VM launch speed
Determine upper bound of VM launch speed
Devise workarounds to improve performance
VM prelaunch and suspension
Reserve RAM, other resources multiplexed
Enforced caching of VM data on PMs
Methodology
Benchmark VM launch performance and
investigate potential improvements
Reassign duties of existing VMs
Others ?
Research Questions & Methodology 31 Wes J. Lloyd PHD Dissertation Proposal Defense
August 7, 2012
RQ5: Which independent variables best
support application performance modeling
for autonomic infrastructure management?
Virtual infrastructure
Virtual infrastructure
Number of VMs (1 to n) per application VM
VM RAM, # cores
VM location data
Application specific parameters
pp
p
p
Number of worker threads
Number of database connections
Number of app server concurrent connections
Research Questions & Methodology 32 Wes J. Lloyd PHD Dissertation Proposal Defense
RQ5: Which independent variables best
support application performance modeling
for autonomic infrastructure management?
Methodology
When resources are scaled ↑↓
↑↓
Virtual infrastructure
Number of VMs (1 to n) per application VM
VM RAM allocation
VM core allocation
Location of VMs
l
f
Explore autonomic infrastructure management
approach(es)
•Performance model based
•Feedback control
Application specific parameters
Number of worker threads
Number of database connections
Number of app server concurrent connections
Research Questions & Methodology 33 Wes J. Lloyd PHD Dissertation Proposal Defense
August 7, 2012
•Hybrid approach
Evaluation Metrics:
Infrastructure Management
Percentage of service requests completed
Percentage of service requests completed
Responsiveness
Max supported load acceleration without
dropping requests
Adaptation time
Adaptation time
Time window with dropped requests
Failure recovery time
Research Questions & Methodology 34 Wes J. Lloyd PHD Dissertation Proposal Defense
August 7, 2012
35 Wes J. Lloyd PHD Dissertation Proposal Defense
August 7, 2012
Expected Contributions (1/2)
Novel intelligent approaches for IaaS cloud
Novel, intelligent approaches for IaaS cloud
Application deployment
Infrastructure management
Move IaaS infrastructure management
beyond simple management of VM pools
beyond simple management of VM pools
Expected Contributions (2/2)
Autonomic component composition
(RQ1, RQ2)
Autonomic infrastructure management
(RQ3, RQ4, RQ5)
Improve application performance modeling
For component composition (RQ1)
New independent variables (RQ1)
Heuristics (RQ2)
For infrastructure management (RQ5)
Support load balancing of physical resources
Contributions 37Wes J. Lloyd PHD Dissertation Proposal Defense August 7, 2012
Non‐goals
Support for stochastic applications
Support for stochastic applications
Only applications with stable resource utilization
characteristics supported
External interference
From non‐application VMs
Hot‐spot detection
Contributions 38Wes J. Lloyd PHD Dissertation Proposal Defense August 7, 2012
39 Wes J. Lloyd PHD Dissertation Proposal Defense
August 7, 2012
Component Compositions
SC2
M D
F
L
SC
6
M
D F
L
SC11
M F
D
L
Rusle 2 Model Fastest Compositions
F
Service Isolation Overhead
Preliminary Results 40
XEN
KVM
Wes J. Lloyd PHD Dissertation Proposal Defense August 7, 2012
Component Compositions
SC2
M D
F
L
SC6
M
D F
L
SC11
M F
D
L
Rusle 2 Model Fastest Compositions
• Determining fastest compositions
F
Service Isolation Overhead
g
p
• Not intuitive
• Testing / prediction required
• Service Isolation
• Was not fastest
• Adds overhead
Preliminary Results 41XEN
KVM
Wes J. Lloyd PHD Dissertation Proposal Defense August 7, 2012
• Results in maximum hosting costs (# of VMs)
Component Composition
Resource Utilization Diversity
Preliminary Results 42
Wes J. Lloyd PHD Dissertation Proposal Defense August 7, 2012
Component Composition
Resource Utilization Diversity
• Resource utilization varies
• Component/VM placements
• VM memory size allocations
• Hypervisor type KVM/XEN
• Testing required to identify resource utilization
• Intuition is insufficient
Preliminary Results 43Wes J. Lloyd PHD Dissertation Proposal Defense
August 7, 2012 Wes J. Lloyd PHD Dissertation Proposal Defense Preliminary Results 45 August 7, 2012 Wes J. Lloyd PHD Dissertation Proposal Defense Preliminary Results 46