• No results found

YouChoose: A Performance Interface Enabling Convenient and Efficient QoS Support for Consolidated Storage Systems

N/A
N/A
Protected

Academic year: 2021

Share "YouChoose: A Performance Interface Enabling Convenient and Efficient QoS Support for Consolidated Storage Systems"

Copied!
27
0
0

Loading.... (view fulltext now)

Full text

(1)

Wayne State University Cluster and Internet Computing Laboratory

YouChoose: A Performance Interface Enabling

Convenient and Efficient QoS Support for

Consolidated Storage Systems

Xuechen Zhang, Yuehai Xu,

and

Song Jiang

Department of Electrical and Computer Engineering

Wayne State University

(2)

2

Data-intensive Applications using

Consolidated of Storage Systems

ƒ Applications become more data intensive

– Scientific applications may analyze large data sets.

– Internet search and E-commerce rely on efficient data access. – Applications’ performance highly depends on I/O service quality.

ƒ Advantages of consolidated storage system

– High utilization due to resource sharing.

– Cost-effectiveness of centralized management.

– Lower operating cost.

ƒ Each user essentially reserves a virtual storage device.

– Contractual quality of services (QoS) requirements (SLA).

(3)

ƒ Amazon EC2 is a web-based service that provides resizable computing capacity in the cloud.

– Create an Amazon Machine Image (AMI), including your programs and data.

– Upload the AMI into Amazon data storage facility. – Choose the computing capacity (instance type).

An Example Issue: Amazon Elastic

Compute Cloud (Amazon EC2)

Equivalent CPU capacity of a 1.0-1.2 GHz 2007 Opteron or 2007 Xeon processor.

(4)

Internet Consolidated storage server E-Commerce Server TPC server Search engine server Computing Server Service subscribers Service Level Agreement

(e.g., response time<100ms, throughput>100 trans./s) (e.g., execution time < 5 minutes) 100MB/s or 10MB/s throughout ?

System Structure

100ms or 10ms latency?

(5)

5

Issues with the Use of Fixed I/O Bounds

ƒ

I/O intensity can change from time to time.

– Requests in the burst period share the same latency bound with those in the quiet period?

– If the bound is determined according to requests in the quiet period, how much resources are demanded to meet it during the busy period?

ƒ

Request size can be highly variable.

– One common latency bound for small and large requests?

– If the throughput is in form of MB/s, any incentive to aggregate small requests into one large one?

ƒ

Spatial locality of requests can vary substantially.

– One common throughput bound for random and sequential requests? – Shall the bound be determined according to random requests or

(6)

6

Implications of Fixed I/O Bounds

ƒ

They may not reflect applications’ real QoS needs.

ƒ

They may discourage programmers’ efforts on the

optimization of I/O requests.

ƒ

They can pose highly variable resource demands on the

storage system.

(7)

7

Our Solution: Use Reference Storage

System as Performance Interface

ƒ Assume that a user can receive satisfactory application performance with use of a dedicated storage system.

– He wants to keep the performance after outsourcing I/O service to a shared storage system.

ƒ The dedicated storage system is used as its performance interface.

– The interface is called Reference Storage System (RSS)

– By implementing the interface, the user will receive performance at least as good as that received on the RSS.

ƒ The RSS interface is not subject to variation of I/O behaviors.

– The interface is tangible to end users and is more relevant to application performance.

– The interface can easily bound the resource demand on the shared storage system.

(8)

Interconnection Fabric

controller controller controller

Application server 1 Application server 2 Application server n

…..

HDD disk array SSD disk array Hybrid disk array Request Streams QoS-aware Scheduler iSCSI iSCSI FibreChannel VSD VSD VSD VSD VSD VSD VSD VSD VSD VSD Virtual Storage Device

(9)

Interconne controller contro Application server 1 Application server 2

HDD disk array SSD dis Request Streams QoS-aware Scheduler iSCSI iSCSI VSD VSD VSD

Dedicated Local Storage

Application

RSS Performance

Interface

(10)

Interconnection Fabric

controller controller controller

Application server 1 Application server 2 Application server n

…..

HDD disk array SSD disk array Hybrid disk array Request Streams QoS-aware Scheduler iSCSI iSCSI FibreChannel RSS RSS RSS RSS RSS Performance Interface VSD VSD VSD VSD VSD

(11)

11

YouChoose: Implementation and challenges

ƒ Interpret RSS for the I/O scheduler to implement the interface

– Predict what the latency of an arriving request is if it was received by RSS.

– It’s a challenge with different access patterns and system configurations.

ƒ Efficiently implement the RSS interface.

– Meet simultaneously RSS requirements for different VSDs

– Able to exploit request locality for system efficiency.

ƒ Migrate virtual storage devices (VSDs) for high device utilization.

– Different disk arrays exhibit various efficiency in hosting VSDs.

(12)

Prediction with the CART Tool

ƒ

The CART (Classification And Regression Trees) Tool

– Known for its efficiency and accuracy.

ƒ

Model Training

r

1

r

2

r

3

r

n

.

.

.

Training Workload RT1 RT2 RT3 RTn

.

.

.

FV1 FV2 FV3 FVn

.

.

.

Request Feature Vector (request size, location, sequentiality, R/W)

Reference System Response Time (RT) Model Construction Trained Storage Model 12

(13)

Prediction with the CART Tool (cont’d)

ƒ

Use the Model

r

1

r

2

r

3

r

n

.

.

.

PRT1 PRT2 PRT3 PRTn

.

.

.

Request Feature Vector

(request size, location, sequentiality, R/W)

Trained Storage Model 13 Real Workload Predicted Response Time (PRT)

(14)

Stream 1 Stream 2 Stream 3 Reference Clock 1 Reference Clock 2 Reference Clock 3 Wall Clock

YouChoose Request Scheduling

ƒ We can predict a request’s service time on RSS (ref_time)

ƒ N+1 clocks:

– One wall clock (wall_clock) – N reference clocks (ref_clock).

ƒ When the stream is considered for scheduling:

– If its request is dispatched, then

ref_clock += ref_timereq – No pending requests, then

(15)

VSD 1 VSD 2 VSD 3

Requested data spread over multiple disks Slot K

Slot K+1 Slot K+2

(16)

Performance Evaluation

ƒ Disk arrays simulated by DiskSim

– Fast disks: QUANTUM TORNADO (10025RPMs, 1.245ms) – Slow disks: SEAGATE ST32171W (7200RPMs, 1.943ms) ƒ Synthetic traces

– Request size: 4KB

– Spatial locality x% [0%-100%]: the probability of two consecutive requests for contiguous data.

ƒ Real-world I/O traces

– Financial : traces from OLTP applications at two large financial institutions.

– WebSearch: traces from a popular search engine.

– OpenMail: collected on a production e-mail system running the HP OpenMail

(17)

Accuracy of the RSS Interface interpreted by CART

17

Relative Error Rate

(%)

WebSearch

(18)

Estimation Accuracy for Time Windows

for Individual Requests for 0.04s Time Window for 0.08s Time Window

More than 85% of relative errors are smaller than 15%

(19)

Impact of Spatial locality (on Dedicated RSS)

(20)

Impact of Spatial locality

(on Shared Storage w/ YouChoose)

(21)

Impact of Spatial locality

(on Shared Storage using the 100 IOPS Bound)

(22)

Performance Isolation

(on Dedicated RSS)

22 100% Locality 75% Locality 0% Locality 75% Locality

(23)

Performance Isolation

(on Shared Storage w/ YouChoose)

23 100% Locality 75% Locality 0% Locality 75% Locality

(24)

Performance Isolation

(on Shared Storage using the 100 IOPS Bound)

(25)

Performance Isolation

(Real-world workloads on dedicated RSS)

25 WebSearch

OpenMail

WebSearch

(26)

Performance Isolation (Real-world workloads)

(27)

27

Conclusions

ƒ

Introduce reference storage system as the

performance interface.

Dynamic access behavior is well accommodated in the

interface.

Resource demand is well capped.

ƒ

Use the machine learning technique to implement the

RSS interface.

ƒ

Achieve system efficiency with batched request

scheduling.

References

Related documents

En cualquier caso, lo más significativo es que se trata de la única des- cripción que conocemos de la cuadrícula urbana de la ciudad de Corrientes durante la guerra contra el

Download and to list template book report receipt sandwhich toy neighborhood with these cute, and feel free printables for books either for visiting this project!. Given an email

maximum stress in cell B176 equal to to the maximum allowable stress in cell B53 by changing the minimum recommended thickness in cell E126!. THIS MUST BE DONE IN ORDER TO ASSURE

For all phenotypes tested, the WC model predicts that all the variants passing the genome-wide significance threshold in the discovery study should pass the Bonferroni corrected

The increase in sea level (and decrease in wave orbital velocities) still increased deposi- tion in the shallowest depth interval relative to scenario B, but a reduction in

Co-eluted His-tagged RadA and un-tagged RadB proteins were observed at a similar ratio

Speaker, “Pecha Kucha In the Legal Research Classroom” Legal Writing Institute Workshop, Wake Forest University School of Law, Winston-Salem, NC (December 3, 2010). Speaker,

2 Past Influence Past influences affecting Seeker’s present state.. 3 Probable Outcome Seeker’s aspirations, hopes, fears, and/or most probable outcome for the