• No results found

Distributed Block-level Storage Management for OpenStack

N/A
N/A
Protected

Academic year: 2021

Share "Distributed Block-level Storage Management for OpenStack"

Copied!
28
0
0

Loading.... (view fulltext now)

Full text

(1)

Distributed Block-level Storage

Management for OpenStack

OpenStack APAC Conference 2012

Daniel Lee

CCMA/ITRI

Cloud Computing Center for Mobile Applications

Industrial Technology Research Institute

(2)

Outline

Overview

Brief on ITRI Cloud OS

Cloud storage system in ITRI Cloud OS

Distributed Main Storage System

Distributed Secondary Storage System

Integration of Cloud OS Storage System with

OpenStack

Summary

(3)

Overview

Data is growing exponentially and moving on to Could. It

needs to be available at all time, secured and protected.

911 in Twin Towers, 311 earthquake in Japan and 420 outage

of Amazon datacenters are disasters.

– Block-level and fully redundant

– Backup / restore + wide-area

– Thin provisioning

– De-duplication

– Others

(4)

ITRI Cloud OS

(An All-in-one IaaS Total Solution)

Physical

Resource

Management

(

HP/Dell

)

Inter-PM Load Balancing

and VM Fail-over (

VMWare

)

Virtual Data Center

Provisioning (

VMWare

)

Power Management

(

IBM

)

Inter-VM Load

Balancing (

F5

)

Security

(

Checkpoint

)

Virtual Data Center Management

Physical Data

Center

Management

(

IBM

)

Physical Compute Servers

All-layer-2 Network

Distributed Main/Secondary Storage

Primary/Secondary

Storage Management

(

EMC

)

(5)

ITRI Cloud OS

Service Interface

Photo

Sharing

VDC

Provision

and Deploy

Video Streami ng VDC

Web

Confere

nce

VDC

Physical

Cluster

Virtual Data Center Management Physical Data Center Management

Multiplexing VDC’s in a Physical DC

•Cloud Application Developer •Cloud Service Provider •Cloud Service Infrastructure Administrator •Carrier

(6)
(7)
(8)
(9)

Cloud Storage System in Cloud OS

Cloud storage aims at

cloud-scale

data centers,

and is designed to be

scalable, available and

low-cost

.

Main components

– Distributed Main Storage subsystem – DMS

Provide current image I/O request processing

– Distributed Secondary Storage subsystem – DSS

Block-level incremental metadata only backup system

Take volume snapshot / restore, de-duplication engine,

(10)

Key Features of Cloud Storage

System from Users’ Perspective

Use like

raw, unformatted

block devices

Support volume size from

1 GB to 2 TB

(64TB architecture-wise)

Can be

attached by different instances

(one at a time for now)

• Multiple volumes

can be mounted to the same instance

Data is

automatically replicated

on different physical storage server

(N=3)

Ability to

clone volumes

and/or create point-in-time

snapshots

of

volumes

• Wide-area backup

for disaster recovery

Snapshot can be scheduled by assigning a

backup policy

with

time,

frequency, retention

, and

WADB

option

(11)

Key Features of Cloud Storage

System from Providers’ Perspective

Use of

commodity hardware

– reducing cost

(JBOD)

Disk space management for up to

multiple

petabytes

• Add disk, remove disk

without interruption

• Thin provisioning

Enables you to provision a specific level of I/O

performance if desired, by setting

I/O throttle

(12)

DMS in Cloud OS

• DMS - Distributed Main Storage

• Goal: A

cloud scale network storage

solution that provide

high capacity, high reliability, high availability and high

performance

• Features:

– Volume operations:

Create / Attach / Detach / Delete

High Reliability

: data replication (N=3)

High Availability

: No single point of failure

High Performance

: client-side metadata/data cache for performance

Thin provisioning

is utilized to optimize utilization of available storage.

– Fast

Volume Cloning

Save Image; FastBoot

(13)

Compute Node

DMS In-kernel client

VM

Volume

Service Node

Storage Node

VM Volume Volume VM NameNode Metadata Data

DMS System Architecture

DMS three components: 1. Namenode RR

(14)

In-Kernel Client Block Devices

..

...

IOCTL

Domain U Kernel Space

Compute Node Applications Read/Write Sectors /dev/hda1 /dev/dms01 Block Frontend Driver

Domain U User Space

Xen / KVM

I/O Virtualization

DataNodes NameNode

Attach a disk to Dom U

User-space Daemon RPM Attach/Detach Volume Metadata Payload Block Backend Driver

DMS Compute Node

Architecture

(15)

Compute Node

DMS In-kernel Client

VM

Volume Service Node

Storage Node

NameNode

2. query metadata: (VolID, LBID) 3. HBID, N-locations

4. Read Data

7. Report failed locations

Read Flow

Storage Node Storage Node

6. Response

(payload) 1. Read request

5. Read Data from another datanode if previous-try failed

(16)

Compute Node

DMS In-kernel Client

VM

Volume Service Node

Storage Node

NameNode

DataNode

2. Request for new space:(VolID, LBID) 3. HBID, N-locations

4. Write Data

7. Commit (Success, or report failed locations)

Write Flow – Memory-Ack Approach

Storage Node DataNode Storage Node DataNode 5. Memory Ack 6. Disk Ack

6. Response 1. Write request

• Memory-ack approach: In client, I/O response to VM(step6) is right after receiving memory ack from

(17)

DSS in Cloud OS

Share the

homogeneous space

with DMS

Block-level Incremental backup

– Copy-on-write (

COW

) and

volume cloning

techniques

– Backup is just a

snapshot of meta-data

Support

volume-level restore

Block-level

de-duplication

• Garbage collection of expired, un-used and redundant disk

blocks

• Wide-area data backup / restore and near-realtime

(18)
(19)
(20)

DSS – WADB

Backing up the data to a remote location at

the time volume snapshot is taken…

Network /

VPN

Beijing

Shang-Hai

(21)
(22)

De-duplication in DropBox

1

2

3

DMS/DSS

1

2

3

(23)

DSS Software Architecture Diagram

Backup/Restore

Request Backup Policy

SOAP server

Snapshot

Garbage Collection

Thread Manager / Flow Control

DSS Daemon Tread DSS main Classes RPC DMS client

protocol RS client Lib

DB manager De-dup Scheduler Restore Timer Fired DSS Manager Quartz Scheduler WADB

(24)

OpenStack

Compute

Storage

Network

Identity Security

Web Interface

Object

(Swift)

Block

File

(25)

Integration of Cloud OS Storage

System with OpenStack

User portal

SECS

VDCM

PDCM

VMM

VM1 VM2 VMn

DMS

volume

PRM

L2

LOG

User

PDCM User

Glance

Nova Compute

Nova Volume

Nova Manager

Nova API

DB

Nova Network

Web Services (OpenStack API enabled)

(26)

Summary of Advantages

Feature

Benefits

Unlimited storage

highly scalable read/write access

Leverages commodity hardware (JBOD)

No lock-in, lower price/GB

Built-in replication

Fully redundant (default: N=3)

Used as system volume

Save Image, Fast Boot, etc.

Thin Provisioning

Optimize utilization of available storage

Highly available (HA) storage servers

(DMS/DSS)

Server failover for high available

Snapshot and backup for block volumes

Data protection and recovery for VM data

Policy based backup

Free of worry – set it and forget about it

Wide-area data backup

For disaster recovering

Block-level data de-duplication

Store more data in the limited space

(27)

More Feature in V2.0

Feature

Benefits

Volume Cloning

Fast cloning without copying data

Share virtual disk within cluster

Support sharing of volume and cluster file system

Disk bandwidth QoS guarantee

Dynamic adjustment at I/O request level

Storage space optimization

N-way replication for write-intensive blocks

N-parity-group for read-intensive blocks

Wide-area data replication

Near real-time wide-area replication

Integration with Compute / Volume

Fully integrated to Compute for attaching block

(28)

Q&A

References

Related documents

Decentering Reduces Reactivity to Food Cues Decentering may be an especially effective tool for reducing reactivity to food cues because it targets its underlying mech- anism,

Chapter 7 Knowledge transfer within a multinational company Chapter 8 HRM in international joint ventures Chapter 9 Foreign assignment Chapter 10 HRM in the global village:

La economía de escala economía de escala se refiere al poder se refiere al poder que tiene una empresa cuando alcanza un que tiene una empresa cuando alcanza un nivel óptimo

The experiment is based on the vibration of the mountain bicycle on the various road surfaces and weight. By including all the parameters requires the use of National

Mean viable and total length (±standard errors) of ERM produced by Funneliformis mosseae IMA1 and Rhizoglomus irregulare IMA6 in symbiosis with Cichorium intybus plants, at

The residential part of the palace occupies the whole of the eastern end contained by the outside enclosing wall, and the main entrance is central with the pond, on

recommendations to Customer on the type(s) of backups and storage that fit the Customer’s goal and objectives. Consultants will take into consideration the amount of storage space

Primary Block Storage Primary File Storage Archival Storage Disk Based Backup Storage Tape Infrastructure & Management Replicated Storage for Disaster Recovery Offsite