• No results found

Database storage management with object-based storage devices

N/A
N/A
Protected

Academic year: 2021

Share "Database storage management with object-based storage devices"

Copied!
24
0
0

Loading.... (view fulltext now)

Full text

(1)

Database storage management

with object-based storage devices

Sami Iren Seagate Research

Steve Schlosser

(2)

Outline

• New hardware: Object-based Storage Devices (OSD)

• Vision: Database-aware storage systems

• Example: Transparent device-specific data placement

(3)

New hardware: Object-based Storage Devices (OSD)

Highly integrated, single disk I.e. prototype Seagate OSD

“Smart” disk for objects I.e. Panasas storage blade Disk array/server subsystem I.e. LLNL units with Lustre

OSD Interface

Storage Device

Block I/O Manager File System Storage Component File System User Component File System Storage Component Applications System Call Interface

Storage Device

Sector/LBA Interface

Block I/O Manager

CPU

Applications

File System User Component System Call Interface

CPU

• Same hardware, new interface …

(4)

Root Object

OSD

U4 U3 U2 U1 C1 C2 P4 U4 U3 U2 U1 C1 C2 P3 U4 U3 U2 U1 C1 C2 P2 Partition Objects P1

OSD: The Objects

User Objects U4 U3 U2 Collection Objects C1 C2 Object ID User Data Attributes Metadata U1

(5)

OSD: The Command Set

Basic Protocol

• READ • WRITE • CREATE • REMOVE • GET ATTR • SET ATTR

Specialized

• APPEND – write w/o offset • CREATE & WRITE – save msg • FLUSH OBJ – force to media • LIST – recovery of objects

Security

• Authorization – on each request • Integrity – for args & data

• SET KEY

• SET MASTER KEY

Groups

• CREATE COLLECTION • REMOVE COLLECTION • LIST COLLECTION

Management

• FORMAT OSD • CREATE PARTITION • REMOVE PARTITION

Very Basic

Space Mgmt

Attributes

• opaque • internal • shared

(6)

What disks can do

OSD: Shared attributes

• Mechanism to push application information into storage

What Applications want

Traditional

OSD

(7)

OSD: Benefits

Improved performance

– Hints, QoS, Differentiated Services

– Data can be differentiated at the device

• Improved device and data sharing

– Platform-dependent metadata moved to device – Systems need only agree on naming

Improved scalability & security

– Devices directly handle client requests

– Object security w/ application-level granularity – Finer granularity than LUN-based security

Improved storage management

– Self-managed, policy-driven storage

– Storage devices become more autonomous

Volumes

Blocks Objects

(8)

Outline

• New hardware: Object-based Storage Devices (OSD)

• Vision: Database-aware storage systems

• Example: Transparent device-specific data placement

(9)

Vision: Storage that cooperates with databases

• DBMS storage managers do a lot to optimize storage access,

with insufficient information

• Storage subsystem front-ends do a lot to optimize under the

covers, without enough information about the application

• Wouldn’t it be better if we could just get along?

• OSD can provide this mechanism

We have a unique opportunity to make DB software and

storage more cooperative

(10)

Solution: OSD for databases

Buffer pool manager Storage manager

Storage device

Block interface

Block I/O manager

Object (OSD) interface

Storage device

Block I/O manager Storage manager

DBMS

Query processor Buffer pool manager

DBMS

Query processor

(11)

Database-aware storage systems

1. Create object for relation 2. Attach schema attribute 3. Populate relation

• Goal 1: OSD uses its parameters to optimize layout under the covers • Goal 2: Access method

should be independent of storage optimizations Schema attribute DBMS OSD

(12)

Database-aware storage systems

• Goal 2: Access method should be independent of storage optimizations

• DBMS should just specify the data it wants, and the destination address • DBMS shouldn’t worry about – Data placement – Addressing blocks/bytes Schema attribute DBMS OSD Memory buffer

(13)

Outline

• New hardware: Object-based Storage Devices (OSD)

• Vision: Database-aware storage systems

• Example: Transparent device-specific data placement

(14)

The Fates project at CMU

• Collaboration between Database Group and Parallel Data Lab

– Ongoing involvement of researchers at Intel Research and EMC – Lachesis: Improved communication between storage subsystem

and DB storage manager (VLDB 2003)

– Clotho: Retool database software to fetch query-specific data (VLDB 2004)

– Atropos: Leverage detailed disk information for improved 2D data placement (FAST 2004)

• Key idea: Storage informs database of its characteristics

(15)

2D data structure access

• On-disk storage requires serialization • Access along one dimension is efficient

– i.e., sequential

• Access along the other is inefficient

– i.e., random I/O

– Or, read entire relation and discard unwanted attributes

R

e

co

rd

s

Attributes

Disk

(16)

Intelligent data placement in Fates

R

e

co

rd

s

Attributes

Disk

• One column per disk track

• Column-major access is efficient • Row-major access is inefficient

– One block per rotation

• Semi-sequential layout enables efficient row-major access

(17)

High-level Fates result

Table Scan

0 50 100 150 200 250 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15

Query projectivity [# of attributes touched]

R

u

n

ti

m

e

[

s

]

Slotted pages (NSM) Cache-conscious (PAX) Vertical decomposition (DSM) CSM

Table: CREATE TABLE R (FLOAT a1, …, FLOAT a15) (1GB) Query: SELECT a1, a2, …, FROM R WHERE a1 < Hi

Clotho Storage Model (CSM):

• Best performance at both ends

(18)

Solving the “parameter problem”

• Relying on storage vendors to expose parameters is problematic

– Measuring parameters is difficult and fragile (but not impossible)

• Key idea: OSD can solve this problem

– Expose schema to storage subsystem via shared attributes – Storage subsystem can do data placement under the covers

(19)

Outline

• New hardware: Object-based Storage Devices (OSD)

• Vision: Database-aware storage systems

• Example: Transparent device-specific data placement

(20)

Linearization problem remains

R

e

co

rd

s

Attributes

• Placement within an object can be optimized

Object

(21)

Linearization problem remains

R

e

co

rd

s

Attributes

• Placement within an object can be optimized

• However, OSD objects are still addressed as a linear array of bytes

– Query A: Two requests

– Query B: Many requests, DBMS must calculate byte offsets

• Violates our goal!

– DBMS should be independent of storage optimizations

Object Query A

(22)

Solution #1: Two-dimensional object interface

• Provide two-dimensional READ and

WRITE commands

– READ (offsetX, lenX, offsetY, lenY) – WRITE (offsetX, lenX, offsetY, lenY)

• Query A: Two requests

• Query B: One request

• Violates our goal!

– DBMS must still translate byte offsets

R

e

co

rd

s

Attributes

Query B Query A

(23)

Solution #2: Relational object interface

• Provide relational READ and WRITE

commands

– READ (record offset, len, field bitmap) – WRITE (record offset, len, field bitmap)

• Bitmap specifies fields to be accessed

– Query A: READ(0, 9, 101) – Query B: READ(1, 1, 111)

• Meets our goal

– DBMS addresses object in terms of the relation’s schema

R

e

co

rd

s

Attributes

Query B Query A

(24)

Conclusion

• Object-based storage devices allow applications to provide

storage subsystems with high-level knowledge about data

• Demonstrated how database systems can take advantage of

modern placement techniques using OSD interfaces

• Placement is just the first step

• In the future, how can database systems cooperate better with

storage systems?

References

Related documents

This paper analyses the response of labor input to technology shocks in an estimated two-stage production framework with both price and wage stickiness and stage-speci…c shocks

In order to provide one such tool, we have developed a custom modification of the Weka data mining software, which implements the calculation of Hosmer-Lemeshow groups of risk and

HIT expands access to the affordable, quality and cost-effective patient care while improving the delivery of healthcare in US (The Office of National

Only self-rated health status, work experience, overworking more than official working hours, and working 14 or more consecutive hours in the past week were

2 back-up batteries type WSA 012, programmable, can be expanded with fieldbus WSC 320 P 0202 Smoke control panel 20A, 10 smoke zones / 10 daily comfort groups, 12 inputs for keypads,.

2019 Review of Melissa Schoenberger, Cultivating Peace: The Virgilian Georgic in English, 1650–1750. This is an open-access article distributed under the terms of the Creative

Mount the supplied ball joints with the 5/16” bolts, flat washers, and locknuts provided in Hardware Package 9 033 to either the top or bottom of the ball joint pocket of the

Coming of Age: The Municipal Role in Caring for Ontario’s Seniors An AMO Report, June 2011.. Assistance to