• No results found

System Architecture. In-Memory Database

N/A
N/A
Protected

Academic year: 2021

Share "System Architecture. In-Memory Database"

Copied!
25
0
0

Loading.... (view fulltext now)

Full text

(1)

IMEX

RESEARCH.COM

© 2010-12 IMEX Research, Copying prohibited. All rights reserved.

Are SSDs Ready for Enterprise Storage Systems

Anil Vasudeva, President & Chief Analyst,

IMEX Research

System Architecture

for

In-Memory Database

Anil Vasudeva

President & Chief Analyst

[email protected]

408-268-0800

© 2007-13 IMEX Research

All Rights Reserved

Copying Prohibited

Contact IMEX for authorization

(2)

IMEX

RESEARCH.COM

2 2

IT Industry Dynamics - Roadmap

Cloudization

On-Premises > Private Clouds > Public Clouds DC to Cloud-Aware Infrast. & Apps. Cascade migration to SPs/Public Clouds.

Integrate Physical Infrast./Blades to meet CAPSIMS®IMEX

Cost, Availability, Performance, Scalability, Inter-operability, Manageability & Security

Integration/Consolidation

Standard IT Infrastructure- Volume Economics HW/Syst SW (Servers, Storage, Networking Devices, System Software(OS, MW & Data Mgmt. SW)

Standardization

Virtualization

Pools Resources. Provisions, Optimizes, Monitors

Shuffles Resources to optimize Delivery of various Business Services

Automatically Maintains Application SLAs

(Self-Configuration, Self-Healing©IMEX, Self-Acctg. Charges etc.)

Automation/SDDC

IT Industry

Roadmap

Source: IMEX Research

Analytics – BI

Predictive Analytics - Unstructured Data

From Dashboards Visualization to Prediction Engines using Big Data.

(3)

IMEX

RESEARCH.COM

© 2010-12 IMEX Research, Copying prohibited. All rights reserved.

3 3

IT Infrastructure: DataCenters & Cloud

Enterprise VZ Data Center

On-Premise Cloud

Home Networks Web 2.0 Social Ntwks. Facebook, Twitter, YouTube… Cable/DSL… Cellular Wireless

Internet

ISP Core Optical Edge ISP ISP ISP ISP ISP Supplier/Partners Remote/Branch Office

Public CloudCenter

© Servers VPN IaaS, PaaS SaaS Vertical Clouds ISP Tier-3 Data Base Servers Tier-2 Apps Management Directory Security Policy

Middleware Platform Switches: Layer 4-7, Layer 2, 10GbE, FC Stg Caching, Proxy, FW, SSL, IDS, DNS, LB, Web Servers Application Servers HA, File/Print, ERP, SCM, CRM Servers Database Servers, Middleware, Data Mgmt Tier-1 Edge Apps FC/IPSANs, NAS

Source:: IMEX Research - Cloud Infrastructure Report ©2009-11

(4)

IMEX

RESEARCH.COM

4 4

IT Industry Dynamics - Roadmap

Cloudization

On-Premises > Private Clouds > Public Clouds DC to Cloud-Aware Infrast. & Apps. Cascade migration to SPs/Public Clouds.

Integrate Physical Infrast./Blades to meet CAPSIMS®IMEX

Cost, Availability, Performance, Scalability, Inter-operability, Manageability & Security

Integration/Consolidation

Standard IT Infrastructure- Volume Economics HW/Syst SW (Servers, Storage, Networking Devices, System Software(OS, MW & Data Mgmt. SW)

Standardization

Virtualization

Pools Resources. Provisions, Optimizes, Monitors

Shuffles Resources to optimize Delivery of various Business Services

Automatically Maintains Application SLAs

(Self-Configuration, Self-Healing©IMEX, Self-Acctg. Charges etc.)

Automation/SDDC

IT Industry

Roadmap

Source: IMEX Research

Analytics – BI

Predictive Analytics - Unstructured Data

From Dashboards Visualization to Prediction Engines using Big Data.

(5)

IMEX

RESEARCH.COM

© 2010-12 IMEX Research, Copying prohibited. All rights reserved.

*IOPS for a required response time ( ms) *=(#Channels*Latency-1)

(RAID - 0, 3) 500 100

MB/sec

10 1 5 50

Data

Warehousing

OLAP

Big Data/ Bus.Intelligence (RAID - 1, 5, 6)

IO

PS*

(* L at en cy -1) Data Streaming

Audio

Video

Scientific Computing

Imaging

HPC

Workloads: Mapped on Infrastructure Metrics

10K 100 K 1K 100 10 1000 K OLTP/Database

eCommerce

Transaction

Processing

(6)

IMEX

RESEARCH.COM

Storage performance, management and costs

are big issues in running Databases

 Data Warehousing Workloads

are I/O intensive

• Predominantly read based with low hit ratios on buffer pools

• High concurrent sequential and random read levels

 Sequential Reads requires high I/O Bandwidth (MB/sec)  Random Reads require high IOPS

• Write rates driven by life cycle management and sort operations

 OLTP Workloads

are strongly random I/O intensive

• Random I/O is more dominant

 Read/write ratios of 80/20 are most common but can be 50/50

 Difficult to build out test systems with sufficient I/O characteristics

 Batch Workloads

(Hadoop) are more write intensive

• Sequential Writes requires high I/O Bandwidth (MB/sec)

 Backup & Recovery

times are critical for these workloads

• Backup operations drive high level of sequential IO • Recovery operation drives high levels of random I/O

Workloads: I/O Characteristics

(7)

IMEX

RESEARCH.COM

Driver : Need Real Time Analytics

(8)

IMEX

RESEARCH.COM

© 2010-12 IMEX Research, Copying prohibited. All rights reserved.

Issue: Server to Storage I/O Gap

P

er

fo

rm

an

ce

1980 1990 2000 2010 A 7.2K/15k rpm HDD can do 100/140 IOPS

For Each Disk Operation,

Millions of CPU Opns. or Thousands of

Memory Opns. can be accomplished

PCIe used for HBAs to connect to (External Shared Storage) via Storage Switches/Fabric as SAN/NAS on HDDs front ended by DRAM Cache creating a gap of 100,000x latency gap

Memory - MultiSlots

Connect to Storage as Direct Attached Storage (Internal Storage)

8 I/O

(9)

IMEX

RESEARCH.COM

© 2010-11 IMEX Research, Copying prohibited. All rights reserved.

9 9 For a targeted query response time in DB & OLTP applications, many more concurrent users can be added cost-effectively when using SSDs or SSD +

HDDs storage vs. adding more HDDs or short-stroking HDDs

Solution: SSDs Improving DB Query Responses

HDDs

14 Drives

HDDs

112 Drives

w short

stroking

12

SSDs

Drives

$$$$$$$$

$$

$$$

IOPS (or Number of Concurrent Users)

Q u e ry R es p o n s e T im e (m s) 0 8 2 4 6 0 10,000 20,000 30,000 40,000

Conceptual Only – Not to Scale

Hybrid HDD/SSD

36 Drives

$$$$

(10)

IMEX

RESEARCH.COM

Industry Trends: Impact on Infrastructure

© 2010-12 IMEX Research, Copying prohibited. All rights reserved.

Servers Storage Network Software Services 2011 2012 2013 2014 2015 2016

Systems & Services Market Revenues $B

Workloads need Infrastructure - Optimized for Cost, Availability, Performance …

Lower Cost DRAM NVMe based Flash Memory

Big Data/RealTm Analytics PCIe Servers based SSD BYOD / Boot Storms Client Images on Servers

Cloud Computing New Protocols / REST, HTTP..

Server/Stg. Price/Perf. New Storage Class Memory

Serviceable PCIe SSDs Same Connector - PCIe+SAS/SATA

Scale Out Clustering Distributed Memory Architecture

Dense Blades Fast, Low Power Memory

Multicores Flash for Concurrent Multitasking

Virtualization New Infrastrecture for Multi-VMs

Power Efficiency Green Memory

64 bit Computing Larger Size Memories

Impact on Infrastructure Industry Dynamics

(11)

IMEX

RESEARCH.COM

11

Solution: SSDs Filling Price/Perf Gap

HDD

Tape

DRAM

CPU SDRAM

Performance I/O Access Latency HDD becoming

Cheaper, not faster

DRAM getting

Faster (to feed faster CPUs) & Larger (to feed Multi-cores & Multi-VMs from Virtualization) SCM NOR NAND PCIe SSD SATA SSD Price $/GB

Source: IMEX Research SSD Industry Report ©2010-12

SSD segmenting into

 PCIe SSD Cache

- as backend to DRAM &  SATA SSD

- as front end to HDD

Best Opportunity to fill the gap is for storage to be close to Server CPU.

(12)

IMEX

RESEARCH.COM

Innovations Roadmap: HW Technologies

(13)

IMEX

RESEARCH.COM

Innovations Roadmap – DB SW Technologies

13 OLTP Database Innovation Progress 1985 1990 1995 2000 2005 2010 2015 0.01 0.1 1 10 100 1,000 10,000 $/TPMc TPMc/ Processor 10^4 10^3 10^2 10^-1 10^1 10^0 10^5

(14)

IMEX

RESEARCH.COM

© 2010-12 IMEX Research, Copying prohibited. All rights reserved. 14

(15)

IMEX

RESEARCH.COM

Technology: Disk vs InMem DB Architecture

(16)

IMEX

RESEARCH.COM

Technology: RDBMS vs. In-Memory DBMS

(17)

IMEX

RESEARCH.COM

Technology:Legacy vs In-Memory Computing

(18)

IMEX

RESEARCH.COM

Oracle vs SAP vs IBM DB

(19)

IMEX

RESEARCH.COM

Competition: Oracle DB Architecture

(20)

IMEX

RESEARCH.COM

Competition: SAP/HANA (Multi-Applications)

20

A Converged DB System

• In-memory database combining transactional data processing, analytical data

processing, and application logic processing functionality in memory.

• A full DBMS with a standard SQL interface, high availability, transactional

isolation and recovery (ACID properties)

• both row-based and column-based stores within the same engine (row-based

storage is good for transactional applications, while column-based storage is better for reports and analytics. Column-based storage compresses the better too.)

• massively parallel execution using multicore processors, SAP HANA optimizes the

SQL which scales well with the number of cores. Aggregation operations by spawning a number of threads that act in parallel, each of which has equal access to the data resident on the memory on that node

• Additional functions - freestyle search (as SQL extensions). BI applications using MDX for Microsoft Excel & Consumer Services plus internal I/F for BusinessObjects

• prepackaged algorithms in the predictive analysis library

of SAP HANA to

perform advanced statistical calculations

• built-in text support, from its predecessor BI Accelerator that was based on the TREX

(21)

IMEX

RESEARCH.COM

© 2010-12 IMEX Research, Copying prohibited. All rights reserved. 21

• supports distribution across hosts, where large tables may be partitioned to be processed in parallel. DB “engine” of the SAP HANA Analytics appliance as well

• HANA’s combination of a row and column store is fundamentally different from any other database engine on the market today, which allows it to perform OLTP and analytics processing in memory, at the same time.

• Avoids CPU waiting info from Memory through its unique CPU-cache-aware algorithms and data structures that there is as much useful data in the CPU caches as possible,. • it uses late materialization to decompress columnar structures as late as possible, or to

run operations directly on the compressed data

• also sold as an appliance on Intel Xeon CPUs leveraging insights into Intel’s HyperThreading, Turbo Boost and Threading Building Blocks

• High Performance Analytic Appliance can perform large-scale data analyses on 500 billion records in less than a minute, taking analytics to an entirely new dimension

• represents a complete data warehouse in RAM, and as a result, much accelerated real-time analytics.

• .

(22)

IMEX

RESEARCH.COM

Technology: In-Memory Computing

(23)

IMEX

RESEARCH.COM

Trends: In-Memory Computing Adoption

(24)

IMEX

RESEARCH.COM

Trends: In-Memory DB Computing

© 2010-12 IMEX Research, Copying prohibited. All rights reserved. 24

61 61 48 60 62 41 39 32 51 50 9 8 9 4 10 11 10 9 7 6 15 10 19 16 6 21 21 26 12 11 4 4 4 1 3 3 1 4 3 2 2 5 6 4 3 6 7 8 4 5 9 12 14 15 16 18 22 21 23 26 ERP

Finance & Accounting Order Mgmt HR Mgmt SCM CRM Project Mgmt Info & Knowledge… Enterprise Asset Mgmt

SRM

Primary DB for Each Application

(25)

IMEX

RESEARCH.COM

© 2010-12 IMEX Research, Copying prohibited. All rights reserved.

Are SSDs Ready for Enterprise Storage Systems

Anil Vasudeva, President & Chief Analyst,

IMEX Research

System Architecture

for

In-Memory Database

Anil Vasudeva

President & Chief Analyst

[email protected]

408-268-0800

© 2007-13 IMEX Research

All Rights Reserved

Copying Prohibited

Contact IMEX for authorization

References

Related documents

Stracke, C.M., 2016 Visionary Open innovations Dimensions of Openness Open standards Open resources Open licensing Open recognition Open availability Open

The objective of this study was to explore Grade 9 teachers’ experiences of teaching Economic and Management Sciences (EMS) in three selected schools in the

Ship-based counts of murres at sea in 2008 indicated that high densities were present in this area (Box 3 in Boertmann & Mosbech 2011), so it is likely that Kippaku

Pr Gentry Shelton, prof, ol religioua education, and four Hnte College graduate students attended a meeting nf the 1 hns- dan Education Divitioa of the National Council

This classification scheme was developed by the National Health and Medical Research Council (NHMRC) in the Code of Practice for the Near-Surface Disposal of

It provides the order fulfillment, service assur- ance, and billing capabilities that wireless service providers need to operate the network efficiently, and allows them to

Such services already include mobile Internet, mobile music, wireless broadband, social networking, mobile advertising, mobile commerce, video related services and social

Retrieving signalling history information based on user identity (MSISDN, IMSI) is often used by Customer Service Agents.. The user interface contains a flexible and