Solid State Drives
Solid State Drive (Flash Technology)
Processors
Memory
SSD
Disk
Very, very,
very, very,
very fast
Very, very,
very fast
Very, very slow
comparatively
Fast
Access Speed
1,000,000
-8,000,000 ns
~200,000 ns
~100 ns
< 10’s ns
New Solid State Drive (Flash Technology)
Processors
Memory
SSD
Disk
Very, very,
very, very,
very fast
Very, very,
very fast
Very, very slow
comparatively
Fast
Access Speed
1,000,000
-8,000,000 ns
~200,000 ns
~100 ns
< 10’s ns
Human Time Context
~ 12.5
hours
~33
minutes
~1 second
Basic Problem --- Disk “Slowing” Down (Relatively)
Capacity growing ok (35% per year), but Read/Seek -1% & Data Rate only
15% per year
While processors & memory speed up and add threads and cache
Seagate 15k RPM/3.5" Drive Specifications
73
450
171
75
3.4
3.6
2002
2008
Capacity (GB)
Max Sustained
DR (MB/s)
Read Seek (ms)
+35%
+15%
-1%
SSD Performance versus HDD
Drive to Drive comparisons
SSD offers up to 33x – 125x
more I/O Operations Per
Second (IOPS)
HDD SSD
HDD is much slower
access -- typically 5X to
40X access time*
125X
33X
HDD SSD
5X
40X
1X
1X
Power Solid State Drive
Sweet spots
1.
Batch window reduction for disk bound applications
You can cut up to 40-50% off window
2.
Response time - transaction/data base for disk bound applications
Internal drives or perhaps even SAN drives
Key points
--
A modest quantity of SSD can make a big difference
-- Both write-heavy and read-heavy work is fine for today’s SSD –
Processors
Memory
SSD
Disk
Very, very,
very, very,
very fast
Very, very,
very fast
Very, very slow
comparatively
Fast
Access Speed
1,000,000
-8,000,000 ns
~200,000 ns
~100 ns
< 10’s ns
SSD Client - Batch Window Reduction Example 1
Associated Bank needed to reduce month end
batch run time from 4+ hours to under 3
hours
SSDs cut 1.5 hours from batch run time
Plus a 16% reduction in # of disk drives
Placed eight DB2 Objects (table, index, view)
on SSD
Batch Performance Runs
0
1
2
3
4
5
H
o
u
rs
72 HDD 72 HDD + 8 SSD 60 HDD + 4 SSDSSD run 2
SSD run 1
Base run
2:48
4
60
2:43
8
72
4:22
0
72
Batch
Run Time
# of
SSDs
# of SAS
Disk Drives
40% Reduction
SSD Client - Batch Window Reduction Example 2
Customer in health care industry
needed to reduce batch windows
significantly
Daily batch running 10+ hours
Monthly batch running 30+ hours
Added 12 SSDs to 168 HDDs
Cut 50% from daily run time
Cut 50% from monthly run time
.
H
o
u
rs
50% Reduction
with SSD
Monthly
Daily
Batch Windows
SSD Client – Data Warehouse Example
SAP Customer using Business
Intelligence Warehouse (BW 7.0)
installed an all-SSD configuration of
about 800GB.
Huge improvement in aggregate
builds / compression
Significant reductions in queries
60-87% reductions with
SSD
Sample query 1
Sample query 2
Sample build/compression
7 min
5 min
65%
reduction
60%
reduction
20 min
2 min
4 days
1/2 day
HDD
SSD
HDD
SSD
87%
reduction
HDD
SSD
SSD Client Example – IPL Reduction
IBM Development has projected
modest improvement for IBM i and
for AIX, but …..
A USA IBM i POWER6 520 client with
16 drives (4 SSD and 12 HDD)
Client put load source on SSD
Now reporting 3 minute IPLs
A European IBM i client also
reporting 3 minute IPLs
Less downtime
What is your cost per minute or cost per hour for down
time? How long is your typical IPL?
Many Systems Buy “Extra” Disk Arms for Performance
HDD % utilization of capacity is held low to help ensure higher I/O performance
and more consistent response time - arm movement, spinning platter an issue
Write cache and read cache attempt to buffer the impact
SSD % utilization of capacity not restricted
Often less than 30-50% full for performance sensitive workloads
Mixed SSD + HDD Can be Great Solution
Hot data may be only 10-20% capacity, but represent 80-90% activity
SSD offers best price performance when focused on “hot” data
HDD offers best storage cost, so focus it on “cold” data …. a hierarchical approach
It is typical for data bases to have a large percentage of data which is infrequently
Save Space/Energy/Cooling
A few SSD can often replace many HDD
Fewer total drives
Maybe fewer controllers
Fewer I/O drawers
Fewer cables
I/O drawer maintenance reduction
can offset SSD maintenance
Should You Use SAN/SVC SSD or Internal SSD?
yes
yes
yes
yes
Pro’s
Con’s
Use
Internal
SSD
Use
SAN/SVC
SSD
SSD - SAN/SVC or Internal …. Or Mixed?
Pro’s
Con’s
Use
Internal
SSD
Use
SAN/SVC
SSD
1. Lower latency - internal is about
0.3 milliseconds faster (per access
.. it can add up) (can be larger)
2. If don’t already have SAN/SVC,
lower cost & less learning
3. Multiple Config options
4. Handle just like other internal
storage
1. If already have SAN/SVC, adds
complexity of having internal
PLUS SAN/SVC
2. Reduces value of SAN/SVC high
function capabilities – Flash
Copy, MetroMirror, HA, partition
mobility, etc
1. Leverage SAN/SVC well-known
value proposition of
manageability, control,
functionality, performance, etc
2. If already have SAN/SVC, less
learning, reduced cost to add
3. DS8000 & V7000 EasyTier
1. If don’t have SAN/SVC, need
initial SAN/SVC investment in
hardware, software, skills
2. Many medium/entry SAN/SVC
don’t offer SSD
MIXED: If using a SAN/SVC which doesn’t have SSD, OR if not using the higher
function capability of a SAN/SVC (for example you do backup/restore functions just like
it was an internal drive); THEN combining internal Power Systems SSD with SAN/SVC
storage can be a good thing.
SSD Technology “Myth”
“Beware SSD as they wear out as they have a limited number of
writes”
Key facts to correct myth …. this is not a concern for IBM Power
Systems SSD
IBM Power Systems SSD are industrial/enterprise drives designed to
handle this. Design points are many years of 24x7x365 heavy write
workloads. These are not “PC-grade” or “consumer-grade” flash drives.
Even if you do somehow “use up all the writes”, the SSD reports status prior
to it being a real problem to the server and server sends a message to the
operator or to IBM for a future scheduled repair action. This is just like a
disk drive reporting a weakening status. Replacement SSD is covered
under IBM Maintenance.
Just like disk drives, you want to protect contents using system mirroring or
eMLC – Enterprise Multi-Level Cell - Technology
enterprise
class performance and reliability
PLUS more cost effective MLC
Flash technology
Compared to the 2009 69GB SSD offering from Power Systems
Better cost on a per GB basis,
More dense physical packaging on a per GB basis
About 50% less energy consumption and heat per drive
Comparable performance
eMLC
Implementing “Hot” and “Cold” Data
IBM i
#1
Best integrated, automated
capability available in the industry
today
“Trace and Balance” function part of
IBM i – automated in i 7.1
Monitors by partition or ASP (Aux
Storage Pool) to determine hot/cold
Upon request, automatically moves hot
data to SSD, cold to HDD
Can re-monitor & rebalance any time
A few key OS files can automatically
be placed on SSD
Can specify specific data base
objects to be placed on SSD
See white paper for additional insights
Additional enhancements being
developed
AIX
Ability to granularly/flexibly select and
locate hot files on SSD
Key AIX performance tools are filemon
& iostat.
Database vendors provide hot data
analysis tools, example: DB2 Snapshot.
Migrating hot data
Migratepv is often useful
Softek Migration Tool provides a nice
suite of functions and can be combined
with IBM Services
See Total Storage Productivity Center
SSD white paper for additional
insights
IBM i Load Balancer
Industry leading automated capability
Monitors partition/ASP using “trace”
User turns trace on during a peak time
User turns trace off after reasonable
sample time
Negligible performance impact expected
Tool monitors “reads” to identify hot data
Looks at 1MB stripes of data
Upon command, automatically moves hot
data to SSD, cold data to HDD
Minimal performance impact, done in
background
Can remonitor and rebalance any time
Probably a weekly or monthly activity
Perhaps less often if data not volatile
IBM i intelligent hot/cold placement
makes a big difference vs normal IBM
striping / scattering of data across all
drives.
This example 72 HDD + 16 SSD
Trans/min A p p li c a ti o n R e s p o n s e t im e 72 HDD + 16 SSD No Balance 72 HDD + 16 SSD Data BalancedPredicting/analyzing what % of data is hot for presale analysis to
help size the number of SSD required:
•
Use PEX tool/output
IBM i SSD Balancing One of 5 ASP Balancing Types
Balance data between busy units and idle units (STRASPBAL
TYPE(*USAGE)
)
Make all of the units in the ASP have the same percent full
(STRASPBAL TYPE(*CAPACITY))
Drain the data from a disk, to prepare unit it to be removed from
the configuration (STRASPBAL TYPE(*MOVDTA))
(Almost obsolete) move hot data off of a compressed disk, and
move cold data to the compressed disk (STRASPBAL
TYPE(*HSM))
Requires specific disk controllers with compression
capability – feats #2741/2748/2778. Compression only allowed in user ASPs
Move cold data to HDDs and move
hot data to SSDs
Using TRCASPBAL to place hot data on SSDs – IBM i
Trace ASP balance counts the read operations based
on 1MB stripes
TRCASPBAL SET(*ON) ASP(1) TIMLMT(*NOMAX)
Start ASP balance moves the data
STRASPBAL TYPE(*HSM) ASP(1) TIMLMT(*NOMAX)
Target is 50% of read operations to be on SSD
Cold data is moved (multiple threads) to HDDs, hot data is
moved (single thread) to SSD
HDD3
HDD4
SSD
HDD2
HDD1
100 500 100 2000 6000 10000 1200 6000 3000 100 300 800 500 900 900 0 4000 300 400 300 200 600 700 1000 100 100 300 900 100 100 300 900 100 10000 6000 4000 10000 6000 4000How to Find Hot Tables and Indexes - IBM i
Performance Explorer
BY FAR the best solution
Perform analysis based on read complete and write
complete events
DB2 maintains statistics about the number of
operations on a table or index
Statistics are zeroed on each IPL
Statistics only identify candidates (logical operations include
both random and sequential operations)
Available via:
Display file description (DSPFD)
Application programming interface (API) QUSRMBRD
System i Navigator Health Center (V6R1 only)
Power Systems SSD Configuration Options
SAS-bay-based
Option introduced 2009 & 1H 2011
PCIe-based
Introduction 2H 2010
PCI SAS
controller
SAS Bays
S
S
D
S
S
D
S
S
D
S
S
D
S
S
D
S
S
D
S
S
D
PCIe SAS
controller
S
S
D
S
S
D
S
S
D
S
S
D
69 GB SSD & 177 GB SSD
177 GB SSD
Can include imbedded SAS controller
S
S
Configuring SAS-Bay-Based SSD
SSD leverages SAS HDD infrastructure
Located in a SAS drive bay – same as SAS HDD would
use
Run by SAS adapter/controller – same as HDD would use
Contents can be protected by RAID or mirroring same as
HDD
RAID-5, RAID-6, RAID-10, mirroring, hot spare
Drives can use hot swap capability of SAS bay
PCIe-Based SSD
PCIe SAS Adapter / Double-wide card
4 SSD bays on card / 1, 2 or 4* SSD modules per adapter
177 GB per SSD module / Up to 708 GB per card
Supported OS: AIX 5.3 or later, IBM i 7.1, REHL 5.5 or later,
SLES 10 or later
Supported servers: 710/720/730/740/750/770/780 (not 795)
177GB
SSD
SAS
Cntrl
177GB
SSD
177GB
SSD
177GB
SSD
Power Systems SSD Statements of Direction
Provided August 2010
IBM plans to enhance its Power Systems Enterprise Class
SSD solutions with technology designed to continue to
provide significant improvements in performance and
storage density over time. IBM plans for these IBM Power
Systems enhancements to include both SAS-bay-based
and PCIe-based SSD product offerings that will leverage
IBM's investments in its SSD optimized Enterprise Class
RAID Storage Controllers.
Roadmap: SSD Configuration 2011 Options
SAS-bay-based
PCIe-based
PCI SAS
controller
SAS Bays
S
S
D
S
S
D
S
S
D
S
S
D
S
S
D
S
S
D
S
S
D
PCIe SAS
controller
S
S
D
S
S
D
S
S
D
S
S
D
Double-wide PCIe card
(need pair for hot plug)
Up to 4 SSD modules
Larger capacity SSD
Continue Hot plug SSD
More SSD modules / controller
New PCIe card
(pair for redundancy)
Large cache, high performance
S
S
D
S
S
D
S
S
D
See SODs made August 2010. All statements regarding IBM's future direction & intent are subject to change or withdrawal
without notice, and represents goals & objectives only.
SSD Analyzer Tool for IBM i
SSD ANALYSIS TOOL (ANZSSDDTA)
Type choices, press Enter.
PERFORMANCE MEMBER . . . *DEFAULT__ Name, *DEFAULT
LIBRARY . . . __________ Name
Additional Parameters
REPORT TYPE . . . *SUMMARY *DETAIL, *SUMMARY, *BOTH
TIME PERIOD::
START TIME AND DATE::
BEGINNING TIME . . . *AVAIL__ Time, *AVAIL
BEGINNING DATE . . . *BEGIN__ Date, *BEGIN
END TIME AND DATE::
ENDING TIME . . . *AVAIL__ Time, *AVAIL
ENDING DATE . . . *END____ Date, *END
NUMBER OF RECORDS IN REPORT . . 50__ 0 - 9999
Bottom
F3=Exit F4=Prompt F5=Refresh F12=Cancel F13=How to use this display
F24=More keys
• Quick, easy, no-charge analysis looks at standard performance report output
• Provides “probably yes”, “probably no”, or “maybe
Solid State Drives
Where do these new drives fit in
Trademarks and Disclaimers
8IBM Corporation 1994-2010. All rights reserved.
References in this document to IBM products or services do not imply that IBM intends to make them available in every country.
Trademarks of International Business Machines Corporation in the United States, other countries, or both can be found on the World Wide Web at
http://www.ibm.com/legal/copytrade.shtml.
Adobe, Acrobat, PostScript and all Adobe-based trademarks are either registered trademarks or trademarks of Adobe Systems Incorporated in the United States, other countries, or both.
Intel, Intel logo, Intel Inside, Intel Inside logo, Intel Centrino, Intel Centrino logo, Celeron, Intel Xeon, Intel SpeedStep, Itanium, and Pentium are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries.
Linux is a registered trademark of Linus Torvalds in the United States, other countries, or both.
Microsoft, Windows, Windows NT, and the Windows logo are trademarks of Microsoft Corporation in the United States, other countries, or both.
IT Infrastructure Library is a registered trademark of the Central Computer and Telecommunications Agency which is now part of the Office of Government Commerce. ITIL is a registered trademark, and a registered community trademark of the Office of Government Commerce, and is registered in the U.S. Patent and Trademark Office.
UNIX is a registered trademark of The Open Group in the United States and other countries.
Cell Broadband Engine and Cell/B.E. are trademarks of Sony Computer Entertainment, Inc., in the United States, other countries, or both and are used under license therefrom.
Java and all Java-based trademarks are trademarks of Sun Microsystems, Inc. in the United States, other countries, or both. Other company, product, or service names may be trademarks or service marks of others.
Information is provided "AS IS" without warranty of any kind.
The customer examples described are presented as illustrations of how those customers have used IBM products and the results they may have achieved. Actual environmental costs and performance characteristics may vary by customer.
Information concerning non-IBM products was obtained from a supplier of these products, published announcement material, or other publicly available sources and does not constitute an endorsement of such products by IBM. Sources for non-IBM list prices and performance numbers are taken from publicly available information, including vendor announcements and vendor worldwide homepages. IBM has not tested these products and cannot confirm the accuracy of performance, capability, or any other claims related to non-IBM products. Questions on the capability of non-IBM products should be addressed to the supplier of those products.
All statements regarding IBM future direction and intent are subject to change or withdrawal without notice, and represent goals and objectives only.
Some information addresses anticipated future capabilities. Such information is not intended as a definitive statement of a commitment to specific levels of performance, function or delivery schedules with respect to any future products. Such commitments are only made in IBM product announcements. The information is presented here to communicate IBM's current investment and development activities as a good faith effort to help with our customers' future planning.
Performance is based on measurements and projections using standard IBM benchmarks in a controlled environment. The actual throughput or performance that any user will experience will vary depending upon considerations such as the amount of multiprogramming in the user's job stream, the I/O configuration, the storage configuration, and the workload processed. Therefore, no assurance can be given that an individual user will achieve throughput or performance improvements
This document was developed for IBM offerings in the United States as of the date of publication. IBM may not make these offerings available in
other countries, and the information is subject to change without notice. Consult your local IBM business contact for information on the IBM
offerings available in your area.
Information in this document concerning non-IBM products was obtained from the suppliers of these products or other public sources. Questions
on the capabilities of non-IBM products should be addressed to the suppliers of those products.
IBM may have patents or pending patent applications covering subject matter in this document. The furnishing of this document does not give
you any license to these patents. Send license inquires, in writing, to IBM Director of Licensing, IBM Corporation, New Castle Drive, Armonk, NY
10504-1785 USA.
All statements regarding IBM future direction and intent are subject to change or withdrawal without notice, and represent goals and objectives
only.
The information contained in this document has not been submitted to any formal IBM test and is provided "AS IS" with no warranties or
guarantees either expressed or implied.
All examples cited or described in this document are presented as illustrations of the manner in which some IBM products can be used and the
results that may be achieved. Actual environmental costs and performance characteristics will vary depending on individual client configurations
and conditions.
IBM Global Financing offerings are provided through IBM Credit Corporation in the United States and other IBM subsidiaries and divisions
worldwide to qualified commercial and government clients. Rates are based on a client's credit rating, financing terms, offering type, equipment
type and options, and may vary by country. Other restrictions may apply. Rates and offerings are subject to change, extension or withdrawal
without notice.
IBM is not responsible for printing errors in this document that result in pricing or information inaccuracies.
All prices shown are IBM's United States suggested list prices and are subject to change without notice; reseller prices may vary.
IBM hardware products are manufactured from new parts, or new and serviceable used parts. Regardless, our warranty terms apply.
Any performance data contained in this document was determined in a controlled environment. Actual results may vary significantly and are
dependent on many factors including system hardware configuration and software design and configuration. Some measurements quoted in this
document may have been made on development-level systems. There is no guarantee these measurements will be the same on
generally-available systems. Some measurements quoted in this document may have been estimated through extrapolation. Users of this document
should verify the applicable data for their specific environment.
Revised September 26, 2006
IBM, the IBM logo, ibm.com AIX, AIX (logo), AIX 6 (logo), AS/400, Active Memory, BladeCenter, Blue Gene, CacheFlow, ClusterProven, DB2, ESCON, i5/OS, i5/OS (logo), IBM Business Partner (logo), IntelliStation, LoadLeveler, Lotus, Lotus Notes, Notes, Operating System/400, OS/400, PartnerLink, PartnerWorld, PowerPC, pSeries, Rational, RISC System/6000, RS/6000, THINK, Tivoli, Tivoli (logo), Tivoli Management Environment, WebSphere, xSeries, z/OS, zSeries, AIX 5L, Chiphopper, Chipkill, Cloudscape, DB2 Universal Database, DS4000, DS6000, DS8000, EnergyScale, Enterprise Workload Manager, General Purpose File System, , GPFS, HACMP, HACMP/6000, HASM, IBM Systems Director Active Energy Manager, iSeries, Micro-Partitioning, POWER, PowerExecutive, PowerVM, PowerVM (logo), PowerHA, Power Architecture, Power Everywhere, Power Family, POWER Hypervisor, Power Systems, Power Systems (logo), Power Systems Software, Power Systems Software (logo), POWER2, POWER3, POWER4, POWER4+, POWER5, POWER5+, POWER6, POWER7, pureScale, System i, System p, System p5, System Storage, System z, Tivoli Enterprise, TME 10, TurboCore, Workload Partitions Manager and X-Architecture are trademarks or registered trademarks of International Business Machines Corporation in the United States, other countries, or both. If these and other IBM trademarked terms are marked on their first occurrence in this information with a trademark symbol (® or ™), these symbols indicate U.S. registered or common law trademarks owned by IBM at the time this information was
published. Such trademarks may also be registered or common law trademarks in other countries. A current list of IBM trademarks is available on the Web at "Copyright and trademark information" at www.ibm.com/legal/copytrade.shtml
The Power Architecture and Power.org wordmarks and the Power and Power.org logos and related marks are trademarks and service marks licensed by Power.org. UNIX is a registered trademark of The Open Group in the United States, other countries or both.
Linux is a registered trademark of Linus Torvalds in the United States, other countries or both.
Microsoft, Windows and the Windows logo are registered trademarks of Microsoft Corporation in the United States, other countries or both.
Intel, Itanium, Pentium are registered trademarks and Xeon is a trademark of Intel Corporation or its subsidiaries in the United States, other countries or both. AMD Opteron is a trademark of Advanced Micro Devices, Inc.
Java and all Java-based trademarks and logos are trademarks of Sun Microsystems, Inc. in the United States, other countries or both. TPC-C and TPC-H are trademarks of the Transaction Performance Processing Council (TPPC).
SPECint, SPECfp, SPECjbb, SPECweb, SPECjAppServer, SPEC OMP, SPECviewperf, SPECapc, SPEChpc, SPECjvm, SPECmail, SPECimap and SPECsfs are trademarks of the Standard Performance Evaluation Corp (SPEC).
NetBench is a registered trademark of Ziff Davis Media in the United States, other countries or both. AltiVec is a trademark of Freescale Semiconductor, Inc.
Cell Broadband Engine is a trademark of Sony Computer Entertainment Inc.
InfiniBand, InfiniBand Trade Association and the InfiniBand design marks are trademarks and/or service marks of the InfiniBand Trade Association. Other company, product and service names may be trademarks or service marks of others.