• No results found

Doc. Code. OceanStor VTL6900 Technical White Paper. Issue 1.1. Date Huawei Technologies Co., Ltd.

N/A
N/A
Protected

Academic year: 2021

Share "Doc. Code. OceanStor VTL6900 Technical White Paper. Issue 1.1. Date Huawei Technologies Co., Ltd."

Copied!
33
0
0

Loading.... (view fulltext now)

Full text

(1)

Doc. Code

OceanStor VTL6900 Technical

White Paper

Issue 1.1

Date 2012-07-30

(2)
(3)

Copyright © Huawei Technologies Co., Ltd. 2012. All rights reserved.

No part of this document may be reproduced or transmitted in any form or by any means without prior written consent of Huawei Technologies Co., Ltd.

Trademarks and Permissions

and other Huawei trademarks are trademarks of Huawei Technologies Co., Ltd.

All other trademarks and trade names mentioned in this document are the property of their respective holders.

Notice

The purchased products, services and features are stipulated by the contract made between Huawei and the customer. All or part of the products, services and features described in this document may not be within the purchase scope or the usage scope. Unless otherwise specified in the contract, all statements, information, and recommendations in this document are provided "AS IS" without warranties, guarantees or representations of any kind, either express or implied.

The information in this document is subject to change without notice. Every effort has been made in the preparation of this document to ensure accuracy of the contents, but all statements, information, and recommendations in this document do not constitute a warranty of any kind, express or implied.

Huawei Technologies Co., Ltd.

Address: Huawei Industrial Base Bantian, Longgang Shenzhen 518129

People's Republic of China Website: http://www.huawei.com

Email: [email protected]

Tel: 0755-28560000 4008302118

(4)
(5)

OceanStor

Technical White Paper for OceanStor VTL6900 Contents

Contents

1 Executive Summary ... vii

2 Introduction... 2-1

3 Solution ... 3-1

3.1 Deduplication ... 3-1 3.1.1 Introduction to Deduplication ... 3-1 3.1.2 Deduplication Principle of the VTL6900 ... 3-2 3.2 High Availability Cluster ... 3-5 3.2.1 Introduction to the High Availability Cluster ... 3-5 3.2.2 High Availability of the VTL6900 ... 3-6 3.3 IP Replication ... 3-8 3.4 Tape Caching ... 3-12 3.4.1 Data Migration Policies ... 3-13 3.4.2 Space Reclamation Policy ... 3-14 3.5 Tape Encryption ... 3-15 3.6 Energy Saving and Consumption Reduction ... 3-16 3.6.1 Energy Saving by Deduplication ... 3-16 3.6.2 Energy Saving by Disk Spin-Down ... 3-16

4 Experience ... 4-1

4.1 Typical Application and Benefit to Customers ... 4-1 4.1.1 VTL Backup System ... 4-1

(6)
(7)

OceanStor

Technical White Paper for OceanStor VTL6900 1 Executive Summary

1

Executive Summary

(8)
(9)

OceanStor

Technical White Paper for OceanStor VTL6900 2 Introduction

2

Introduction

With the explosive data increase, the backup and recovery speeds of traditional tape backup systems cannot meet the backup requirements of customers. As disk technologies develop rapidly, the capacity of disk media greatly increases, and the price of disk storage devices significantly decreases in the unit capacity. Backup systems that use Serial Advanced Technology Attachment (SATA) disks are widely deployed in IT environments of customers and gain favor from customers with the high backup and recovery performance. VLT products are representatives of the backup systems. VTL products inherit advantages such as high performance, easy maintenance, and cutting-edge mature media management of disk devices. Therefore, they keep a good development trend since the launching and have bright market prospects.

For mid-range and high-end customers, the data amount involved in one time of full backup may reach tens of terabytes. Provided that the backup window is eight hours, the backup system must provide a backup rate of at least 1000 Mbit/s. Provided that incremental backup is performed once a day, full backup is performed once a week, and the storage period is three months, the storage capacity of backup devices must reach hundreds of terabytes. The construction, operation, and management of such a large-scale disk backup system require large investments in storage, power consumption, and management. In addition, mid-range and high-end customers have high requirements for the availability of both production systems and backup systems. Common single-engine VTL products cannot meet the high availability requirements.

(10)
(11)

OceanStor

Technical White Paper for OceanStor VTL6900 3 Solution

3

Solution

The VTL6900 is oriented at mid-range and high-end customers. It uses deduplication, high availability clusters, and disk spin-down to solve the problems facing customers.

This chapter describes three features of the VTL6900, including deduplication, high availability clusters, IP replication,tape caching and energy saving and consumption reduction.

3.1 Deduplication

3.1.1 Introduction to Deduplication

The deduplication technology eliminates repetitive data through hardware or software to reduce the occupied storage space.

In a backup system, data is transferred from a backup client (source end) to a backup device (destination end) under the control of the backup server. Based on the location where deduplication is performed, deduplication is divided into the deduplication at the source end and deduplication at the destination end. In the deduplication at the source end, a backup client processes data before transferring the data to a backup device. It aims to eliminate repetitive data and send unique data to the backup device. In the deduplication at the destination end, a backup client transfers data to a backup device without processing the data. After receiving the data, the backup device performs deduplication.

Based on the time when deduplication is performed, deduplication at the destination end is divided into in-line deduplication and post processing deduplication). In the in-line deduplication, a backup device performs deduplication while receiving backup data. The deduplication is complete when the backup data reception is complete. In the post processing deduplication, a backup device performs deduplication after the backup is complete, that is, the backup device receives all backup data and then performs deduplication against the received backup data at a specified time point.

(12)

3 Solution

OceanStor Technical White Paper for OceanStor VTL6900

3-2 Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd. Issue 1.0 (2012-05-11)

space. Therefore, index-based comparison can be directly performed within the memory space, providing a higher efficiency than content-based comparison. At present, index-based comparison has been widely applied to various deduplication technologies.

3.1.2 Deduplication Principle of the VTL6900

The VTL 6900 supports both post processing deduplication and in-line deduplication. SIR stands for single instance repository.

Post Processing Deduplication

Index-based comparison is applied to the post processing deduplication supported by the VTL6900.

Figure 3-1 Principle of post processing deduplication

In post processing deduplication, the VTL6900 software consists of two modules: VTL module and SIR module. The storage space of the VTL6900 is logically divided into two parts, which are respectively occupied by the VTL module and SIR module. The VTL storage space is also called cache, and the SIR storage space is also called repository. After receiving the backup data, the VTL6900 stores it in the VTL storage space. The SIR module obtains the backup data at a specified time point, when the backup is complete, or when the storage level reaches the specified value, and compares the backup data with existing data blocks in the SIR storage space. The SIR module divides original backup data into kilobyte-size data blocks and calculates a hash value (also called an index) for each data block based on the secure hash algorithm 1 (SHA-1). By comparing the hash values of new data blocks and those of existing data blocks, the SIR module identifies repetitive data blocks. Repetitive data blocks will be discarded, with pointers to the data blocks retained. Unique new data blocks will be stored in the SIR storage space. See Figure 3-1.

(13)

OceanStor

Technical White Paper for OceanStor VTL6900 3 Solution

Figure 3-2 Original virtual tape data on the VTL6900

Before the SIR engine performs deduplication, backup data is stored on virtual tapes in the VTL storage space, as shown in Figure 3-2. After the deduplication is complete, data stored on the virtual tapes is replaced with pointers, and the virtual tapes are called virtual index tapes (VITs). Each pointer points to a single-instance data block in the SIR storage space. The released VTL storage space is used to store new backup data. See Figure 3-3.

Figure 3-3 Data distribution on the VTL6900 after the deduplication

Figure 3-4 Usage of VTL6900 storage space in post processing deduplication

(14)

3 Solution

OceanStor Technical White Paper for OceanStor VTL6900

3-4 Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd. Issue 1.0 (2012-05-11)

deduplication. New backup data stored in the cache is compared with data blocks on the SIR data disk, to identify repetitive data. The SIR index disk is used to store the indexes (namely SHA-1 hash values) of all data blocks on the SIR data disk. The capacity of the SIR index disk increases with the capacity of the SIR data disk. The SIR module reads all indexes (index table) from the SIR index disk and writes them to the memory of the SIR engine, providing quick search in the index table. The required memory capacity of the SIR engine increases with the capacity of the SIR index disk.

In-line Deduplication

Index-based comparison is applied to the in-line deduplication supported by the VTL6900. Figure 3-5 shows the in-line deduplication process.

Figure 3-5 Principle of in-line deduplication

(15)

OceanStor

Technical White Paper for OceanStor VTL6900 3 Solution

Figure 3-6 Usage of VTL6900 storage space in inline deduplication

 If errors occur during the backup in in-line deduplication, most backup software will write the backup data to a new tape.

 If errors occur at the beginning of the backup in in-line deduplication, post processing deduplication will be triggered instead.

3.2 High Availability Cluster

3.2.1 Introduction to the High Availability Cluster

If a node in a high availability cluster becomes faulty and cannot work properly, another node in the high availability cluster will take over the work of the faulty node.

A high availability cluster consists of an active node and standby nodes. The active node is a node that is executing tasks. A standby node is a backup of the active node. When the active node becomes faulty, a standby node will take over the work of the active node.

A high availability cluster is implemented based on resource switchover. The resource refers to the collection of information concerning the work taken over by a standby node when the active node in a high availability cluster becomes faulty. The standby node operates properly after taking over the resource of the faulty node, minimizing the impacts on the client.

(16)

3 Solution

OceanStor Technical White Paper for OceanStor VTL6900

3-6 Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd. Issue 1.0 (2012-05-11)

This section is prepared by referring to the Storage Overview.

3.2.2 High Availability of the VTL6900

Both VTL and SIR software modules of the VTL6900 support the high availability feature. That is, the high availability design is applied to both the VTL engine and SIR engine.

High Availability of the VTL Engine

The VTL6900 supports two VTL engines, which can work in bidirectional active-standby mode (bidirectional failover).

Figure 3-7 High availability configuration of the VTL engines

As shown in Figure 3-7, when VTL A and VTL B operate properly, both of them provide external VTL services. In unidirectional failover mode, provided that VTL A is the active node and VTL B is the standby node, VTL B monitors the status of VTL A and takes over the work of VTL A if VTL A becomes faulty. However, if VTL B becomes faulty, VTL A does not take over the work of VTL B. In bidirectional failover mode, VTL A and VTL B monitor each other. If either of them becomes faulty, the other VTL takes over the work of the faulty VTL and provides external VTL services.

(17)

OceanStor

Technical White Paper for OceanStor VTL6900 3 Solution

machine after the takeover. After the takeover, the VTL A software application accesses the original storage memory (VTL storage memory) through the shared storage and continues to provide VTL services for the original backup server through the standby host port. The shared storage functions as follows: Both the VTL A physical machine and VTL B physical machine are connected to the storage unit of the VTL6900 through physical channels. When running on the VTL B physical machine, the VTL A software application can still access its original storage memory by using the physical channel of the VTL B physical machine. The standby host port functions as follows (take the fiber channel [FC] as an example): VTL A provides external VTL services by using FC port Target wwpn1. Correspondingly, VTL B provides an FC port Standby wwpn3 as a standby port for Target wwpn1. In actual configurations, FC ports Target wwpn1 and Standby wwpn3 are connected to the backup server of the VTL A through FC channels. When running on the VTL B physical machine, the VTL A software application can still provide VTL services for its original backup server by using port Standby wwpn3.

In bidirectional failover mode, VTL A and VTL B software applications monitor the status of each other through the heartbeat network. If either of them cannot provide external VTL services properly due to software, hardware, or channel (connected to the storage unit) faults, failover will be automatically triggered. The normal node will take over the work of the faulty node. The takeover process takes about four minutes. After the faulty node recovers, failback will be triggered automatically (or manually), and the takeover terminates.

High Availability of the SIR Engine

The VTL6900 supports three SIR engines. Two of the SIR engines function as active nodes, and the rest one functions as the standby node.

Figure 3-8 High availability configuration of the SIR engines

(18)

3 Solution

OceanStor Technical White Paper for OceanStor VTL6900

3-8 Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd. Issue 1.0 (2012-05-11)

standby SIR physical machine. After SIR 1 recovers, it automatically functions as the standby SIR engine after being powered on.

In the high availability configurations of SIR engines, all SIR engines can access the SIR storage memory through physical channels. Each SIR engine can access its own storage unit and the storage units of other SIR engines. This ensures that all SIR software applications can access their own storage units when they run on any SIR physical machines. In addition, all SIR engines are interconnected with VTL engines through physical channels. Therefore, SIR engines can read original backup data from the VTL storage memory for deduplication, and VTL engines can read data from the SIR storage memory for recovery. Generally, the physical connections of all SIR engines are the same, ensuring free switchover between the active and standby nodes.

Moreover, SIR software modules also provide the high performance feature. In high availability configurations, three SIR engines of the VTL6900 work in 2+1 active-standby mode. During actual operation, the two active SIR engines perform deduplication. They constitute a high performance cluster, improving the deduplication performance and efficiency. The VTL6900 also supports the configuration of one SIR engine or two SIR engines. In this case, the SIR engines do not provide the high availability feature. When two SIR engines are configured, they constitute a high performance cluster.

3.3 IP Replication

Replication is a common technology used for disaster recovery. Data replication refers to copying data from one medium onto another medium and generating a data copy by using the data replication software.

The traditional disaster recovery generally uses the transportation method. The backup software copies data onto a physical tape library, and the physical tape library is transported to a remote place for preservation. During the transportation, tapes may get lost or damaged; thus, the effect of disaster recovery cannot be ensured.

Over an IP network, the local VTL6900 copies data on virtual tapes to the remote VTL6900. Through this method, the VTL6900 utilizes the convenience and high speed of the network to save the transportation cost. The local VTL6900 encrypts the tape data by using the

encryption algorithm before data transfer. Then the remote VTL6900 decrypts the data after receiving it. As a result, the data security during transfer is ensured.

The VTL6900 provides four options for the IP replication:  Remote Copy

 Automatic Replication  IP Replication

 Replication upon De-duplication.

(19)

OceanStor

Technical White Paper for OceanStor VTL6900 3 Solution

Table 3-1 Four options of IP replication O p t i o n T y p e Description A u t o R e p l i c a t i o n A u t o m a t i c

When a virtual tape is exported from the VTL, the system automatically copies the data on the virtual tape to another VTL6900. R e m o t e C o p y M a n u a l

The data on a virtual tape is copied to another VTL as required. I P R e p l i c a t i o n A u t o m a t i c

Within the specified interval and according to the

(20)

3 Solution

OceanStor Technical White Paper for OceanStor VTL6900

3-10 Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd. Issue 1.0 (2012-05-11) O p t i o n T y p e Description R e p l i c a t i o n u p o n D e -d u p l i c a t i o n A u t o m a t i c

When the de-duplication function is enabled, the deletion policy is integrated with the replication policy. The changed data is copied to another VTL6900 according to the replication policy.

These four options differ mainly in the replication triggering mechanism.

 Auto Replication is triggered by the backup software. If the VTL is set Auto Replication, the replication of the virtual tape is triggered when the VTL receives the eject command from the backup software (For a physical tape library, the eject command for the backup software means to eject the tape out of the physical tape library; for a virtual tape library, this command means to put the virtual tape into the virtual vault).

(21)

OceanStor

Technical White Paper for OceanStor VTL6900 3 Solution

in the remote VTL6900 must not have the same name as any virtual tape in the local VTL6900.

 IP Replication is triggered based on the policy. The policy can be:

− Data increment-based replication policy.

The VTL6900 can identify the amount of the data backed up to the tape each time. If the data increment exceeds the pre-set threshold, the replication is automatically triggered after the copy.

− Time point-based replication.

The user can specify the time point for the first replication and the replication interval for each virtual tape. Then, the data on the virtual tape will be copied according to the specified time point. The remote virtual tape that adopts IP Replication must be promoted manually before use.

 Replication upon De-duplication is manually triggered based on the policy.

The triggering condition can be the specific date or time point, or upon the completion of the backup operation. The local VTL6900 transfers the data after de-duplication to the remote VTL6900 over an IP network. After de-duplication, data blocks instead of data are transferred during the IP replication. The bandwidth occupation decreases and the transfer efficiency increases. As a result, the remote data-level disaster recovery can be implemented with low costs, easy deployment, and high efficiencies.

The remote IP replication has the following scenarios:  One VTL6900 copies data to the remote VTL6900. Figure 3-9 Networking of one-to-one remote disaster recovery

(22)

3 Solution

OceanStor Technical White Paper for OceanStor VTL6900

3-12 Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd. Issue 1.0 (2012-05-11) Figure 3-10 Networking of many-to-one remote disaster recovery

3.4 Tape Caching

Tape Caching is an advanced function of the VTL6900. This function uses the high-speed VTL6900 as the high-speed cache of the physical tape library. The backup data is written to the VTL6900 first. After the backup operation is complete, the VTL6900 migrates the backup data to the physical tape library according to the preset policy. In this way, the hierarchical storage architecture forms.

The VTL6900 can shorten the backup window and quickly recover data. Physical tape libraries are suitable for large-capacity offline data. Therefore, the VTL6900 can be combined with physical tape libraries to implement the hierarchical storage. The principles of the hierarchical storage include:

 The data that needs to be archived for a long time is stored on the physical tape libraries.  The frequently-used data is stored in the VTL.

The VTL takes over the physical tape libraries.

Physical tape libraries have the slow backup speed and disks are unsuited for seldom-accessed data for a long time. The hierarchical storage eliminates the shortcomings of physical tape libraries and disks.

(23)

OceanStor

Technical White Paper for OceanStor VTL6900 3 Solution

Figure 3-11 Networking of the hierarchical storage

Data can be recovered directly from the VTL or physical tape library. To fully utilize the high-speed cache, the VTL6900 provides various migration triggering policies and space reclaiming policies.

3.4.1 Data Migration Policies

Tape Caching provides two policies for triggering data migration between the VTL6900 and the physical tape library: 1) time-based migration; 2) intelligent migration. Table 3-2 and Table 3-3 list the two policies.

Table 3-2 Time-based migration policy Policy Name Description Certain time

point each day

Migration is performed in a one-day cycle. The VTL6900 starts data migration at the specified time point each day.

Certain time point

each week

Migration is performed in a one-week cycle. The VTL6900 starts data migration at the specified time point each day from Monday to Saturday.

Table 3-3 Intelligent migration policy Policy Name Description

And/Or Conjunction/disjunction of the intelligent policy. The option And means migration is triggered only when all conditions are met; or means that migration is triggered when any condition is met.

Data storage period

(24)

3 Solution

OceanStor Technical White Paper for OceanStor VTL6900

3-14 Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd. Issue 1.0 (2012-05-11)

Policy Name Description

Watermark Migration is triggered when the usage of the disk space of the VTL6900 reaches 90%.

After backup (tape space used out)

Migration is triggered after each backup. "Tape space used out" is the additional policy for "after backup". If the two options are chosen, the VTL6900 checks the usage of the virtual tape when a virtual tape is ejected out of the tape drive. If the space of this tape is used out, migration is triggered.

Postponed to a certain time point

Migration is postponed to a specific time point after the condition is met this time. This policy must be used together with the preceding three policies. When the condition of any preceding policy is met, migration can be postponed to a specific time point.

The time-based migration policy and intelligent migration policy cannot be used simultaneously. For the time-based migration policy, "Certain time point each day" and "Certain time point each week" cannot be used at the same time. The user can only select either for the condition of triggering migration. Multiple options of the intelligent policy can be chosen simultaneously. The options can be combined to meet different requirements of migration.

3.4.2 Space Reclamation Policy

To fully utilize the cache, the VTL6900 provides two space reclamation policies to ensure the space utilization: 1) intelligent reclamation; 2) reclamation upon de-duplication. Table 3-4 lists the reclamation methods.

Table 3-4 Reclamation methods Policy Name Description Intelligent

reclamation

The space occupied by the virtual tapes of the VTL6900 used as the cache is reclaimed. That is, the data on these virtual tapes is deleted and only the indexes to the physical tapes are reserved.

Reclamation upon

de-duplication

Through the de-duplication algorithm, the duplicate data is deleted to release the storage space of the VTL6900.

Table 3-5 lists the methods of triggering space reclamation. Table 3-5 Methods of triggering space reclamation

(25)

OceanStor

Technical White Paper for OceanStor VTL6900 3 Solution

Policy Name Description Immediate

reclamation

After the migration is complete, the space originally occupied by the migrated data is reclaimed.

Watermark When the remaining disk space accounts for less than 10% of the total space, the space originally occupied by the migrated data is reclaimed. This trigger method is available only under intelligent reclamation.

Users do not need to worry about data loss. The VTL6900 only reclaims the space originally occupied by the migrated data. The space occupied by the other data will not be reclaimed. Thus, the data security and consistency are ensured.

3.5 Tape Encryption

To ensure the security of the data stored on tapes, the VTL6900 encrypts tapes when data is transferred to physical tape libraries.

Figure 3-12 Tape encryption

(26)

3 Solution

OceanStor Technical White Paper for OceanStor VTL6900

3-16 Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd. Issue 1.0 (2012-05-11)

3.6 Energy Saving and Consumption Reduction

According to the statistics made by the International Energy Agency (IEA) in 2008, global energy consumption had increased by 73% from 1973 to 2006. Since 1970s, the price of energy has been increasing, causing a cost pressure in industry and manufacturing. Therefore, the whole society attaches more importance to energy saving. In the context of energy saving promotion, users pay more attention to the energy saving effect of storage products. At present, many organizations and enterprises consider the energy saving performance as an indispensable factor for selecting storage products. To a certain extent, the energy saving performance determines the success or failure of a product.

3.6.1 Energy Saving by Deduplication

Deduplication significantly reduces the storage investments, management costs, and power consumption of storage systems for users. Provided that the amount of data to be stored is the same, users can save more data in the same storage capacity by using deduplication. With the same storage density, decrease in the storage capacity indicates decrease in the number of the storage devices required, which therefore reduces the power consumption.

3.6.2 Energy Saving by Disk Spin-Down

In addition to deduplication, disk spin-down can also reduce the power consumption of storage systems. The operating principle of disk spin-down is as follows: Disks that are not accessed in a long time enters the spin-down or even power-off state, for the purpose of saving energy and extending the service life of disks. On storage devices that use the disk spin-down technology, disks without read/write operations are in the spin-down state, and disks with read/write operations are in the running state. If read/write operations are performed on the disks in the spin-down state, the disks will be spun up and enter the running state. After the read/write operations are complete, disks in the running state will enter different levels of spin-down states as required. Therefore, the disk spin-down technology applies to large-capacity near-line storage devices and tiered storage devices that provide a low access frequency and require low instant availability for data. It especially applies to data backup and archiving devices oriented at data recovery, for example, VTL devices. For a VTL device that supports disk spin-down, within the hours of backup window each day, all disks on the VTL device are in the running state under extreme conditions. In the rest time of the day, all disks on the VTL device are in the spin-down state.

(27)

OceanStor

Technical White Paper for OceanStor VTL6900 4 Experience

4

Experience

The VTL6900 supports clustered VTL and SIR engines, providing the backup performance of 8600 MB/s and a raw storage capacity of 2304TB. In addition, the VLT6900 provides deduplication and disk spin-down, meeting the requirements of mid-range and high-end users for high performance, large capacity, energy saving and consumption reduction, and high availability.

This chapter describes the VTL6900 solution and benefit to customers in two typical application scenarios: VTL backup system and remote backup system. For details about the solution and benefit to customers in other application scenarios such as tiered backup and remote disaster recovery, see the Technical White Paper for the VTL6900.

4.1 Typical Application and Benefit to Customers

4.1.1 VTL Backup System

Application Scenario

The VTL backup system applies to the following scenarios:

A backup system needs to be built: No backup system is available or the existing backup system needs to be improved, and therefore the customer needs to select new backup devices.

 The VTL6900 is used to replace the existing physical tape library in the backup system of a customer: The customer has constructed a backup system using the physical tape library. However, the physical tape library needs to be replaced with a new backup device for superior performance, reliability, and management.

Solution

(28)

4 Experience

OceanStor Technical White Paper for OceanStor VTL6900

4-2 Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd. Issue 1.0 (2012-05-11)

The VTL6900 is connected to the backup server through FC storage area network (SAN) or IP SAN.

Benefit to Customers

The VTL backup system provides the following benefits for customers:

 High performance, meeting customers' requirements for the backup window: When only one VTL engine is configured, the VTL6900 provides the backup performance of 9TB/hr and can back up 63 TB data within eight hours. When two engines are configured, the VTL6900 provides the backup performance of 31TB/hr and can back up 239 TB data within eight hours. The VTL6900 meets the backup window requirements of customers whose data amount in one backup does not exceed 239 TB.

Large capacity, meeting customers' requirements for the storage capacity: When two VTL engines are configured, the VTL6900 provides a maximum raw capacity of 2304 TB, with an available capacity of 1690 TB. When the deduplication function is configured, the VTL6900 provides a maximum capacity of 220 TB for storing the deduplication data. Provided that the deduplication ratio is 20:1, the VTL6900 can store 4 PB backup data. This meets the backup capacity requirements of mid-range and high-end customers.

Deduplication, reducing power consumption and investment in storage: The VTL6900 supports deduplication and disk spin-down. This significantly reduces the disk storage capacity required by the backup system and therefore reduces the power consumption and investment in storage.

(29)

OceanStor

Technical White Paper for OceanStor VTL6900 4 Experience

Remote Backup System

Application Scenario

The remote backup system applies to the following scenarios:  Network bandwidth resources of a customer are insufficient.

 Besides the data stored at the data center, data on multiple remote branch nodes of a customer needs to be backed up.

 No backup system is available for a customer. Alternatively, the customer has a backup system, but the available bandwidth resources used between branch nodes and the data center are insufficient. As a result, the branch nodes provide low data backup performance and cannot meet customers' requirements for the backup window.

Solution

Figure 4-1 shows the networking for the remote backup system. Figure 4-1 Remote backup system

 An integrated or All In One VTL6900 device that supports deduplication is deployed on each branch node. The VTL6900 that supports deduplication is configured in high availability mode at the data center. For details, see the VTL6900 Product Description. Data on the branch nodes is backed up to the local integrated or All In One VTL6900

device and then replicated to the VTL6900 at the data center by using the deduplication-based remote replication function. Data at the data center is backed up to the local VTL6900.

Benefit to Customers

(30)

4 Experience

OceanStor Technical White Paper for OceanStor VTL6900

4-4 Huawei Proprietary and Confidential

Copyright © Huawei Technologies Co., Ltd. Issue 1.0 (2012-05-11)  Local VTL backup, meeting customers' backup window requirements for the branch nodes: Data on the branch nodes is backed up to the local integrated or All In One VTL6900 device. This is an application scenario of the VTL backup system. For details about the benefits to customers, see section 4.1.1 "VTL Backup System." VTL products provide the high performance feature, for example, the integrated VTL6900 device provides a maximum backup rate of 2.34TB/hr, and the All In One VTL6900 device provides a maximum backup rate of 9TB/hr. Therefore, even if the amount of backup data on the branch nodes reaches 63 TB, the backup time does not exceed eight hours. This meets the customers' backup window requirements for the branch nodes.

 Deduplication-based remote replication, significantly reducing customers' requirements for the network bandwidth and investment in the network bandwidth: Backup data on the branch nodes is deduplicated on the local VTL and then replicated to the VTL at the data center through the wide area network (WAN). During the replication, only deduplicated data blocks not stored at the data center are transferred. This significantly reduces the bandwidth required for the replication compared with the method of transferring backup data without deduplication, reducing customers' requirements for the network bandwidth and investment in the network bandwidth.

(31)

OceanStor

Technical White Paper for OceanStor VTL6900 5 Conclusion

5

Conclusion

The VTL6900 supports deduplication, disk spin-down, and high availability clustering of VTL engines and SIR engines. Therefore, it can properly solve the problems facing mid-range and high-end customers in terms of low performance, insufficient capacity, high power consumption, and low availability of backup systems.

The VTL6900 meets the backup window requirements of customers whose data amount in one backup does not exceed 239 TB.

Provided that the deduplication ratio is 20:1, the VTL6900 can store 4 PB backup data. This meets customers' requirements for the backup capacity.

 By using deduplication and disk spin-down, the VTL6900 reduces the power consumption of customers by over 50%.

(32)
(33)

OceanStor

Technical White Paper for OceanStor VTL6900 6 Acronyms and Abbreviations

6

Acronyms and Abbreviations

Table 6-1 Acronyms and abbreviations related to the VTL6900

Acronym or Abbreviation Full Name

VTL Virtual tape library

SATA Serial Advanced Technology Attachment

SIR Single instance repository

FC Fiber channel

IPMI Intelligent Platform Management Interface

LUN Logical unit number

References

Related documents

[r]

How the study was conducted The researchers used a 3-D global atmospheric download to predict how the radioactive material download move over earth and a health-effects model to see

To restore data from the secondary physical tape copy, it must either be migrated back to virtual tape in the VTL or placed into a tape library NetBackup controls, and then

organisasjonslæring, arbeidsplasslæring, uformell og formell læring, læring gjennom praksis, sosial praksis og så videre vil derfor være nyttige når man skal foreta en studie

Quality: We measure quality (Q in our formal model) by observing the average number of citations received by a scientist for all the papers he or she published in a given

The load balancing service working based on the node throughput selects node 2 at the T1 point in time to process services, node 1 at the T2 point in time to process services,

With deduplication, hibernation, and HA cluster configuration of VTL and SIR engines, the VTL6000 is an ideal solution for medium- and high-end users to solve such problems as the

The VTL responsible for the movement of data between virtual and physical tape must ensure that this streaming occurs and that application or backup server bottlenecks when