Copyright © 2011 EMC Corporation. All Rights Reserved.
CLASSIC DATA CENTER
The course website for Cloud Infrastructure and Services is:
https://sites.google.com/site/ciscse3802/
1
Copyright © 2011 EMC Corporation. All Rights Reserved.
Classic Data Center
UNIT 2: CLASSIC DATA CENTER (CDC)
Upon completion of this unit, you should be able to:
• Describe the key elements of a CDC (compute, storage, and network)
• Describe the common storage networking technologies in a CDC
• Explain business continuity technologies commonly used in a CDC
• Discuss CDC management
2
Copyright © 2011 EMC Corporation. All Rights Reserved.
Classic Data Center
Topics covered in this lesson:
• Application and DBMS
• Physical and logical components of a compute system
• Storage device options
• RAID technology and Intelligent storage system
Application, DBMS, Compute, and Storage
3
UNIT 2: CLASSIC DATA CENTER (CDC)
Copyright © 2011 EMC Corporation. All Rights Reserved.
Classic Data Center
CLASSIC DATA CENTER (CDC)
Core elements of CDC Application
Database Management System (DBMS) Compute
Storage Network
A CDC is a facility containing physical IT resources including compute, network, and storage
CDC
Classic Data Center
4
Copyright © 2011 EMC Corporation. All Rights Reserved.
The core elements of a CDC:
Application: An application is a computer program that provides the logic for computing operations. Applications may use a DBMS, which uses operating system services to perform store/retrieve operations on storage devices.
DBMS: DBMS provides a structured way to store data in logically organized tables that are interrelated. A DBMS optimizes the storage and retrieval of data.
Compute: Compute is a physical computing machine that
runs operating systems, applications, and databases.
Copyright © 2011 EMC Corporation. All Rights Reserved.
Storage: Storage refers to a device that stores data persistently for subsequent use.
Network: Network is a data path that facilitates communication between clients and compute systems or between compute systems and storage.
These core elements are typically viewed and managed as separate entities.
But, all these elements must work together to address data processing
requirements. Other elements of a CDC are power supplies and environmental
controls, such as air conditioning and fire suppression.
Data Center Infrastructure - 7
An Example
Consider an order processing system consisting of:
• Application for order entry.
• Database Management System (DBMS) to store customer and product information.
• Server/Operating System on which the Application and Database programs are run.
• Networks that provide
• Connectivity between Clients and the Application/Database Server
• Connectivity between the Server and the Storage system.
• Storage Array.
Local Area Network
Storage Area Network
Storage Array Client
Server
Application User
Interface Database
Data Center Infrastructure - 8
An Example ..Closer Look
• A customer order is entered via the Application User Interface on a client.
Local Area Network
Storage Area Network
Client
Server
Application User Interface
Storage Array
Data Center Infrastructure - 9
An Example ..Closer Look
• A customer order is entered via the Application User Interface on a client
• The client accesses the Server over a Local Area Network.
Storage Area Network
Client
Server
Storage Array
Local Area Network
Data Center Infrastructure - 10
An Example ..Closer Look
• A DBMS uses the operating system on the server to read and write this data to the physical location on a disk.
Storage Area Network
Client
Server
O/S and DBMS
Storage Array
Local Area Network
Data Center Infrastructure - 11
An Example ..Closer Look
• A DBMS uses the operating system on the server to read and write this data to the physical location on disk.
• A Network provides the communication link between the server and the
storage array, and transports the read/write commands and data between the server and the storage array.
Storage Area Network
Client
Server
Storage Array
Local Area Network
Data Center Infrastructure - 12
An Example ..Closer Look
• A DBMS uses the operating system on the server to read and write this data to the physical location on disk.
• A Network provides the communication link between the client and the
server, and transports the read/write commands and data between the server and the storage array.
• A storage array receives the read/write commands and data from the server and performs the necessary operations to store the data on the physical disks.
Storage Area Network
Client
Server
Storage Array
Local Area Network
Database
Data Center Infrastructure - 13
An Example.. Optimal Order Processing
• The Application should be optimized for fast interaction with the DBMS.
• The tables in the Database should be constructed with care so that the number of read/write operations can be minimized.
• The Server should have sufficient CPU and memory resources to satisfy Application and DBMS needs.
• The Networks should provide fast communication between Client and Server, as well as Server and Storage Array.
• The Storage Array should service the read/write requests from the
Server for optimal performance.
Data Center Infrastructure - 14
An Example.. A Final Look at Data Access
When the DBMS receives a request from the Application:
• It first searches the Server memory. If data is found there, the operation takes, perhaps, a millisecond.
• If not, it then uses the Operating System to request the data from the Storage Array.
• Dedicated high speed networks transport this request to the Storage Array.
• Intelligent Storage Arrays can deliver the requested data within a few
milliseconds. They are also typically configured to protect data in the
event of drive failures.
Copyright © 2011 EMC Corporation. All Rights Reserved.
Classic Data Center
KEY REQUIREMENTS OF A DATA CENTER
Availabilit y
Data Integrity
Performanc e
Flexibilit y
Scalabilit y
Securit y
Manageabilit y
5
Copyright © 2011 EMC Corporation. All Rights Reserved.
A data center infrastructure should meet the following attributes to ensure that data is accessible to its users effectively and efficiently all the time :
1. Availability: All data center elements should be designed to ensure accessibility. The inability of users to access data can have a significant negative impact on a business.
2. Performance: All the elements of the data center should provide optimal performance and service all processing requests at high speed.
3. Scalability/flexibility:
o Data center operations should be able to allocate additional processing capabilities or storage space on demand, without interrupting business operations.
o Business growth often requires deploying more servers, new applications, and additional databases.
o The infrastructure should be flexible enough to meet the changing business requirements.
o Data center should be able to provide additional resources on demand without
interrupting availability, or, at the very least, with minimal disruption.
Copyright © 2011 EMC Corporation. All Rights Reserved.
4. Security/Data integrity:
o It is important to establish policies, procedures, and proper integration of the data center elements to prevent unauthorized access to information.
o Data integrity ensures that data is unaltered. Any variation in data during its retrieval implies corruption, which may affect the operations of the
organization.
5. Manageability:
o Manageability can be achieved through automation and reduction of manual intervention in common tasks.
o The additional resource requirements should primarily be managed by
reallocating or improving utilization of existing resources, rather than by adding new resources.
o The cost of data center management is one of the key concerns. Organizations are looking towards optimizing their IT expenditure on data center
maintenance, so that they can invest on innovation and new application
deployment.
Copyright © 2011 EMC Corporation. All Rights Reserved.
Classic Data Center
APPLICATION
Commonly deployed applications in a CDC
Business applications
E-mail, Enterprise Resource Planning (ERP), Decision Support System (DSS), Data Warehouse (DW)
Management applications
Resource management, performance tuning
Data protection applications
Backup, replication
Security applications
Authentication, antivirus
Key I/O characteristics of an application
Read intensive vs. write intensive Sequential vs. random
18
Copyright © 2011 EMC Corporation. All Rights Reserved.
Classic Data Center
DATABASE MANAGEMENT SYSTEM (DBMS)
Database is a structured way to store data in logically organized tables that are interrelated
Helps to optimize the storage and retrieval of data
DBMS is a collection of computer programs that control the creation, maintenance, and use of databases
Processes an application’s request for data
Instructs the OS to retrieve the appropriate data from storage
Popular DBMS examples are MySQL, Oracle RDBMS, SQL Server, etc.
19
Copyright © 2011 EMC Corporation. All Rights Reserved.
COMPUTE
Compute consists of physical components (hardware devices) and logical components (software and protocols)
Physical components of compute are CPU, Memory, and Input/Output (I/O)
devices
I/O devices facilitate the following types of communication:
User to compute: Handled by basic I/O devices such as keyboard, mouse, etc.
Compute to compute/storage: Enabled using host controller or host adapter
20
A resource that runs applications with the help of underlying computing components
Compute
Classic Data Center
Copyright © 2011 EMC Corporation. All Rights Reserved.
Classic Data Center
EXAMPLES OF COMPUTE SYSTEM
Examples of compute systems:
Laptops/Desktops Blade servers
Complex cluster of servers Mainframes
Bladed server technology is commonly used to deploy compute systems in a CDC
Blade servers were developed in response to the growing need of a computing power that does not consume a large floor space.
Consolidates power- and system-level function into a single, integrated chassis
Enables the addition of server modules as hot-pluggable components Provides increased server performance and availability without
increase in size, cost, or complexity
21
Copyright © 2011 EMC Corporation. All Rights Reserved.
BLADE SERVER
A server architecture that houses multiple server modules ("blades") in a single chassis.
It is widely used in data centers to save space and improve system management. Either self-standing or rack mounted, the chassis provides the power supply, and each blade has its own CPU, RAM and storage.
Redundant power supplies may be an option. Blade
servers generally provide their own management systems
and may include a network or storage switch
Blade Servers
Copyright © 2011 EMC Corporation. All Rights Reserved.
Classic Data Center
SERVER CLUSTERING
Multiple servers (nodes ) are brought together in a cluster to improve availability and performance
When a failure occurs on one node in a cluster, resources and workload are redirected to another node
The cluster service is the software that connects the nodes in the cluster and provides a single-system view to the clients that are using the cluster.
Multiple servers (nodes) are brought together in a cluster to improve availability and performance. These nodes communicate with each other over a private network. Communication between the nodes of a cluster enables the cluster service to detect node failures and status changes, and manages the cluster as a single entity.
Exchange heartbeat is a checkup mechanism between two nodes
To see whether a node is up and running A failover is initiated, if heartbeat fails
In such cases, another node in the cluster will take over the workload of the failed node.
24
Advantages of Server Clustering • High Availability : Server clustering is specifically designed for high availability solution. In case, if a server is having a problem another server from the cluster takes over the server having issue. This ensures the high availability of the web applications.
• Scalability: It offers scalability by enabling clusters to expand non-disruptively.
• Load balancing: It also provides load balancing by
distributing the application load evenly among multiple
servers within the cluster.
Copyright © 2011 EMC Corporation. All Rights Reserved.
Ease of Management: it allows non-disruptive maintenance of server resources. If
a dedicated server from the cluster requires maintenance, it can be stopped while
other servers handles its load. Thus, makes the maintenance more easier. The
clustered servers can be configured and managed easily without any trouble.
Copyright © 2011 EMC Corporation. All Rights Reserved.
DISADVANTAGES OF SERVER CLUSTERING
Server clustering usually requires more servers and hardware to manage and monitor, thus, increases the infrastructure. Some web hosting providers may afford it.
Server clustering is not much flexible, as not all the server types can be clustered. There are many applications which are not supported by the clustered design.
It is not a cost-effective solution, as it needs a good server design which can be a bit expensive.
Though there are some disadvantages of server clustering, it is still a great way to ensure
round the clock availability for your business which is affordable only by large organizations
and businesses.
Copyright © 2011 EMC Corporation. All Rights Reserved.
Classic Data Center 11
LOGICAL COMPONENTS OF COMPUTE
Application s
Operating System Volume Manager Device Drivers
File System
Logical Components
Storage
Copyright © 2011 EMC Corporation. All Rights Reserved.
File System
A file system is the methods and data structures
that an operating system uses to keep track of files
on a disk or partition; that is, the way the files are
organized on the disk.
Copyright © 2011 EMC Corporation. All Rights Reserved.
FILE SYSTEM
A file is a collection of related records or data stored as a unit with a name. A file system is a hierarchical structure of files.
File systems enable easy access to data files residing within a disk drive, a disk partition, or a logical volume.
A file system needs compute-based logical structures and software routines that control access to files.
It provides users with the functionality to create, modify, delete, and access files. Access to
the files on the disks is controlled by the permissions given to the file by the owner. These
permissions are also maintained by the file system.
Copyright © 2011 EMC Corporation. All Rights Reserved.
OPERATING SYSTEM
An operating system controls all aspects of a computing environment. It works between the application and physical components of the compute system.
One of the services it provides to the application is data access. The operating system also monitors and responds to user actions and the environment.
It organizes and controls hardware components and manages the allocation of hardware resources.
It provides basic security for the access and usage of all managed resources. An
operating system also performs basic storage management tasks while managing
other underlying components, such as the file system, volume manager, and device
drivers.
Copyright © 2011 EMC Corporation. All Rights Reserved.
DEVICE DRIVER
A device driver is special software that permits the operating system to interact with a specific device, such as a printer, a mouse, or a hard drive.
A device driver enables the operating system to recognize the
device and to use a standard interface (provided as an
application programming interface or API) to access and control
devices.
Copyright © 2011 EMC Corporation. All Rights Reserved.
VOLUME MANAGER
Logical Volume Managers (LVMs) introduce a logical layer between the operating system and physical storage.
LVM is a software that runs on the host computer and manages the logical and
physical storage.
Copyright © 2011 EMC Corporation. All Rights Reserved.
34
STORAGE
Data created by individuals/businesses must be stored for further processing
The type of storage device used is based on the type of data and the rate at which it is created and used
A storage device may use magnetic, optical, or solid state media
Examples: Disk drive (magnetic), CD (optical), Flash drive (solid state)
It is a resource that stores data persistently for subsequent use.
Storage
Classic Data Center
Storage Device options
• Tape Drivers
Copyright © 2011 EMC Corporation. All Rights Reserved.
TAPE DRIVE
Tapes are a popular storage option for backup purposes because of their relatively low cost and transportability.
However, tape has its limitations; data is stored on the tape linearly along the length of the tape. Search and retrieval of data is done sequentially, taking several seconds to access the data.
On a tape drive, the read/write head touches the tape surface, which causes the
tape to degrade or wear out after repeated use. Also, the overhead associated with
managing tape media is significant.
• Optical Disk
Copyright © 2011 EMC Corporation. All Rights Reserved.
Optical disk storage is popular in small, single-user computing environments.
Optical disks have limited capacity and speed, which limits the use of optical media as a business data storage solution.
The capability to Write Once and Read Many (WORM) is an advantage of optical disk
storage. Optical disks, to some degree, guarantee that the content has not been
altered. Hence, they can be used as low-cost alternatives for long-term storage of
relatively small amounts of fixed content.
Copyright © 2011 EMC Corporation. All Rights Reserved.
STORAGE DEVICE OPTIONS
Tape Drive
• Low cost solution for long term data storage
• Sequential data access, physical wear and tear, and storage/retrieval overheads
Optical Disk
• Write Once and Read Many (WORM): CD, DVD
• Limited in capacity and speed
• Popular in small, single-user environments
Disk Drive
• Random read/write access
• Uses mechanical parts for data access
• Most popular storage device with large storage capacity
Solid State Drive
• Provides ultra high performance required by mission-critical applications
• Very low latency per I/O, low power requirements, and very high throughput per drive
39
Classic Data Center
Copyright © 2011 EMC Corporation. All Rights Reserved.
40
REDUNDANT ARRAY OF INDEPENDENT DISKS (RAID)
Overcomes limitations of disk drives Improves storage system performance
By serving I/Os from multiple disks simultaneously
RAID techniques that form the basis for defining various RAID levels are:
Striping, mirroring, and parity
RAID controller helps implementing these RAID techniques
A technology which utilizes multiple disk drives as a set to provide protection, capacity, and/or performance benefits
RAID
Classic Data Center
Copyright © 2011 EMC Corporation. All Rights Reserved.
Classic Data Center
41
RAID TECHNIQUES
A
A1 A2 A3
A
A A
A
A1 A2 Ap
RAID
Controller
RAID
Controller
RAID
Controller
Striping Mirrorin
g
Parit
y
• Striping is a technique of spreading data across multiple drives in order to use the drives in parallel. All read-write heads work simultaneously. This allows more data to be processed in a shorter time. Consequently, performance increases, when compared to writing/retrieving data to/from one disk at a time.
• Mirroring is a technique where data is stored on two different disk drives, yielding two copies of data. In the event of one drive failure, the data is intact on the surviving drive, and the controller continues to service the compute system’s data requests from the surviving disk of the mirrored pair.
o Mirroring improves read performance because read requests
are serviced by both disks. However, write performance
deteriorates because each write request manifests as two writes
on the disk drives.
o Mirroring is expensive because it involves duplication of data — the amount of storage capacity required is twice the amount of data being stored. Mirroring can be implemented with striped RAID, by mirroring entire stripes of disk set to stripes on the other disk set. This is known as nested RAID.
▪ Parity is a method of protecting striped data from disk failure without the cost of mirroring. An additional disk drive is added in the strip set to hold parity, a mathematical construct that allows re-creation of the missing data.
▪ The use of parity bits is a common method of detecting errors in data transmission and storage.
▪ While data is being written to a RAID-5 volume, parity is calculated by
doing an exclusive OR (XOR) procedure on the data. The resulting parity is
then written to the volume.
Parity computations are used in RAID drive arrays for fault
tolerance by calculating the data in two drives and storing
the results on a third. The parity is computed by XOR'ing a
bit from drive 1 with a bit from drive 2 and storing the result
on drive 3 (to learn about XOR, see OR).
o Parity RAID is less expensive than mirroring because parity overhead is only a fraction of the total capacity.
o Parity information can be stored on separate, dedicated disk drives or distributed across all the drives in a RAID set.
o Parity calculation is a bitwise XOR operation. Calculation of parity
is a function of the RAID controller. If one of the disks fails in a
RAID set, the value of its data is calculated by using the parity
information and the data on the surviving disks. The value is
calculated using XOR operation.
Nested RAID
• Most data centers require data redundancy and performance from their RAID arrays.
• RAID 0+1 and RAID 1+0 combine the performance benefits of RAID 0 with the redundancy benefits of RAID 1.
• They use striping and mirroring techniques and combine their benefits.
• These types of RAID require an even number of disks, the minimum
being four.
RAID 10 (RAID 1+0) : Combining RAID 1 and RAID 0, this level is often
referred to as RAID 10, which offers higher performance than RAID 1, but at a much higher cost. In RAID 1+0, the data is mirrored and then both copies of data are striped across multiple HDDs in a RAID set.
Note: When replacing a failed drive, only the mirror is rebuilt.
RAID 01 (RAID 0+1):
• RAID 0+1 is similar to RAID 1+0, except the data organization method is slightly different.
• RAID 0+1 creates a stripe set and then mirrors the stripe set.
Copyright © 2011 EMC Corporation. All Rights Reserved.
RAID LEVELS
RAID Levels Definition
RAID 0 Striping with no fault tolerance RAID 1 Disk mirroring
Nested Combinations of RAID levels; Example: RAID 1 + RAID 0 RAID 3 Parity RAID with dedicated parity disk
RAID 5 Parity RAID with distributed parity across all the disks in the set
RAID 6 Distributed parity RAID with dual parity
49
Classic Data Center
Copyright © 2011 EMC Corporation. All Rights Reserved.
50
INTELLIGENT STORAGE SYSTEM
Is a RAID array highly optimized for I/O processing
Have large amounts of cache for improving I/O performance Have operating environments that provide:
Intelligence for managing cache
Optimal management, allocation, and utilization of storage resources
Classic Data Center
Copyright © 2011 EMC Corporation. All Rights Reserved.
Classic Data Center 18
COMPONENTS OF AN INTELLIGENT STORAGE SYSTEM
Compute Front-en
d
Back-en Cache d
Physical Disks Intelligent Storage
System
Netw
ork
Copyright © 2011 EMC Corporation. All Rights Reserved.
Classic Data Center 19
PRESENTING STORAGE TO COMPUTE SYSTEM
LUN 0
LUN 1
Front-en d
Back-en d Cache
Physical Disks Intelligent Storage
System
LUN Comput 0
e LUN
1
Controller RAID Set
Networ
k
Copyright © 2011 EMC Corporation. All Rights Reserved.
Classic Data Center
Topics covered in this lesson:
• Compute to compute communication
• Compute to storage communication
•
Direct Attached Storage (DAS)
•
Fibre Channel SAN (FC-SAN)
Lesson 2: Storage Networking Technologies -1
53
MODULE 2: CLASSIC DATA CENTER (CDC)
Copyright © 2011 EMC Corporation. All Rights Reserved.
Classic Data Center
COMPUTE TO COMPUTE COMMUNICATION
Typically uses Ethernet or TCP/IP protocol
LAN, MAN, and WAN
Communication is enabled using various components:
Network Interface Card (NIC)
Has unique MAC address
Switches and routers
Switch provides scalability and interconnection between multiple compute systems
Routers allow different networks to communicate with each other
Cables
Twisted pair, co-axial cable, optical fiber
54
Copyright © 2011 EMC Corporation. All Rights Reserved.
COMPUTE TO STORAGE COMMUNICATION
Communication is enabled using various hardware components (HBA, CNA, NIC, switch, router, gateway ,and cables) and protocols
Communication between compute and storage can be done using channel or network technologies
55
Channel Technology Network Technology
Compute system and peripheral devices are connected through channel
Compute system and peripheral
devices are connected over a network
Provides low protocol overhead due to tight coupling
High protocol overhead due to network connection
Supports transmission only over short distances
Supports transmission over long distances
Protocol examples: PCI, IDE/ATA, SCSI, etc.
Protocol examples: iSCSI( SCSI over IP), FCoE ( Fibre Channel over
Ethernet), and FC
Classic Data Center
Copyright © 2011 EMC Corporation. All Rights Reserved.
COMMUNICATION PROTOCOLS
Peripheral Component Interconnect (PCI)
Provides interconnection between CPU and attached devices Latest PCI Express bus provides throughput of 133 MB/sec
Integrated Device Electronics/Advanced Technology Attachment (IDE/ATA)
Popular protocol to connect to disk drives Supports 16-bit parallel transmission
Serial version is called Serial ATA (SATA)
Both versions offer good performance at a relatively low cost
56
Classic Data Center
Copyright © 2011 EMC Corporation. All Rights Reserved.
COMMUNICATION PROTOCOLS (CONTD.)
Small Computer System Interface (SCSI)
Preferred storage connectivity option for high-end environments
Improved performance, scalability, and high cost when compared to ATA Serial version is called Serial Attached SCSI (SAS)
Transmission Control Protocol/Internet Protocol (TCP/IP)
Traditionally used for compute to compute communication Now used for compute to storage communication also
iSCSI (SCSI over IP) and FCoE (Fibre Channel over Ethernet) are examples
57
Classic Data Center
Copyright © 2011 EMC Corporation. All Rights Reserved.
DATA ACCESS BY COMPUTE
Classic Data Center 25 Applicatio
n
File System
Storag e
Applicatio n
File System
Storag e
Networ
k Networ
k
File level Access Block level Access Compute
Compute
Storage
Storage
Block level Request
File level Request
Copyright © 2011 EMC Corporation. All Rights Reserved.
DIRECT ATTACHED STORAGE (DAS)
DAS is classified as internal or external based on the location of the storage device with respect to the compute system
Benefits:
Simple to deploy and ideal for local data provisioning Low capital expense and less complexity
Challenges:
Limited scalability
Limited ability to share resources
Islands of over and under utilized storage resources
59
An internal or external storage device, which connects directly to a compute system DAS
Classic Data Center
Copyright © 2011 EMC Corporation. All Rights Reserved.
60
EMERGENCE OF STORAGE NETWORKING TECHNOLOGIES
Just-in-time information for business users Flexible and resilient storage architecture
DAS is inefficient to fulfill these requirements
Storage networking technologies emerged as a solution
Fibre Channel SAN (FC SAN)
Network Attached Storage (NAS) Internet Protocol SAN (IP SAN)
Fibre Channel over Ethernet (FCoE) Object Based storage
Unified storage
Classic Data Center
Copyright © 2011 EMC Corporation. All Rights Reserved.
Classic Data Center
WHAT IS FC SAN ?
Dedicated high speed network of
compute systems and shared storage devices
Uses SCSI over FC protocol
Provides block level data access
Storage Array Storage
Array Server
s
• Enables storage consolidation and sharing
• Enables centralized Management
• Provides scalability and high performance
• Reduces storage and administration cost
Benefits
FC SAN
Server s
Server s
61
Copyright © 2011 EMC Corporation. All Rights Reserved.
COMPONENTS OF FC SAN
Node ports Cables
Connectors
Interconnecting Devices Storage Arrays
SAN Management software
62
Classic Data Center
Copyright © 2011 EMC Corporation. All Rights Reserved.
FIBRE CHANNEL FABRIC
In a switched fabric, the link between any two switches is called Inter Switch Link (ISL)
Classic Data Center 30
One or more interconnected FC switches through which multiple SAN nodes can communicate
FC Fabric
Inter switch links
Storage Array
Storage Array
Server
Servers
Server
Copyright © 2011 EMC Corporation. All Rights Reserved.
PORT TYPES
Comput e
N-Por t
FC Switch F-Por
t
FC Switch
Storage Array N-Por
t F-Por
E-Por t t
E-Por t
Storage
Array N-Por
t
F-Por t
Classic Data Center 31
Copyright © 2011 EMC Corporation. All Rights Reserved.
FC SAN ADDRESSING
Fibre Channel Address
Used to communicate between nodes within SAN Similar in functionality to an IP address on NICs 24 bit address, dynamically assigned
World Wide Name: Unique 64 bit identifier
Static to the port, similar to NIC’s MAC address
Used to physically identify ports or nodes within SAN
65
World Wide Name – Array
5 010
1 0 000
0 0 000
0 6 011
0 0 000
0 1 000
1 6 011
0 0 000
0 0 000
0 0 000
0 6 011
0 0 000
0 0 000
0 1 000
1 B 101
1 2 001 24 0
bit s
Model Seed
32 bit s
Classic Data Center
Classic Data Center 33
Zoning
WWN 10:00:00:00:C9:20:DC:40
WWN 10:00:00:00:C9:20:DC:56
WWN 10:00:00:00:C9:20:DC:82
WWN 50:06:04:82:E8:91:2B:9E
Zone 1 (WWN Zone) = 10:00:00:00:C9:20:DC:82 ; 50:06:04:82:E8:91:2B:9E Zone 2 (Port Zone) = 15,5 ;
15,7 Zone 3 (Mixed Zone) = 10:00:00:00:C9:20:DC:56 ; 15,12
Serv er 1
Storage Array Port 5
Port 1
Port 7
Port 9
Port 12 Zone 3
Zone 1 Zone 2 Switch Domain ID = 15
Serv er 2
Server
3
Copyright © 2011 EMC Corporation. All Rights Reserved.
Classic Data Center
Topics covered in this lesson:
• Internet Protocol SAN (IP-SAN)
• Fibre Channel over Ethernet (FCoE)
• Network Attached Storage (NAS)
Lesson 3: Storage Networking Technologies -2
67
MODULE 2: CLASSIC DATA CENTER (CDC)
Copyright © 2011 EMC Corporation. All Rights Reserved.
IP-SAN
IP is being positioned as a storage transport because:
Offers easier management
Allows existing network infrastructure to can be leveraged Reduces cost compared to new SAN hardware and software Supports multi-vendor interoperability
Many long-distance disaster recovery solutions already leverage IP-based networks
Many robust and mature security options are available for IP networks
IP-SAN
It is a technology that provides transfer of block level data over an IP network.
Classic Data Center
68
Copyright © 2011 EMC Corporation. All Rights Reserved.
Classic Data Center
BLOCK STORAGE OVER IP – PROTOCOL OPTIONS
iSCSI (SCSI over IP)
Encapsulation of SCSI data in IP packets
Ethernet NIC card TOE card
iSCSI HBA
Hardware-based gateway to Fibre Channel storage Used to connect compute
systems FCIP
Fibre Channel-to-IP
bridge/tunnel (point-to- point)
Fibre Channel end points
Used in DR
FC Port (b) FCIP Implementation
FC HBA
Storage Array iSCSI Port
FC Port iSCSI Gateway (a) iSCSI Implementation
Storage Array
iSCSI HBA
Storage Array Server
I P
I P FCIP
gatewa y
FCIP gatewa y
69
Storage Array
Copyright © 2011 EMC Corporation. All Rights Reserved.
Classic Data Center
ISCSI TOPOLOGIES
Native
No FC components
iSCSI Initiators connect directly to the Array Bridged
Translates iSCSI/IP to FC iSCSI initiator configured
with bridge as target Bridge acts as virtual FC
initiator
FC HBA Servers
iSCSI HBA
Storage Array
FC Port iSCSI Gateway
Storage Array Native iSCSI Port
Servers
iSCSI HBA
FC SAN I
P
I P
Servers
70
Copyright © 2011 EMC Corporation. All Rights Reserved.
FIBRE CHANNEL OVER IP (FCIP)
IP-based storage networking technology
Combines advantages of Fibre Channel and IP
Creates virtual FC link that connects devices in a different fabric Distance extension solution
Used for data sharing over geographically dispersed SAN
Hos t
FC SAN I
P
FC SAN FCIP gateway FCIP gateway
Classic Data Center
71
Copyright © 2011 EMC Corporation. All Rights Reserved.
FIBRE CHANNEL OVER ETHERNET (FCOE)
A protocol that encapsulates Fibre Channel frames for transport over Enhanced Ethernet networks
Enables the consolidation of SAN traffic and Ethernet traffic onto a common 10 Gigabit Ethernet infrastructure
Consolidates compute to compute and compute to storage communication over a single channel
• Lowers capital expenditure
• Reduces power and cooling requirements
• Enables consolidation of network infrastructure
• Lowers Total Cost of Ownership (TCO)
Benefits
Classic Data Center
72
Copyright © 2011 EMC Corporation. All Rights Reserved.
I/O CONSOLIDATION WITH FCOE
Classic Data Center
73
Host s
…………
Storage Arrays
…………
FCoE Switch
FCoE links
LA
N
Copyright © 2011 EMC Corporation. All Rights Reserved.
COMPONENTS OF FCOE
Converged Network Adapter(CNA)
Multi function adapter
Performs the data networking of NIC and storage networking of HBA
FCoE Switch
Contains Ethernet bridge and Fibre Channel Forwarder (FCF)
FCF encapsulates FC frames into FCoE frames and de-capsulates FCoE frames to FC frames
Converged Enhanced Ethernet (CEE)
Extensions to conventional Ethernet standard to eliminate its lossy nature
74
Classic Data Center
Copyright © 2011 EMC Corporation. All Rights Reserved.
Classic Data Center
75
FILE SHARING TECHNOLOGY EVOLUTION
Network Attached Storage (NAS)
Stand Alone PC
Networked File Sharing
Networked PCs
Portable Media for File Sharing
Copyright © 2011 EMC Corporation. All Rights Reserved.
76
WHAT IS NETWORK ATTACHED STORAGE(NAS)?
Client s
Applicatio n Server
Print Serve
r
NAS
Device NAS
A storage device connected to a network that provides file level data access to heterogeneous clients
Classic Data Center
Copyright © 2011 EMC Corporation. All Rights Reserved.
Classic Data Center
77
BENEFITS OF NAS
Supports comprehensive access to information Provides improved efficiency
Provides improved flexibility Provides centralized storage Simplifies management
Enables scalability
High availability – through native clustering
Provides security integration to environment (user authentication
and authorization)
Copyright © 2011 EMC Corporation. All Rights Reserved.
Classic Data Center
COMPONENTS OF NAS
Network Interface NF
S
CIF
NAS Device OS S Storage Interface NAS Head
NF S
CIF S UNIX
Window
s Storage
Array
NA S I
P
78
Copyright © 2011 EMC Corporation. All Rights Reserved.
Classic Data Center
Topics covered in this lesson:
• Object Based Storage
• Unified Storage
Lesson 4: Object Based and Unified Storage Technologies
79
MODULE 2: CLASSIC DATA CENTER (CDC)
Copyright © 2011 EMC Corporation. All Rights Reserved.
OBJECT BASED STORAGE
Object Based Storage combines data with rich metadata to create an “object”
Object Based Storage stores data in a flat address space
There are no hierarchies or nested directories
Each object is identified by an unique ID (Object ID)
Generated by a hashing function
Classic Data Center 47 Object
Object Object
Object Object Object Object
Object OID
Metadat a
Data
Copyright © 2011 EMC Corporation. All Rights Reserved.
OBJECT BASED STORAGE (CONTD.)
Object Based Storage uses HTTP communication as its standard interface
SOAP and REST are the protocols commonly used in object based communication in Cloud
Simple Object Access Protocol (SOAP ) is used for communication between peers in a distributed environment
Uses Extensible Markup Language ( XML) framework
Representational State Transfer (REST) is used to retrieve information from a Website by reading Web pages
81
Classic Data Center
Copyright © 2011 EMC Corporation. All Rights Reserved.
Classic Data Center
• Increasing amount of unstructured data
• SAN is highly scalable and supports data access at a block level
• Not a good option for data sharing
• NAS is a good option for applications which need to share data
• Limited scalability due to hierarchical structure
• Object Based approach potentially eliminates SAN and NAS limitations
• Highly scalable with data sharing capabilities
WHY OBJECT BASED STORAGE ?
82
Copyright © 2011 EMC Corporation. All Rights Reserved.
BENEFITS OF OBJECT BASED STORAGE
Automates and simplifies storage management Ensures data integrity
Ensures compliance and auditability Enables easy data migration
Enables self healing
Facilitates intelligent replication Allows flexible scalability
83
Classic Data Center
Copyright © 2011 EMC Corporation. All Rights Reserved.
Classic Data Center
UNIFIED STORAGE
Provides Consolidated Management interface for NAS, SAN, iSCSI, FCoE, and object based technologies
FC FCoE NFS and
CIFS
SOAP and REST
LAN/WA N
Unified Storage
51
Copyright © 2011 EMC Corporation. All Rights Reserved.
Classic Data Center
BENEFITS OF UNIFIED STORAGE
Provides consolidated multi-protocol storage
File: NFS, CIFS
Block: iSCSI, FC, FCoE Object: REST, SOAP
Simplifies administration
Integrated management interface
Reduces cost of storage assets, along with power, cooling, and space
Provides a highly scalable architecture
52
Copyright © 2011 EMC Corporation. All Rights Reserved.
Classic Data Center
Topics covered in this lesson:
• Business Continuity (BC) Terminologies
• Backup Granularity
• Backup Components and Operation
• Deduplication: Types and methods
Lesson 5: Business Continuity Overview and Backup
MODULE 2: CLASSIC DATA CENTER (CDC)
53
Copyright © 2011 EMC Corporation. All Rights Reserved.
Classic Data Center
Topics covered in this lesson:
• Key Management activities in a CDC
• Information Lifecycle Management (ILM)
Lesson 7: CDC Management
MODULE 2: CLASSIC DATA CENTER (CDC)
82
Copyright © 2011 EMC Corporation. All Rights Reserved.
OVERVIEW OF CDC MANAGEMENT ACTIVITIES
Key management activities in a CDC:
1.Monitoring and Alerting 2. Reporting
3. Availability Management 4. Capacity Management
5. Performance Management 6. Security Management
Classic Data Center 83
Copyright © 2011 EMC Corporation. All Rights Reserved.
1. MONITORING: HELPS TO ANALYZE THE STATUS AND UTILIZATION OF VARIOUS STORAGE INFRASTRUCTURE COMPONENTS
Compute systems, storage, and networks are the key components to be monitored
Key Parameters to be Monitored Description
Accessibility Availability of a component to perform a desired operation . For example, a port may go down resulting in a chain of availability alerts.
Capacity Amount of storage infrastructure resources available
For ex: free space available on a file system or RAID group, mailbox quota allocated to users, no.of ports available on switch etc.
It ensures uninterrupted data availability and scalability.
Performance How efficiently different components are performing Ex: The number of I/Os to disks, application response time, network utilization, and server CPU utilization
Security Mechanisms to track and prevent unauthorized access
and login failures. Physical security of a storage
infrastructure is also continuously monitored using badge
readers, biometric scans, or video cameras. Classic Data Center 84
Copyright © 2011 EMC Corporation. All Rights Reserved.
MONITORING (CONTD.)
Classic Data Center 85
This slide is intentionally left blank
Copyright © 2011 EMC Corporation. All Rights Reserved.
ALERTING OF EVENTS
• Alerting is an integral part of monitoring.
• Monitoring tools enables administrators to assign different severity levels for different alerts
• Whenever a condition with a particular severity level occurs, an alert is sent to the administrator to initiate a corrective action.
• Continuous monitoring, in conjunction with automated alerting, enables administrators to respond to failures quickly and proactively.
Levels of Alerts based on Severity Description
Information alert Provides useful information and may not require administrator intervention
For ex: creation of zone or LUN, Unauthorized access in system
Warning alert Require administrative attention
For ex: file systems becoming full
Fatal alert Require immediate administrative attention
because the condition could affect overall performance or availability
For ex: power failures/disk failures/memory
failures Classic Data Center 86
Copyright © 2011 EMC Corporation. All Rights Reserved.
2. REPORTING
• Reporting on CDC resources involves keeping track and gathering information from various components/processes.
• This information is compiled to generate reports for:
Type of Report Description
Capacity Planning Provides current and historic information about utilization of storage, file system, database
tablespace, ports, and so on
Chargeback Provides information about the allocation or utilization of the CDC infrastructure
components by various departments or user groups
Performance Provides details about the performance of various infrastructure components in a CDC
Classic Data Center 87
Copyright © 2011 EMC Corporation. All Rights Reserved.
.
3. AVAILABILITY MANAGEMENT
• Its task is to establish guidelines for all configurations to achieve high availability based on service level requirements
Ensures high availability by:
Eliminating single points of failure by configuring
RAID protection
Redundant Fabrics ( Connecting the server to the storage array using at least two independent fabrics and switches )
Performing data backup and replication
Classic Data Center 88
Copyright © 2011 EMC Corporation. All Rights Reserved.
4. CAPACITY MANAGEMENT
Ensures adequate availability of resources for all services based on their service level requirements
Manages resource allocation Key activities:
Trend Analysis ( actual utilization of allocated storage) and Capacity analysis ( compares the allocated storage to the forecasted
storage on a regular basis)
Storage provisioning
Classic Data Center 89
Copyright © 2011 EMC Corporation. All Rights Reserved.
5. PERFORMANCE MANAGEMENT
Is configure/design for optimal operational efficiency Performance analysis activity helps in :
Identify bottlenecks
Fine tuning for performance enhancement
Classic Data Center 90
Copyright © 2011 EMC Corporation. All Rights Reserved.
6. SECURITY MANAGEMENT
Prevents unauthorized activities or access Key activities:
Compute
Creation of user logins and user privileges
Storage Array
LUN masking prevents data corruption on the storage array by restricting compute access to a defined set of logical devices
SAN
Configuration of zoning to restrict unauthorized HBAs
Classic Data Center 91
Copyright © 2011 EMC Corporation. All Rights Reserved.
MANAGING INFORMATION IN CDC – CHALLENGES
In order to frame an effective information management policy, organisations need to consider the following key challenges of information management:
• Exploding digital universe
Multifold increase of information growth
• Increasing dependency on information
The strategic use of information plays an important role in determining the success of an organization
• Changing value of information
Information that is valuable today may become less important tomorrow
Classic Data Center 92
Copyright © 2011 EMC Corporation. All Rights Reserved.
Classic Data Center
Topics covered in this lesson:
• Business Continuity (BC) Terminologies
• Backup Granularity
• Backup Components and Operation
• Deduplication: Types and methods
Lesson 5: Business Continuity Overview and Backup
MODULE 2: CLASSIC DATA CENTER (CDC)
53
Copyright © 2011 EMC Corporation. All Rights Reserved.
BUSINESS CONTINUITY
• Business Continuity solutions address unavailability and degraded application performance
• Business Continuity is an integrated and enterprise wide process and set of activities to ensure “information availability”
Processes and/or procedures for ensuring continued business operations
BC
Classic Data Center 54
Copyright © 2011 EMC Corporation. All Rights Reserved.