ORACLE BIG DATA APPLIANCE X4-2
BIG DATA FOR THE ENTERPRISE
OPEN, SECURE AND INTEGRATED
KEY FEATURES
Massively scalable, open infrastructure to store and manage big data
Industry-leading security, performance and the most comprehensive big data tool set on the market all bundled in an easy to deploy appliance.
Big Data Connectors delivers load rates of up to 15TB per hour between Big Data Appliance and Oracle Exadata
Cloudera’s comprehensive software suite including Cloudera Distribution including Apache Hadoop (CDH) delivers managed and proven Hadoop components to the enterprise
Oracle Enterprise Manager combined with Cloudera Manager simplifies management of the entire Big Data Appliance
Advanced analytics with Oracle R on Hadoop data
Handle low-latency unstructured workloads with the pre-installed and configured Oracle NoSQL Database Community Edition
InfiniBand connectivity between nodes and across appliances as well as to Oracle Exadata
Flexible configuration choices for optimizing both floor space and growth path for Hadoop and Oracle NoSQL Database
KEY BENEFITS
Optimized, Complete and Secure Big Data Solution
Most comprehensive big data tool set integrated in a single appliance
Integrated with Oracle Exadata to analyze all your data
Risk-free installation and rapid time to value
Simplified operations, updates and patch management though a single
Oracle Big Data Appliance X4-2 is a comprehensive Big Data platform, engineered for secure data processing with a low overall total cost of ownership. It is optimized for both batch and real-time processing – utilizing Cloudera’s Distribution for Apache Hadoop, Oracle NoSQL Database, Cloudera Impala and Cloudera Search to satisfy diverse computing requirements. Built using industry-standard hardware from Sun, Big Data Appliance X4-2 delivers the perfect balance between compute power, I/O bandwidth and memory footprint – offering 33% more storage capacity than the previous generation appliance. Big Data Appliance X4-2 provides a highly optimized platform with integrated management capabilities that allows you to derive value quickly with lower risk.
Comprehensive Big Data Platform
Oracle Big Data Appliance is an open, multi-purpose big data platform. It is optimized to run a diverse set of workloads – including batch processing jobs as well as interactive
applications. Apache Hadoop’s MapReduce framework powers the batch capabilities – processing massive volumes of data with linear scalability. There are several options for interactive applications – each with their own unique properties. Oracle NoSQL Database is a distributed key-value database. It is designed to be highly available and extremely scalable with predictable levels of throughput and latency. Cloudera Impala provides real-time SQL queries over data stored in HDFS – enabling business intelligence tools to access data in Hadoop without requiring MapReduce processing. Finally, Cloudera Search offers full-text interactive search over data stored in HDFS – with results delivered using a faceted navigation model.
In addition to providing the full Cloudera software platform, Big Data Appliance utilizes Oracle Big Data Connectors to simplify data integration and analytics. Big Data Connectors provide high speed access to data in Hadoop from Oracle Exadata and Oracle Database – with data transfer rates in the order of 15 TB/hour. Big Data Connectors also enable integrated, highly scalable analytics to run on Big Data Appliance – providing native access to Hadoop data and parallel processing using Oracle R Distribution. Finally, Oracle XQuery for Hadoop is a new capability that enables standard XQuery operations to process and transform documents in various formats (JSON, XML, Avro and others), executing in parallel across the Hadoop cluster.
The big data domain is marked by continuous innovation; Big Data Appliance embraces these innovations by providing an open environment without compromising tight integration and enterprise-level support. Organizations are free to deploy external software to support new functionality – such as graph analytics, natural language processing and fraud detection –to meet the needs of the application. Support for non-Oracle components is delivered by their respective support channels and not by Oracle.
Big Data Appliance X4-2 Software Integrated Software
Oracle Linux 6.4 with Unbreakable Enterprise Kernel
Oracle Java – JDK 7
command utility of the entire stack (OS, Java, Oracle NoSQL Database and the Cloudera stack)
Single Management Console integrating Big Data Appliance hardware and software monitoring
Single-vendor support for your entire big data solution covering both hardware and software
RELATED PRODUCTS AND SERVICES Oracle Big Data Appliance brings a low risk, highly scalable big data platform to the enterprise.
RELATED PRODUCTS The following are related products available from Oracle:
Oracle Exadata
Oracle Big Data Connectors
Oracle NoSQL Database
Oracle Exalytics
Oracle Business Intelligence Enterprise Edition
Oracle Endeca Information Discovery
Oracle Data Integrator
Oracle Enterprise Manager
RELATED SERVICES
The following services are available from Oracle Support Services:
Advanced Customer Services
Product Support Services
Consulting Services
Oracle University Courses
Cloudera Software
Cloudera’s Distribution including Apache Hadoop (CDH)
Impala
HBase (as well as support for Accumulo)
Search
Cloudera Manager including:
Cloudera Back-up and Disaster Recovery (BDR)
Cloudera Navigator
Oracle R Distribution
Oracle NoSQL Database Community Edition*
Oracle Big Data Appliance Enterprise Manager Plug-In Optional Software (separately licensed)
Oracle Big Data Connectors
Oracle SQL Connector for Hadoop
Oracle Loader for Hadoop
Oracle XQuery for Hadoop
Oracle R Advanced Analytics for Hadoop
Oracle Data Integrator Application Adapter for Hadoop
Oracle Audit Vault and Database Firewall for Hadoop Auditing Oracle Data Integrator
Oracle NoSQL Database Enterprise Edition
* Support for Oracle NoSQL Database Community Edition is not a part of Big Data Appliance. It is a separately purchased component
Lower TCO than Do-it-Yourself Hadoop
Oracle Big Data Appliance lowers the total cost of ownership of a big data platform when compared to a DIY system. Not only are the costs of an initial deployment lower with Big Data Appliance, but more significantly, so are the ongoing costs of maintenance, optimization and system growth.
Big Data Appliance provides unique pricing to dramatically reduce the three to four year TCO when compared to a DIY big data platform. Big Data Appliance bundles the hardware (servers, high-speed networking, power distribution units and peripherals), OS support and subscription costs for the Cloudera software into a single price for the life of the system. A single support license covers both the hardware and the integrated software.
Organizations do not want to spend valuable intellectual capital assembling and tuning an optimized Hadoop/NoSQL infrastructure, especially when these resources can be applied to delivering high value business solutions. Big Data Appliance delivers a pre-configured, highly tuned environment out-of-box for Apache Hadoop and Oracle NoSQL Database. This optimized environment enables companies to focus their resources on developing compelling business applications – lowering the risk for the solution. Additionally, the pre-tuned environment avoids extensive ramp-up time for new applications due to performance and production issues.
Simplified Operations
Oracle Enterprise Manager provides a single entry point for managing the entire system – both hardware and software – providing continuity across other Oracle products in the
organization. To provide deep management capabilities for Hadoop, Enterprise Manager enables a context-aware integration with Cloudera Manager.
Big Data Appliance simplifies day-to-day operations by providing a simple one-command
installation, update, patch and expansion utility – Mammoth – which enables rapid
deployment updates (typically quarterly) to the frequently evolving Hadoop stack without
incurring significant downtime. Mammoth also enables Oracle-tested, seamless upgrades
between Hadoop versions and automated service management to ensure the best balance between Hadoop Master Nodes and Data Nodes.
Big Data Appliance is supported by Oracle, giving organizations a single point of support for their hardware, all integrated software (including all Cloudera software) and any additional Oracle software installed.
Comprehensive Security
Securing data is critical to Big Data solutions in the enterprise; Big Data Appliance provides strong authentication, authorization and auditing of data in Hadoop out of the box.
Strong authentication is provided using Kerberos. This ensures that all users are who they claim to be – and that rogue services are not added to the system.
Big Data Appliance leverages Apache Sentry (an open-source project of which Oracle is a founding member) to authorize SQL access via tools like Hive and Impala. By delivering and developing Sentry, Oracle delivers Big Data Appliance with the highest data security levels currently available for Hadoop.
To ensure security and data access compliance, Big Data Appliance integrates with Oracle Audit Vault and Database Firewall. An Oracle Audit Vault agent is pre-installed on Big Data Appliance to track and audit data access on the Hadoop system. By leveraging Oracle Audit Vault and Database Firewall, all auditing across the organization is consolidated into a single audit repository ensuring a comprehensive view across all data.
Flexible Configurations
Big Data Appliance is designed to expand as your data and requirements grow. Initial big data implementations may start with Big Data Appliance Starter Rack. This six server rack comes fully equipped with a complete set of switches and power distribution units (PDU) required for a full rack. This allows the appliance to easily and efficiently expand in six node hardware increments to larger configurations using the Oracle Big Data Appliance In-Rack Expansion.
In addition to upgrading within a rack, multiple racks can be connected using the integrated InfiniBand fabric to form even larger configurations; up to 18 racks can be connected in a non-blocking manner by connecting InfiniBand cables without the need for any external switches. Larger non-blocking configurations are supported with additional external InfiniBand switches, larger blocking network configurations can be supported without additional switches.
Big Data Appliance is multitenant; it can be configured as a single cluster or as a set of clusters. This provides the flexibility customers need when deploying development, test and production clusters.
Big Data Appliance X4-2 Hardware
Full Rack Starter Rack In-Rack Expansion
18 x compute/storage nodes
6 x compute/storage nodes
6 x compute/storage nodes
Per Node:
2 x Eight-Core Intel ® Xeon ® E5-2650 V2 Processors
64 GB Memory (expandable to 512 GB)
Disk Controller HBA with 512MB Battery backed write cache
12 x 4TB 7,200 RPM High Capacity SAS Disks
2 x QDR (40Gb/s) Ports
4 x 10 Gb Ethernet Ports
1 x ILOM Ethernet Port
2 x 32 Port QDR InfiniBand Switch
32 x InfiniBand ports
8 x 10Gb Ethernet ports
Leverages the leaf switches from the Starter Rack
1 x 36 Port QDR InfiniBand Switch
36 x InfiniBand Ports
Leverages the spine switch from the Starter Rack
Additional Hardware Components included:
Ethernet Administration Switch
2 x Redundant Power Distributions Units (PDUs)
42U rack packaging
Leverages the administration switch, PDUs and base rack from the Starter Rack
Spares Kit Included:
2 x 4 TB High Capacity SAS disk
InfiniBand cables
Leverages the spares kit from the Starter Rack
Big Data Appliance X4-2 Expansions
Multi-Rack Connection In-Rack Expansion
Upgradeability:
Field upgrade leveraging either a single (6 nodes) or two (2 x 6 nodes) In-Rack Expansions. Expansion supports multiple generations of hardware
Additional hardware include with each In-Rack Expansion:
6 x Compute node with direct attached storage as shown earlier
InfiniBand and Ethernet cables to connect all of the components
Up to 18 racks can be connected without requiring additional InfiniBand switches
InfiniBand cables to connect 3 racks are included in the rack Spares Kits
Additional optical InfiniBand cables required when connecting 4 or more racks
Memory Expansions
Expand the memory in any number of nodes from 64GB per node to 512GB per node.
Big Data Appliance X4-2 – Environmental Specificaions Physical Dimensions
Height Width Depth
42U, 78.66” - 1998 mm 23.62” - 600mm 47.24” - 1200 mm Weight
Starter Rack 1037 Lbs
Starter Rack + In- 1400 Lbs
Rack Expansion
Full Rack 1800 Lbs
Power
Starter Rack
Maximum 4.2 KW
Typical
13.0 KW
Starter Rack + In- Rack Expansion
Maximum 7.7 KW
Typical
15.4 KW
Full Rack
Maximum 10.0KW
Typical
17.0 KW
Cooling
Starter Rack
Maximum 14,052 BTU/hour
Typical 9,836 BTU/hour
Starter Rack + In- Rack Expansion
Maximum 26,411 BTU/hour
Typical 18,487 BTU/hour
Full Rack
Maximum 34,142 BTU/hour
Typical 23,940 BTU/hour
Airflow
2Starter Rack Maximum 676 CFM
Typical 473 CFM
Starter Rack + In- Rack Expansion
Maximum 1223 CFM
Typical 856 CFM
Full Rack
Maximum 1,573 CFM
Typical 1,103 CFM
Further Environmental Specifications
Operating temperature/humidity: 5 ºC to 32 ºC (41 ºF to 89.6 ºF), 10% to 90% relative humidity, non-condensing
Altitude Operating: Up to 3,048 m, max. ambient temperature is de-rated by 1° C per 300 m above 900 m
Regulations
3Safety: UL 60950-1 2nd Ed, EN60950-1:2006 2nd Ed, CB Scheme with all country differences
RFI/EMI: FCC CFR 47 Part 15 Subpart B Class A, EN 55022:2006+A1:2007 Class A, EN 61000-3-11:2000, EN 61000-3-12:2005, ETSI EN 300 386 V1.4.1 (2008) Immunity: EN 55024:1998+A1:2001:+A2:2003
Certifications
3Safety: UL/cUL, CE, BSMI, GOST R, S-Mark, CSA C22.2 No. 60950-1-07 2nd Ed, CCC EMC: CE, FCC, VCCI, ICES, KCC, GOST R, BSMI Class A, AS/NZ 3548, CCC Other: Complies with WEEE Directive (2002/96/EC) and RoHS Directive (2002/95/EC)
1
Typical power usage varies by application workload
2
Airflow must be front to back
3
In some cases, as applicable, regulatory and certification compliance were obtained at
the component level
Contact Us
For more information about Oracle Big Data Appliance, visit oracle.com or call +1.800.ORACLE1 to speak to an Oracle representative.
Copyright © 2013, Oracle and/or its affiliates. All rights reserved.
This document is provided for information purposes only and the contents hereof are subject to change without notice. This document is not warranted to be error-free, nor subject to any other warranties or conditions, whether expressed orally or implied in law, including implied warranties and conditions of merchantability or fitness for a particular purpose.
We specifically disclaim any liability with respect to this document and no contractual obligations are formed either directly or indirectly by this document. This document may not be reproduced or transmitted in any form or by any means, electronic or mechanical, for any purpose, without our prior written permission.
Oracle and Java are registered trademarks of Oracle and/or its affiliates. Other names may be trademarks of their respective owners. Cloudera, Cloudera CDH, and Cloudera Manager, Cloudera Navigator and Cloudera BDR are registered and unregistered trademarks of Cloudera, Inc.,
Intel and Intel Xeon are trademarks or registered trademarks of Intel Corporation. All SPARC trademarks are used under license and are trademarks or registered trademarks of SPARC International, Inc. AMD, Opteron, the AMD logo, and the AMD Opteron logo are trademarks or registered trademarks of Advanced Micro Devices. UNIX is a registered trademark licensed through X/Open Company, Ltd. 0611
Big Data Appliance Support Services
Hardware Warranty: 1 year with a 4 hour web/phone response during normal business hours (Mon-Fri 8AM-5PM), with 2 business day on-site
response/Parts Exchange
Oracle Premier Support for Systems: Oracle Linux and integrated software support and 24x7 with 2 hour on-site hardware service response (subject to proximity to service center)
Oracle Premier Support for Operating Systems
Oracle Customer Data and Device Retention
System Installation Services
Software Configuration Services
System Expansion Support Services including hardware installation and software configuration
Quarterly on-site patch deployment service