• No results found

Fujitsu Interstage Big Data Parallel Processing Server V1.0

N/A
N/A
Protected

Academic year: 2021

Share "Fujitsu Interstage Big Data Parallel Processing Server V1.0"

Copied!
5
0
0

Loading.... (view fulltext now)

Full text

(1)

Page 1

Datasheet

Fujitsu

Big Data Parallel Processing Server V1.0

Availability

featuring Fujitsu's proprietary distributed file system

Fujitsu Big Data Software

An organizations ability to collect and analyze large volumes of data is fundamental to their capability to predict trends, forecast sales and make crucial business decisions.

In the era of Big Data, the amount of data is increasing exponentially and is becom more and more diverse. The Collect, Analyze Decide and Act cycle, has become too difficult to process through conventional IT systems and has resulted in the emergence of Big Data technology.

Big Data technology can shorten the, afore mentioned, cycle

in real time, thus making it possible for organizations to engage new opportunities not previously conceived. It is critical that companies can utilize Big Data to better understand their customers and stay ahead of the competit

Until now,

well leveraged. Fujitsu Data Software

times and an ability to combine

conventional business data. With Interstage, organizat

efficiencies in enterprise processing and create new business opportunities.

*1: Apache Hadoop: Developed and released by

Apache Hadoop is open source software for efficiently performing parallel distributed processing of massive volumes of data. 1 of 5

Datasheet

Fujitsu

Big Data Parallel Processing Server V1.0

vailability and processing performance enhanced with an Apache Hadoop solution

featuring Fujitsu's proprietary distributed file system

Fujitsu Big Data Software

rganizations ability to collect and analyze large volumes of data is fundamental to their capability to predict trends, forecast sales and make crucial business decisions.

In the era of Big Data, the amount of data is increasing exponentially and is becom more and more diverse. The Collect, Analyze Decide and Act cycle, has become too difficult to process through conventional IT systems and has resulted in the emergence of Big Data technology.

Big Data technology can shorten the, afore mentioned, cycle by performing the analysis in real time, thus making it possible for organizations to engage new opportunities not previously conceived. It is critical that companies can utilize Big Data to better understand their customers and stay ahead of the competition.

Until now, Big Data technology has n well leveraged. Fujitsu

Data Software offers reductions in analysis times and an ability to combine

conventional business data. With Interstage, organizations can benefit from greater efficiencies in enterprise processing and create new business opportunities.

Apache Hadoop: Developed and released by

Apache Hadoop is open source software for efficiently performing parallel distributed processing of massive volumes of data.

Datasheet

Fujitsu Interstage

Big Data Parallel Processing Server V1.0

and processing performance enhanced with an Apache Hadoop solution

featuring Fujitsu's proprietary distributed file system

Fujitsu Big Data Software

rganizations ability to collect and analyze large volumes of data is fundamental to their capability to predict trends, forecast sales and make crucial business decisions.

In the era of Big Data, the amount of data is increasing exponentially and is becom more and more diverse. The Collect, Analyze Decide and Act cycle, has become too difficult to process through conventional IT systems and has resulted in the emergence of Big

Big Data technology can shorten the, afore by performing the analysis in real time, thus making it possible for organizations to engage new opportunities not previously conceived. It is critical that companies can utilize Big Data to better understand their customers and stay ahead of

technology has n well leveraged. Fujitsu’s new Interstage Big

offers reductions in analysis times and an ability to combine Big conventional business data. With Interstage,

ions can benefit from greater efficiencies in enterprise processing and create new business opportunities.

Apache Hadoop: Developed and released by

Apache Hadoop is open source software for efficiently performing parallel distributed processing of massive volumes of data.

Interstage

Big Data Parallel Processing Server V1.0

and processing performance enhanced with an Apache Hadoop solution

featuring Fujitsu's proprietary distributed file system

rganizations ability to collect and analyze large volumes of data is fundamental to their capability to predict trends, forecast sales and

In the era of Big Data, the amount of data is increasing exponentially and is becoming more and more diverse. The Collect, Analyze Decide and Act cycle, has become too difficult to process through conventional IT systems and has resulted in the emergence of Big

Big Data technology can shorten the, afore by performing the analysis in real time, thus making it possible for organizations to engage new opportunities not previously conceived. It is critical that companies can utilize Big Data to better understand their customers and stay ahead of

technology has not been Interstage Big offers reductions in analysis

ig Data with conventional business data. With Interstage,

ions can benefit from greater efficiencies in enterprise processing and

Interstage Server

Interstage Big Data Parallel Processing Server is a parallel distributed processing softwa platform that supports improved data reliability. It combines the Apache Hadoop with Fujitsu’s

By combining with Apache Hadoop,

improves data integrity without requiring d

servers, thereby achieving substantially better processing performance. Fujitsu’s enterprise systems By the Easy enables to setup

eliminates setup mistakes

Apache Hadoop: Developed and released by the Apache Software Foundation

Apache Hadoop is open source software for efficiently performing parallel distributed processing of massive volumes of data.

Interstage

Big Data Parallel Processing Server V1.0

and processing performance enhanced with an Apache Hadoop solution

featuring Fujitsu's proprietary distributed file system

Interstage Big Data Parallel Processing Server

Interstage Big Data Parallel Processing Server is a parallel distributed processing softwa platform that supports improved data reliability. It combines the Apache Hadoop with Fujitsu’s Distributed

By combining Fujitsu's with Apache Hadoop,

improves data integrity without requiring data to be transferred to Hadoop processing servers, thereby achieving substantially better processing performance.

Fujitsu’s strong track record in mission enterprise systems

By the Easy setup enables to reduce

setup and simplifies operations for scale eliminates setup mistakes

Apache Software Foundation

Apache Hadoop is open source software for efficiently performing parallel distributed processing of massive volumes of data.

Big Data Parallel Processing Server V1.0

and processing performance enhanced with an Apache Hadoop solution

featuring Fujitsu's proprietary distributed file system

Big Data Parallel Processing Interstage Big Data Parallel Processing Server is a parallel distributed processing softwa platform that supports improved data reliability. It combines the Apache Hadoop

istributed File S Fujitsu's Distributed with Apache Hadoop, the new solution improves data integrity without requiring

ata to be transferred to Hadoop processing servers, thereby achieving substantially better processing performance.

strong track record in mission enterprise systems supports this technology

etup technology, Organizations educe time for initial

implifies operations for scale eliminates setup mistakes.

(ASF).

Apache Hadoop is open source software for efficiently performing parallel distributed processing of massive volumes of data.

http://www.fujitsu.com/global/services/software/interstage

Big Data Parallel Processing Server V1.0

and processing performance enhanced with an Apache Hadoop solution

Big Data Parallel Processing Interstage Big Data Parallel Processing Server is a parallel distributed processing software platform that supports improved data reliability. It combines the Apache Hadoop(*1)

System. istributed File System

new solution improves data integrity without requiring

ata to be transferred to Hadoop processing servers, thereby achieving substantially better

strong track record in mission-critical supports this technology. technology, Organizations time for initial software implifies operations for scale-out,

Apache Hadoop is open source software for efficiently performing parallel distributed processing of massive volumes of data.

http://www.fujitsu.com/global/services/software/interstage

Big Data Parallel Processing Server V1.0

and processing performance enhanced with an Apache Hadoop solution

Interstage Big Data Parallel Processing Server

(*1)

ystem

ata to be transferred to Hadoop processing servers, thereby achieving substantially better

critical . technology, Organizations out,

Usage Scenario

Retail - Previously the data of an individual store was not able to be analyzed and therefore product

at the discretion of the store managers. Big Data technology has enabled the application of batch processing to individual stores. It is now possible to accurately stock the best-selling products leading to higher sales and pro

Apache Hadoop is open source software for efficiently performing parallel distributed processing of massive volumes of data.

http://www.fujitsu.com/global/services/software/interstage ttp://ts.fujitsu.com/

Big Data Parallel Processing Server V1.0

and processing performance enhanced with an Apache Hadoop solution

Usage Scenario

Previously the data of an individual store was not able to be analyzed and therefore product range decisions were made at the discretion of the store managers. Big Data technology has enabled the application of batch processing to individual stores. It is now possible to accurately stock

selling products leading to higher sales and profits.

http://www.fujitsu.com/global/services/software/interstage ttp://ts.fujitsu.com/Interstage Previously the data of an individual store was not able to be analyzed and

range decisions were made at the discretion of the store managers. Big Data technology has enabled the application of batch processing to individual stores. It is now possible to accurately stock

selling products leading to higher

http://www.fujitsu.com/global/services/software/interstage Interstage Previously the data of an individual

range decisions were made at the discretion of the store managers.

application of batch processing to individual stores. It is now possible to accurately stock

(2)

DatasheetFujitsu Interstage Big Data Parallel Processing Server V1.0

Page 2 of 5 http://www.fujitsu.com/global/services/software/interstage

ttp://ts.fujitsu.com/Interstage

Main features Benefits

High Performance Distributed File System

■Fujitsu‘s Distributed File System for Hadoop, which has been proven in mission-critical enterprise systems

■Data sharing with existing systems eliminates the need to transfer files

■Easy data access via the standard Linux file interface

■Reduces the time required for Big Data analysis

■Reduces the time required for enterprise batch processing ■Simplifies access to Big Data

■Enhances productivity through flexible and efficient data sharing between applications and Hadoop

High Availability

■Built-in high availability management features to realize a secure Hadoop environment

 Master server automatic fail-over for Hadoop Task Tracker  High availability Distributed File System

 Redundant network between the master and slave servers

■Distributed File System on top of the high-availability storage system.

■Enables a higher availability Hadoop environment, eliminating the single point of failure

■Relieves users from the need to deal with the underlying high availability design and configuration

■Ensures higher availability of data

Easy Setup and Scale-out

■Reduces time for initial installation and setup of system. ■Simplifies operations for scale-out, eliminates setup mistakes

■Enables easy/automated configuration of high performance and high-availability Hadoop environment

■Reduces troubles due to setup mistakes

■Enables easier and quicker scale-out of system to handle more data and/or to reduce data processing time

(3)

Page 3

High Performance Distributed File System handling

In recent years, the amount of data produced in our daily lives continues to grow at an explosive pace

technologies

devices, and various kinds of sensors. At the same time, there is also

to use activities and forecasts sources, and then

information in new business areas or to improve operations.

Apache Hadoop is

standard open source software for achieving distributed processing of large volumes of unstructured data.

Parallel Pro

systematized platform with

enhanced data/operation management and high availability technologies. This enables Big Data

harnessed for use in enterprise information systems.

Interstage Big Data Parallel Processing Server V1.0 enables enhanced

processing performance while shortening deployment times. This makes use of Data in enterprise systems more feasible. Fujitsu

In addition to the standard Hadoop Distributed File System (HDFS), this new solution features Fujitsu's proprietary distributed file system

strong track record in mission

enterprise systems, enabling the Hadoop system to achieve h

availability

Topics

3 of 5

High Performance Distributed File System handling Big Data

In recent years, the amount of data produced in our daily lives continues to grow at an explosive pace. This is due to the technologies such as

devices, and various kinds of sensors. At the same time, there is also

e Big Data in corporate

activities—for instance, to discover trends forecasts from various logs and data sources, and then to leverage

information in new business areas or to improve operations.

Apache Hadoop is considered the

standard open source software for achieving distributed processing of large volumes of unstructured data. Interstage Big Data Parallel Processing Server

systematized platform with

enhanced data/operation management and high availability technologies. This enables Big Data and mission

harnessed for use in enterprise information systems.

erstage Big Data Parallel Processing Server V1.0 enables enhanced

processing performance while shortening deployment times. This makes use of

in enterprise systems more feasible. Fujitsu’s Distributed F

dition to the standard Hadoop Distributed File System (HDFS), this new solution features Fujitsu's proprietary distributed file system

strong track record in mission

enterprise systems, enabling the Hadoop system to achieve high performance and high availability.

opics

High Performance Distributed File System In recent years, the amount of data produced in our daily lives continues to grow at an

. This is due to the spread of smartphones, tablet devices, and various kinds of sensors. At the same time, there is also significant demand

corporate business for instance, to discover trends

from various logs and data leverage such information in new business areas or to

considered the industry standard open source software for achieving distributed processing of large volumes of

Interstage Big Data cessing Server combines this in a systematized platform with Fujitsu’s enhanced data/operation management and high availability technologies. This enables

mission-critical systems harnessed for use in enterprise information

erstage Big Data Parallel Processing Server V1.0 enables enhanced availability

processing performance while shortening deployment times. This makes use of

in enterprise systems more feasible. File System dition to the standard Hadoop Distributed File System (HDFS), this new solution features Fujitsu's proprietary distributed file system (DFS), which boasts a strong track record in mission-critical enterprise systems, enabling the Hadoop

igh performance and high High Performance Distributed File System for In recent years, the amount of data produced in our daily lives continues to grow at an

spread of smartphones, tablet devices, and various kinds of sensors. At the

demand business for instance, to discover trends

from various logs and data information in new business areas or to

industry standard open source software for achieving distributed processing of large volumes of

Interstage Big Data combines this in a

Fujitsu’s enhanced data/operation management and high availability technologies. This enables

critical systems to be harnessed for use in enterprise information

erstage Big Data Parallel Processing Server and processing performance while shortening deployment times. This makes use of Big in enterprise systems more feasible.

dition to the standard Hadoop Distributed File System (HDFS), this new solution features Fujitsu's proprietary

, which boasts a critical enterprise systems, enabling the Hadoop

igh performance and high

With DFS, data processing can be performed using Hadoop by directly accessing data stored within the storage system. Unlike the standard Apache Hadoop format, which temporarily transfers the required data to HDFS befo

does not require data to be transferred, thereby significantly reducing

A Fujitsu internal evaluation showed reduction in

of HDFS

server memory cache enable I/O performance.

[Hadoop standard file system]

[Fujitsu’s distributed file system]

DFS allows applications to access files using a st

their existing too

data to the Hadoop system, including and printing without modification. Thus, the entire system configuration becomes simpler, reducing the time and costs required to tailor applications for Hadoop.

With DFS, data processing can be performed using Hadoop by directly accessing data stored within the storage system. Unlike the standard Apache Hadoop format, which temporarily transfers the required data to HDFS before processing, Fujitsu's software does not require data to be transferred, thereby significantly reducing

A Fujitsu internal evaluation showed reduction in data processing time to 1/5 of HDFS(*1). Additionally

server memory cache enable I/O performance.

[Hadoop standard file system]

[Fujitsu’s distributed file system]

DFS allows applications to access files using a standard Linux interface. Users can employ their existing too

data to the Hadoop system, including and printing without modification. Thus, the entire system configuration becomes simpler, reducing the time and costs required to tailor applications for Hadoop.

With DFS, data processing can be performed using Hadoop by directly accessing data stored within the storage system. Unlike the standard Apache Hadoop format, which temporarily transfers the required data to

re processing, Fujitsu's software does not require data to be transferred, thereby significantly reducing

A Fujitsu internal evaluation showed data processing time to 1/5 Additionally, efficient use of s server memory cache enabled

I/O performance.

[Hadoop standard file system]

[Fujitsu’s distributed file system]

DFS allows applications to access files using a andard Linux interface. Users can employ their existing tools to provide and retrieve data to the Hadoop system, including and printing without modification. Thus, the entire system configuration becomes simpler, reducing the time and costs required to tailor applications for Hadoop.

http://www.fujitsu.com/global/services/software/interstage With DFS, data processing can be performed

using Hadoop by directly accessing data stored within the storage system. Unlike the standard Apache Hadoop format, which temporarily transfers the required data to

re processing, Fujitsu's software does not require data to be transferred, thereby significantly reducing processing time. A Fujitsu internal evaluation showed a

data processing time to 1/5 that , efficient use of slave

d 10 times faster [Hadoop standard file system]

[Fujitsu’s distributed file system]

DFS allows applications to access files using a andard Linux interface. Users can employ

to provide and retrieve data to the Hadoop system, including backup and printing without modification. Thus, the entire system configuration becomes simpler, reducing the time and costs required to tailor

http://www.fujitsu.com/global/services/software/interstage With DFS, data processing can be performed

stored within the storage system. Unlike the

rocessing time.

that lave 10 times faster

DFS allows applications to access files using a

backup and printing without modification. Thus, the entire system configuration becomes simpler, reducing the time and costs required to tailor

Apache Hadoop's single

be eliminated through redundant operations. A master server that employs Fujitsu clustering

availability

storing data in the storage system. Easy Setup

Fujitsu’s Easy deployment time. Apache Hadoop and software components DFS) are automated numerous

fast, secure

high availability Hadoop environment realized.

Sometime

system may require more computing power as the size of

Easy Setup

servers to be added easily, constructed system image multiple servers

system and server deployment operations

dynamic disk add system.

*1: Measured by Fujitsu

http://www.fujitsu.com/global/services/software/interstage ttp://ts.fujitsu.com/

Apache Hadoop's single

be eliminated through redundant operations. A master server that employs Fujitsu

ing technology enables high availability. Availability is also improved by storing data in the storage system.

Setup and Scale-out

Easy Setup reduces installation and deployment time. Installation and setup of Apache Hadoop and the

software components (e.g., are automated. By eliminating numerous error-prone manual

fast, secure setup of a high performance and high availability Hadoop environment

Sometime after initial configuration system may require more computing power

size of the data set Setup enables addit

to be added easily, constructed system image multiple servers in a single pass system and server deployment operations. Fujitsu DFS also dynamic disk addition to

*1: Measured by Fujitsu

http://www.fujitsu.com/global/services/software/interstage ttp://ts.fujitsu.com/Interstage

point of failure can be eliminated through redundant operations. A master server that employs Fujitsu

technology enables high is also improved by storing data in the storage system.

out

Setup reduces installation and Installation and setup of

the underlying Fujitsu (e.g., clustering and . By eliminating prone manual operations,

high performance and high availability Hadoop environment is

fter initial configuration, the system may require more computing power

ta set increases. Fujitsu enables additional Hadoop slave to be added easily, allowing pre constructed system images to be deployed

in a single pass for quick system and server deployment with just a few

also allows for o scale-up the file

http://www.fujitsu.com/global/services/software/interstage Interstage point of failure can be eliminated through redundant operations.

is also improved by

Setup reduces installation and Installation and setup of

underlying Fujitsu clustering and ons, a high performance and

is

he system may require more computing power

Fujitsu Hadoop slave

pre-to be deployed on

for quick with just a few

(4)

DatasheetFujitsu Interstage Big Data Parallel Processing Server V1.0

Page 4 of 5 http://www.fujitsu.com/global/services/software/interstage

ttp://ts.fujitsu.com/Interstage

Server Hardware PRIMERGY RX(*1), (*2) , TX(*2)

Two PRIMERGY RX are required for master server clustering

Operating Systems Red Hat(R) Enterprise Linux(R) 5.6 (for Intel64)

Red Hat(R) Enterprise Linux(R) 5.7 (for Intel64) Red Hat(R) Enterprise Linux(R) 5.8 (for Intel64) Red Hat(R) Enterprise Linux(R) 6 (for Intel64) Red Hat(R) Enterprise Linux(R) 6.1 (for Intel64) Red Hat(R) Enterprise Linux(R) 6.2 (for Intel64)

Storage Hardware ETERNUS

iSCSI connection with 10 Gbit Ethernet is required(*3)

Other Software Prerequisites ServerView Agent for Linux V4.50.12 or later IPMI driver

ETERNUS Multi-path driver Interstage Application Server V10

User Interface English

*1: PRIMERGY RX900 is not supported for this product *2: One or more (up to four) dual core processors required *3: FC connection is not supported

(5)

Page 5 of 5 http://www.fujitsu.com/global/services/software/interstage ttp://ts.fujitsu.com/Interstage

Fujitsu Platform Solutions More information Copyright

In addition to Fujitsu Interstage Big Data Parallel Processing Server V1, Fujitsu provides a wide range of platform solutions. These solutions combine reliable Fujitsu products with the best in services, know-how and worldwide partnerships. Dynamic Infrastructures

With the Fujitsu Dynamic Infrastructures approach, Fujitsu offers a full portfolio of IT products, solutions and services, ranging from clients to datacenter solutions, Managed Infrastructure to Infrastructure as a Service. How much you benefit from Fujitsu technologies and services depends on the level of cooperation you choose. This takes IT flexibility and efficiency to the next level.

Computing products

www.fujitsu.com/global/services/computing/ ■PRIMERGY: Industrial standard server ■SPARC Enterprise: UNIX server ■PRIMEQUEST: Mission-critical IA server ■ETERNUS: Storage system

Software

www.fujitsu.com/software/

■Interstage: Application infrastructure software

■Systemwalker: System management software

To learn more about Fujitsu Interstage Big Data Parallel Processing Server V1, please contact your Fujitsu sales

representative, Fujitsu business partner, or visit the Fujitsu website.

www.fujitsu.com/software

 Copyright 2012 FUJITSU Limited. Fujitsu, the Fujitsu logo, and Interstage are trademarks or registered trademarks of Fujitsu Limited in Japan and other countries.

Linux is a registered trademark of Linus Torvalds in the United States and other countries.

Apache Hadoop, Hadoop, HDFS, HBase, Hive, and Pig are registered trademarks of the Apache Software Foundation in the United States and other countries. Other company product and service names may be trademarks or registered

trademarks of their respective owners. Fujitsu Green Policy Innovation

Fujitsu Green Policy Innovation is our worldwide project to reduce the burdens placed on the environment. Using our global know-how, we aim to resolve environmental energy efficiency issues through the use of IT. More information is available at:

www.fujitsu.com/global/about/environment/

Disclaimer

Technical data is subject to modification and delivery is subject to availability. Any liability that the data and illustrations contained herein are complete, actual or correct is hereby disclaimed. Designations used herein may be trademarks and/or copyrights of their respective manufacturers, the use of which by third parties for their own purposes may infringe the rights of such owners.

More Information

Contact Information

FUJITSU LIMITED

Address: Shiodome City Center,

1-5-2 Higashi-Shimbashi, Minato-Ku, Tokyo 105-7123, Japan

Website:www.fujitsu.com

References

Related documents

Hammer-ons and pull-offs are indicated by slurs (the context tells

Apache Hadoop is an open-source software framework written in Java platform that offers an efficient and effective method for storing and processing massive amounts of

or on a weekend, please report to the Public Relations office which is located across the street from the Mission’s main office, behind the gift shop.. If no one from Public

Charleston Community Hospital.[36] There, the Supreme Court of Illinois held that "the jury could have found a hospital negligent, inter alia, in failing to have a sufficient

Currently, a scalable simulation-based back-end, a back-end based on symbolic ex- ecution, and a formal back-end exploiting functional equivalences be- tween a C program and a

Data collection included pre-operative de- mographic data (sex, age, smoking history, body mass index [BMI] – kg/m 2 ), comorbidities, radiological assessment (diameter of

Correlation between Breast Self-Examination Practices and Demographic Characteristics, Risk Factors and Clinical Stage of Breast Cancer among Iraqi Patients..

Greetings Potential 2015 Ultimate Chicago Sandblast Event Sponsorship Partner, From creating brand awareness and increasing sales, Ultimate Chicago Sandblast is the ideal marketing