• No results found

# Not a part of 1Z0-061 or 1Z0-144 Certification test, but very important technology in BIG DATA Analysis

N/A
N/A
Protected

Academic year: 2021

Share "# Not a part of 1Z0-061 or 1Z0-144 Certification test, but very important technology in BIG DATA Analysis"

Copied!
49
0
0

Loading.... (view fulltext now)

Full text

(1)

compiled by Rocky Jagtiani Tech Head for SCTPL , 9892544177

1

Section 9 : Case Study #

Objectives of this Session

The Motivation For Hadoop

What problems exist with traditional large-scale computing systems

What requirements an alternative approach should have

How Hadoop addresses those requirements

Hadoop: Basic Concepts

What Is Hadoop?

The Hadoop Distributed File System (HDFS)

How Google MapReduce Algorithm works

Anatomy of a Hadoop Cluster

Who uses Hadoop ?

db.suven.net

# Not a part of 1Z0-061 or 1Z0-144 Certification test , but very important technology in BIG DATA Analysis

(2)

• Hadoop Solutions

– The most common problems Hadoop can solve

– The types of analytics often performed with Hadoop

– Where the data comes from ?

– The benefits of analyzing data with Hadoop

– How some real-world companies use Hadoop

• Hadoop Ecosystem

• Cloudera Software (All Open-Source)

compiled by Rocky Jagtiani Tech Head for SCTPL , 9892544177

2

Objectives of this Session … contd…

(3)

The Motivation For Hadoop

compiled by Rocky Jagtiani Tech Head for 3 SCTPL , 9892544177

(4)

*

MPI: Message Passing Interface

PVM: Parallel Virtual Machine

compiled by Rocky Jagtiani Tech Head for 4 SCTPL , 9892544177

(5)

Major

Problem

compiled by Rocky Jagtiani Tech Head for 5 SCTPL , 9892544177

(6)

1 GB = 1000 MB , 1 TB = 1000 GB , 1 PT = 1000 TB , 1 Exabyte = 1000 PT

PT => petabyte , TB => teraByte

compiled by Rocky Jagtiani Tech Head for 6 SCTPL , 9892544177

(7)

compiled by Rocky Jagtiani Tech Head for 7 SCTPL , 9892544177

(8)

The Motivation For Hadoop

compiled by Rocky Jagtiani Tech Head for 8 SCTPL , 9892544177

(9)

1.

2.

compiled by Rocky Jagtiani Tech Head for 9 SCTPL , 9892544177

(10)

3.

4.

5.

10 compiled by Rocky Jagtiani Tech Head for

SCTPL , 9892544177

(11)

Hadoop History

compiled by Rocky Jagtiani Tech Head for 11 SCTPL , 9892544177

(12)

Core Hadoop Concepts

compiled by Rocky Jagtiani Tech Head for 12 SCTPL , 9892544177

(13)

Hadoop Components

compiled by Rocky Jagtiani Tech Head for 13 SCTPL , 9892544177

(14)

HDFS

compiled by Rocky Jagtiani Tech Head for 14 SCTPL , 9892544177

(15)

HDFS

Concepts

compiled by Rocky Jagtiani Tech Head for 15 SCTPL , 9892544177

(16)

HDFS : How Files Are Stored ?

compiled by Rocky Jagtiani Tech Head for 16 SCTPL , 9892544177

(17)

How Files Are Stored: Example

compiled by Rocky Jagtiani Tech Head for 17 SCTPL , 9892544177

(18)

IMP :

How MapReduce Work ?

compiled by Rocky Jagtiani Tech Head for 18 SCTPL , 9892544177

(19)

MapReduce: The Mapper

19 compiled by Rocky Jagtiani Tech Head for SCTPL , 9892544177

(20)

Example :

compiled by Rocky Jagtiani Tech Head for 20 SCTPL , 9892544177

(21)

compiled by Rocky Jagtiani Tech Head for SCTPL , 9892544177 21

(22)

compiled by Rocky Jagtiani Tech Head for SCTPL , 9892544177 22

(23)

compiled by Rocky Jagtiani Tech Head for SCTPL , 9892544177 23

(24)

compiled by Rocky Jagtiani Tech Head for SCTPL , 9892544177

24

(25)

compiled by Rocky Jagtiani Tech Head for

SCTPL , 9892544177 25

Anatomy of a Hadoop Cluster :

(26)

compiled by Rocky Jagtiani Tech Head for SCTPL , 9892544177 26

(27)

compiled by Rocky Jagtiani Tech Head for SCTPL , 9892544177 27

(28)

compiled by Rocky Jagtiani Tech Head for SCTPL , 9892544177

28

Who uses Hadoop ?

(29)

compiled by Rocky Jagtiani Tech Head for

SCTPL , 9892544177 29

Hadoop Solutions

(30)

compiled by Rocky Jagtiani Tech Head for SCTPL , 9892544177 30

A

(31)

compiled by Rocky Jagtiani Tech Head for

SCTPL , 9892544177 31

B What is Problem if the data is coming ?

(32)

compiled by Rocky Jagtiani Tech Head for

SCTPL , 9892544177 32

C

(33)

compiled by Rocky Jagtiani Tech Head for SCTPL , 9892544177 33

The most common problems Hadoop can solve :

We understand how each problem is solved using Hadoop in brief

D

(34)

compiled by Rocky Jagtiani Tech Head for SCTPL , 9892544177 34

(35)

compiled by Rocky Jagtiani Tech Head for

SCTPL , 9892544177 35

(36)

compiled by Rocky Jagtiani Tech Head for SCTPL , 9892544177 36

(37)

compiled by Rocky Jagtiani Tech Head for SCTPL , 9892544177 37

(38)

compiled by Rocky Jagtiani Tech Head for SCTPL , 9892544177 38

(39)

compiled by Rocky Jagtiani Tech Head for SCTPL , 9892544177 39

(40)

compiled by Rocky Jagtiani Tech Head for

SCTPL , 9892544177 40

(41)

compiled by Rocky Jagtiani Tech Head for

SCTPL , 9892544177 41

(42)

compiled by Rocky Jagtiani Tech Head for

SCTPL , 9892544177 42

How some real-world companies use Hadoop

E

(43)

compiled by Rocky Jagtiani Tech Head for

SCTPL , 9892544177 43

Hadoop Ecosystem

(44)

compiled by Rocky Jagtiani Tech Head for SCTPL , 9892544177 44

Cloudera Software (All Open-Source)

(45)

compiled by Rocky Jagtiani Tech Head for

SCTPL , 9892544177 45

*enterprise data

warehouse (EDW)

Conclusion :

(46)

1) Input to mapper is

"Google is one of the richest companies "

"one who works with the Google is technical expert "

what will be the out put after reducing ?

compiled by Rocky Jagtiani Tech Head for SCTPL , 9892544177

46

Questions

(47)

2) Input to mapper is

"Cat is eating milk"

"Cat is very sweet and she likes milk"

"milk is in bottle"

what will be the out put after reducing ?

compiled by Rocky Jagtiani Tech Head for SCTPL , 9892544177

47

(48)

3) Input to mapper is

"Dollar is national currency for USA"

"Rupee is national currency for India"

"Dollar is ahead of Rupee in economy"

"India is developing country"

what will be the out put after Mapping ?

compiled by Rocky Jagtiani Tech Head for SCTPL , 9892544177

48

(49)

compiled by Rocky Jagtiani Tech Head for SCTPL , 9892544177

49

what will be the out put after reducing ?

what will be the out put after shuffling?

References

Related documents

Greetings Potential 2015 Ultimate Chicago Sandblast Event Sponsorship Partner, From creating brand awareness and increasing sales, Ultimate Chicago Sandblast is the ideal marketing

Examples are pulse width modulation (PWM) converters and cycloconverters. Interharmonics generated by them may be located anywhere in the spectrum with respect to

Purpose: To evaluate whether rehabilitation of muscle force or balance improves postural stability and quality of life (QoL), and whether self-administered training

Cisco Devices Generic DoS 367 ICMP Remote DoS Vulnerabilities 367 Malformed SNMP Message DoS Vulnerability 369. Examples of Specific DoS Attacks Against Cisco Routers 370 Cisco

Travel and Evacuation Insurance: Each participant MUST have Travel Medical and Evacuation insurance to attend this course.. This insurance is available from

This research focuses on clustering of genes with similar expres- sion patterns using Hidden Markov Models (HMM) for time course data because they are able to model the

Re- sults therefore suggest that meaningful growth in residential PV capacity in Norway depends greater knowledge among households, continued advances in technology, clarity with

This study, therefore, recommended 25% inclusion level of ANLE in boar semen extension up to 48 h as indicated by observed mean values of all parameters, which