• No results found

BIG DATA SERIES: HADOOP DEVELOPER TRAINING PROGRAM. An Overview

N/A
N/A
Protected

Academic year: 2021

Share "BIG DATA SERIES: HADOOP DEVELOPER TRAINING PROGRAM. An Overview"

Copied!
10
0
0

Loading.... (view fulltext now)

Full text

(1)

BIG DATA SERIES: HADOOP DEVELOPER

TRAINING PROGRAM

An Overview

(2)

Contents

Contents

... 1

BIG DATA SERIES: HADOOP DEVELOPER TRAINING PROGRAM ... 1

Program Overview ... 4

Curriculum ... 5

Module 1: Big Data: Hadoop Introduction... 5

Module 2: Deep dive Hadoop ... 5

Module 3: All Things MapReduce ... 6

Module 4: All Things Pig ... 6

Module 5: All Things Hive ... 7

Module 6: All Things HBase ... 7

Module 7: Impala and Zookeeper ... 7

Module 8: All things Oozie ... 8

Module 9: MR Unit Testing Framework ... 8

Module 10: Bringing It All-together ... 9

(3)
(4)

Section

1

Program Overview

Big Data is the new rage. Here is an opportunity to join the revolution. As the data is growing beyond one’s imagination, Enterprises are finding it even harder to manage and run their Businesses effectively. Thankfully Apache Hadoop arrives on the scene, at a perfect time to solve the problems.

Join us at eduBrake to fully enable your development teams in address everything they need to know to successfully realize ROI on your Big Data investments. Knowledge and training investments pay dividends for years to come. Get all the training you need from our experts on Big Data.

Big Data Series: Hadoop Developer Training Program is exactly suited for those who want to learn all things Hadoop. The training enables your team to successfully understand Hadoop architecture, write Map Reduce Programs, bring in and handle data, learn other Hadoop ecosystem applications such as Pig, Hbase, Hive, Flume, Pig, Oozie etc. The training program provides a direct hands on experience along with 40 hrs exercise and certification project.

The program is designed to be an Instructor led Virtual training along with a hands on exercise assignments. The student will gain access to an Online Learning Management System to manage their progress and submit the assignments. The students will have weekly office hours for helping them on their exercises.

(5)

Section

2

Curriculum

Hadoop Developer training program’s curriculum is as given below.

Module 1: Big Data: Hadoop Introduction.

In this module the students will gain complete understanding of what a Big Data technology offers and the architectural consideration for Hadoop and its eco-system components.

Topics covered:

 Need for Big Data.

 Grand Entry of Hadoop – The Hero.

 Hadoop Architecture.

 Hadoop Ecosystem.

 Home Work Assignment – Setup and install Hadoop VM

 Home Work Assignment – Load Preliminary Data + Run Hadoop Commands.

Module 2: Deep dive Hadoop

In this module, the students will learn a deeper understanding of Hadoop along with the techniques needed to load the data in to the Hadoop

Infrastructure.

Topics covered

(6)

MapReduce Introduction

Flume

Sqoop

Module 3: All Things MapReduce

In this module, everything about MapReduce will be taught. This module will involve heavy hands-on exercises. Depending upon the speed with which the students cruise through, we may decide even to spend more time on this module. If you get all of this, consider you are already on the path of becoming a Map Reduce expert.

Topics covered

 Java Map Reduers

 Streaming Map Reducers

 Combiners

 Partitioners and Reducers

 Algorithms

Module 4: All Things Pig

In this module, everything about Pig data flow engine. The students will get a clear understanding on the Pig and Pig Latin, the data flow scripting language.

Topics covered

 Intro to Pig

 Installing and Running Pig

 Grunt

 Types and Schemas

(7)

 Developing and Testing

Module 5: All Things Hive

In this module, everything about Hive will be taught. The students will learn how Hive provides an SQL dialect for querying the data stored in Hadoop cluster.

Topics covered

 Intro to Hive

 Hive Architecture

 HiveQL : Data definition and Data Manipulation

 And everything else about Hive.

Module 6: All Things HBase

In this module, everything about HBase is discussed.

Topics covered

 Intro to HBase

 HBase Usecases

 HBase Datamodel

 HBase Schema Design.

Module 7: Impala and Zookeeper

In this module, the students will get their understanding around Impala and Zookeeper

(8)

Topics covered

 Intro to Impala

 Intro to Zookeeper

Module 8: All things Oozie

In this module, the students will learn everything about Oozie – the workflow processor for Hadoop. This module presents lot of

opportunities for hands-on. The students will configure Oozie workflow with different kinds of actions and test them together on the Hadoop Infrastructure.

Topics covered

 Intro to Oozie.

 Configuring Oozie workflows.

Module 9: MR Unit Testing Framework

In this module, the students will learn how to test Map Reduce programs on and off the Hadoop infrastructure. Expect again, a lot of hands-on experience on this class.

Topics covered

 Map Reduce revisited

 Standalone Testing of Map Reduce

 Mapper Testing

 Reducer Testing

 Counter Testing

(9)

Module 10: Bringing It All-together

In this module, the students will re-cap everything they have learned. All of the skills being taught are put in to practice in the form of a

Certification Project. Students are required to work on the Certification Project on their own. They will be given 2 weeks of time to submit their Certification Project, which earns them an “eduBrake Certified Hadoop Professional” title.

(10)

Section

3

Conclusion

eduBrake believes strongly in imparting world class high quality

professional training for the students. Our instructor employ most cutting edge learning techniques and tools to ensure that the students get a very engaging learning experience. We use different learning methodologies like Mind Mapping techniques, visual story boards etc to breakdown a complex concept in to something that can be easily understood.

The training will be delivered in an Instructor Led Virtual Class room. If your organization has specific training needs, please email or call us. The details are given below:

eduBrake.com

2355 Lebanon Rd, #10102

Frisco, TX 75034

T: 1-972-370-3089

References

Related documents

Because LTSS encounters will likely not be reflected in the primary care medical record, states should consider the broader definition of medical record as inclusive of all

When deciding for an eCommerce Solution that works with SAP Business ONE you can measure the available features against those success factors.. In order to do this in a

both models were statistically significant in a number of subgroups that are considered to be low-to-moderate-risk using traditional metrics; e.g., patients with a TIMI risk score ≤

requirements and specifications which are to provide, install, and support a high- speed, fiber optic-based 10Gb District Wide Area Network (WAN) service for Albany City

We base our so-called G-Net on a vehicle detection followed by a wheel localization phase on the cropped image of the vehicle, both based on a recurrent neural network [7, 11]

 Web Based UI  HBase Architecture  HBase Components  Zookeeper  Compaction  HBase Hands-on  Mapreduce integration  Pig Integration  Hive

character in the Game of Thrones series, Jon Snow’s entire role in the show serves as the source of his description, which begins in the first season when he is the bastard of

Dari hasil pengujian dengan simulasi komputer menunjukkan bahwa rangkaian boost-up chopper efektif untuk mendapatkan tegangan DC yang stabil dari sistem panel surya dan tegangan