BIG DATA SERIES: HADOOP DEVELOPER
TRAINING PROGRAM
An Overview
Contents
Contents
... 1
BIG DATA SERIES: HADOOP DEVELOPER TRAINING PROGRAM ... 1
Program Overview ... 4
Curriculum ... 5
Module 1: Big Data: Hadoop Introduction... 5
Module 2: Deep dive Hadoop ... 5
Module 3: All Things MapReduce ... 6
Module 4: All Things Pig ... 6
Module 5: All Things Hive ... 7
Module 6: All Things HBase ... 7
Module 7: Impala and Zookeeper ... 7
Module 8: All things Oozie ... 8
Module 9: MR Unit Testing Framework ... 8
Module 10: Bringing It All-together ... 9
Section
1
Program Overview
Big Data is the new rage. Here is an opportunity to join the revolution. As the data is growing beyond one’s imagination, Enterprises are finding it even harder to manage and run their Businesses effectively. Thankfully Apache Hadoop arrives on the scene, at a perfect time to solve the problems.
Join us at eduBrake to fully enable your development teams in address everything they need to know to successfully realize ROI on your Big Data investments. Knowledge and training investments pay dividends for years to come. Get all the training you need from our experts on Big Data.
Big Data Series: Hadoop Developer Training Program is exactly suited for those who want to learn all things Hadoop. The training enables your team to successfully understand Hadoop architecture, write Map Reduce Programs, bring in and handle data, learn other Hadoop ecosystem applications such as Pig, Hbase, Hive, Flume, Pig, Oozie etc. The training program provides a direct hands on experience along with 40 hrs exercise and certification project.
The program is designed to be an Instructor led Virtual training along with a hands on exercise assignments. The student will gain access to an Online Learning Management System to manage their progress and submit the assignments. The students will have weekly office hours for helping them on their exercises.
Section
2
Curriculum
Hadoop Developer training program’s curriculum is as given below.
Module 1: Big Data: Hadoop Introduction.
In this module the students will gain complete understanding of what a Big Data technology offers and the architectural consideration for Hadoop and its eco-system components.
Topics covered:
Need for Big Data.
Grand Entry of Hadoop – The Hero.
Hadoop Architecture.
Hadoop Ecosystem.
Home Work Assignment – Setup and install Hadoop VM
Home Work Assignment – Load Preliminary Data + Run Hadoop Commands.
Module 2: Deep dive Hadoop
In this module, the students will learn a deeper understanding of Hadoop along with the techniques needed to load the data in to the Hadoop
Infrastructure.
Topics covered
MapReduce Introduction
Flume
Sqoop
Module 3: All Things MapReduce
In this module, everything about MapReduce will be taught. This module will involve heavy hands-on exercises. Depending upon the speed with which the students cruise through, we may decide even to spend more time on this module. If you get all of this, consider you are already on the path of becoming a Map Reduce expert.
Topics covered
Java Map Reduers
Streaming Map Reducers
Combiners
Partitioners and Reducers
Algorithms
Module 4: All Things Pig
In this module, everything about Pig data flow engine. The students will get a clear understanding on the Pig and Pig Latin, the data flow scripting language.
Topics covered
Intro to Pig
Installing and Running Pig
Grunt
Types and Schemas
Developing and Testing
Module 5: All Things Hive
In this module, everything about Hive will be taught. The students will learn how Hive provides an SQL dialect for querying the data stored in Hadoop cluster.
Topics covered
Intro to Hive
Hive Architecture
HiveQL : Data definition and Data Manipulation
And everything else about Hive.
Module 6: All Things HBase
In this module, everything about HBase is discussed.
Topics covered
Intro to HBase
HBase Usecases
HBase Datamodel
HBase Schema Design.
Module 7: Impala and Zookeeper
In this module, the students will get their understanding around Impala and Zookeeper
Topics covered
Intro to Impala
Intro to Zookeeper
Module 8: All things Oozie
In this module, the students will learn everything about Oozie – the workflow processor for Hadoop. This module presents lot of
opportunities for hands-on. The students will configure Oozie workflow with different kinds of actions and test them together on the Hadoop Infrastructure.
Topics covered
Intro to Oozie.
Configuring Oozie workflows.
Module 9: MR Unit Testing Framework
In this module, the students will learn how to test Map Reduce programs on and off the Hadoop infrastructure. Expect again, a lot of hands-on experience on this class.
Topics covered
Map Reduce revisited
Standalone Testing of Map Reduce
Mapper Testing
Reducer Testing
Counter Testing
Module 10: Bringing It All-together
In this module, the students will re-cap everything they have learned. All of the skills being taught are put in to practice in the form of a
Certification Project. Students are required to work on the Certification Project on their own. They will be given 2 weeks of time to submit their Certification Project, which earns them an “eduBrake Certified Hadoop Professional” title.
Section
3
Conclusion
eduBrake believes strongly in imparting world class high quality
professional training for the students. Our instructor employ most cutting edge learning techniques and tools to ensure that the students get a very engaging learning experience. We use different learning methodologies like Mind Mapping techniques, visual story boards etc to breakdown a complex concept in to something that can be easily understood.
The training will be delivered in an Instructor Led Virtual Class room. If your organization has specific training needs, please email or call us. The details are given below:
eduBrake.com
2355 Lebanon Rd, #10102
Frisco, TX 75034
T: 1-972-370-3089