Big Data & Hadoop Training Course Content

Overview / Description: Hadoop developer training builds your development skills in big data domain. Hadoop is a free source, Java based programming framework that assists the processing and storage of highly large data sets in a distributed computing environment. Hadoop developer jobs is same as the software developer and their roles are also similar but the domain in which they work is different.

Prerequisites / Eligibility: There are no prerequisites for Hadoop knowledge. This course is designed for people proficient with Object Oriented Programming language (OOPS) and JAVA.

Detailed Course Content :

Chapter 01:Introduction

  • 1.1 Bigdata Introduction
  • 1.2 Distributed System
  • 1.3 BigData Use cases
  • 1.4 Various Solutions
  • 1.5 Overview of Hadoop Echo System

Chapter 02: ZooKeeper

  • 2.1 ZooKeeper -Race Condition
  • 2.2 ZooKeeper – Dead Lock
  • 2.3 Use Cases
  • 2.4 When not to Use

Chapter 03: HDFS

  • 3.1 Why HDFS and Why not existing File system
  • 3.2 HDFS – Name Node and Data Node
  • 3.3 Advanced HDFS concepts (HA, Federation)
  • 3.4 Data Locality (RAC awareness

Chapter 04: YARN

  • 4.1 YARN – Why not existing tools
  • 4.2 YARN – Evolution from MapReduce 1.0
  • 4.3 Resource Management – YARN Architecture
  • 4.4 Advanced Concept – Speculative Execution

Chapter 05: MapReduce Basics

  • 5.1 MapReduce-Understanding Sorting
  • 5.2 MapReduce-Overview
  • 5.3 Example 1- Word Frequency Problem without MapReduce
  • 5.4 Example2- Only Mapper – Image Resizing
  • 5.5 Example3- Word Frequency Problem
  • 5.6 Exampe4 – Temperature Problem
  • 5.7 Example5 – Multiple Reducer
  • 5.8 Example6 – Java MapReduce walk through

Chapter 06: MapReduce – Advanced

  • 6.1 Writing MapReduce Code Using Java
  • 6.2 Building MapReduce project using Ant
  • 6.3 Concept – Associative and Commutative
  • 6.4 Example1- Combiner
  • 6.5 Example2- Hadoop Streaming
  • 6.6 Example3 – Advanced Problem Solving – Anagrams
  • 6.7 Example4 -Advanced Problem Solving – Same DNA
  • 6.8 Example5 -Advanced Problem Solving – Similar DNA
  • 6.9 Example6-Joins – Voting
  • 6.10. Limitations of MapReduce

Chapter 07: Analyzing Data with Pig

  • 7.1 Pig – Introduction
  • 7.2 Pig – Modes
  • 7.3 Getting Started
  • 7.4 Example – NYSE (NewYork Stock Exchange)
  • 7.5 Concept – Lazy Evolution

Chapter 08: rocessing Data with Hive

  • 8.1 Hive Introduction
  • 8.2 Hive – Data Type
  • 8.3 Getting Started
  • 8.4 Loading Data in Hive Tables
  • 8.5 Example: Movelens Data Processing
  • 8.6 Advanced Concepts – Views
  • 8.7 Connecting Tableau and Hive Server2
  • 8.8 Connecting Microsoft Excel and Hive Server2
  • 8.9 Project: Sentiment Analysis of Twitter Data
  • 8.10. Advanced – Partitioned Tables
  • 8.11. Understanding HCatelog & Impala

Chapter 09: NoSQL and HBase

  • 9.1 NoSQL – Scaling Out /Up
  • 9.2 NoSQL – ACID properties and RDBMS
  • 9.3 CAP Theorem
  • 9.4 HBase Architecture – Region Servers
  • 9.5 HBase Data Model – Column Family Orientedness
  • 9.6 Getting Started – Creating tables and adding Data
  • 9.7 Example – Google Link Storage
  • 9.8 Bloom Filter – Concept
  • 9.9 Comparison of NoSQL Databases.

Chapter 10: importing Data with Sqoop, Flume and Oozie

  • 10.1 sqoop – Introduction
  • 10.2 Sqoop Import – MySQL to HDFS
  • 10.3 Exporting from MySQL to HDFS
  • 10.4 Unbound Dataset Processing or Stream processing
  • 10.5 Flume Overview –
  • 10.6 Source , Sink, Channel
  • 10.7 Example1: Data from Local Network service into HDFS
  • 10.8 Example2: Extracting Twitter Data
  • 10.9 Creating workflow with Oozie

Chapter 11: Fundamentals of Scala

  • 11.1 Scala – Quick Introduction
  • 11.2 Scala – Quick Introduction – Variables and Methods
  • 11.3 Getting Started: Interactive, Compilation, SBT
  • 11.4 Types, Variables & Values
  • 11.5 Functions
  • 11.6 Collections
  • 11.7 Classes
  • 11.8 Parameters

Chapter 12: Spark Basics

  • 12.1 Spark Introduction – Why Spark?
  • 12.2 Using the Spark Shell
  • 12.3 Example 1 – Performing Word Count
  • 12.4 Understanding Spark Cluster Modes on YARN
  • 12.5 RDDs (Resilient Distributed Datasets)
  • 12.6 General RDD Operations: Transformations & Actions
  • 12.7 RDD lineage
  • 12.8 RDD Persistence Overview
  • 12.9 Distributed Persistence
  • Member Testimonials

    Vinodh GV

    one million likes from my side.. I'm just a beginner and so It helped me a lot... _/\_
    2017-06-30T12:08:52+00:00
    one million likes from my side.. I'm just a beginner and so It helped me a lot... _/\_

    Divya M

    "I got a full time offer at Availity and today was my first day there .First of all I have to thank You all for... Read More
    2017-06-30T12:44:12+00:00
    "I got a full time offer at Availity and today was my first day there .First of all I have to thank You all for all the classes and the projects which gave me immense confidence because of which I am here.Also I would like to thank you all for the extra effort for providing reference for my work with Atomic77. Thank you Karthik,Manoj & Saqib for always insisting on practice.During my interview,the moment I started talking in detail about developing keyword driven framework which I worked on at SLP project, I guess the interview panel were almost sure their search has ended. I would definitely suggest ITelearn to my friends who are looking to learn things in the right way.Being a part of your learning experience not only teaches us confidence but most importantly teaches us to learn and debug anything the right way...I will always be Thankful to ITelearn team and will continue attending few of your ongoing courses. Thanks again and I wish ITelearn to spread more confidence and happiness to all its attendees..Will keep in touch. "

    Khurram Mehmood

    Awesome!! I actually felt like in a class, superb presentation
    2017-06-30T12:40:27+00:00
    Awesome!! I actually felt like in a class, superb presentation

    Andre Alexei

    "I agree, no Selenium. Otherwise, excellent JUnit tutorial. Very smooth delivery and build/code on the fly. Karthik. Good work. "
    2017-06-30T12:19:45+00:00
    "I agree, no Selenium. Otherwise, excellent JUnit tutorial. Very smooth delivery and build/code on the fly. Karthik. Good work. "

    Jianfeng Sun

    It is really good training for QTP. Thanks so much. Recently, google has changed the gmail system. More ajax and elements become more difficult to... Read More
    2017-06-30T12:28:39+00:00
    It is really good training for QTP. Thanks so much. Recently, google has changed the gmail system. More ajax and elements become more difficult to identify. Can you give a training about the same testing purpose on new gmail system?
  •  

Leave a Reply

Your email address will not be published. Required fields are marked *