Hadoop, Mapreduce for Big Data problems - Part 1

Location
Online
Dates
Can be taken anytime
Course Type
Professional Training Course
Accreditation
Yes (Details)
Language
English
Price
$10

Course Overview

This course helps you learn in-depth details of Hadoop and MapReduce along with a hands on experience of the same. You'll learn how to set up your own cluster using both VMs and the Cloud. All the major features of MapReduce are covered - including advanced topics like Total Sort and Secondary Sort. You will also learn how to Build your Hadoop Cluster and Customize your own MapReduce Jobs. By the end of Part 1 and 2 of this course, you may expect to get in-depth understanding of Hadoop, MapReduce and Big Data Problems.

Coupon code - WIISEGT

Who should take this course

It is available for all the learners.

Accreditation

WIISE

Course content

The outline of this course is mentioned below:

Introduction:

Introduction

Importance of Big Data:

  • The Big Data Prototype
  • Serial vs Distributed Computing

Hadoop Overview:

  • What is Hadoop?
  • HDFS or the Hadoop Distributed File System
  • MapReduce Introduction
  • YARN or Yet Another Resource Negotiator

Hadoop Installation:

  • Install Modes
  • Standalone Mode Install
  • Pseudo - Distributed Mode Install

The MapReduce "Hello World":

  • The basic philosophy underlying MapReduce
  • Visualized And Explained
  • Digging a little deeper at every step
  • "Hello World" in MapReduce
  • The Mapper
  • The Reducer
  • The Job

Run a MapReduce Job:

  • Get comfortable with HDFS
  • Run your first MapReduce Job

Combiners, Shuffle and Sort and The Streaming API:

  • Combiners Usage
  • Not all Reducers are Combiners
  • Mappers and Reducers
  • Parallelizing Reduce using Shuffle And Sort
  • Introducing the Streaming API
  • Python for MapReduce

HDFS:

  • Protecting against data loss using replication
  • Name nodes and why they're critical
  • Checkpointing to backup name node information

Yarn:

  • Basic Components
  • Submitting a Job to Yarn
  • Plug in Scheduling Policies
  • Configure the Scheduler

MapReduce Customizations For Finer Grained Control:

  • Setting up your MapReduce to Accept Command Line Arguments
  • The Tool, Tool Runner and Generic Options Parser
  • Configuring Properties of the Job Object
  • Customizing the Partitioner, Sort Comparator and Group Comparator

About Course Provider

WIISE is a 'Professional Learning Network'​ with a global outreach that helps anyone to learn anything to achieve personal and professional goals.

We bring top-rated interactive learning courses & certifications from across the world through respected Global Academic Institutes and Industry experts to our learners.

WIISE for Teams is a Smart training solution suitable for growing businesses (SMB’s) - deliver online cost-effective, on-demand training, staff engagement & Upskilling to their employees and customers. WIISE incorporates the latest micro-learning & social-learning techniques that provides fast and engaging training at a fraction of cost of traditional training methods.

WIISE is brought by respectable Learning services & Skill development company - PositiveShift Group - Silicon Valley CA USA, India (www.positiveshift.in). The company has been awarded unique Innovation partnership with National Skill Development Corporation (NSDC) and Ministry of Skill Development and Entrepreneurship, Govt of India.

Frequently asked questions

{{ item.question }}