click to enable zoom
Loading Maps
We didn't find any results
View Roadmap Satellite Hybrid Terrain My Location Fullscreen Prev Next
Advanced Search

₹ 0 to ₹ 100,000

We found 0 results. Do you want to load the results now ?
Advanced Search

₹ 0 to ₹ 100,000

we found 0 results
Your search results

Hadoop Developer at digITs Technologies Institute





₹ 16,000  
#703, 30th Main, 1st Phase, 2nd Stage, ,
add to favorites
699

About The Course:-

This course will cover concept such as HDFS, Hadoop Cluster,   Hadoop Architecture etc.,.

Who Should Take this?

Systems administrators, linux administrators, windows administrators, Infrastructure engineers, Big Data Architects, DB Administrators, IT managers and Mainframe Professionals.

Pre-requisites for this training
This course requires no prior knowledge of Java, Hadoop Cluster Administration or Apache Hadoop. Fundamental knowledge of Linux basics is necessary as Hadoop runs on Linux.

Course Objectives:-

After completion of the course you should be able to unserstand :

  • The core technologies of Hadoop
  • How to populate HDFS from external sources
  • How to plan your Hadoop cluster hardware and software
  • How to deploy a Hadoop cluster
  • What issues to consider when installing Pig, Hive, and Impala
  • What issues to consider when deploying Hadoop clients
  • How Cloudera Manager can simplify Hadoop administration
  • How to configure HDFS for high availability
  • What issues to consider when implementing Hadoop security
  • How to schedule jobs on the cluster
  • How to maintain your cluster
  • How to monitor, troubleshoot, and optimize the cluster
  • Management and monitoring tools

Introduction

The Motivation for Hadoop

·         Problems with traditional large-scale systems

·         Requirements for a new approach

Hadoop: Basic Concepts

·         An Overview of Hadoop

·         The Hadoop Distributed File System

·         Hands-On Exercise

·         How MapReduce Works

·         Hands-On Exercise

·         Anatomy of a Hadoop Cluster

·         Other Hadoop Ecosystem Components

Writing a MapReduce Program

·         The MapReduce Flow

·         Examining a Sample MapReduce Program

·         Basic MapReduce API Concepts

·         The Driver Code

·         The Mapper

·         The Reducer

·         Hadoop’s Streaming API

·         Using Eclipse for Rapid Development

·         Hands-on exercise

·         The New MapReduce API

Delving Deeper Into The Hadoop API

·         More about ToolRunner

·         Testing with MRUnit

·         Reducing Intermediate Data With Combiners

·         The configure and close methods for Map/Reduce Setup and Teardown

·         Writing Partitioners for Better Load Balancing

·         Hands-On Exercise

·         Directly Accessing HDFS

·         Using the Distributed Cache

·         Hands-On Exercise

Common MapReduce Algorithms

·         Sorting and Searching

·         Indexing

·         Machine Learning With Mahout

·         Term Frequency – Inverse Document Frequency

·         Word Co-Occurrence

·         Hands-On Exercise

Using HBase

·         What is HBase?

·         HBase Architecture

·         HBase API

·         Managing large data sets with HBase

·         Using HBase in Hadoop applications

·         Hands-on exercise

Using Hive and Pig

·         Hive Basics

·         Pig Basics

·         Hands-on exercise

Practical Development Tips and Techniques

·         Debugging MapReduce Code

·         Using LocalJobRunner Mode For Easier Debugging

·         Retrieving Job Information with Counters

·         Logging

·         Splittable File Formats

·         Determining the Optimal Number of Reducers

·         Map-Only MapReduce Jobs

·         Hands-On Exercise

More Advanced MapReduce Programming

·         Custom Writables and WritableComparables

·         Saving Binary Data using SequenceFiles and Avro Files

·         Creating InputFormats and OutputFormats

·         Hands-On Exercise

Joining Data Sets in MapReduce

·         Map-Side Joins

·         The Secondary Sort

·         Reduce-Side Joins

Good Course
  • Content
  • Instructor
  • Institute
4.7
User Rating 0 (0 votes)
Sending
Comments Rating 0 (0 reviews)
hadoop admin digits technologies Bangalore course
Price: ₹ 16,000
Start-End Dates: Contact Institute
Course Duration: 30 days
Instructional Level: Appropriate for All
Certification
Quizzes
Live Projects
Doubt Clearing Sessions
Reading Material
EMI Option
Online Support
Post completion course access
Practice Exams
Placement assistance
Refund Policy
Post completion support

Compare courses

Leave a Reply

digITs Technologies

Bangalore
+91 80-42105164
+91 9108498070
[email protected]

Contact Us