Hadoop Admin Online Training - PowerPoint PPT Presentation

About This Presentation
Title:

Hadoop Admin Online Training

Description:

The Glory IT Technologies Hadoop Administration Online training course is designed to provide the requisite knowledge, skills for you to become a successful Hadoop architect, big data engineer or Hadoop administrator. It begins with tutorials on the fundamental concepts of Apache Hadoop and Hadoop Cluster. It enables you to deploy, configure, manage, monitor, and secure a Hadoop Cluster. The course will also provide a brief on Hive & HBase Administration. There course will also include many challenging, practical and focused hands-on exercises. Towards end of the course, you will be able to understand and solve real industry-relevant problems that you will encounter while working on Hadoop Cluster. – PowerPoint PPT presentation

Number of Views:37

less

Transcript and Presenter's Notes

Title: Hadoop Admin Online Training


1
Hadoop Admin Online Training
  • Glory IT
    Technologies

2
Prerequisites
  • Knowledge of Hadoop and Distributed Computing.

3
Module 1 Introduction to Hadoop
  • The amount of data processing in todays life
  • What Hadoop is why it is important?
  • Hadoop comparison with traditional systems
  • Hadoop history
  • Hadoop main components and architecture

4
Module 2 Hadoop Distributed File System (HDFS)
  • HDFS overview and design
  • HDFS architecture
  • HDFS file storage
  • Component failures and recoveries
  • Block placement
  • Balancing the Hadoop cluster

5
Module 3 Planning your Hadoop cluster
  • Planning a Hadoop cluster and its capacity
  • Hadoop software and hardware configuration
  • HDFS Block replication and rack awareness
  • Network topology for Hadoop cluster

6
Module 4 Hadoop Deployment
  • Different Hadoop deployment types
  • Hadoop distribution options
  • Hadoop competitors
  • Hadoop installation procedure
  • Distributed cluster architecture

7
Module 5 Working with HDFS
  • Ways of accessing data in HDFS
  • Common HDFS operations and commands
  • Different HDFS commands
  • Internals of a file read in HDFS
  • Data copying with distcp

8
Module 6 -Mapreduce Abstraction
  • What MapReduce is and why it is popular
  • The Big Picture of the MapReduce
  • MapReduce process and terminology
  • MapReduce components failures and recoveries
  • Working with MapReduce

9
Module 7 Hadoop Cluster Configuration
  • Hadoop configuration overview and important
    configuration file
  • Configuration parameters and values
  • HDFS parameters MapReduce parameters
  • Hadoop environment setup
  • Include and Exclude configuration files

10
Module 8 Hadoop Administration and Maintenance
  • Namenode/Data node directory structures and files
  • File system image and Edit log
  • The Checkpoint Procedure
  • Namenode failure and recovery procedure
  • Safe Mode
  • Metadata and Data backup
  • Potential problems and solutions / what to look
    for
  • Adding and removing nodes

11
Module 9 Hadoop Monitoring and Troubleshooting
  • Best practices of monitoring a Hadoop cluster
  • Using logs and stack traces for monitoring and
    troubleshooting
  • Using open-source tools to monitor Hadoop cluster

12
Module 10 Job Scheduling
  • How to schedule Hadoop Jobs on the same cluster
  • Default Hadoop FIFO Schedule
  • Fair Scheduler and its configuration

13
Module 11 Hadoop Multi Node Cluster Setup and
Running Map Reduce Jobs on Amazon Ec2
  • Hadoop Multi Node Cluster Setup using Amazon ec2
    Creating 4 node cluster setup
  • Running Map Reduce Jobs on Cluster

14
Contact us free Demo
  • We stay with you until you get the results you
    want.
  • If you really interested, please let me know .
  • We will arrange the Demo Session.
  • Feel Free to call us any time
  • Thanks RegardsSrinivasGloryITTechnologiesEmai
    lInfo_at_gloryittechnologies.comPhone91-903281345
    6/91-9160177789Skype ID gloryittechnologies

15
  • THANK YOU
Write a Comment
User Comments (0)
About PowerShow.com