Hadoop online training fro EasylearningGuru - PowerPoint PPT Presentation

About This Presentation
Title:

Hadoop online training fro EasylearningGuru

Description:

easylearningguru provides online classes on Hadoop ,witch will help u to lean hadoop easy and fast – PowerPoint PPT presentation

Number of Views:357

less

Transcript and Presenter's Notes

Title: Hadoop online training fro EasylearningGuru


1
Welcome to the World of Big Data Hadoop
2
Agenda
  • What is Big Data ?
  • Different Kinds of Big Data
  • Big Data Global Market
  • Hadoop Global job trends
  • What is Hadoop ?

3
What is Big Data?
  • Big data is the term for a collection of data
    sets so large and complex that it becomes
    difficult to process using on-hand database
    management tools or traditional data processing
    applications.

4
Types of Big Data ?
Semi-Structured Data
Traditional RDBMS deals with only Structured data.
Need of a technology which deals with
Semi-structured data, Unstructured data and
Structured data as well
5
The 3Vs of Big Data
6
Sources of Data
Mobile Devices (Tracking all the objects all the
time)
Social Media Networks (All of us are generating
data)
Sensor Technology Networks (Measuring all kinds
of data)
Scientific Instruments (Collecting all sorts of
data)
7
Where Big Data is used ?
8
Facebook Scenario

Facebook on an average generates 70 thousand MB
in 1 minute.
1 hour 70,000 MB 60 4.2 Million
MB 1 Day 4.2 Million 24 MB 10.8
Billion MB 98438 GB 1 week 6.9 thousand
GB 690 TB 4 weeks 690 TB 4 2756 TB
2.7 PB 52 weeks 2.7 PB 52 143.3 PB
And thats aloooooooooot of data !
9
Various Bigdata Technologies
10
Big Data Global Market
Sources Dice, LinkedIn.
11
Hadoop Global Job Trends
More than 17,000 employees with Hadoop skill
across these companies
Top Hadoop Technology Companies
Sources Dice, LinkedIn.
12
Hadoop Global Job Trends
Sources Dice, LinkedIn.
13
What is Hadoop ?
Hadoop was created by Doug Cutting and Mike
Cafarella. Hadoop provides the reliable shared
storage and analysis system. It is designed to
scale up from a single server to thousand of
machines, with a high degree of fault
tolerance.
14
Hadoop History
15
Hadoop Core Components
  • Core Hadoop has two main systems
  • Hadoop Distributed File System The Hadoop file
    system is a Distributed file system which holds
    the large amount of data across multiple nodes in
    a cluster.
  • MapReduce MapReduce is a distributed programming
    paradigm used to analyze the data in the HDFS. 

16
Hadoop Distributed File System (HDFS)
  • A given file is broken down into blocks
    (default64MB), then blocks are replicated
    across cluster (default3).
  • Optimized for throughput.
  • HDFS allows you to put/get/delete files.
  • Follows the philosophy
  • Write Once and Read Multiple times
  • Block Replication for
  • - Durability, High Availability and
    Throughput.

17
MapReduce Flow
18
MapReduce Framework
Map Reduce works by breaking the processing into
two phases Map Phase and Reduce Phase.
19
(No Transcript)
20
What we offer
21
(No Transcript)
22
Syllabus
  • Introduction
  • Big Data
  • Hadoop
  • Hadoop
  • HDFS
  • MapReduce
  • PIG
  • Pig 1
  • Pig 2
  • Hive
  • Hive 1
  • Hive 2
  • Hbase
  • Zookeeper
  • Sqoop
  • Yarn
  • Project Class

23
Thank you for watching the Live Demo for
Hadoop. You can always contact us on Your
queries are always welcome.
  • Phone 91 124 4763660 (India)
  • Email contact_at_easylearning.guru
  • Skype Id easylearning.guru
  • Website www.easylearning.guru
Write a Comment
User Comments (0)
About PowerShow.com