Jason Baker, Chris Bond, James C. Corbett, JJ Furman, Andrey Khorlin, James Larson,Jean - PowerPoint PPT Presentation

1 / 22
About This Presentation
Title:

Jason Baker, Chris Bond, James C. Corbett, JJ Furman, Andrey Khorlin, James Larson,Jean

Description:

Jason Baker, Chris Bond, James C. Corbett, JJ Furman, Andrey Khorlin, James Larson,Jean. Michel L eon, Yawei Li, Alexander Lloyd, Vadim Yushprakh – PowerPoint PPT presentation

Number of Views:280
Avg rating:3.0/5.0
Slides: 23
Provided by: engUokAc9
Category:

less

Transcript and Presenter's Notes

Title: Jason Baker, Chris Bond, James C. Corbett, JJ Furman, Andrey Khorlin, James Larson,Jean


1
MegastoreScalable Highly Available Storage for
Interactive Systems
  • Jason Baker, Chris Bond, James C. Corbett, JJ
    Furman, Andrey Khorlin, James Larson,Jean
  • Michel Leon, Yawei Li, Alexander Lloyd, Vadim
    Yushprakh

Presented By Hamid Seyedmoradi Ayoub
Hamidi Ehsan Mohamad Nezamian Advanced Database
Systems SRBIAU, Kurdistan Campus 10May2012
2
Megastore
  • Motivation
  • Introduction
  • NoSQL RDBMS
  • Megastore
  • Paxos

3
Megastore
  • wow! more than 3 billion write
  • and 20 billion read daily
  • key Contribution
  • Data Model and Storage System
  • Paxos Replication
  • Report on Experience

4
AVAILABILITY SCALE
  • Replication
  • For Availability, we implemented a
    synchronous, fault tolerant log replicator
    optimized for long distance links
  • Partitioning and Locality
  • for scale, we partitioned data into a vast
    space of small databases

5
AVAILABILITY SCALE
  • Replication
  • Strategies
  • Asynchronous Master/Slave
  • Synchronous Master/Slave
  • Optimistic Replication
  • We decided to use Paxos

6
Technology Options
7
Technology Options
8
AVAILABILITY SCALE
  • Partitioning and Locality
  • Replication

Datacenters
ACID semantics within an entity group
Entity Groups Partition the datastore
Looser consistency across entity groups
Each entity group is synchronously replicated
across datacenters
Entity group data and replication metadata stored
in scalable NoSQL datastores
9
AVAILABILITY SCALE
  • Partitioning and Locality
  • Operations

Entities (Units of data)
Most transactions are within a single entity
group
Entity Group 1
Local Index
Global Indexes span entity groups but have
weaker consistency
Cross Entity group transactions supported via Two
Phase Commit
receive
queue
Send
Local Index
Asynch communication between entity groups
supported by Queues
Entity Group 2
10
AVAILABILITY SCALE
  • Partitioning and Locality
  • Entity Groups
  • Selecting Entity Group Boundaries
  • Example
  • Email
  • Blogs
  • Physical Layout

11
Megastore
  • API Design Philosophy
  • Data Model
  • Pre-Joining with Keys
  • SCATTER
  • Indexes
  • Storing Clause
  • Repeated Indexes.
  • Inline Indexes
  • Mapping to Bigtable

12
Megastore

13
Megastore
  • Transactions and Concurrency Control
  • Read
  • current
  • snapshot
  • inconsistent
  • Transaction Lifecycle
  • 1-Read 3-Commit
    5-Clean up
  • 2-Application logic 4-Apply

14
Megastore
  • Queues
  • Two
  • Phase
  • Commit

15
REPLICATION
  • Brief Summary of Paxos
  • Megastores Approach
  • Fast Reads
  • Fast Writes
  • Replica Types
  • Witness Replica
  • Architecture

16
Architecture
17
Data Structures and Algorithms
  • Replicated Logs

18
Data Structures and Algorithms
  • Reads
  • Query Local
  • Find Position
  • Local read
  • Majority read
  • Catchup
  • Validate
  • Query Data

19
Data Structures and Algorithms
20
Data Structures and Algorithms
  • Writes
  • Accept Leader
  • Prepare
  • Accept
  • Invalidate
  • Apply

21
Feedback
22
END
  • With Thanks
  • Question
  • ?
Write a Comment
User Comments (0)
About PowerShow.com