Introduction to MIMD Architectures Sima, Fountain and Kacsuk Chapter 15 - PowerPoint PPT Presentation

1 / 17
About This Presentation
Title:

Introduction to MIMD Architectures Sima, Fountain and Kacsuk Chapter 15

Description:

Replicate the processor/memory pairs. Connect them via ... rB. rB. B. B. 16. David Abramson, 2004. Material from Sima, Fountain and Kacsuk, Addison Wesley 1997 ... – PowerPoint PPT presentation

Number of Views:172
Avg rating:3.0/5.0
Slides: 18
Provided by: david2523
Category:

less

Transcript and Presenter's Notes

Title: Introduction to MIMD Architectures Sima, Fountain and Kacsuk Chapter 15


1
Introduction to MIMD ArchitecturesSima,
Fountain and KacsukChapter 15
  • CSE462

2
Architectural Concepts
  • Distributed Memory MIMD
  • Replicate the processor/memory pairs
  • Connect them via an interconnection network
  • Shared Memory MIMD
  • Replicate the processors
  • Replicate the memories
  • Connect them via an interconnection network

3
Distributed Memory Machine
  • Access to local memory module is much faster than
    remote
  • Hardware remote accesses via
  • Load/Store primitive
  • Message passing layer
  • Cache memory for local memory traffic
  • Message
  • Memory-memory
  • Cache-cache

4
Advantages of Distributed Memory
  • Local memory traffic less contention than in
    shared memory
  • Highly scalable
  • Dont need sophisticated synchronization features
    like monitors, semaphores. Message passing serves
    dual purpose
  • To send the data
  • Provide synchronization

5
Problems of Distributed Memory
  • Load balancing
  • Message passing can lead to synchronization
    failures, including deadlock
  • BlockingSend -gt BlockingReceive
  • BlockingReceive -gt BlockingSend
  • Intensive data copying of whole structures
  • Small message overheads are high

6
Shared Memory Architecture
  • All processors have equal access to shared memory
    modules
  • Local Caches reduce
  • Memory traffic
  • Network traffic
  • Memory access time
  • IP Synchronisation
  • Indivisible load/store

7
Advantages of Shared Memory
  • No need to partition code or data
  • Occurs on the fly
  • No need to move data explicitly
  • Dont need new programming languages or compilers.

8
Disadvantages of Shared Memory
  • Synchronization is difficult
  • Lack of scalability
  • IPC becomes bottleneck
  • Scalability can be addressed by
  • High throughput, low latency network
  • Cache Memories
  • Causes coherence problem
  • Distributed shared memory architecture

9
Distributed Shared Memory
  • Three design choices
  • Non-uniform memory access (NUMA)
  • Like Cray T3D
  • Cache coherent non-uniforms memory access
    (CC-NUMA)
  • Convex SPP, Stanford DASH
  • Cache-only memory access (COMA)
  • Like KSR-1

10
Non-uniform memory access (NUMA)
Interconnection Network
11
Cache coherent non-uniforms memory access
(CC-NUMA)
Interconnection Network
12
Cache-only memory access (COMA)
Interconnection Network
13
Classification of MIMD Computers
14
Problems of Scalable Computers
  • Tolerate and hide the latency of remote loads
  • Worse if output of one computation relies on
    another to complete
  • Tolerate and hide idling due to synchronization
    among processors

15
Tolerating Remote Loads
PEn
PE0
PE1
Pn
P0
P1
rA
B
rB
A
Result
Mn
M0
M1
Load A
Interconnection Network
Load B
Result A B
16
Tolerating Latency
  • Cache memory
  • Simply lowers the cost of remote access
  • Introduces cache coherence problem
  • Prefetching
  • Already present, so cost is low
  • Increases network load
  • Threads fast context switching
  • Accept that it will take a long time and cover
    the overhead
  • These solutions dont solve synchronization
    issues
  • Latency tolerant algorithms

17
Design issues of scalable MIMD
  • Processor Design
  • Pipelining, parallel instruction issue
  • Atomic data access, prefetching, cache memory,
    message passing, etc
  • Interconnection network design
  • Scalable, high bandwidth, low latency
  • Memory design
  • Shared memory design
  • Cache coherence
  • IO Subsystem
  • Parallel IO
Write a Comment
User Comments (0)
About PowerShow.com