Lecture 19: Networks for Large Cache Design - PowerPoint PPT Presentation

About This Presentation
Title:

Lecture 19: Networks for Large Cache Design

Description:

Title: PowerPoint Presentation Author: Rajeev Balasubramonian Last modified by: Administator Created Date: 9/20/2002 6:19:18 PM Document presentation format – PowerPoint PPT presentation

Number of Views:75
Avg rating:3.0/5.0
Slides: 15
Provided by: RajeevBalas187
Category:

less

Transcript and Presenter's Notes

Title: Lecture 19: Networks for Large Cache Design


1
Lecture 19 Networks for Large Cache Design
  • Papers
  • Interconnect Design Considerations for Large
    NUCA
  • Caches, Muralimanohar and Balasubramonian,
    ISCA07
  • Design and Management of 3D Chip Multiprocessors
    using
  • Network-in-Memory, Li et al., ISCA06
  • A Domain-Specific On-Chip Network Design for
    Large Scale
  • Cache Systems, Jin et al., HPCA07
  • Nahalal Cache Organization for Chip
    Multiprocessors,
  • Guz et al., Comp. Arch. Letters, 2007

2
Traditional Networks
Example designs for contiguous L2 cache regions
3
NUCA Delays
Cache Controller
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
4
Explorations for Optimality
5
Early and Aggressive Look-Up
Cache Controller
  • Address packet can only contain
  • LSB and can use latency-optimized
  • wires (transmission lines / fat wires)
  • Data packet also contains tags and
  • can use regular wires
  • The on-chip network can now have
  • different types of links for address
  • and data

MSB
LSB
R
6
Hybrid Network
Cache Controller
Data Network
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
R
7
Hybrid Network
Cache Controller
Address Network
R
R
R
R
8
Results
9
3D Designs, Li et al., ISCA06
  • D-NUCA first search in cylinder, then multicast
    search everywhere
  • Data is migrated close to requester, but need
    not jump across layers

10
Halo Network, Jin et al., HPCA07
  • D-NUCA Sets are distributed across columns
  • Ways are distributed across
    rows

11
Halo Network
12
Nahalal, Guz et al., CAL07
13
Nahalal
  • Block is initially placed in cores private bank
    and then swapped into
  • the shared bank if frequently accessed by
    other cores
  • Parallel search across all banks

14
Title
  • Bullet
Write a Comment
User Comments (0)
About PowerShow.com