CS184a: Computer Architecture (Structures and Organization) - PowerPoint PPT Presentation

About This Presentation
Title:

CS184a: Computer Architecture (Structures and Organization)

Description:

(2) Crossbar. Avoid bottleneck. Every output gets its own interconnect channel ... Can't afford full crossbar. Need to exploit locality. Can't have everything close ... – PowerPoint PPT presentation

Number of Views:31
Avg rating:3.0/5.0
Slides: 51
Provided by: andre57
Category:

less

Transcript and Presenter's Notes

Title: CS184a: Computer Architecture (Structures and Organization)


1
CS184aComputer Architecture(Structures and
Organization)
  • Day11 October 30, 2000
  • Interconnect Requirements

2
Last Time
  • Saw various compute blocks
  • Role of automated mapping in exploring design
    space
  • To exploit structure in typical designs we need
    programmable interconnect
  • All reasonable, scalable structures
  • small to moderate sized logic blocks
  • connected via programmable interconnect
  • been saying delay across programmable
    interconnect is a big factor

3
Today
  • Interconnect Design Space
  • Dominance of Interconnect
  • Simple things
  • and why they dont work
  • Interconnect Implications
  • Characterizing Interconnect Requirements

4
Dominant Area
5
Dominant Time
6
Dominant Time
7
Dominant Power
XC4003A data from Eric Kusse (UCB MS 1997)
8
For Spatial Architectures
  • Interconnect dominant
  • area
  • power
  • time
  • so need to understand in order to optimize
    architectures

9
Interconnect
  • Problem
  • Thousands of independent (bit) operators
    producing results
  • true of FPGAs today
  • true for LIW, multi-uP, etc. in future
  • Each taking as inputs the results of other (bit)
    processing elements
  • Interconnect is late bound
  • dont know until after fabrication

10
Design Issues
  • Flexibility -- route anything
  • (w/in reason?)
  • Area -- wires, switches
  • Delay -- switches in path, stubs, wire length
  • Power -- switch, wire capacitance
  • Routability -- computational difficulty finding
    routes

11
(1) Shared Bus
  • Familiar case
  • Use single interconnect resource
  • Reuse in Time
  • Consequence?

12
Shared Bus
  • Consider operation yAx2 Bx C
  • 3 mpys
  • 2 adds
  • 5 values need to be routed from producer to
    consumer
  • Performance lower bound if have design w/
  • m multipliers
  • u madd units
  • a adders
  • i simultaneous interconnection busses

13
Viewpoint
  • Interconnect is a resource
  • Bottleneck for design can be in availability of
    any resource
  • Lower Bound on Delay
  • Logical Resource / Physical Resources
  • May be worse
  • dependencies
  • ability to use resource

14
Shared Bus
  • Flexibility ()
  • routes everything (given enough time)
  • can be trick to schedule use optimally
  • Delay (Power) (--)
  • wire length O(kn)
  • parasitic stubs knn
  • series switch 1
  • O(kn)
  • sequentialize I/B
  • Area ()
  • kn switches
  • O(n)

15
Term Bisection Bandwidth
  • Partition design into two equal size halves
  • Minimize wires (nets) with ends in both halves
  • Number of wires crossing is bisection bandwidth

16
(2) Crossbar
  • Avoid bottleneck
  • Every output gets its own interconnect channel

17
Crossbar
18
Crossbar
19
Crossbar
  • Flexibility ()
  • routes everything (guaranteed)
  • Delay (Power) (-)
  • wire length O(kn)
  • parasitic stubs knn
  • series switch 1
  • O(kn)
  • Area (-)
  • Bisection bandwidth n
  • kn2 switches
  • O(n2)

20
Crossbar
  • Too expensive
  • Switch Area kn22.5Kl2
  • Switch Area/LUT kn 2.5Kl2
  • n1024, k4 gt 10M l2
  • What can we do?

21
Avoiding Crossbar Costs
  • Typical architecture trick
  • exploit expected problem structure

22
Avoiding Crossbar Costs
  • Typical architecture trick
  • exploit expected problem structure
  • We have freedom in operator placement
  • Designs have spatial locality
  • gtplace connected components close together
  • dont need full interconnect?

23
Exploit Locality
  • Wires expensive
  • Local interconnect cheap
  • 1D versions?
  • (explore on hmwrk)

24
Exploit Locality
  • Wires expensive
  • Local interconnect cheap
  • Use 2D to make more things closer
  • Mesh?

25
Mesh Analysis
  • Can we place everything close?

26
Mesh Closeness
  • Try placing everything close

27
Mesh Analysis
  • Flexibility - ?
  • Ok w/ large w
  • Delay (Power)
  • Series switches
  • 1--?n
  • Wire length
  • w--?n
  • Stubs
  • O(w)--O(w?n)
  • Area
  • Bisection BW -- w?n
  • Switches -- O(nw)
  • O(w2n) linear pop
  • larger on homework

28
Mesh
  • Plausible
  • but Whats w
  • and how does it grow?

29
Characterize Locality
  • Want to exploit locality
  • How much locality do we have?
  • Impact on resources required?

30
Bisection Bandwidth
  • Bisect design
  • Bisection bandwidth of design
  • gt lower bound on network bisection bandwidth
  • Design with more locality
  • gt lower bisection bandwidth
  • Enough?

N/2
cutsize
N/2
31
Characterizing Locality
  • Single cut not capture locality within halves
  • Cut again
  • gt recursive bisection

32
Regularizing Growth
  • How do bisection bandwidths shrink (grow) at
    different levels of bisection hierarchy?
  • Basic assumption Geometric
  • 1
  • 1/?
  • 1/?2

33
Geometric Growth
  • (F,a)-bifurcator
  • F bandwidth at root
  • geometric regression a at each level

34
Good Model?
Log-log plot gt straight lines represent
geometric growth
35
Rents Rule
  • Long standing empirical relationship
  • IO CNP
  • 0?P ?1.0
  • compare (F,a)-bifurcator
  • a 2P
  • Captures notion of locality
  • some signals generated and consumed locally
  • reconvergent fanout

36
Monday class stopped here
37
Rents Rule
  • Typically consider
  • 0.5?P ?0.75
  • High-Speed Logic P0.67
  • Memory (P0.1-0.2)
  • Example (i10)
  • max C7, P0.68
  • avg C5, P0.72

38
What tell us about design?
  • Recursive bandwidth requirements in network

39
What tell us about design?
  • Recursive bandwidth requirements in network
  • lower bound on resource requirements
  • N.B. necessary but not sufficient condition on
    network design
  • I.e. design must also be able to use the wires

40
What tell us about design?
  • Interconnect lengths
  • Intuition
  • if pgt0.5, everything cannot be nearest neighbor
  • as p grows, so wire distances

41
What tell us about design?
  • Interconnect lengths
  • IO(n2)P cross distance n
  • dIO/dn end at exactly distance n
  • E(l)Integral 0 to n?N
  • of n(dIO/dn)/n2
  • assume iid sources
  • E(l)O(N(p-0.5))
  • pgt0.5

42
What Tell us about design?
  • IO?NP
  • Bisection BW?NP
  • side length ?NP
  • N if plt0.5
  • Area ?N2p
  • pgt0.5

N.B. 2D VLSI world has natural Rent of
P0.5 (area vs. perimeter)
43
Rents Rule Caveats
  • Modern systems on a chip -- likely to contain
    subcomponents of varying Rent complexity
  • Less I/O at certain natural boundaries
  • System close
  • (Rents Rule apply to workstation, PC, PDA?)

44
Area/Wire Length
  • Bad news
  • Area O(N2p)
  • faster than N
  • Avg. Wire Length O(N(p-0.5))
  • grows with N
  • Can designers/CAD control p (locality) once
    appreciate its effects?
  • I.e. maybe this cost changes design
    style/criteria so we mitigate effects?

45
What Rent didnt tell us
  • Bisection bandwidth purely geometrical
  • No constraint for delay
  • I.e. a partition may leave critical path weaving
    between halves

46
Critical Path and Bisection
Minimum cut may cross critical path multiple
times. Minimizing long wires in critical path gt
increase cut size.
47
Rent Weakness
  • Not account for path topology
  • ? Can we define a Temporal Rent which takes
    into consideration?
  • Promising research topic

48
Finishing Up...
49
Big IdeasMSB Ideas
  • Interconnect Dominant
  • power, delay, area
  • Can be bottleneck for designs
  • Cant afford full crossbar
  • Need to exploit locality
  • Cant have everything close

50
Big IdeasMSB Ideas
  • Rents rule characterize locality
  • gt Area growth O(N2p)
  • pgt0.5 gt interconnect growing faster than
    compute elements
  • expect interconnect to dominate other resources
Write a Comment
User Comments (0)
About PowerShow.com