Thermal Analysis of a 3D FPGA for use in a PlaceandRoute algorithm' - PowerPoint PPT Presentation

1 / 35
About This Presentation
Title:

Thermal Analysis of a 3D FPGA for use in a PlaceandRoute algorithm'

Description:

Thermal Analysis of a 3-D FPGA for use in a Place-and-Route algorithm. Amir Hirsch and Vikram Chandrasekhar. Overview. Motivation. Thermal Model. Power Extraction ... – PowerPoint PPT presentation

Number of Views:96
Avg rating:3.0/5.0
Slides: 36
Provided by: Amir85
Category:

less

Transcript and Presenter's Notes

Title: Thermal Analysis of a 3D FPGA for use in a PlaceandRoute algorithm'


1
Thermal Analysis of a 3-D FPGA for use in a
Place-and-Route algorithm.
  • Amir Hirsch and Vikram Chandrasekhar

2
Overview
  • Motivation
  • Thermal Model
  • Power Extraction
  • Temperature Profiles
  • Leakage
  • Conclusions

3
Why Model Temperature?
4
Leakage vs Temperature
VT kT/q
HotLeakage A Temperature-Aware Model of
Subthreshold and Gate Leakage for Architects.
UNIV. OF VIRGINIA DEPT. OF COMPUTER SCIENCE TECH.
REPORT CS-2003-05
5
3-D FPGA structure
6
3-D FPGA switch matrix
7
Need for a thermal model
  • Tiles are much closer vertically than in a single
    layer
  • Vertical thermal resistances much smaller than
    horizontal thermal resistances
  • Important to understand how the temperature
    distribution in each layer, changes across layers

8
Thermal Physics
  • Heat Equation

C volumetric heat capacity (J / Km3) Specific
Heat mass density ? heat conductivity (W /
Km)Q heat generation density (W / m3)
Method 1. Discrete time and Discrete space. Use
MATLAB.Load thermal conductivity (K) from a GIF
layout file.Load heat generation density (Q)
from a GIF excitation file.
9
Experimenting with the Model
  • Wire in Insulator, thermally excited in center

Model
Excitation
After 10,000 iterations
10
Experimenting with the Model
  • Metal MIT in Insulator, Excited with varying heat

Model
Excitation
After 10,000 iterations
11
Problems with Method 1
  • Extraction of Q(x,y,z) is difficult
  • Too slow to perform without a supercomputer
    (10,000 iterations of a 5000 node model took 2
    hours)

A material model of a layout provides for a
highly accurate simulation of but is impractical
to simulation entire 3-D layouts
12
Lumped RC Model
We did not couple leakage into node power
13
Benefits of this Model
  • Extract the model to a SPICE simulation using
    thermal nodes as sub-circuits (speed is the SPICE
    simulators problem).
  • We can only control the tile level layout in a
    place-and-route algorithm
  • Useful for transient simulations

DC simulation is equivalent to solvingFast
matrix method for thermostatic equilibrium
inPlacement and Routing in 3D Integrated
Circuits, Univ. of Minnesota
14
Switching power dissipation
  • Fully-buffered switch architecture
  • Bi-directional switches using tri-state buffers
  • Every driver sees one input capacitance and one
    output capacitance for every switch

15
Switching power in a tile
  • Wires driven by tri-state buffers in a switch
    matrix
  • Switching in input and output decoders
  • Power dissipated in LUTs
  • Each wire segment has its own switching activity
  • Depends on current placement and routing of the
    circuit

16
Computing switching activity
  • Topologically sort the LUT network
  • Assign random input probabilities to primary
    inputs
  • Propagate the input probabilities across each
    level of LUTs till the primary outputs
  • In every LUT, sum up probabilities of cells
    storing '1' being accessed
  • Switching activity Probability of a net making
    a 0?1 transition

17
Extraction from routing
  • Switching activities need to be computed only
    once for a circuit
  • Need to extract the following for every tile in
    the current routing
  • Which are the nets routed through this tile ?
  • What are their switching activities ?
  • What is the total switching power dissipated in
    this tile ?

18
Obtaining temperature profiles
  • Get the relative switching power dissipated by
    the tiles
  • Provides the node power for every node in the
    lumped thermal RC model
  • The number of tiles used by the routing and the
    total switching power do not vary much
  • But the temperature distribution depends on
    relative placement of high activity tiles

19
Switching power distribution
Top layer of alu4 mapped to a 4 layer FPGA
20
Switching power across layers
Layer 1 switching power
Layer 2 switching power
Layer 4 switching power
Layer 3 switching power
21
Comparison of single and multiple layer FPGAs
Switching power for single layer FPGA
Switching power for top layer of four layer FPGA
22
Temperature Profile
ALU4 mapped onto top layer only
Temperature across FPGA (Top Layer)
23
Temperature Profile
ALU4 mapped to four layers
Top Layer Temperature
24
Temperature Profile
ALU4 mapped to four layers
Top Layer Temperature
25
Interlayer Gradient
Interlayer Temperature Gradient
Mean percentage temperature difference across
layers for ALU4 mapped to top layer
26
Interlayer Gradient
Interlayer Temperature Gradient
Mean percentage temperature difference across
layers for ALU4 mapped to four layers
27
Leakage
28
ALU4 mapped to one layer. Layer 1 Leakage
29
ALU4 mapped to one layer. Layer 2 Leakage (3,4
look same)
30
Total Leakage on all four layers .0024
If unused cells do not leak 1.2857 x 10-4
31
ALU4 mapped to four layers. Layer 1 Leakage
32
ALU4 mapped to four layers. Layer 2 Leakage
Total Leakage on all four layers .0034
If unused cells do not leak 6.8338 x 10-4
33
Conclusions
  • Its useful to think about thermal issues using
    an equivalent lumped RC method
  • Temperature differs little between layers so we
    stack high activity nodes to minimize the hotspot
    area this does not necessarily reduce leakage

34
Use in a 3-D FPGA CAD tool
  • Have to integrate the thermal model into the
    place-and-route tool
  • Switching activity computation can be performed
    once for a circuit
  • How should the switching activity be integrated
    into the cost functions ?
  • Depends on the resulting temperature distributions

35
Future work
  • High activity tiles being placed far apart not
    necessarily a good solution
  • Can even be a high-level option form multiple
    hot spots or a single hotspot
  • Leakage power loopback to be done
  • VDD scaling to be performed use of multiple
    layers reduces critical path delay
  • Slow the circuit down by lowering VDD and observe
    the new temperature profiles
Write a Comment
User Comments (0)
About PowerShow.com