Title: Farhan Mohamed Ali W21 Jigar Vora W22 Sonali Kapoor W23 Avni Jhunjhunwala W24
1Presentation 4 MAD MAC 525
Farhan Mohamed Ali (W2-1)Jigar Vora
(W2-2)Sonali Kapoor (W2-3) Avni Jhunjhunwala
(W2-4)
W2
Design Manager Zack Menegakis
15th February, 2006 Gate Level Design
Project Objective Design a crucial part of a GPU
called the Multiply Accumulate Unit (MAC) which
will revolutionize graphics.
2MAD MAC 525 Status
- Project chosen
- Specifications defined
- Architecture
- Design
- Behavioral Verilog
- Testbenches
- Verilog Gate Level Design
- Floor plan (revised Updated)
- Schematic (adder)
- To be done
- Layout
- Extraction, LVS, post-layout simulation
3Recap - MAD MAC 525
- Multiply Add (MAD) / Multiply Accumulate Unit
(MAC) - Executes function ABC on 16 bit floating point
inputs - Multiply and add in parallel to greatly speed up
operation - Rounding is only performed only once so greater
accuracy than individual multiply and add
functions. - MAD MAC accelerates FP16 blending to enable true
HDR graphics - Bright things can be really bright
- Dark things can be really dark
- And the details can be seen in both
4Design Decisions
- Using n pass shifters instead of regular gates
for the muxes - Increases speed --- ?
- Reduces transistor count
- Reduces area
- Complexity of the project remains the same
5(No Transcript)
6(No Transcript)
7Block Diagram
Input
Input
Input
16
16
16
5
RegArray A
RegArray B
RegArray C
10
10
10
5
5
Multiplier
Exp Calc
Align
5
14
22
35
Control Logic Sign Dtrmin
Leading 0 Anticipator
Adder/Subtractor
36
4
Normalize
14
5
Round
10
5
1
Reg Y
Output
16
8- Updated Estimated Transistor Count
- n-pass gates
- Registers (I/O, pipelining, threading)
1800 1800 - Carry-Save Multiplier 3500 3500
- Carry-Select Adder/Subtractor 3700 3700
- Alignment Shifter 530 1500
- Leading 0 Anticipator 350 350
- Normalize 900 3400
- Rounding 300 300
- Exponents 700 700
- Total 11780 15250
9- Estimated area (in um sq)
- n-pass
- Registers 9000
- Multiplier 25000
- Adder 26500
- Align 3800
- Leading zero counter 2500
- Normalize 6500
- Round 2000
- Exponent calc 5000
-
- Total 80300
10Main Floorplan
Multiplier
Reg A
Reg C
Exp Calc
Reg B
Align C
Pipeline Reg
Pipeline Reg
Adder
Ld Zero
Pipeline Reg
Round
Normalize
Reg Y
11 12Input From Reg A
10
Input from Reg B
And Array 726 transistors
10
Full Adder Array 2640 transistors
Input To Adder
22
13Schematics
- Multiplier 11 x 11 Carry-Save Multiplier
14Schematics
- Leading Zero Counter Carry-Save Adder to count
the leading zeroes of C
15Schematics
- Align Exponents N-pass shifter
16Schematics
- I bit N-pass shifter used in the align block
17Schematics
- Normalize n-Pass Shifter to shift the result of
the adder by the amount given by the Leading Zero
Counter
18- Shifter for the Normalize
19Schematics
- Round Incrementer and Shifter
20Critical Path
Input
Input
Input
16
16
16
5
RegArray A
RegArray B
RegArray C
10
10
10
5
5
Multiplier
Exp Calc
Align
5
22
14
35
Pipeline Reg
Pipeline Reg
Control Logic Sign Dtrmin
Leading 0 Anticipator
Adder/Subtractor
Pipeline Reg
36
4
Normalize
14
5
Round
10
5
1
Reg Y
16
21