CS3104: ?????? Principles of Digital Design [project2] floating-point number addition

About This Presentation

Title:

CS3104: ?????? Principles of Digital Design [project2] floating-point number addition

Description:

Title: Lecture 1: Course Introduction and Overview Author: Chung-Ta King Last modified by: VLSI Created Date: 8/12/1995 11:37:26 AM Document presentation format – PowerPoint PPT presentation

Number of Views:76

Avg rating:3.0/5.0

Slides: 13

Provided by: Chun111

Category:

more less

Transcript and Presenter's Notes

Title: CS3104: ?????? Principles of Digital Design [project2] floating-point number addition

1
CS3104 ??????Principles of Digital
Designproject2 floating-point number addition

??? ??
?? ???
????????????
??????????

2
Scientific Notation
exponent
decimal point
Sign, magnitude
23
-24
6.02 x 10 1.673 x 10
radix (base)
Mantissa
Sign, magnitude
e - 127
IEEE F.P. (SP)?1.M x 2

Issues
Arithmetic (, -, , / )
Representation, Normal form
Range and Precision
Rounding
Exceptions (e.g., divide by zero, overflow,
underflow)
Errors
Properties ( negation, inversion, if A B then
A - B 0 )

3
Floating-Point Representation
Representation of floating point numbers in IEEE
754 standard single precision
1
8
23
S
E
sign
M
mantissa sign magnitude, normalized binary
significand w/ hidden integer bit 1.M
exponent excess 127 binary integer
actual exponent is e E - 127
0 lt E lt 255
S
E-127
N (-1) 2 (1.M)
Magnitude of numbers that can be represented is
in the range
-126
127
23
)
2
(1.0)
(2 - 2
to
2
which is approximately
-38
38
to
3.40 x 10
1.8 x 10
(integer comparison valid on IEEE Fl.Pt. numbers
of same sign!)
4
Floating-Point Representation

Examples

-0.75 -0.11x20 -1.1x2(-1) 1 01111110
10000 i.e. (-1) x (1 .1000 0000 0000 0000 0000
000) x (2 ) 0 0 00000000 0 . . . 0
-1.5 1 01111111 10 . . . 0
1
126
5
Floating-Point Addition

Basic addition algorithm
Illustrate 9.999ten x 10 1.610ten x 10
Assume that 4 decimal digits of the significand
and two
decimal digits of the exponent.
STEP 1 compute Ye - Xe (getting ready to align
binary point)
STEP 2 right shift the smaller number, e.g., Xm,
that many
positions to form Xm 2
EX 1.610ten x 10 0.1610ten x 10
0.01610ten x 10
If we can represent only four
decimal digits, so the
number is really
0.016ten x 10
SETP 3 compute Xm 2 Ym EX
9.999ten
0.016ten
10.015ten
The sum is 10.015ten x 10

-1
1
Xe-Ye
-1
0
1
1
Xe-Ye
1
6
Floating-Point Addition

Basic addition algorithm (continue)
if representation demands normalization, then
normalize
STEP 4 left shift result, decrement result
exponent (e.g.,0.001xx)
right shift result, increment
result exponent (e.g., 101.1xx)
EX we pick up the normalized form
10.015ten x 10
1.0015ten x 10
NOTE check overflow or underflow during the
shift
STEP 5 round the mantissa continue until MSB of
data is 1 (NOTE Hidden bit in IEEE
Standard)
EX we must round the number
1.0015tenx10 1.002
x 10

1
2
2
2
7
Extra Bits for Accuracy

Extra bits during intermediate calculations to
help rounding, to get closer to actual number
Two extra bits on the right in IEEE 754 guard
and round
E.g., base 10 precision 3 bit
Guard bits digits to the right of mantissa to
guard against loss of digits gt can later be
shifted left into mantissa during normalization
(especially when the two numbers are very close,
or when multiplication)
Round bits after the guards being shifted into
mantissa, the result can be rounded according to
the round bits
Sticky bit additional bit to the right of the
round digit to better fine tune rounding

2-bias
0 2 1.69
1.6900 10 - .0785 10 1.6115 10
2-bias
0 0 7.85
-
2-bias
0 2 1.61
d0 . d1 d2 d3 . . . dp-1 0 0 0 0 . 0 0
X . . . X X X S
X X S
Sticky bit set to 1 if any 1 bits fall off
the end of the round digit

8
Four Rounding Modes

Always round up (toward ? )
Always round down (toward -?)
Truncate
Round-to-nearest-even

Round-to- nearest-even(x)
100. 011. 010. 001. 000.
x
00.0 00.1 01.0 01.1 10.0 10.1 11.0 11.1
9
Example with G, R, and S bits
sign exponent
mantissa
GRS A 0 1000 0011(131) 1.1000
0010 1100 0000 0000 000 000 B 0 0111
1111(127) 1.0000 0011 0000 0101 1001 010 000 B
aligned 0 1000 0010(131) 0.0001 0000 0011
0000 0101 100 101 A-B 0 1000 0010(131)
1.0111 0010 1000 1111 1010 011
011 Postnormalization 0 1000 0001(131) 1.0111
0010 1000 1111 1010 011 011
RS Rounding 0 1000 0001(131)
1.0111 0010 1000 1111 1010 011 01 Result
0 1000 0001(131) 1.0111 0010 1000
1111 1010 011 Note1 The original G bit can
serve an R bit, and the original R and S bits
must be ORed in order to generate a
new sticky bit. Note2 If the Boolean
expression RS R SL R (S
L) where L is LSB of the resultant
significand equals 1, we must round
up.
10
Flow of Floating-Point Addition
Start
1. Compare the exponents of the two
numbers. Shift the smaller number to right until
its exponent would match the larger exponent.
2. Add the significands
3. Normalize the sum, either shifting right and
incrementing the exponent or shifting left and
decrementing the exponent.
Overflow or Underflow?
Yes
Exception
No
4. Round the significand to the appropriate
number of bits
Still normalized?
Yes
No
Done
11
Architecture Example
Sign Exponent Significand
Sign Exponent Significand
Small ALU
0 1
0 1
Exponent difference
0 1
Shift right
Control
Big ALU
Increment or Decrement
Shift left or right
Rounding Hardware
Sign Exponent Significand
12
Schedule and score rules

Schedule
Project 2-1 (deadline on 11/30) Step1 and 2
Project 2-2 (deadline on 12/21) Step3
(Post-normalization)
Project 2-3 (deadline on 01/11) Step4
(Rounding)
Score rules
(1). Completeness.
(2). Modulize.
(3). Architecture.

Write a Comment

User Comments (0)

About PowerShow.com

CS3104: ?????? Principles of Digital Design [project2] floating-point number addition - PowerPoint PPT Presentation

CS3104: ?????? Principles of Digital Design [project2] floating-point number addition

Title: Lecture 1: Course Introduction and Overview Author: Chung-Ta King Last modified by: VLSI Created Date: 8/12/1995 11:37:26 AM Document presentation format – PowerPoint PPT presentation