EEL 5722 FPGA Design Fall 2003 SelfTimed FPGAs Part I PowerPoint PPT Presentation

presentation player overlay
1 / 25
About This Presentation
Transcript and Presenter's Notes

Title: EEL 5722 FPGA Design Fall 2003 SelfTimed FPGAs Part I


1
EEL 5722FPGA DesignFall 2003Self-Timed
FPGAsPart I
2
Asynchronous vs. Synchronous Circuits
3
Asynchronous vs. Synchronous Circuits
4
Advantages of Asynchronous Circuits(Al Davis,
Asynch 94)
Asynchronous circuits (AC) achieve average case
performance by exploiting data dependencies. ACs
perform better if the difference between the
average and worst case is large. Care must be
taken not to spend excessive time in computation
completion detection.
5
Power Consumption of ACs(Al Davis, Asynch 94)
ACs consume power only when needed on-demand
power consumption. CMOS implementations of ACs
consume power only during transitions. ACs
eliminate the clock which performs countless
transitions.
6
Power Consumption of ACs(Al Davis, Asynch 94)
7
Modular Composition of ACs(Al Davis, Asynch 94)
The design of ACs lends itself to modular
composition (building by blocks). This design
process allows also incremental
improvements. Building by blocks can be modeled
by using an object-oriented design approach for
hardware.
8
Absence of Clocked Interfaces(Al Davis, Asynch
94)
ACs do not require clock alignment at subsystem
interfaces. Indeed, the synchronization of an
incoming signal to a clock requires great care
and is time consuming. By using ACs,
metastability can be prevented to a large
extent. Without a clock, ACs can adapt naturally
to a variety of data rates.
9
Clock Distribution(Al Davis, Asynch 94)
Without a clock, ACs can avoid clock distribution
problems. Clock distribution problems are a
major drain on design time and power budgets. In
addition, clock lines tend to consume a large
chip area.
10
Global Synchrony(Al Davis, Asynch 94)
Some state that global synchrony does not exist
anyway. It is only a useful abstraction, not
reality. With increased density and chip area,
the problems of clock distribution and clock
speed prevent a design from having a single
global clock. Furthermore, these problems are
exacerbated by the requirement of heterogeneous
timing for new chips. As a result, new directions
are being explored to overcome these problems
such as Globally Asynchronous Locally Synchronous
(GALS) approach.
11
Concurrency in ACs(Al Davis, Asynch 94)
ACs offer a natural way to exploit
concurrency. Concurrency in AC design happen
naturally instead of supporting it through
interleaved execution.
12
Low Noise and Low Emission
A digital circuit can emit electromagnetic
radiation at its clock frequency. A neighboring
radio receiver circuit may mistake this radiation
for a radio signal. Due to the absence of a
clock, ACs may have better noise and
electromagnetic properties than synchronous
circuits.
13
Low Noise and Low Emission
14
Timing Models of ACs
A circuit is modeled as a closed network of
gates. A closed network is a network in which
all inputs are connected to outputs and
vice-versa. The state of the circuit is the set
of all gate outputs.
15
Speed-Independent (SI) ACs
A speed-independent (SI) circuit is a circuit
that operates correctly assuming positive,
bounded but unknown delays in gates and ideal
zero-delay in wires. Assuming zero-delay in
wires is not realistic in todays CMOS processes.
16
Delay-Insensitive (DI) ACs
A delay-insensitive (DI) circuit is a circuit
that operates correctly assuming positive,
bounded but unknown delays in gates and
wires. Although these circuits can be quite
robust, their number is very small.
17
Quasi Delay-Insensitive (QDI) ACs
A quasi delay-insensitive (QDI) circuit is a DI
circuit with isochronic forks. An isochronic
fork is a wire fork whose wire delays on both
wire branches of the fork are identical. In
practice, these forks are implemented in the gate
implementations of basic building blocks.
18
Data Signaling Protocols of ACs
The bundled-data protocol refers to a situation
where the data signals use normal Boolean levels
to encode information, and where separate request
and acknowledge wires are bundled with the data
signals. Bundled-data protocols are also
called single-rail encoding.
19
Dual-Rail Data Signaling
The dual-rail protocol uses two wires to
represent one bit, but the information is encoded
as transitions or events.
20
Control Signaling in ACs
The most basic control protocol is the
Request/Acknowledge protocol. This protocol can
be layered on top of data signaling protocol.
The sender issues a Request to the receiver
before sending the data. The receiver replies by
sending an Acknowledge to the sender. Then the
sender sends the data. If the sender initiates
the data transfer, the transfer channel is a
push-channel. On the other hand if the receiver
initiates the transfer, the channel is a
pull-channel.
21
The Four-Phase Protocol
  • The four-phase protocol refers to the number of
    communication actions
  • The sender issues data and sets Req to high
  • (ii) The receiver absorbs the data and sets Ack
    to high
  • (iii) The sender responds by setting Req to low
  • (iv) The receiver acknowledges by setting Ack to
    low
  • This protocol can be used with both single and
    dual-rail encoding.

22
The Four-Phase Protocol
23
The Two-Phase Protocol
The two-phase protocol follows these steps (i)
The sender sends data and produces a Req
event (ii) The receiver absorbs data and produces
an Ack event In some cycles, an event can be a
rising transition and in some it can be a falling
transition. There is no difference between
them. The three events, data change, request
event, and acknowledge event always follow in
cyclic order. Successive cycles may take
different amounts of time. This protocol
eliminates the superfluous return-to-zero transiti
ons in the four-phase protocol.
24
The Two-Phase Protocol
25
Circuit Implementation Styles
Four-phase bundled-data which most closely
resembles the design of synchronous circuits and
which normally leads to the most efficient
circuits, due to the extensive use of timing
assumptions (example Amulet 2 processor).
Two-phase bundled-data known also as
micropipelines and introduced by Ivan Sutherland
in his 1988 Turing Award lecture (example Amulet
1 processor) Four-phase dual-rail the
classic approach introduced by Mullers
pioneering work in the 1950s. Two-phase
dual-rail such as Level-Encoded two-phase
Dual-Rail scheme (LEDR).
Write a Comment
User Comments (0)
About PowerShow.com