Principles of Adiabatic Processes - PowerPoint PPT Presentation

About This Presentation
Title:

Principles of Adiabatic Processes

Description:

Scattering of ballistic electrons from lattice imperfections, causing Ohmic resistance ... diodes in the charge-return path. Forgetting to obey one of the ... – PowerPoint PPT presentation

Number of Views:1083
Avg rating:3.0/5.0
Slides: 128
Provided by: Jam123
Learn more at: https://www.cise.ufl.edu
Category:

less

Transcript and Presenter's Notes

Title: Principles of Adiabatic Processes


1
Principles of Adiabatic Processes
2
Adiabatic Processes - overview
  • Adiabatic steps in the reversible Carnot cycle
  • Evolution of the meaning of adiabatic
  • Time-proportional reversibility (TPR) of
    quasi-adiabatic processes
  • Adiabatic theorem of quantum mechanics
  • Adiabatic transitions of a two-state system
  • Logic memory in irreversible and adiabatic
    processes.

3
The Carnot Cycle
  • In 1822-24, Sadi Carnot analyzed the efficiency
    of an ideal heat engine all of whose steps were
    reversible, and furthermore proved that
  • Any reversible engine (regardless of details)
    would have the same efficiency (TH?TL)/TH.
  • No engine could have greater efficiency than a
    reversible engine w/o producing work from nothing
  • Temperature itself could be defined on a
    thermodynamic scale based on heat recoverable by
    a reversible engine operating between TH and TL

4
Steps of Carnot Cycle
P
  • Isothermal expansion at TH
  • Adiabatic (without flow ofheat) expansion TH?TL
  • Isothermal compression at TL
  • Adiabatic compression TL?TH

TH
TL
V
Adia-batic
Iso- thermal
Iso- thermal
Adia-batic
Reser-voir
Reser-voir
Reser-voir
Reser-voir
5
Carnot Cycle Terminology
  • Adiabatic (Latin) literally Without flow of
    heat
  • I.e., no entropy enters or leaves the system
  • Isothermal At the same temperature
  • Temperature of system remains constant as entropy
    enters or leaves.
  • Both kinds of steps, in the case of the Carnot
    cycle, are examples of isentropic processes
  • at the same entropy
  • I.e., no (known) information is transformed into
    entropy in either process
  • But, the usage of the word adiabatic in applied
    physics has mutated to essentially mean
    isentropic.

6
Old and New Adiabatic
  • Consider a closed system where you just lose
    track of its detailed evolution
  • Its adiabatic (no net heat flow),
  • But its not adiabatic (not isentropic)
  • Consider a box containing some heat,flying
    ballistically out of the system
  • Its not adiabatic, (no heat flow)
  • because heat is flowing out of the system
  • But its adiabatic (no entropy is generated)

The System
Box o Heat
7
Justifying the Modern Usage
  • In an adiabatic process following a desired
    trajectory through configuration space,
  • No heat flows in or out of the subsystem
    consisting of those particular degrees of freedom
    whose variation carries out the motion along the
    desired trajectory.
  • E.g., the computational degrees of freedom in a
    computational process.
  • No heat flow ? no entropy flow
  • Heat is just energy whose configuration info. is
    entropy
  • No entropy flow ? no sustained entropy generation
  • Since bounded systems have a maximum entropy

8
Quasi-Adiabatic Processes
  • Complete adiabaticity means absolutely zero rate
    of entropy generation
  • Requires infinite degree of isolation of system
    from uncontrolled external environment!
  • ? Impossible to completely achieve in practice.
  • Real processes are only adiabatic to the extent
    that their entropy generation approaches zero.
  • Term quasi-adiabatic emphasizes imperfection
  • Asymptotically adiabatic designs conceptually
    approach 0 in the limit of variation of specified
    technology design parameter(s)
  • E.g., low device frequency, large device size

9
Quasi-Adiabatic Processes
  • No real process is completely adiabatic
  • Because some outside system may always have
    enough energy to interact with disturb your
    systems evolution - e.g., cosmic ray, asteroid
  • ? Evolution of system state is never perfectly
    known
  • Unless you know the exact quantum state of the
    whole universe
  • Entropy of your system always increases.
  • Unless it is already at a maximum (at
    equilibrium)
  • Cant really be at complete equilibrium with its
    surroundings
  • unless whole universe is at utterly stable heat
    death state.
  • Systems at equilibrium are sometimes called
    static.
  • Non-equilibrium, quasi-adiabatic processes are
    sometimes also called quasi-static
  • Changing, but near a local equilibrium otherwise

10
Quantifying Adiabaticity
  • An appropriate metric for quantifying the degree
    of adiabaticity of any process is just to use the
    quality factor Q of that process.
  • Q isnt just for oscillatory processes any more
  • Q is generally the ratio Etrans / Ediss between
    the
  • Energy Etrans involved in carrying out a process
  • transitioning between states along a trajectory
  • Amount Ediss of energy dissipated during the
    process.
  • Normally also matches the following ratios
  • Physical information content / entropy generated
  • Quantum computation rate / decoherence rate
  • Decoherence time / quantum-transition time

11
Degree of Reversibility
  • The degree of reversibility (a.k.a.
    reversibility, a.k.a. thermodynamic efficiency)
    of any quasi-adiabatic process is defined as the
    ratio of
  • the total free energy at the start of the process
  • ? by the total energy spent in the process
  • Or, equivalently
  • the known, accessible information at the start
  • ? by the amount that is converted to entropy
  • This same quantity is referred to as the
    (per-cycle) quality factor Q for any resonant
    element (e.g., LC oscillator) in EE.

12
The Adiabatic Principle
  • Claim Any ideal quasi-adiabatic process
    performed over time t has a thermodynamic
    efficiency that is proportional to t,
  • in the limit as t?0.
  • We call processes that realize this idealization
    time-proportionally reversible (TPR) processes.
  • Note that the total energy spent (?Espent), and
    the total entropy generated (?S), are both
    inversely proportional to t in any TPR process.
  • The slower the process, the more energy-efficient.

13
Proving the Adiabatic Principle
  • (See RevComp memo M14)
  • Assume free energy is in generalized kinetic
    energy of motion Ek of system through its
    configuration space. Ek ½mv2 ? v2 (?/t)2 ?
    t?2 for m, ? const.
  • Assume that every tf time, on average (mean free
    time), a constant fraction f of Ek is thermalized
    (turned into heat)
  • Whole process thermalizes energy f(t/tf)Ek ?
    t?t?2 t?1. Constant in front is ½ fm?2/tf ?
    ??2, where ?½fm/tf is the effective viscosity.

14
Example Electrical Resistance
  • We know PspentI2R(Q/t)2R, or Espent Pt
    Q2R/t. ? Note scaling with 1/t
  • Charge transfer through a resistor obeys the
    adiabatic principle!
  • Why is this so, microscopically?
  • In most situations, conduction electrons have a
    large Fermi velocity or thermal velocity relative
    to drift velocity.
  • ? Scatter off of lattice-atom cross-sections with
    a mean free time tf that is fairly independent of
    drift velocity
  • Each scattering event thermalizes the electrons
    drift kinetic energy - a frac. f of currents
    total Ek
  • Therefore assumptions in prev. proof apply!

15
The Adiabatic Theorem
  • A result in basic quantum theory
  • Proved in many quantum mechanics textbooks
  • Paraphrased A system initially in its ground
    state (or more generally, its nth energy
    eigenstate) will, after subjecting it to a
    sufficiently slow change of applied forces,
    remain in the corresponding state, with high
    probability.
  • Result has been recently shown to be very
    general.
  • Amount of leakage out of desired state is
    proportional to speed of transition, at low
    speeds.
  • ?Quantum systems obey the adiabatic principle!

16
Some Loss-Inducing Interactions
  • For ordinary voltage-coded electronics
  • Interactions whose dissipation scales with speed
  • Parasitic EM emission from dynamic (C,L)
    reactances
  • Scattering of ballistic electrons from lattice
    imperfections, causing Ohmic resistance
  • Interactions having different scaling laws
  • Interference from outside EM sources
  • Thermally-activated leakage of electrons over
    potential energy barriers
  • Quantum tunneling of electrons through narrow
    barriers (sub-Fermi wavelength)
  • Losses due to intentional treatment of known
    physical information as entropy (bit erasure)

17
Some Ways to Reduce Losses
  • EM interference / emission Add shielding, use
    high-Q MEMS/NEMS oscillators
  • Scattering/resistance Ballistic FETs,
    superconductors
  • Thermal leakage avoid low VT and/or high temps
  • Tunneling thick tunnel barriers, high-?
    dielectrics, conductors w. low Fermi-level/high
    electron affinity, vacuum-gap barriers?
  • Intentional bit erasure reduce voltages, use
    mostly-reversible adiabatic logic designs

18
Adiabatic Circuits Reversible Computing
  • Myths, Controversies Misconceptions

19
Some Claims Against Reversible Computing Eventual Resolution of Claim
John von Neumann, 1949 Offhandedly remarks during a lecture that computing requires kT ln 2 dissipation per elementary act of decision (bit-operation). No proof provided. Twelve years later, Rolf Landauer of IBM tries valiantly to prove it, but succeeds only for logically irreversible operations.
Rolf Landauer, 1961 Proposes that the logically irreversible operations that can be seen to necessarily cause dissipation are irreducible. Landauers argument for irreducibility of logically irreversible operations was conclusively refuted by Bennetts 1973 paper (partially presaged by Lecerf).
Bennetts 1973 construction is criticized for using too much memory. Bennett devises a more space-efficient version of the algorithm in 1989.
Bennetts models criticized by various parties for depending on random Brownian motion, and not making steady forward progress. Fredkin and Toffoli at MIT, 1980, provide ballistic billiard ball model of reversible computing that makes steady progress.
Various parties including Zurek note that Fredkins original classical-mechanical billiard-ball model is chaotically unstable. Zurek, 1984, shows that quantum models can avoid the chaotic instabilities. (Though there are workable classical ways to fix the problem also.)
Various parties propose that classical reversible logic principles wont work at the nanoscale, for unspecified or vaguely-stated reasons. Drexler, 1980s, designs various mechanical nanoscale reversible logics and carefully analyzes their energy dissipation.
Carver Mead, CalTech, 1980 Attempts to show that the kT bound is unavoidable in electronic devices, via a collection of counter-examples. No general proof provided. Later he asked Feynman about the issue in 1985 Feynman provided a quantum-mechanical model of reversible computing.
Various parties point out that Feynmans model of reversible computing only supports serial computation. Margolus at MIT, 1990, demonstrates a parallel quantum model of reversible computingbut only with 1 dimension of parallelism.
People question whether the various theoretical models can be validated with a working electronic implementation. Seitz and colleagues at CalTech, 1985, demonstrate working energy recovery circuits using adiabatic switching principles.
Seitz, 1985Has some working circuits, but unsure if arbitrary logic is possible. Koller Athas, Hall, and Merkle (1992) separately devise general reversible combinational logics.
Koller Athas, 1992 Conjecture reversible sequential feedback logic impossible. Younis Knight _at_MIT do reversible sequential, pipelineable circuits in 1993-94.
Some computer architects (including anonymous ISCA reviewers) wonder whether the constraint of reversible logic leads to unreasonable design convolutions. Vieri, Frank and coworkers at MIT, 1995-99, refute these qualms by demonstrating straightforward designs for fully-reversible and scalable gate arrays, microprocessors, and instruction sets.
Some computer science theorists suggest that the algorithmic overheads of reversible computing might outweigh their practical benefits. Frank, 1997-2003, publishes a variety of rigorous theoretical analysis refuting these claims for the most general classes of applications.
Various parties point out that high-quality power supplies for adiabatic circuits seem difficult to build electronically. Frank, 2000, suggests microscale/nanoscale electromechanical resonators for high-quality energy recovery with desired waveform shape and frequency.
Frank, 2002Briefly wonders if synchronization of parallel reversible computation in 3 dimensions (not covered by Margolus) might not be possible. Later that year, Frank devises a simple mechanical model showing that parallel reversible systems can indeed be synchronized locally in 3 dimensions.
20
Adiabatic Circuits and Reversible Computing
  • Commonly Encountered Myths, Fallacies, and
    Pitfalls
  • (in the Hennessy-Patterson tradition)

21
Myths about Adiabatic Circuits Reversible
Computing
  • Someone proved that computing with ltltkT
    free-energy loss per bit-operation is
    impossible.
  • Physics isnt reversible.
  • An energy-efficient adiabatic clock/power supply
    is impossible to build.
  • True adiabaticity doesnt require reversible
    logic.
  • Sequential logic cant be done adiabatically.
  • Adiabatic circuits require many clock/power
    rails and/or voltage levels.
  • Adiabatic design is necessarily difficult.

22
Fallacies about Adiabatic Circuits and Reversible
Computing
  • Since speed scales with energy dissipation in
    adiabatic circuits, they arent good for
    high-performance computing.
  • If I tried and failed to invent an efficient
    adiabatic logic, it must be impossible.
  • The algorithmic overheads of reversible
    computing mean it can never be cost-effective.
  • Since leakage gets worse in nanoscale devices,
    adiabatics is doomed.

23
Pitfalls in Adiabatic Circuits and Reversible
Computing
  • Using diodes in the charge-return path.
  • Forgetting to obey one of the transistor rules.
  • Using traditional models of computational
    complexity.
  • Restricting oneself to an asymptotically
    inefficient design style.
  • Assuming that the best reversible and
    irreversible algorithms are similar.
  • Failing to optimize the degree of reversibility
    of a design.
  • Ignoring charge leakage in low-power/adiabatic
    design.

24
Adiabatic/Reversible Computing
  • Basic Models and Concepts

25
Bistable Potential-Energy Wells
  • Consider any system having an adjustable,
    bistable potential energy surface (PES) in its
    configuration space.
  • The two stable states form a natural bit.
  • One state represents 0, the other 1.
  • Consider now the P.E. well havingtwo adjustable
    parameters
  • (1) Height of the potential energy
    barrierrelative to the well bottom
  • (2) Relative height of the left and rightstates
    in the well (bias)

(Landauer 61)
0
1
26
Possible Parameter Settings
  • We will distinguish six qualitatively different
    settings of the well parameters, as follows

BarrierHeight
Direction of Bias Force
27
One Mechanical Implementation
Stateknob
Rightwardbias
Barrierwedge
Leftwardbias
spring
spring
Barrier up
Barrier down
28
Possible Adiabatic Transitions
  • Catalog of all the possible transitions in these
    wells, adiabatic not...

(Ignoring superposition states.)
1states
1
1
1
leak
0
0states
0
leak
0
BarrierHeight
N
1
0
Direction of Bias Force
29
Ordinary Irreversible Logics
  • Principle of operation Lower a barrier, or not,
    based on input. Series/parallel combinations of
    barriers do logic.
    Major dissipation
    in at least one of the possible transitions.

1
Input changes, barrier lowered
0
  • Amplifies input signals.

Example Ordinary CMOS logics
Outputirreversiblychanged to 0
0
30
Ordinary Irreversible Memory
  • Lower a barrier, dissipating stored information.
    Apply an input bias. Raise the barrier to latch
    the new informationinto place. Remove
    inputbias.

Retractinput
1
1
Dissipationhere can bemade as low as kT ln 2
Retractinput
Barrierup
0
0
Barrier up
(3)
(1)
Input1
Input0
ExampleordinaryDRAM
N
1
0
(2)
(2)
31
Input-Bias Clocked-Barrier Logic
  • Cycle of operation
  • (1) Data input applies bias
  • Add forces to do logic
  • (2) Clock signal raises barrier
  • (3) Data input bias removed

Can amplify/restore input signalin the
barrier-raising step.
(3)
1
1
(4)
Can reset latch reversibly (4) given copy
ofcontents.
(3)
0
0
(2)
(4)
(4)
(2)
(4)
Examples AdiabaticQDCA, SCRL latch, Rod logic
latch, PQ logic,Buckled logic
(1)
(1)
N
1
0
(4)
(4)
32
Input-Barrier, Clocked-Bias Retractile
  • Barrier signal amplified.
  • Must reset output prior to changing input.
  • Combinational logic only!
  • Cycle of operation
  • (1) Inputs raise or lower barriers
  • Do logic w. series/parallel barriers
  • Clock applies bias force, which changes state, or
    not

0
0
0
(1) Input barrier height
ExamplesHalls logic,SCRL gates,Rod logic
interlocks
N
1
0
(2) Clocked force applied ?
33
Input-Barrier, Clocked-Bias Latching
  • ? Cycle of operation
  • Input conditionally lowers barrier
  • Do logic w. series/parallel barriers
  • Clock applies bias force conditional bit flip
  • Input removed, raising the barrier locking in
    the state-change
  • Clockbias canretract

1
(4)
(4)
0
0
0
(2)
(2)
(3)
(1)
Examples Mikes4-cycle 2-level adiabaticCMOS
logic (2LAL)
(2)
(2)
N
1
0
34
Full Classical-Mechanical Model
  • Claim The following components are sufficient
    for a complete, scalable, parallel, pipelinable,
    linear-time, stable, classical reversible
    computing system
  • (a) Ballistically rotating flywheel driving
    linear motion.
  • (b) Scalable mesh to synchronize local flywheel
    phases in 3-D.
  • (c) Sinusoidal to flat-topped waveform shape
    converter.
  • (d) Non-amplifying signal inverter (NOT gate).
  • (e) Non-amplifying OR/AND gate.
  • (f) Signal amplifier/latch.

Sleeve
(a)
(c)
(b)
(f)
(d)
Primary drawback Slow propagationspeed of
mechanical (phonon) signals.
(e)
cf. Drexler 92
35
Adiabatic electronics CMOS implementations
36
Conventional Gates are Irreversible
  • Logic gate behavior (on receiving new input)
  • Many-to-one transformation of local state!
  • Required to dissipate bT, by Landauer principle
  • Incurs ½CV2 dissipation in 2 out of 4 cases.

Transformation of local state
Example Static CMOS Inverter
in
out
37
(No Transcript)
38
(No Transcript)
39
Exact formulafor frequency reduction f ? RC/t
40
(No Transcript)
41
Common Mistakes to Avoid
  • In Adiabatic Design

42
Common Mistakes to Avoid
  • Dont use diodes in charge-return path!
  • The built-in voltage drop kills adiabaticity.
  • Dont disobey adiabatic transistor rules by
    either
  • Turning on transistor with voltage across it
  • Turning off transistor with current thru it!
  • This one is often neglected!
  • Use mostly-reversible logic!
  • Optimize degree of reversibility for application
  • Dont over-constrain the design family!
  • Asymptotically efficient circuits should be
    possible

43
Adiabatic Rules for Transistors
  • Rule 1 Never turn on a transistor if it has a
    nonzero voltage across it!
  • I.e., between its source drain terminals.
  • Why This erases info. causes ½CV2 disspation.
  • Rule 2 Never apply a nonzero voltage across a
    transistor even during any on?off transition!
  • Why When partially turned on, the transistor has
    relatively low R, gets high PV2/R dissipation.
  • Corollary Never turn off a transistor if it has
    a nonzero current going through it!
  • Why As R gradually increases, the VIR voltage
    drop will build, and then rule 2 will be violated.

44
Adiabatic Rules, continued
  • Transistor Rule 3 Never suddenly change the
    voltage applied across any on transistor.
  • Why So transition will be more reversible
    dissipation will approach CV2(RC/t), not ½CV2.
  • Adiabatic rules for other components
  • Diodes Dont use them at all!
  • There is always a built-in voltage drop across
    them!
  • Resistors Avoid moderate network resistances, if
    poss.
  • e.g. stay away from range gt10 k? and lt1 M?
  • Capacitors Minimize, reliability permitting.
  • Note Dissipation scales with C2!

45
Transistor Rules Summarized
Legal adiabatic transitions in green. (For n- or
p-FETs.)Dissipative states and transitions in
red.
off
high
low
off
off
high
high
low
low
off
high
low
on
on
high
low
high
low
on
on
low
low
high
high
46
SCRL Split-level Charge Recovery Logic
  • The First Pipelined Fully-Adiabatic CMOS
    Logic(Younis Knight, MIT, 94)

47
?
Transformation of local state
48
Input-Barrier, Clocked-Bias Retractile
Must reset outputprior to input.
Combinational logiconly!
  • Cycle of operation
  • Inputs raise or lower barriers
  • Do logic w. series/parallel barriers
  • Clock applies bias force which changes state, or
    not

0
0
0
ExamplesHalls logic,SCRL gates,Rod logic
interlocks
Input barrier height
N
1
0
Clocked force applied ?
49
(No Transcript)
50
Retractile Logic w. SCRL gates
  • Simple combinational logic of any depth N
  • Requires N timing phases
  • Non-pipelined
  • No sequential reuse ofHW (even worse)
  • We needsequentiallogic!

Time ?
51
Sequential Retractile Logic
  • Approach 1 (Hall 92)
  • After every N stages, invoke an irreversible
    latch
  • stores the output of the last stage
  • Then, retract all the stages,
  • and begin a new cycle
  • Problems
  • Reduces dissipation by at most a factor of N
  • Also reduces HW efficiency by order N!
  • In worst case, compared to a pipelined,
    sequential circuit
  • Approach 2 (Knight Younis, 93)
  • The store output stage can also be reversible!
  • Gives fully-adiabatic, sequential, pipelined
    circuits!
  • N can be as small 1 or 2 still have arbitrarily
    high Q

52
Simple Reversible CMOS Latch
  • Uses a standard CMOS transmission gate
  • Sequence of operation
  • (1) input initially matches latch contents
    (output)
  • (2) input changes?output changes (3) latch
    closes (4) input removed

Before Input Inputinput arrived removedin out
in out in outa a a a a a b b a b
P
in
out
53
Resetting a Reversible Latch
  • Can reversibly unlatch data as follows (exactly
    the reverse of the latching process)
  • (1) Data value d stored on memory node M.
  • (2) Present an exact copy of d on input.
  • (3) Open the latch (connecting input to M).
  • No dissipation since voltage levels match
  • (4) Retract the copy of d from the input.
  • Retracts copy stored in latch also.

54
Input-Bias Clocked-Barrier Logic
  • Cycle of operation
  • Data input applies bias
  • Add forces to do logic
  • Clock signal raises barrier
  • Data input bias removed

Can amplify/restore input signalin clocking step.
Retractinput
1
1
Retractinput
Clockbarrierup
Can reset latch reversibly givencopy of
contents.
0
0
Clock up
Input1
Input0
Examples AdiabaticQDCA, SCRL latch, Rod logic
latch, PQ logic,Buckled logic
N
1
0
55
(No Transcript)
56
(No Transcript)
57
SCRL 6-tick clock cycle
Initial state All gates off, all nodes neutral.
in
out
58
SCRL 6-tick clock cycle
Tick 1 Input goes valid, forward T-gate opens.
in
out
59
SCRL 6-tick clock cycle
Tick 2 Forward gate charges, output goes
valid.(Tick 1 of subsequent gate.)
in
out
60
SCRL 6-tick clock cycle
Tick 3 Forward T-gate closes, reverse gate
charges.
in
out
61
SCRL 6-tick clock cycle
Tick 4 Reverse T-gate opens, forward gate
discharges.
in
out
62
SCRL 6-tick clock cycle
Tick 5 Reverse gate discharges, input goes
neutral.
in
out
63
SCRL 6-tick clock cycle
Tick 6 Reverse T-gate closes, output goes
neutral.Ready for next input!
in
out
64
24 ticks/cyclein this version-includes
2-levelretractile stages
65
(No Transcript)
66
(No Transcript)
67
Some Timing Terminology
  • For sequential adiabatic circuits
  • 1 Tick Time for a single ramp transition
  • adiabatic speed fraction f times the RC gate
    delay.
  • 1 Phase Latency for a data value to propagate
    forward by 1 pipeline stage.
  • 1 Cycle Minimum period for all timing
    information to return back to its initial state.
  • Diadic Two retractile levels per gate
  • permits inverting or non-inverting logic.
  • Dual rail Two wires per logic value
  • permits universal logic with monodic gates

Monadiconly 1 level
68
Some Figures of Demerit
  • Some quantities we may wish to minimize
  • Ticks/phase
  • proportional to logic propagation latency
  • Ticks/cycle
  • reciprocal to rate of data throughput
  • Transistor-ticks/cycle
  • reciprocal to HW cost-efficiency
  • Number of required clock/power input signals
  • supplying these may be a significant component of
    system cost
  • Number of distinct voltage levels required
  • may affect reliability/power tradeoff

69
Some Interesting Questions
  • About pipelined, sequential, fully-adiabatic CMOS
    logic
  • Q Does it require an intermediate voltage level?
  • A No, you can get by with only 2 different
    levels.
  • Q What is the minimum number of externally
    provided timing signals you can get away with?
  • A ?4 (?12 if split levels are used)
  • Q Can the order-N different timing signals
    needed for long retractile cascades be internally
    generated within an adiabatic circuit?
  • A Yes, but not statically, unless N2 hardware is
    used
  • where N is the number of stages per full
    sequential cycle
  • We now demonstrate these answers.

70
Some Timing Examples
  • See next slide for some detailed timing diagrams.
  • N-level retractile cascades
  • 2N ticks/phase 1 phase/cycle 2N ticks/cycle
  • 3-phase fully-static diadic SCRL
  • 8 ticks/phase 3 phases/cycle 24 ticks/cycle
  • 2-phase fully-static monadic SCRL
  • 5 ticks/phase 2 phases/cycle 10 ticks/cycle
  • 2-phase fully-static diadic SCRL
  • 6 ticks/phase 2 phases/cycle 12 ticks/cycle
  • 6 tick/cycle dynamic SCRL detailed previously
  • 1 tick/phase 6 phases/cycle 6 ticks/cycle

71
Some SCRL timing diagrams
72
Reversible / Adiabatic Chips Designed _at_ MIT,
1996-1999
By the author and other then-students in the MIT
Reversible Computing group,under AI/LCS lab
members Tom Knight and Norm Margolus.
73
2LAL 2-Level Adiabatic Logic
  • A Novel Alternative to SCRL

74
2LAL 2-level Adiabatic Logic
(Implementable using ordinary CMOS transistors)
P
P
  • Use simplified T-gate symbol
  • Basic buffer element
  • cross-coupled T-gates
  • Only 4 timing signals,4 ticks per cycle
  • ?i rises during tick i
  • ?i falls during tick (i2) mod 4

?
?1
Tick
in
0 1 2 3
?0
?1
out
?2
?0
?3
75
2LAL Cycle of Operation
Tick 0
Tick 1
Tick 2
Tick 3
?1?1
in?1
in?0
?1?0
out?1
in
?0?1
?0?0
?1?1
in0
out?0
out0
?0?1
?0?0
76
2LAL Shift Register Structure
  • 1-tick delay per logic stage
  • Logic pulse timing propagation

?1
?2
?3
?0
in
out
?0
?1
?2
?3
0 1 2 3 ...
0 1 2 3 ...
in
in
77
More complex logic functions
  • Non-inverting Boolean functions
  • For inverting functions, must use quad-rail logic
    encoding
  • To invert, justswap the rails!
  • Zero-transistorinverters.

?
?
A
B
A
A
B
A?B
AB
A 0
A 1
A0
A0
A1
A1
78
Hardware Efficiency issues
  • Hardware efficiency How many logic operations
    per unit hardware per unit time?
  • Hardware spacetime complexity How much hardware
    for how much time per logic op?
  • Were interested in minimizing( of
    transistors) ( of ticks) / (gate cycle)
  • SCRL inverter, w. return path
  • (8 transistors) ? (6 ticks) 48 transistor-ticks
  • Quad-rail 2LAL buffer stage
  • (16 transistors) ? (4 ticks) 64 transistor-ticks

79
More SCRL vs. 2LAL
  • SCRL reversible NAND, w. all inverters
  • (23 transistors) ? (6 ticks) 138 T-ticks
  • Quad-rail 2LAL AND
  • (48 transistors) ? (4 ticks) 192 T-ticks
  • Result of comparison Although 2LAL minimizes
    of rails, and ticks/cycle, it does not minimize
    overall spacetime complexity.
  • The question of whether 6-tick SCRL minimizes
    per-op spacetime complexity among pipelined
    adiabatic CMOS logics is still open.

80
Minimizing Power-Clock Signals
  • How many external clock signals required?
  • N-level-deep retractile cascade logic
  • 2N waveforms 1 phase 2N signals
  • 6 tick/cycle, 6-phase dynamic SCRL
  • 6 waveforms 6 phases 36 signals
  • 24 tick/cycle, 3-phase static SCRL
  • 12 waveforms 3 phases 36 signals
  • 4 tick/cycle, 2LAL
  • 1 waveform 4 phases 4 signals!
  • It turns out that 12 signals are sufficient to
    implement any combination of 2-level or 3-level
    logics (including retractile) on-chip!

81
How to Do It
  • Circular 2LAL shifter pulse-gated clocks

Tick
0 1 2 3
P0
P1
P2
P3
P0
P1
in
0
P2
P3
out
?0
P0
P1
P2
P3
?2
?1
?2
?2
?3
2
82
12-rail system pros cons
  • Pros
  • Completely solves adiabatic timing design problem
  • Enables mixtures of retractile, SCRL, and other
    logic styles on 1 chip
  • Enables simple fully-adiabatic SRAM DRAM
  • Cons
  • Timing signals are dynamic
  • Known fully-static alternatives use order N2
    gates and signals for N-tick-long cycles
  • N can be large in a chip that includes deep
    retractile networks
  • Energy waste in driving the source/drain junction
    capacitances of all the T-gates even when timing
    pulse isnt present
  • SOI reduces these parasitics

83
GCAL General CMOS Adiabatic Logic
  • A general CMOS adiabatic design methodology
  • Currently under development at UF
  • Combines best features of SCRL, 2LAL, and
    retractile logics
  • Permits designs attaining asymptotically optimal
    cost-efficiency
  • For any combination of time, space, spacetime,
    energy costs
  • Arbitrarily high degree of reversibility
  • Permits using minimal 2-level and 3-level
    adiabatic gates
  • Requires only 4 externally supplied clock/power
    signals for 2-level logic
  • And only 12 total for mixed 2-level 3-level
    logic
  • Supports mixtures of fully-pipelined and
    retractile logic.
  • Supports quiescent dynamic/static latches RAM
    cells
  • Tools currently under development
  • A new HDL specialized for describing adiabatic
    designs
  • Digital circuit simulator with adiabaticity
    checker
  • Adiabatic logic synthesis tool, with automatic
    legacy design converter

84
GCAL DRAM/SRAM cells
  • GCAL DRAM cell
  • 4 transistors
  • 4 word lines/row
  • 2 bit lines/col (or 1)
  • GCAL SRAM cell
  • 8 transistors
  • 6 word lines/row
  • 2 bit lines/col (or 1)

85
DRAM Cell Write Cycle
  • All nodes initially ½.
  • T-gate initially closed (off).
  • Transmission gate opens.
  • Internal node is connected to bit-line (at
    matching voltage).
  • Bit line transitions to 0 or 1.
  • Pulls internal node to matching level.
  • Transmission gate closes.
  • Internal node latched to new level.
  • Bit line transitions back to ½.
  • Prepares for a new cycle.
  • Use the reverse sequence of operations to unwrite.

86
DRAM Cell Read Cycle
  • All external nodes initially ½.
  • T-gate initially off.
  • Internal node contains data.
  • Inverter rails split.
  • Bit line set to (inverted) data.
  • T-gate at end of column latches bit-line data.
  • Inverter rails merge.
  • Bit line restored to ½ level.
  • Can use the reverse sequence of operations to
    unread copy of data available at end of column.

87
Fully-Adiabatic DRAM cell
  • 6T, 6 lines/row, 1 line/column (in/out together)
  • Read cycle
  • Initially ? lines neutral, out neutral, R off
  • R for desired row turns on
  • ? for desired row splits, driving out column
  • R turns off, out is read
  • ? merges, out is reset
  • Write cycle
  • First, do read cycle.
  • in is set to out
  • W turns on
  • in changed to new value...

88
Fully-Adiabatic SRAM
  • 10-T, 10 lines/row, 1 line/column
  • Operation similar to DRAM, except
  • Read-out
  • T2 off N2 retracts T3 on N2 asserts T2 on, T3
    off
  • Write
  • T2 off N2 retracts N1 retracts, copy of M
    presented on input T1 on inchanges T1 off,
    N1asserts N2 asserts T2 on

N1
N2
M
T1
T2
T3
out
in
89
Limits of Adiabatics
90
Structured Systems
  • A structured system is defined as a system about
    whose state we have some knowledge.
  • Some of its physical information is known.
  • ? Its entropy is not at a maximum (by defn.).
  • ? It is not at equilibrium (by defn.).
  • For states with a given energy E,
  • we say the systems energy is distributed among
    those states, in proportion to their probability.

All statesof the abstractsystem havingenergy E
The systemsenergy isin here
States w.prob. gt 0
91
Desired Trajectories
  • Any structured systemwe build to servesome
    purposehas somedesiredtrajectory, or set
    oftrajectories, through its configuration space
    that we would ideally like it to follow at all
    times.
  • Think of any given state as having a specific
    desirability at any given time.

Time
Config-uration
Desired trajectories
92
Energy Losses
  • Energy dissipation can be viewed as a departure
    of part of the systems energy away from the
    systems desired trajectory.
  • E.g., 1 of 106 electronsleaks out of aDRAM cell
    systems energy hasdeparted from
    desiredtrajectory (all 106 stay)by a small
    amount

Time
Config-uration
Energy that hasdeparted from desiredtrajectories
93
Limits of Adiabatics IFriction
94
Generalized Friction
  • Any force leading to departure from desired
    trajectory that obeys the adiabatic principle
  • I.e., force strength ( total energy loss) is
    proportional to velocity along trajectory at low
    velocities
  • Examples
  • Ordinary sliding friction
  • Fluid viscosity
  • Electrical resistance
  • Forces causing electromagnetic radiative losses
  • Forces causing losses in inelastic collisions

95
Ways to Quantify Friction
  • Normal friction measures referring to length,
    mass, etc. may not apply to all processes.
  • For a given mechanism executing a specified
    process (i.e., following a specified desired
    trajectory or -ies) over a time t
  • Energy coefficient cE ?Elostt ?Elost/q
  • Energy dissipated from traj. per unit of
    quickness
  • Note quickness q 1/t has units like Hz
  • Entropy coefficient cS ?Smadet ?Smade/q
  • New entropy generated per unit of quickness
  • Note that cE cST at temperature T.

What matters!
96
Energy Coefficient in Electronics
  • For charging capacitive load C by voltage V
    through effective resistance R cE ?Elostt
    (CV2RC/t)t C2V2R
  • If the resistances are voltage-controlled
    switches with gain factor k controlled by the
    same voltage V, then effective R ? 1/kV cE
    C2V/k
  • In constant-field-scaled CMOS, k ? 1/dox ? ?, C ?
    ?, and V ? ?, so cE ? ?3/? ?4 ?Elost cE/t
    ? ?4/? ?3 (like CV2
    energy)

97
Degree of Reversibility of CMOS
  • What is the Q of a min-size CMOS transistor?
  • Q Efree/?Ediss
  • Efree/(cE/t) ½CV2/(C2V2R/t)
    ½(t/RC) ½ s (s slowdown factor)
  • Note Using transistors wider than minimum-size
    (larger C, smaller R) wouldnt change RC or Q,
    and would increase overall dissipation by
    increasing cE.

98
Lower Bounds on Friction?
  • No general (technology-independent) lower bounds
    on friction coefficients for interesting types of
    processes (e.g. computation) are currently known.
  • Clever engineering may eventually reduce the
    friction in desired processes to values as small
    as is desired.
  • Some ways
  • Reduce number of moving parts (or particles)
  • Isolate moving parts of system from unwanted
    interactions w. environment

99
Entropy coefficients of some reversible logic
gate operations
  • From Frank, Ultimate theoretical models of
    nanocomputers (Nanotechnology, 1998)
  • SCRL, circa 1997 1 b/Hz
  • Optimistic reversible CMOS 10 b/kHz
  • Merkles quantum FET 1.2 b/GHz
  • Nanomechanical rod logic .07 b/GHz
  • Superconducting PQ gate 25 b/THz
  • Helical logic .01 b/THz

How low can you go? We dont really know!
100
Is Adiabatic Limit Achievable?
  • Even if there is some lower bound on cS, it seems
    we can have ?Smade? 0 as t ? ?.
  • What factors may prevent this?
  • Any lower bound gt0 on the number of irreversible
    bit-operations performed. (Each has ?Smade ? 1.)
  • Fortunately, the lower bound can always be made
    0.
  • Any lower bound on the rate of energy leakage,
    even when system is completely stopped.
  • Any upper bound on the Q of the clocking
    synchronization system.
  • The system dissipates Efree/Q on every cycle.
  • No technology-independent upper bounds on Q known

101
Some Synonyms
  • Leakage of energy or (equivalently) probability
    mass out of a desired configuration or
    trajectory.
  • Occurrence of errors in the desired analog or
    digital state of a system. (Motion away from
    desired states.)
  • Decay of structure of a structured system. (The
    state departs from desired state.)

Leakage Error Decay
102
Perfect Mechanisms?
  • If a structured system is perfectly closed,
  • I.e. non-interacting with other systems, at all!
  • And if its internal interactions are perfectly
    known,
  • Then, and only then, is its (von Neumann) entropy
    going to be a constant.
  • Otherwise, its entropy will continuously increase
    as we lose track of its state.
  • In this case, no mechanism is perfect, in that
    some of its energy (i.e. some probability mass)
    is always leaking away from the desired
    trajector(y/ies) at some nonzero base rate, even
    when the rate of systems progress along its
    trajectory is zero.

103
Leakage Limits
  • Claim No real, structured system can have
    absolutely zero rate of energy leakage out of its
    desired trajectories, even if not moving.
  • However No general,technology-independentlower
    bound onleakage ratesis known (otherthan
    zero.)
  • Engineering advances mightmake leakage as small
    as desired.

Time
Config-uration
Energy that hasleaked from desiredconfiguration
104
Quantifying Leakage
  • For a given structured system
  • Leakage power Pleak dEleak / dt
  • Spontaneous entropy generation rate Sleak
    dSleak / dt
  • Again, note Pleak Sleak T at temperature T.



105
Ways to Decrease Leakage
  • Have high potential-energy barriers
  • slows down thermally excited leakage
    exponentially
  • Have thick potential-energy barriers
  • slows down quantum tunneling exponentially
  • Example Older generations of CMOS!
  • Mechanical (clockwork) systems have high
    potential energy barriers, for their size
  • Decay may require atoms to diffuse out of
    tightly-bonded spots.
  • Mechanisms that avoid making/breaking contacts
    (e.g. buckled logic) avoid losses due to stiction.

106
Limits of Adiabatics IILeakage
107
Minimum Losses w. Leakage
Etot Eadia Eleak
Eleak Pleaktr
Eadia cE / tr
108
Minimum Loss Derivation
109
Leakage in CMOS
  • In a given technology with constant-field
    scaling, leakage becomes worse at smaller scales
    because
  • Energy barriers between states are lower
  • Higher rates of thermally-induced leakage, at
    given T
  • Higher rates of quantum tunneling
    (temp.-independent)
  • Energy barriers between states are narrower
  • Higher rates of quantum tunneling
  • These effects get worse exponentially with
    1/length (doubly-exponentially with time)
  • Need alt. technologies w. high energy barriers!

110
Future Techs. w. Low Leakage?
  • How can we achieve low entropy coefficients in
    minimum-scale (atomic-scale) devices?
  • Need high energy barriers
  • Can achieve using atomic (not just electronic)
    interactions
  • E.g. mechanical logics (rod logic, buckling
    logic)
  • If strong bonds (e.g. C-C) are used in structure,
    rates of unwanted bond breakage can made be very
    low.
  • Rate for an atom passing through another one
    (e.g. knobs in rod logic) is extremely low due to
  • height of barrier strength of Coulombic
    fermionic repulsion between electrons,
  • width of barrier large number of particles
    involved
  • Other possibilities?

111
Minimum Dissipation with Variable V
  • Notice that this function of V approaches 0
    exponentially as V?8,
  • This is even true if we scale C?V!
  • Thus, there is no lower limit to the energy
    dissipation of adiabatic field-effect circuits!
  • The key is to make devices larger, not smaller!
  • Device sizes need grow only logarithmically.

112
Maximum Q factor in terms of V
  • The maximum logic Q factor is the maximum ratio
    between the energy involved in carrying out a
    logical transition, and the energy dissipated by
    the circuit during the transition.
  • We just calculated the minimum energy dissipated.
  • Thus, Qmax Einvolved/Ediss,min
    ½CV2/(2CV2exp(-V/2fT)) (1/4)exp(V/2fT)
  • Note that the maximum logic Q-factor goes up
    exponentially with the logic-swing voltage V.

113
Minimum energy Roff/Ron ratio
  • (A simpler version of earlier derivation.) Note
    that cE C2V2Ron and if the dominant leakage
    mode is source/drain, then Pleak V2/Roff
  • So putting the two together cEPleak
    C2V4(Ron/Roff) Emin 2(cEPleak)1/2
    2CV2(Ron/Roff)1/2
  • So we can rederive the maximum logic Q as
    follows Qmax ½CV2 / (2CV2(Ron/Roff)1/2)
    ¼(Roff/Ron)1/2 ¼(Imax/Ileak)1/2
    ¼(ron/off)1/2

114
Limits of Adiabatics IIIClock/Power Supplies
  • See transparencies.

115
Timing in Adiabatic Systems
  • When multiple adiabatic devices interact, the
    relative timing must be precise, in order to
    ensure that adiabatic rules are met.
  • There are two basic approaches to timing
  • Global (a.k.a. clocked, a.k.a. synchronous)
    timing
  • This is the approach in nearly all conventional
    irreversible CPUs.
  • Also is the basis for all practical adiabatic and
    quantum computing mechanisms that have been
    proposed to date.
  • Local (a.k.a. self-timed, a.k.a. asynchronous)
    timing
  • Implemented in a few commercial irreversible
    chips.
  • Feynman 86 showed that a self-timed serial
    reversible computation was implementable in QM,
    in principle.
  • Margolus 90 extended this to a 2-D model with
    1-D of parallelism. - Can it still work in a full
    3-D, fully-quantum-mechanical?
  • Indications from considering classical-mechanical
    3D meshes of coupled oscillators is yes.

116
Global Timing
  • Examples of adiabatic systems designed on the
    basis of global, synchronous timing
  • Adiabatic CMOS with external power/clock rails
  • Superconducting parametric quantron (Likharev)
  • Adiabatic Quantum-Dot Cellular Automaton (Lent)
  • Adiabatic mechanical logics (Merkle, Drexler)
  • All proposed quantum computers
  • A potential problem Synchronous timing may not
    scale well to large machine sizes.
  • Work by Janzig others raises issues of possible
    limits on timing systems due to quantum
    uncertainty.
  • Issue is still unresolved.

117
Clock/Power Supply Desiderata
  • Here are some requirements for a good adiabatic
    timing signal / power supply for driving
    voltage-coded logic
  • Generate a trapezoidal voltage waveform with very
    flat high/low regions.
  • Needed to avoid current through transistors when
    turning them off
  • The flatness of the signal limits the maximum Q
    factor of the logic.
  • Waveform during the high?low transitions should
    ideally be linear,
  • But this does not affect the maximum logic Q,
    only the energy coefficient.
  • So long as ramp slope scales down everywhere with
    transition time.
  • Operate resonantly with the logic circuit, with a
    high Q factor.
  • The power supplys Q will limit the overall
    system Q
  • If possible, scale Q ? t (cycle time)
  • Required to be considered an adiabatic mechanism.
  • May conflict w. inductor scaling laws!
  • At the least, Q should still be high at
    leakage-limited speed
  • Have a reasonable cost, compared to the logic it
    powers.
  • Be scalable to large meshes of mutually
    synchronized devices.

118
Supply concepts in my research
  • Superpose several sinusoidal signals from
    phase-synchronized oscillators at harmonics of
    fundamental frequency
  • Weight these frequency components as per Fourier
    transform of desired waveform
  • Create relatively high-L integrated inductors via
    vertical, helical metal coils
  • Only thin oxide layers between turns
  • Use mechanically oscillating, capacitive MEMS
    structures in vacuo as high-Q (10k) oscillator
  • Use geometry to get desired wave shape directly

119
Early supply concepts
  • Inductors switches.
  • See transparency.
  • Stepwise charging.
  • See transparency.

120
Newer Supply Concepts
  • Transmission-line-based adiabatic resonators.
  • See transparency.
  • MEMS-based resonant power supply
  • See next couple of slides
  • Ideal adiabatic supplies - Can they exist?
  • Idealized mechanical model See transparency.
  • But, there may be quantum limits to
    reusability/scalability of global timing signals.
  • This is a very fundamental issue!

121
MEMS/NEMS Resonators
  • A Novel Clock/Power Supply Technology for
    Adiabatic Circuits

122
MEMS/NEMS Resonators
  • State of the art of technology demonstrated in
    lab
  • Frequencies up to the 100s of MHz, even GHz
  • Qs gt10,000 in vacuum, several thousand even in
    air!
  • Rapidly becoming technology of choicefor
    commercial RF filters, etc., in
    communicationsSoC (Systems-on-a-Chip) e.g. for
    cellphones.

U. Mich., poly, f156 MHz, Q9,400
34 µm
123
A MEMS Supply Concept
  • Energy storedmechanically.
  • Variable couplingstrength ? customwave shape.
  • Can reduce lossesthrough balancing,filtering.

124
Limits of AdiabaticsSummary
  • Some final remarks.

125
Summary of Limiting Factors
  • When considering adiabaticizing a system, ask
  • What fraction of system power is in logic? fL
  • Vs. Displays, transmitters, propulsion.
  • What fraction of logic is done adiabatically? fa
  • Can be all, but w. cost-efficiency overheads.
  • How large is the Ion/Ioff ratio of switches?
Write a Comment
User Comments (0)
About PowerShow.com