EnergyAware Memory Access Scheduling - PowerPoint PPT Presentation

1 / 19
About This Presentation
Title:

EnergyAware Memory Access Scheduling

Description:

Row activation and precharge energy: 20nJ. Column access energy for 64 bytes: 26nJ ... results: Number of row activations (biu=8) Simulation results: ... – PowerPoint PPT presentation

Number of Views:22
Avg rating:3.0/5.0
Slides: 20
Provided by: KEB83
Category:

less

Transcript and Presenter's Notes

Title: EnergyAware Memory Access Scheduling


1
Energy-Aware Memory Access Scheduling
  • Yongkui Han and Israel Koren
  • Electrical and Computer Engineering
  • University of Massachusetts, Amherst
  • January 30, 2004

2
3-D structure of contemporary DRAM chips
Memory data location (bank,row,column) Three
steps in accessing the memory data bank
precharge, row access (row activation), column
access. Once a row has been accessed, a new
column access can issue each cycle until the bank
is precharged. It is faster to access data
located in the same row than in different rows.
3
Motivation
  • DRAM devices are accessed in units of L2 cache
    block size, e.g., 64 bytes.
  • Energy values for Micron MT48LC16M16A2 SDRAM
    device
  • Row activation and precharge energy 20nJ
  • Column access energy for 64 bytes 26nJ
  • If we reorder the memory accesses to put together
    accesses to the same row, we can remove some
    unnecessary row activations and save DRAM energy.
  • Next we compare a performance-aware scheduling
    policy with an energy-aware scheduling policy.

4
Example 1 critical word first
1. Original accesses order (0,0,0) (0,0,1)
(0,0,2) (0,0,3) (0,1,0) (0,1,1) (0,1,2) (0,1,3)
energy 2026420264 196nJ (0,0,2) and
(0,1,3) contain critical words, e.g. on a cache
miss, the word causing the miss is the critical
word. 2. Performance-aware scheduling order
(0,0,2) (0,1,3) (0,0,1) (0,0,2) (0,0,3) (0,1,0)
(0,1,1) (0,1,2) energy 202620262026320
263 236nJ 3. Energy-aware scheduling order
(0,0,2) (0,0,0) (0,0,1) (0,0,3) (0,1,3) (0,1,0)
(0,1,1) (0,1,2) energy 2026420264
196nJ We can save 40nJ compared to
performance-aware scheduling.
5
Example 2 read-bypass-write
1. Original accesses order (0,0,0)r (0,1,0)w
(0,0,4)w (0,1,4)r (0,2,4)r energy (2026)5
230nJ 2. Performance-aware scheduling order
(0,0,0)r (0,1,0)r (0,2,4)r (0,0,4)w (0,1,0)w
energy (2026)5 230nJ 3. Energy-aware
scheduling order (0,0,0)r (0,0,4)w (0,1,4)r
(0,1,0)w (0,2,4)r energy 203265 190nJ We
can save 40nJ compared to performance-aware
scheduling.
6
Example 3 putting together column accesses
Precharge 3 cycles,row access 3 cycles, column
access 1 cycles 1. original accesses order
(0,0,0)r (0,1,0)w (0,0,4)r (0,1,8)w(0,0,8)r(0,2,3)
w(0,1,12)w cycles (331)7 49 cycles,
energy (2026)7 322nJ 2. performance-aware
scheduling order (0,0,0)r (0,0,4)r (0,0,8)r
(0,1,0)w (0,1,8)w(0,1,12)w(0,2,3)w cycles
(33)317 25 cycles, energy 203267
242nJ 3. energy-aware scheduling identical to
the above, saving 24 cycles, and 80nJ.
7
Three Scheduling Policies
  • FCFS First Come First Serve
  • this is the simplest one, just follow the
    original memory access order.
  • RIFF Read or Instruction Fetch First
  • give higher priority to memory read over memory
    write. this is a performance-aware scheduling
    policy, suggested in previous papers.
  • SRAF Same Row Access First
  • give higher priority to memory accesses to the
    same row as the current one. this is an
    energy-aware scheduling policy we suggest.

8
Policy Implementation
  • All three policies are implemented at the BIU
    (Bus Interface Unit), which is located in the
    processor, before accesses go to the DRAM system
    from the processor.

9
Experimental Setup
  • Simulator sim-mase (included in the simplescalar
    v4.0 test release) developed by UMich, which
    includes a DRAM simulator developed by UMD.
  • 5 floating point SPEC2000 benchmarks.
  • Simulation fastforward 1 billion instructions,
    then simulate the next 100 million instructions
    in detail.
  • Policy implementation overhead Since the memory
    access scheduling operations can be overlapped
    with memory access operations, there is no
    performance overhead, and the energy overhead is
    negligible.

10
Benchmarks SPEC CFP2000
11
Simulation parameters
12
DRAM system configuration
13
Simulation results Number of row activations
(biu8)
14
Simulation results Performance (biu8)
15
Simulation results Energy consumption (biu8)
16
Simulation results Energy consumption (biu32)
17
Conclusion
  • Memory access order greatly affects the energy
    consumption of DRAM memory devices.
  • By putting together accesses to the same row, we
    can remove many unnecessary row activations.
  • We can save considerable energy through memory
    access scheduling, especially for memory access
    intensive applications.

18
Questions?
19
(No Transcript)
Write a Comment
User Comments (0)
About PowerShow.com