Runahead Execution: An Alternative to Very Large Instruction Windows for Outoforder Processors

About This Presentation

Title:

Runahead Execution: An Alternative to Very Large Instruction Windows for Outoforder Processors

Description:

Long Running Instruction. Commited Instruction. Instruction Window. Filling the Instruction Window ... instructions during long stalls. Disregard results ... – PowerPoint PPT presentation

Number of Views:54

Avg rating:3.0/5.0

Slides: 22

Provided by: mte3

Learn more at: https://www.eecg.toronto.edu

Category:

more less

Transcript and Presenter's Notes

Title: Runahead Execution: An Alternative to Very Large Instruction Windows for Outoforder Processors

1
Runahead Execution An Alternative to Very Large
Instruction Windows for Out-of-order Processors

Onur Mutlu, The University of Texas at Austin
Jared Start, Microprocessor Research, Intel Labs
Chris Wilkerson, Desktop Platforms Group, Intel
Corp
Yale N. Patt, The University of Texas at Austin
Presented by Mark Teper

2
Outline

The Problem
Related Work
The Idea Runahead Execution
Details
Results
Issues

3
Brief Overview

Instruction Window
Set of in-order instructions that have not yet
been commited
Scheduling Window
Set of unexecuted instructions needed to
selected for execution
What can go wrong?

Program Flow
Instruction Window

Scheduling Windows
Execution Units
4
The Problem
Instruction Window

Program Flow
Unexecuted Instruction
Executing Instruction
Long Running Instruction
Commited Instruction
5
Filling the Instruction Window
IPC
6
Related Work

Caches
Alter size and structure of caches
Attempt to reduce unnecessary memory reads

Prefetching
Attempt to fetch data into nearby cache before
needed
Hardware software techniques

Other techniques
Waiting instruction buffer (WIB)
Long-latency block retirements

7
RunAhead Execution

Continue executing instructions during long
stalls
Disregard results once data is available

Instruction Window

Program Flow
Checkpoint
Unexecuted Instruction
Executing Instruction
Long Running Instruction
Commited Instruction
8
Benefits

Acts as a high accuracy prefetcher
Software prefetchers have less information
Hardware prefetchers cant analyze code as well
Biase predictors
Makes use of cycles that are otherwise wasted

9
Entering RunAhead

Processors can enter run-ahead mode at any point
L2 Cache Misses used in paper
Architecture needs to be able to checkpoint and
restore register state
Including branch-history register and return
address stack

10
Handling Avoided Read

Run Ahead trigger returns immediately
Value is marked as INV
Processor continues fetching and executing
instructions

ld r1, r2
Add r3, r2, r2
Add r3, r1, r2
move r1, 0

R2
R3
11
Executing Instruction in RunAhead

Instructions are fetched and executed as normal
Instructions are committed retired out of the
instruction window in program order
If the instructions registers are INV it can be
retired without executing
No data is ever observable outside the CPU

12
Branches during RunAhead

Divergence Points Incorrect INV value branch
prediction

13
Exiting RunAhead

Occurs when stalling memory access finally
returns
Checkpointed architecture is restored
All instructions in the machine are flushed
Processor starts fetching again at instruction
which caused RunAhead execution
Paper presented optimization where fetching
started slightly before stalled instruction
returned

14
Biasing Branch Predictors