Artemis: Practical Runtime Monitoring of Applications for Execution Anomalies

About This Presentation

Title:

Artemis: Practical Runtime Monitoring of Applications for Execution Anomalies

Description:

Artemis: Practical Runtime Monitoring of Applications for Execution Anomalies ... With Artemis, much less data is examined by runtime analysis, reducing overhead ... – PowerPoint PPT presentation

Number of Views:65

Avg rating:3.0/5.0

Slides: 22

Provided by: long5

Category:

more less

Transcript and Presenter's Notes

Title: Artemis: Practical Runtime Monitoring of Applications for Execution Anomalies

1
Artemis Practical Runtime Monitoring of
Applications for Execution Anomalies

Long Fei and Samuel P. Midkiff
School of Electrical and Computer Engineering
Purdue University, West Lafayette
PLDI 2006
Subproject of PROBE

2
Motivation

Bugs are expensive!
Cost in 2002 60 billion dollars, 0.6 GDP
Debugging Approaches
Manual debugging inefficient, impossible
Extensive testing inefficient, path explosion
Static analysis conservative, false alarms
Manual annotation inefficient, not scalable
Runtime debugging high overhead

3
What is Artemis?

Is not a bug detection tool
Makes existing tools more efficient in bug
detection

4
Outline for the Rest of the Talk

Birds eye view of related work
Artemis framework
Experimental results
Conclusions

5
Birds Eye View of Compiler-Aided Debugging
Techniques
static

More efficient design
Problem specific
Usually involves assumptions about OS, compiler,
or hardware

compiler techniques
software debugging

Exploit parallelism
Shadow checking process (Patil SPE97)
Thread-level speculation (Oplinger ASPLOS02)

dynamic
Liblit PLDI03
faster
random
no program information
sampling
runtime overhead
Perform fewer checks
parallel
adaptive
selective
Chilimbi ASPLOS04
use program information
Artemis
6
Artemis Design Goals

General
Work with multiple pre-existing debugging schemes
Pure software approach that works with general
hardware, OS and compiler
Effective
Improve overhead in general
Have low asymptotic overhead in long-running
programs
Adaptive
Adjust coverage of monitoring to system load

7
Key Idea

Because runtime monitoring is expensive
want to monitor only when a bug occurs
Our goal is to approximate this
avoid re-monitoring executions whose outcome has
been previously observed

8
How to Determine Where Bugs are Likely to be Seen

Code region behavior (and bug behavior) is
determined by regions context
Monitor the 1st time a region executes under a
context
If buggy, the bug is monitored
if not buggy, only monitor this region with this
context once
Over time, almost all executions of a region have
a previously seen context yields low asymptotic
monitoring overhead
Hard part efficiently representing, storing and
comparing contexts for a region
The context could be the whole program state!

9
Decision to Monitor
code segment entrance
first entrance ?
context seen before ?
use unmonitored version
N
Y
Y
N
initialize context
update context record, add current context
use monitored version
10
Target Programs

Our prototype targets sequential code regions
Determined by how contexts are defined
Can be used with race-free programs without loss
of precision
Target the sequential regions of these programs
Use with programs with races is ongoing research

11
Implementation Issues

Define code regions
Represent and compare contexts
Interface with existing runtime debugging schemes
Adhere to overhead constraints
Adapt to system load

12
Defining Code Regions

Spatial granularity
Temporal granularity
Context check frequency
Context check efficiency
Ideal case a small context dominates the
behavior of a large piece of code
Our choice
Procedure natural logic boundary

13
Approximating Context for Efficiency

Exact Context
Too large to store and check (might be entire
program state)
Represent approximately tradeoff between
precision and efficiency
Approximated Context
In-scope global variables, method parameters,
in-scope pointers
Values of non-pointer variables are mapped into a
compact form (value invariant as in DIDUCE ICSE
02)
Requires 2 integer fields 2 bitwise operations
for each check 3 bitwise operations for each
update
Pointers tracked by declared (not actual) types
argv approximated by vector length
Correlations between context elements are lost
If a4,b3 and a5,b8 are two contexts of a
region, we track a(4,5), b(3,8)

14
Simulating Monitoring Schemes

We need to measure performance on a wide range of
runtime monitoring schemes
A generic monitoring scheme
Inserts instrumentation into application at
probability p
Calls a dummy monitoring function, which
simulates the overhead of some real monitoring
scheme
Can adjust overhead from zero to arbitrarily
large
Disable dummy monitoring to reveal the asymptotic
overhead of Artemis
Only performs the context checks associated with
the cost of monitoring, but not the monitoring
Allows measuring context checking overhead only

15
Experiment Asymptotic Overhead Measured by
Simulation
16
Two Findings

Performance floor
As monitoring scheme overhead approaches zero,
Artemis overhead is 5.57 of unmonitored program
execution time
When can we use Artemis to improve overhead ?
Break even baseline monitoring overhead
Monitoring overhead gt 5.6, Artemis helps
By solving x0.0045x0.0557, x 5.60
This covers most of the monitoring techniques

17
An Optimization Reuse Context Sets Across Runs

Eliminates the initial building of sets of
observed contexts
Converges faster to the asymptotic overhead
Invariant profile
Dump the context invariants into a file at
program exit
Load dumped invariants at the next run
Invariant profile size is 0.4 4.7 of program
binary size (average 1.7, std 0.95)

18
Using Artemis (with invariant profile)
training Artemis
baseline
Artemis
invariant profile
source
instrumentation
build
production run
bug report
19
Convergence to Asymptotic Overhead e.g. bzip2
from SPECint
7.3

Asymptotic overhead reduced to lt 7.5 (from 280)

20
Experiments with Real Monitoring Schemes

Measuring how well does (monitoring scheme guided
by Artemis) approximates the capabilities of
original monitoring scheme
Artemis with hardware-based monitoring (AccMon)
detected 3/3 bugs, 2.67 times improvement, in
very short-running programs
Artemis with value invariant detection and
checking (C-DIDUCE) Source-level
instrumentation covered 75 of violations, 4.6
times improvement, in short-running programs
Full results and details are in the paper

21
Conclusions