Model Checking Java Programs - PowerPoint PPT Presentation

About This Presentation

Title:

Model Checking Java Programs

Description:

Translate statements to SAL guarded commands ... JVM run-time in SAL. Heap, stack, are implemented as ... Similar goals, different optimizations, no SAL, no C ... – PowerPoint PPT presentation

Number of Views:71

Avg rating:3.0/5.0

Slides: 41

Provided by: ulrich6

Learn more at: https://faculty.cc.gatech.edu

Category:

more less

Transcript and Presenter's Notes

Title: Model Checking Java Programs

1
Model Checking Java Programs

David Park, Ulrich Stern, Jens Skakkebaek, and
David L. Dill
Stanford University

2
Introduction

Can model checking be usefully applied to
programs?
Model checking background
Model checking software
A Java model checker
Research directions

3
Background on model checking
4
Model Checking

Model checking analyzes the reachable state
space of a system for certain properties.
Analysis may
enumerate states and
may also look for paths (e.g., unfair cycles)
State set representation can be
explicit (e.g., hash table of states) E.g.,
SPIN, Mur?
symbolic (e.g., a Boolean function represented as
a Boolean decision diagram BDD) SMV, nuSMV,
VIS

5
Explicit-state vs. symbolic

Explicit state model checking has several
advantages
More predictable (hard to diagnose reasons for
BDD blowup).
Avoid difficulties of translating everything to
Boolean functions.
Easier to deal with dynamic features of software
(e.g. heap-allocated objects).

6
Trivial Mur? example

One state variable, two rules
count gt 0 --gt count count - 1
count lt 10 --gt count count 1

0
. . .
1
2
3
10
7
Basic explicit model checking
On-the-fly search procedure Initialize Queue
is empty Table is empty push s0 onto
Queue Loop while Queue not empty do remove
s from Queue if s is NOT already in Table
then enter s in Table for all
enabled rules r at s do s succ(s,r)
insert s into Queue
8
Sweet spot for model checking

Designs that are control-dominated and
nondeterministic
(Nondeterminism stems from concurrency and
environmental behavior)
Designers cant foresee all cases and
interactions
Directed or random testing gets poor coverage
(there arent even good coverage metrics)
Simple static analysis methods either miss too
many bugs or report too many false errors.
Even running a prototype results in infrequent,
unrepeatable, hard-to-diagnose failures.

9
Perspectives on model checking

As a testing method
Model checking is more expensive and more
thorough
Tests generated automatically
States are saved to avoid redundant work
As a static analysis method
Model checking enumerates paths more precisely
than traditional static analysis
Hence, it is more accurate and more costly.

10
The three big problems

Computational complexity (e.g., the state
explosion problem)
Finding properties to check
Describing/constraining environment

11
Model checking of software
12
The Key Requirement

A new verification technique will not be adopted
unless the benefits outweigh the costs.

13
Special problems of software

Bugs are less expensive in software than hardware
or protocols
Field upgrades are often relatively inexpensive
Dynamic structures
Heap
Recursion
Dynamically allocated threads
Large state spaces
Complex environment
OS
Hardware
User

14
Bugs are inexpensive

Cost of verification must be minimized
Verifying code instead of high-level specs
reduces specification burden
Checking implicit properties (e.g., deadlock)
reduces cost
Other costs must be displaced (e.g., manual
testing)
Target applications that have relatively high
cost
Safety critical
Embedded real-time systems
Other cases security?

15
Dynamic stuff

Heap
Recursion
Dynamically allocated Threads
Eliminate artificial limitations of existing
model checkers
Allow dynamic arrays
No a priori limit on state space, but thats ok.

16
Large state spaces

Large state spaces
Software complexity is not highly constrained by
physical resources
Target embedded applications, which are somewhat
resource-constrained (but less so each day).
Use available model checking optimizations

17
Complex environment

OS
Hardware
User
Hope that detailed constraints arent needed
Bite the bullet and write specifications

18
A Java model checker
19
Java Model checking

Why Java?
Lots of interest
Well-defined thread model
Possibly to be used for embedded real-time
applications in future.

20
Value of model checking for Java

Concurrency problems still very hard to test and
debug
Nondeterminism from scheduling
Seemingly reliable applications may break when on
new hardware, JVM, or under different scheduling
load.
Unpredictable, non-repeatable failures.
Other sources of nondeterminism
Interactions with user, system calls

21
Status of project

Mostly an integration of existing ideas
Prototype is implemented
Implements a large subset of Java including most
advanced features of Java.
Inheritance, overriding, overloading, exception
handling
Can deal with small programs written by others
Cant deal with native code in libraries, etc.

22
Properties checked

Goal Keep specification simple
Check properties that dont have to be explicitly
specified
Programmers are comfortable with in-line
assertions.
Checker looks for
deadlock
assertion failures
selected exceptions array bounds, run-time type
errors.
. . . more coming soon.

23
Translation strategy

Translate statements to SAL guarded commands
Nondeterminism of guarded commands used to model
scheduler, possible results of API calls.
Implement JVM run-time in SAL
Heap, stack, are implemented as dynamic arrays
Classes, stack frames implemented as records

24
Processing steps
Java Program
Java Byte-code
Jimple
SAL Level 1
SAL Level 0
C
Model Checker
Error Trace
25
Java to SAL translation

Jimple statements SAL guarded commands
Example

i0 1is translated into(PCTID label_0)
--gt next(Stack)TIDSP.localVariables.i0
1 next(PC)TID label_1where PC
program counterTID thread identifier of
current threadlabel_0 SAL label of the
statement i0 1SP stack pointer
26
Optimization Atomic Blocks

Idea (Bruening, 1999) execute large blocks of
code without interleaving at the statement level.
Dont need to save or copy intermediate states
(just save state at end of block).
Avoid state explosion from fine-grained
interleaving.

vs.
27
Atomic Blocks

Assumption all accesses to shared variables are
locked.
This can and should be checked during
verification using same method as in Eraser.
Blocks are broken immediately after unlock
events.
Not necessary to break at lock. There may be
multiple locks and lots of other statements in
the block.
This would miss deadlocks,
. . . but there is a more sophisticated deadlock
check based on circular wait conditions that
catches all of them.
If a block fails to acquire a lock, it is aborted

28
Atomic Blocks

Model checker does the optimization on-the-fly
execution continues until unlock. Then state is
saved and other threads can be executed.
This is a special form of persistent set
reduction (Wolper and Godefroid).

29
Savings from atomic blocks
30
Optimization Hash Compaction

Idea Instead of saving (large) states in state
table, store (small) signatures (WolperLeroy,
SternDill).
Tradeoff May result in missed errors because
state search falsely thinks it has seen a state
before.
Probability of missed error can be bounded
5-byte signatures, 80 million states P(omission)
lt 0.13.
Outcomes
Error found (guaranteed correct)
Ran out of space, no errors (inconclusive)
Searched all states, no errors (almost guaranteed
correct).

31
Related work

Eraser Savage et al., 1997
Checks unlocked variables, but doesnt replace
test generation.
Verisoft Godefroid, 1996, Rivet Bruening,
1999, Stoller 2000
Systematically exercises design, but doesnt
check previously visited states (may do redundant
work)
Java PathFinder (NASA)
Similar goals, different optimizations, no SAL,
no C
dSPIN - dynamic data structures in SPIN (but no
special optimizations)

32
The Future
33
The three big problems (again)

Computational complexity (e.g., the state
explosion problem)
Finding properties to check
Describing/constraining environment
More research is needed on all of these problems,
in addition to integrating existing techniques.

34
Computational complexity

This problem requires an assault from many
directions.
Reduce the problem before model checking
Slicing based on property being checked.
Data and control abstraction.
E.g., Bandera system.
Additional model checker optimizations
Better persistent set reductions
Heap-based optimization
symmetry
early garbage collection
Partial verification
Provide guidance to most interesting parts of
state space

35
Finding properties to check

Race detection
Eraser model - but requiring Java locks on all
shared variables is neither necessary nor
sufficient
Misses higher-level locking constructs
Misses higher-level atomicity requirements
Possibilities of new models lock ordering,
happens before
Check wider range of exceptions
Specify and check requirements of standard
libraries

36
Environmental specification

Write more robust programs
Detect and report bad environment behavior
Code to do this can be used by verifier to
exclude false errors
Specify reusable constraints for common cases
(e.g., standard libraries)
Slicing and abstraction can immunize verification
from irrelevant environment problems.
Better solutions are needed.

37
Model checking static analysis