SAPA: A Domain-independent Heuristic Temporal Planner - PowerPoint PPT Presentation

About This Presentation
Title:

SAPA: A Domain-independent Heuristic Temporal Planner

Description:

3/25: Leaving STRIPS Planning and going to Sapa – PowerPoint PPT presentation

Number of Views:37
Avg rating:3.0/5.0
Slides: 19
Provided by: Minh82
Category:

less

Transcript and Presenter's Notes

Title: SAPA: A Domain-independent Heuristic Temporal Planner


1
3/25 Leaving STRIPS Planning and going
to Sapa
2
Administrivia 3/25
  • Homework 4 due next class
  • Midterm soon after that
  • Will be take home
  • Will have a Shock and Awe flavor
  • You can be an embedded exam taker by
    suggesting problems
  • Today
  • Metric/Temporal planning (MTP)
  • Representation issues
  • Modeling MTP in Progression and Regression
  • And Graphplan and PO planning etc. etc.

3
Metric Temporal Planning
  • Time Durative actions Temporal constraints on
    goals (deadlines inter-goal constraints eg.
    Make sure to be in airport 2 hours before you are
    in the plane) Exogenous events
  • Durations may be static or dynamic
  • duration depends on the contexteg. Time to fill
    your gas tank depends on how empty the tank is to
    begin with
  • Advanced issues Uncertain durations
  • Modeling issues When are preconditions needed?
    How long will they persist? When are effects
    given?
  • A default assumption is to say that all
    preconditions are needed at the beginning and
    must hold during the entire actions duration.
    And that all effects will be available at the end
    of the action
  • E.g Consider Grading homeworks actionwhen are
    the homeworks needed? When are the grades
    available? What does your teacher tell you?
  • Planning issues How to support concurrency?(see
    next slide) How to support multi-objective
    (cost/make-span/robustness) optimization
  • Resources Actions may consume/produce
    (continuous quantity) resources
  • Modeling issues How to model resource
    availability (especially over the duration of an
    action)
  • Planning issues How to efficiently reason with
    continuous quantities during planning

Special cases TP Temporal planning RP
Resource Planning
4
Concurrency
  • Suppose I tell you that a plan P contains actions
    A1 A10, each with duration d1d10, then what is
    the makespan (execution duration) of P?
  • Makespan(P) gt max(d1d10)
  • If Makespan(P) Sum(d1d10), then it is a
    strictly serial plan
  • If Makespan(P) gt Sum(d1..d10), then there is
    idle-time in the plan
  • Actions dont need to start right after the
    preceding action
  • Think of the bank teller gossiping with his
    colleague in between servicing each customer
  • Planned idle/slack time may not always be a bad
    thingit can sometimes improve the robustness of
    the plan
  • Think of three travel plans involving connections
    in Minneapolis
  • Plan 1 schedules 5 min for connection time
    plan 2 schedules 1 hour plan 3 schedules 2 days.
    Which one is better (all else being equal).

5
Some Brand Names
  • Planners that can handle similar types of
    temporal and resource constraints
  • TLPlan, HSTS, IxTexT, Zeno, SAPA
  • TlPlan, SAPA are progression-based planners
  • HSTS,IxTET,Zeno are partial-order-based planners
  • TlPlan,HSTS are domain-customized planners the
    rest are domain independent
  • Planners that can handle a subset of constraints
  • Only temporal TGP, TPG, LPGP
  • Only resources LPSAT, GRT-R, Kautz-Walser
  • Subset of temporal and resource constraints TP4,
    Resource-IPP
  • LPGP and LPSAT are loosely-coupled systems.
    LPSAT connects SAT and LP solvers LPGP connects
    Graphplan and LPsolver
  • Issues of how tight is the loose-connection.
  • TGP,TPG,LPGP are Graphplan-based
  • LPSAT is based on SAT encodings being sent to LP
    solvers
  • Kautz-Walser is based solely on LP encodings

6
Approaches for MTP
  • In theory, pretty much every one of the
    approaches we saw for classical planning can be
    (and have been) extended to MTP (with varying
    degrees of scalability)
  • There are some interesting tradeoffs
  • PO planners are easiest to extend to support the
    concurrency needed for durative actions
  • Have harder time handling resources (because
    resource consumption depends on exactly what
    actions occurred before this time point)
  • Progression planners easiest to extend to support
    resource consuming actions
  • But harder time handling concurrency (need to
    consider advancing clock as a separate option
    in addition to applying one of the actions)

7
3/27 Our Road Map
  • Will focus on conjunctive planning
    approacheswith special attention to Sapa
  • action models
  • Using PDDL2.1 standard
  • how to model the search
  • Progression Regression PO planning
  • how to extract good heuristics

8
Action Representation
  • Durative with EA SA DA
  • Instantaneous effects e at time
  • te SA d, 0 ? d ? DA
  • Preconditions need to be true at the starting
    point, and protected during a period of time d, 0
    ? d ? DA
  • Action can consume or produce continuous amount
    of some resource

9
PDDL 2.1 (Level 2)Pure Durative Actions
(durative-action burn_match parameters
() duration ( ?duration 15) condition (and
(at start have_match) (at start
have_strikepad)) effect (and (at start
have_light) (at end (not
have_light)) ) )
(durative-action cross_cellar parameters
() duration ( ?duration 10) condition (and (at
start have_light) (over all
have_light) (at start at_steps)) effect
(and (at start (not at_steps)) (at
start crossing)(at end at_fuse_box) )
have_match, have strikepad
(dur 15)
have_light
have_light
have_light, at_steps
have_light
(dur 10)
at_steps, crossing
at_fuse_box
10
PDDL 2.1 Level 3Durative actions and numeric
quantities(but discrete effects)
The entire energy to be consumed is encumbered
at the very beginning (even though it gets
consumed Slowly over the full duration.
11
PDDL 2.1 Level 4Durative actions and numeric
quantities(with continuous effects )
12
Issues in modeling continuous change by discrete
vs. continuous effects
  • Consider the action of boiling a pan of water
  • The quantity temperature of water changes
    continuously over the duration of the action
  • We can ignore continuous effects by specifying
    that temperature is 1000 C at the end
  • Easy to handle can only access the temperature
    at the end of the action Reduces concurrency
    (what if we also put a blow torch to the pan to
    hasten the process?)
  • Or we can specify that the temperature of the
    water raises at a linear rate until it becomes
    100
  • Harder to handle but allows more concurrency
    (the total rate of increase is summation of all
    the individual rates of increase)

13
PDDL 2.1 StandardSummary
  • Durations
  • Static and dynamic durations allowed
  • Also allows duration inequalities
  • Preconditions
  • Can be at start or over all (throughout the
    duration)
  • Doesnt model preconditions being needed for
    arbitrary durations in the middle
  • Effects
  • Can be at start or at end
  • This makes effects discrete
  • Numeric quantities
  • Can be present in the preconditions or effects
  • Presence in the effects can be discrete (at
    start/at end) or continuous
  • Continuous change specified by giving a rate
    at which the quantity changes
  • Non-linear rate harder

14
State of the Art (as of IPC2002)
  • At IPC 2002 PDDL 2.1 standard had three levels
  • Level 1 STRIPS/ADL
  • Level 2 Durative Actions
  • FF, LPG, SAPA.
  • Level 3 Numeric quantities
  • discrete change
  • Sapa, LPG
  • Level 4 Continuous change
  • None at IPC
  • Some planners can handle it in theory but none
    are scalable

15
Problem Representation
  • Achievement Goals are specified as a list ltpi,tigt
    where pi needs to hold by time ti
  • ti is the deadline by which G must hold. It can
    be metric time (e.g. make clear(b) true by 2pm.)
  • If ti is omitted we will assume that G is a
    non-deadline goal (must be true by the time the
    plan is done.
  • Persist Goals are specified as a condition and
    an interval over which it must hold
  • A persist goal may be supported by different
    actions for the different parts of the duration (
    goal reduction a la ZENO)
  • E.g. striking multiple matches to have light over
    a duration

16
Plan representation
An executable plan must provide -- the actions
that need to be executed -- the start times
for each of the actions ? Or a set of
simple temporal constraints on the
set of actions (S.T.C. are generalization of
partial orders) E.g.
A14,5?A2 (means 4 lt
ST(A2) ST(A1) lt 5 )
Plan views Pert and Gantt charts GANTT Chart
is what is shown on the right PERT shows the
Causal links
17
(No Transcript)
18
Plan Quality Measures
  • Makespan Clock time for the execution of the
    plan (more concurrency ? lower makespan)
  • Slack The difference between the deadline for a
    goal and the time by which the plan achieves it
  • Tardiness is negative slack
  • Optimize max/min/average slack/tardiness measures
  • Cost Sum of costs of all the actions
  • Can be split into multiple dimensions, one
    corresponding to each resource

Can two plans with same make-span have different
slack measures?
Write a Comment
User Comments (0)
About PowerShow.com