Learning Hierarchical Task Networks - PowerPoint PPT Presentation

About This Presentation
Title:

Learning Hierarchical Task Networks

Description:

A set of domain operators with known effects. A worked out problem solution that consists of ... Extend method to learn skills with negated goals and subgoals ... – PowerPoint PPT presentation

Number of Views:74
Avg rating:3.0/5.0
Slides: 21
Provided by: burs6
Learn more at: http://www.isle.org
Category:

less

Transcript and Presenter's Notes

Title: Learning Hierarchical Task Networks


1
Learning Hierarchical Task Networks by Analyzing
Expert Traces Pat Langley Tolga Konik Negin
Nejati Institute for the Study of Learning and
Expertise Palo Alto, California
2
Formulation of the Learning Task
  • Given
  • A set of domain operators with known effects
  • A worked out problem solution that consists of
  • The goal to be achieved in the problem
  • A sequence of operator instances that achieves
    the goal
  • A related sequence of intermediate problem states
  • Find A hierarchical task network that
  • Reproduces the solution to the training problem
  • Generalizes well to related problems in the
    domain

3
The ICARUS Architecture
Perceptual Buffer
Conceptual Memory
Belief Memory
Conceptual Inference
Perception
Environment
Skill Retrieval and Selection
Goal/Intention Memory
Skill Memory
Skill Execution
Skill Learning
Motor Buffer
4
Representing Long-Term Structures
ICARUS encodes two forms of general long-term
knowledge
  • Conceptual clauses A set of relational inference
    rules with perceived objects or defined concepts
    in their antecedents
  • Skill clauses A set of executable skills that
    specify
  • a head that indicates a goal the skill achieves
  • a single (typically defined) precondition
  • a set of ordered subgoals or actions for
    achieving the goal.

These define a specialized class of hierarchical
task networks in a syntax very similar to Nau et
al.s SHOP2 formalism. Beliefs, goals, and
intentions are instances of these structures.
5
Representing Concepts (Axioms)
((in-rightmost-lane ?self ?clane) percepts
((self ?self) (segment ?seg) (line ?clane
segment ?seg)) relations ((driving-well-in-se
gment ?self ?seg ?clane) (last-lane ?clane)
(not (lane-to-right ?clane ?anylane)))
) ((driving-well-in-segment ?self ?seg ?lane)
percepts ((self ?self) (segment ?seg) (line
?lane segment ?seg)) relations ((in-segment
?self ?seg) (in-lane ?self ?lane)
(aligned-with-lane-in-segment ?self ?seg ?lane)
(centered-in-lane ?self ?seg ?lane)
(steering-wheel-straight ?self)) ) ((in-lane
?self ?lane) percepts ((self ?self segment
?seg) (line ?lane segment ?seg dist ?dist))
tests ((gt ?dist -10) (lt ?dist 0)) )
Nonprimitive Concepts
Primitive Concepts
6
Representing Skills (Methods)
((in-rightmost-lane ?self ?line) percepts
((self ?self) (line ?line)) start ((last-lane
?line)) subgoals ((driving-well-in-segment
?self ?seg ?line)) ) ((driving-well-in-segment
?self ?seg ?line) percepts ((segment ?seg)
(line ?line) (self ?self)) start
((steering-wheel-straight ?self)) subgoals
((in-segment ?self ?seg) (centered-in-lane
?self ?seg ?line) (aligned-with-lane-in-segment
?self ?seg ?line) (steering-wheel-straight
?self)) ) ((in-segment ?self ?endsg)
percepts ((self ?self speed ?speed)
(intersection ?int cross ?cross) (segment
?endsg street ?cross angle ?angle)) start
((in-intersection-for-right-turn ?self ?int))
actions ((?steer 1)) )
Nonprimitive Skill Clauses
Primitive Skill Clauses
7
Hierarchical Structure of Memory
ICARUS organizes both concepts and skills in a
hierarchical manner.
Each concept is defined in terms of other
concepts and/or percepts. Each skill is defined
in terms of other skills, concepts, and percepts.
8
Hierarchical Structure of Memory
ICARUS interleaves its long-term memories for
concepts and skills.
For example, the skill highlighted here refers
directly to the highlighted concepts.
9
Basic ICARUS Processes
ICARUS matches patterns to recognize concepts and
select skills.
concepts
Concepts are matched bottom up, starting from
percepts. Skill paths are matched top down,
starting from intentions.
skills
10
Impasse-Driven Analytical Learning
Skill Hierarchy
Problem
Reactive Execution
?
Initial State
Goal
If Impasse
Effects of Primitive skills
Experts Primitive Skill Sequence

Analytical Learning
11
Learning HTNs by Trace Analysis
concepts
primitive skills
12
Learning HTNs by Trace Analysis
Skill Chaining
concepts
primitive skills
13
Learning HTNs by Trace Analysis
Concept Chaining
concepts
primitive skills
14
Constructing an Explanation
clear A
unstack B A
C
C
B
B
B
B
A
A
C
A
C
A
unstackable B A
hand-empty
clear B
on B A
unstack C B
putdown C
concepts
primitive skills
unstackable C B
putdownable C
15
From an Explanation to an HTN
clear A
unstack B A
C
B
B
B
A
A
C
A
C
unstackable B A
hand-empty
clear B
on B A
unstack C B
putdown C
concepts
primitive skills
unstackable C B
putdownable C
16
From an Explanation to an HTN
clear ?x
unstackable ?y ?x
unstack ?y ?x
hand-empty
clear ?y
on ?y ?x
putdownable ?z
unstackable ?z ?y
putdown ?z
unstack ?z ?y
concepts
primitive skills
17
Key Ideas of the Approach
  • Constrained form of hierarchical task networks
  • Each skill clause/method has a goal as its head
  • Each method has one (possibly defined)
    precondition
  • The resulting semi-lattice makes learning
    tractable
  • Learning involves analyzing the expert trace
  • Explanation draws on a form of goal regression
  • Each step in the explanation becomes an HTN
    method
  • Similar to explanation-based learning for
    planning but retains the explanation structure

18
Related Research
  • Nonincremental, knowledge-lean approaches
  • Behavioral cloning (Sammut, 1996 Urbancic
    Bratko 1994)
  • Relational induction from traces (e.g., Reddy
    Tadepalli, 1997)
  • Incremental, knowledge-intensive approaches
  • Explanation-based learning (e.g., Shavlik, 1989
    Mooney, 1990)
  • Derivational analogy (e.g., Veloso Carbonell,
    1993)
  • Programming by demonstration (e.g., Lau et al.,
    2003)

19
Plans for Future Research
  • Extend framework to use and learn partial-order
    skills
  • Augment approach to use known subtasks during
    learning
  • Extend method to learn skills with negated goals
    and subgoals
  • Modify approach to handle partially observable
    traces
  • Extend system to learn skills with uncertain
    outcomes

20
End of Presentation
Write a Comment
User Comments (0)
About PowerShow.com