Systmes MultiAgents - PowerPoint PPT Presentation

1 / 143
About This Presentation
Title:

Systmes MultiAgents

Description:

se rapprocher de la r alit d cisionnelle: ... a way that the expression or prevision of all possible cases becomes prohibitive. ... – PowerPoint PPT presentation

Number of Views:112
Avg rating:3.0/5.0
Slides: 144
Provided by: mattsu5
Category:

less

Transcript and Presenter's Notes

Title: Systmes MultiAgents


1
Systèmes Multi-Agents
  • Conception
  • Applications
  • J. Ferber, "Les systèmes multi agents",
    InterEditions, 1995
  • http//www-poleia.lip6.fr/drogoul/cours/links.htm
    l

2
Part I - Conception
  • 1. Motivations et origines
  • 2. Problèmes et definitions
  • 3. Le principe dinteraction
  • 4. Larchitecture blackboard

3
1. - Motivations et origines
  • Systèmes actuels unicité de l'expert trop
    souvent considérée
  • se rapprocher de la réalité décisionnelle
  • faire apparaître la multiplicité des experts et
    la multiplicité des relations entre experts
    (coopération, compétition, négotiation,)
  • du décideur individuel aux réseaux de décideurs
  • population d'agents autonomes en interaction
  • métaphore des organisations
  • on met l'accent sur l'interaction

4
1. - Motivations et origines
  • Première définition
  • SMA un système dans lesquels des agents
    artificiels opèrent collectivement et de façon
    décentralisée pour accomplir une tâche.
  • Ces entités peuvent être implantées sur un
    support physique ou logique (entités matérielles
    ou immatérielles)

5
1. - Motivations and origins
  • 1.1. Evolution of the theory of mind
  • 1.2. Limitation of classical AI
  • 1.3. Evolution of the computer programming
    paradigm

6
1.1. - Evolution of the theory of mind
  • Development in the 70th of the theory of mind
    which postulate that
  • Intelligence is relying on individual competences
    ability to interact with a physical and social
    environment (eg perceive and communicate)
  • Reasoning does not resume to applying an a priori
    fixed sequence of expert rules but rather imply a
    collection of concurrent, heterogeneous and
    dynamically evolving processes
  • Two simultaneous and complementary trends
    Minsky - The Society of Mind / Vygotsky - The
    Mind in Society

7
Minsky - The Society of Mind
  • A useful metaphor to think of intelligence is to
    consider a large system of experts or agencies
    that can be assembled together in various
    configurations to get things done
  • Minsky said,  ...each brain contains hundreds of
    different types of machines, interconnected in
    specific ways which predestine that brain to
    become a large, diverse society of partially
    specialized agencies 
  • Cognition is a distributed phenomenon
  • Minsky 85

8
Vygotsky - The Mind in Society
  • The mind in society the origins of individual
    psychological functions are social
  • Every high-level cognitive function appears
    twice first as an inter-psychological process
    and only later as an intra-psychological process
  • The new functional system inside the child is
    brought into existence in the interaction of the
    child with others (typically adults) and with
    artifacts
  • As a consequence of the experience of
    interactions with others, the child eventually
    may become able to create the functional system
    in the absence of the others
  • Vygotsky 78

9
The distributed cognition paradigm
  • Cognition is no more envisaged as a purely local
    and isolated information processing but rather
    considered as
  • Context-dependent
  • Temporally distributed past reasoning may
    influence current processings
  • Involving cooperation and communication with the
    physical and social environment
  • Dynamically evolving as the result of its
    processings and interactions
  • Hutchin 95

10
1.2. - Limitation of classical AI
  • Considering problems of increasing complexity
  • Problems that are physically and functionally
    distributed
  • Problems that involve heterogeneous data and
    expertise 
  • Problems in which data, information and knowledge
    is uncertain, incomplete and dynamically evolving
  • Problems that can not be tackled by global
    problem solving methods

11
Physical distribution
From Miksch 96
12
Physical distribution
13
Functional distribution
  • Task example patient monitoring
  • Sensing, interpretation and summarization of
    patient data 
  • Detection, diagnosis, and correction of critical
    situations
  • Construction, refinement and revision of
    short-term and long-term therapy plans
  • Control and supervision of monitoring devices
  • Explanation of observations, diagnoses,
    predictions, and therapies based on the
    underlying anatomy and physiology

14
(No Transcript)
15
Heteregoneus knowledge and expertise
  • Various type of knowledge
  • Clinical Knowledge of common problems, symptoms
    and treatments
  • Biological knowledge of anatomy, physiology and
    pathophysiology
  • Knowledge of fundamental physical models and
    fault conditions
  • Various types of expertise
  • Patient monitoring a team work involving
    members with complementary tasks and skills,
  • which is most often staffed with new or
    inexperienced physicians and nurses

16
Complexity requires a local view
  • Complex system behaviour often emerge as the
    dynamic interaction between
  • The system components
  • The system as a whole with the environment
  • The environment with the individual components
  • The resulting dynamics, at the system level, may
    influence the environment which in turn will
    influence the component dynamics
  • Even when a clean formulation is possible,
    analytical approaches often involves concurrent
    expansion of recursive functions

17
Complexity as dynamicity of interactions
System
Environment
Comp.2
Comp.1
Comp.N
18
Decentralization as an alternative view
  • An alternative to the classical approach based on
    a single monolithic system is the divide and
    conquer principle where a phenomenon is viewed as
    composed of a set of related and interacting
    sub-phenomena
  • The whole phenomenon is then described by several
    (hererogeneous) models accounting for its
    component behaviours, together with several
    (heterogeneous) models accounting for their
    interactions
  • Instead of designing a single  heavy 
    all-purpose system, this approach creates
     light , case-based, narrow-minded units that
    have clearly identified objectives and background
    information necessary to successfully achieve
    their objectives 

19
Decentralization as an alternative view
  • While in the first case the model of the whole
    phenomenon to be regulated is contained in a
    single unit, in the second case a number of
    partial models of the phenomenon are contained in
    several units
  • Each of these units can regulate just a single
    part of the entire phenomenon
  • A global view for the whole phenomenon simply
    emerges from the structured interaction of the
    partial units

20
Complexity to deal with complexity
  • Main advantages
  • A complex global model usually depends on several
    parameters that are difficult to identify and to
    measure 
  • Models with higher degree of approximation with
    respect to the real phenomenon may be derived,
    because the decomposition allows to develop
    sub-models for very specific contexts
  • Alternative sub-models may be employed for
    describing the same phenomenon (competitive
    models)
  • Since the sub-parts of the phenomenon may
    overlap, the actions that each unit undertake to
    regulate these sub-parts may conflict fusion
    and/or negotiation mechanisms are then required

21
1.3. - Evolution of the computer programming
paradigm
  • Toward more effective design and re-use
  • Looking for high specification levels
  • Looking for fault tolerant design
  • Looking for more expressive representation, more
    accurate operative perspective
  • Toward increased man-machine communication
    capabilities

22
(No Transcript)
23
Towards autonomous systems
  • Complexity increases in such a way that the
    expression or prevision of all possible cases
    becomes prohibitive.
  • There is a shift, from a compositionality
    hypothesis to an autonomy hypothesis of the
    system components
  • This suggests to design entities fitted with own
    laws, to augment their capacity of internal
    adaptation, and thus of autonomy and
    autoorganisation
  • Courant 94

24
2. - Issues and definitions
  • An agent is a computer system situated in some
    environment, that is capable of autonomous action
    in this environment in order to meet its design
    objectives
  • Autonomy the agent should be able to act
    without the direct intervention of humans (or
    other agents), and should have control over its
    own actions and internal state
  • Multi-agent system a set of agents interacting
    in the exploitation of a common environment,
    toward a common global goal

25
By definition
  • Multi-Agent Systems are such that
  • Each agent has incomplete information or
    capabilities to solve a problem
  • There is no global system control, nor any global
    view of the system given to any single agent
    (except the human one)
  • Computation is asynchronous
  • In addition, mobile agents may be designed, that
    have the ability to traverse a computer network
    accumulating information from several sites (eg
    online monitors, nurses reporting stations,
    patient records, doctors at remote locations)

26
Designing styles
  • Deux aspects à traiter
  • Aspects microscopiques (orientés agent)
  • comment construire un agent capable d'agir de
    manière autonome,
  • quelles sont ses représentations et ses
    comportements
  • Aspects macroscopiques (orientés système)
  • comment construire une organisation capable
    d'agir de manière coopérative
  • quels sont ses moyens de communication et de
    coordination

27
Designing styles
  • A multi-agent system may be
  • Open the set of agents is not predefined, new
    agents may be created on demand
  • Closed the set of agents is fixed in advance
  • Homogeneous all agents obey the same model
  • Heterogeneous agents fitted with different
    models, operating at various levels of grain, may
    co-exit
  • Hybrid human and non-human agents may
    collaborate  anonymously  to perform the task
    at hand

28
Agent models
Knowledge base
Control unit
cooperative planning layer
social models
local planning layer
mental models
Knowledge Abstraction
behaviour- based layer
world models
Perception
Action
Environnement
29
Agent Models Cognitive
  • 1. Contrôle
  • (buts, plans, tâches)
  • 2. Expertise du domaine
  • 3. Connaissances
  • sur soi-même
  • et sur les autres (croyances)
  • 4. Communications

30
Agent Models Réactive
  • 1. Contrôle
  • 2. Comportements
  • 3. Perception
  • 4. Reproduction

31
Agent behaviour
  • Do forever
  • Receive observation (percept)
  • Update internal model (beliefs)
  • Deliberate to form intentions
  • Use intentions to plan actions (means-end
    reasoning)
  • Execute plan
  • Two essential points
  • The agents have bounded resources (including
    time)
  • The world changes while deliberating, planning
    and executing and this can result in intentions
    and plans being invalidated

32
Agents as intentional systems
  • Predominant approach treat agents as intentional
    systems that may be understood by attributing to
    them mental states such as
  • The beliefs that agents have
  • The goals that agents will try to achieve
  • The actions that agents perform
  • The ongoing interaction

33
3. - The interaction principle
  • Interaction
  • Communication
  • Task allocation
  • Cooperation
  • Coordination of actions
  • Resolution of conflict

34
Modes de Communication
  • Communications directes (ou explicites)
  • l'échange direct est réalisé volontairement en
    direction d'un individu ou groupe d'individus
  • communication par partage d'informations
  • les agents lisent et déposent une information sur
    une zone de données commune (eg tableau noir)
  • communication par envoi de messages (notion de
    protocole)
  • communication point à point (téléphone)
  • communication par diffusion (broadcast)
  • Communications indirectes (ou implicites)
  • les agents laissent des traces (signaux) de leur
    présence ou de leur action qui sont perçues par
    d'autres agents
  • lenvironnement propage (et éventuellement
    déforme) les signaux déclenchés par la
    réalisation dune action cela entraîne des types
    d'échanges limités et permet de ne pas avoir à
    déterminer précisément le rôle de chaque individu
    dans le traitement collectif (ex les objets dans
    l'environnement émettent des signaux ou des
    champs de potentiels guidant les agents)

35
Commmunication types
Message passing
Message
Information sharing
Infor- mation
36
Messages et Acteurs
  • Le modèle acteur centré sur le principe du
    message
  • les acteurs sont réactifs, ils mettent en oeuvre
    un traitement en réponse à un message reçu d'un
    autre acteur, et sont capables d'envoyer des
    messages à d'autres acteurs.
  • Comportement
  • exécuter une action
  • envoyer un message à lui-même ou à d'autres
    acteurs
  • créer d'autres acteurs
  • spécifier un comportement de remplacement
  • Fonctionnement
  • à réception d'un message, vérifie si le message
    matche le comportement de l'acteur
  • si OK, exécute l'action correspondante
  • principe de continuation désigne l'acteur auquel
    envoyer le résultat du message
  • peut éventuellement déléguer à un autre (proxy)

37
Agent situé ou communiquant
  • Agent purement situé
  • l'environnement possède une métrique,
  • les agents sont situés à une position dans
    l'environnement qui détermine ce qu'ils
    perçoivent
  • ils peuvent se déplacer
  • il n'y a pas communications directes entre
    agents, elle se font via l'environnement
  • Agent purement communiquant
  • il n'y a pas d'environnement au sens physique du
    terme,
  • les agents n'ont pas d'ancrage physique,
  • ils communiquent via des informations qui
    circulent entre les agents

38
Situé ou Communiquant
  • Société de Fourmis
  • La résolution du problème s'inscrit dans
    l'environnement physique et dans l'organisation
    physique trouvée par les agents
  • Réseau de décideurs
  • la résolution du problème s'inscrit dans une
    structure conceptuelle et dans les modes de
    coopération enre agents

39
Agents Réactifs Situés (exemple)
  • Problème un ensemble de robots doivent trouver
    du minerai et le rapporter à la base

40
Agents Réactifs Situés (exemple)
  • Règle Explorer
  • si je ne porte rien et je ne perçois aucun
    minerai et je ne perçois aucune marque
  • alors j'explore de manière aléatoire
  • Règle SuivreMarque
  • si je ne porte rien et je ne perçois aucun
    minerai et je perçois une marque
  • alors je me dirige vers cette marque
  • Règle Trouver
  • si je ne porte rien et je perçois du minerai
  • alors je prends un échantillon de minerai
  • Règle Rapporter
  • si je porte du minerai et je ne suis pas à la
    base
  • alors retourner à la base et déposer une marque
  • Règle Déposer
  • si je porte du minerai et je suis à la base
  • alors déposer le minerai

41
Task allocation
  • Objectives
  • Decompose the problem into sub-problems
  • Allocate the tasks to agents, according to their
    competences and specialities
  • Re-organize during execution if necessary
  • Approach
  • Static the allocation is performed a priori by
    the system designer 
  • Dynamic the allocation is performed by the
    agents themselves (eg contract net)
  • Hybrid the initial allocation my be revised to
    account for changes in the environment (case of
    an open architecture in particular)

42
The Contract Net
  • Objective given a task to perform, allocate it
    to the  best  agent, knowing the task
    characteristics, its eventual realization
    constraints, and considering the agent potential
    and effective capabilities to succeed 
  • 3 main steps
  • Sending of a call for a task / reception of the
    proposals by the contacted agents
  • Selection of the best proposals / establishment
    of the contract(s) / reception of the result(s)
  • Selection / construction of final result

43
The Contract Net
44
Cooperation styles
  • Three cooperation styles may be distinguished
    Hoc 96
  • Confrontative cooperation a task is performed
    by agents with heteregoneous competencies or
    viewpoints, operating on the same data set the
    result is obtained by fusion the emphasis is on
    competence distribution
  • Augmentative cooperation a task is performed by
    agents with similar competencies or viewpoints,
    operating on disjoint subsets of data the
    result is obtained as a collection of partial
    results the emphasis is on data distribution
  • Integrative cooperation a task is decomposed
    into sub-tasks performed by agents operating in a
    coordinated way  the result is obtained upon
    execution completion the emphasis is on goal
    distribution

45
Confrontative cooperation
46
Augmentative cooperation
Agent 1
47
Integrative cooperation
48
Coordination of actions
  • How to plan and coordinate the actions of several
    agents in order to reach a common goal?
  • Two main modes
  • Planning (centralized or distributed)
  • Opportunistic problem solving

49
Planning
  • Centralized planning
  • A centralized manager distributes the plans to
    every agent, having the knowledge of their
    competences competencies in task decomposition
  • Easiest way to maintain consistency of problem
    solving but not too far from classical planning
  • Distributed planning
  • Each agent produces partial plans and communicate
    them to the other agents or to a mediator
  • Issues fuse/synchronize the plans in a
    consistent way avoid duplication of efforts
    conflicts dynamic planning?
  • Heavy communication load, high complexity

50
Opportunistic problem solving
  • The system  simply  chooses a next action at
    each step, as the one that will allow the best
    progress toward the solution, given the curent
    situation (ie the available data and the
    intermediate state of problem solvng)
  • Strongly data-directed, allow rapid refocusing
    (at each control cycle)
  • Implies some knowledge of action cost and utility

51
Resolution of conflicts
  • Several solutions
  • Authoritary a supervising agent has the
    authority and knowledge to take a decision
  • Mediation a mediator agent knows the various
    viewpoints and tries to solve the conflict
  • Negotiation the conflicting agents try to find
    a solution through several negotiation steps

52
The negotiation process
  • Main negotiation steps
  • 1. A makes a proposal
  • 2. B evaluates this proposal, determines the
    resulting satisfaction according to his own goals
  • 3. if B is satisfied, then STOP
  • otherwise B elaborates a counter-proposal based
    on his own goals and constraints
  • 4. Go to step 2 with A  and B roles exchanged

53
4. - The blackboard architecture
  • A group of human experts is working cooperatively
    to solve a problem, using a blackboard as the
    workplace to develop the solution
  • Problem solving starts when the problem and
    initial data are written on the blackboard
  • The experts watch the blackboard, looking for an
    opportunity to apply their expertise to the
    developing solution
  • When an expert finds sufficient information to
    make a contribution, he records the contribution
    on the blackboard, hopefully enabling other
    experts to apply their expertise
  • This process continues until the problem has been
    solved

54
The blackboard architecture
Level N
Solution
Hypotheses
Level 2
Level 1
Data
Blackboard
55
Knowledge sources


56
KS Knowledge Sources / Specialists
  • Each KS is a specialist at solving certain
    aspects of the overall problem the KSs are all
    independent once a KS finds the information it
    needs on the blackboard, it can proceed without
    any assistance from others
  • Additional KSs can be added, poorer performing
    KSs can be enhanced, and inappropriate KSs can be
    removed, without changing any other KSs
  • It does not matter whether a KS implements
    rule-based inferencing, a neural network,
    linear-programming, or a procedural simulation
    program. Each of these diverse approaches can
    make its contributions within the blackboard
    framework each KS is hidden from direct view,
    and seen as a black box from the outside

57
Organizing the BB
  • When the problem at hand is complex, there is a
    growing number of contributions made on the
    blackboard, so that quickly locating pertinent
    information may become a problem 
  • A common solution is to subdivide the blackboard
    into regions, each corresponding to a particular
    kind or level of information
  • Other criteria like information relevance,
    criticality or recency can be used

58
Event-based activation
  • The KS do not interact directly they  watch 
    the blackboard, looking for an opportunity to
    contribute to the solution
  • Such opportunities arise when an event occurs (a
    change is made to the blackboard) that match the
    KS condition part some specialists may also
    respond to external events, such as the ones
    produced by perceptual units
  • In practice, rather than having each KS scan the
    blackboard, each KS informs the system about the
    kind of events in which it is interested the
    system records this information and directly
    considers the KS for activation whenever that
    kind of event occurs

59
Incremental / opportunistic problem solving
  • Blackboard systems operate incrementally KSs
    contribute to the solution as appropriate,
    sometimes refining, sometimes contradicting, and
    sometimes initiating a new line of reasoning 
  • Blackboard systems are particularly effective
    when there are many steps toward the solution and
    many potential paths involving those steps
  • By opportunistically exploring the paths that are
    most effective in solving the particular problem,
    a blackboard system can significantly outperform
    a problem solver that uses a predetermined
    approach to generating a solution

60
Control
  • A control component that is separate from the
    individual KSs is responsible for managing the
    course of problem solving
  • The control component can be viewed as a
    specialist in directing problem solving, by
    considering the overall benefit of the
    contributions that would be made by triggered KSs
  • When the currently executing KS activation
    completes, the control component selects the most
    appropriate pending KS activation for execution

61
The agenda-based control mechanism
  • Every time a KS action is executed, the changes
    to the BB are described in terms of BB event
    types these event descriptions are passed to
    the BB monitor, which identifies the KSs that
    should be trigggered (the ones that declared
    interested in this type of event)
  • If a KS precondition is found to be satisfied,
    the KS is said to be activated and its action
    component placed in the agenda
  • All possible actions are placed onto the agenda
    on each cycle the actions are rated and the most
    highly rated is chosen for execution
  • In addition, focus decisions may be used to rate
    and schedule KS activation 

62
Focus of attention
  • In the simple blackboard model, the scheduler
    chooses a KS and then the KS executes using the
    context (BB elements) appropriate for it
  • In some systems, instead of directly choosing a
    KS, the scheduler chooses first a context
    (location in the BB) only then the KSs for
    which that context is appropriate are considered
    enabled and are executed
  • Control decisions thus operate on the condition
    and action parts of the KSs
  • Typically, the focus of attention will be an
    event chosen from the event list

63
BB control at a glance
Knowledge Sources

Events
KSIs
64
(No Transcript)
65
Adding a control blackboard
  • The BB1 blackboard framework introduced the
    notion of control blackboard. The key idea was to
    store the control strategy on the blackboard
    itself and to build KSs capable of modifying it.
    In this way the system could adapt its activity
    selection to better suit the current situation
  • Example control plan
  • Prefer KS whose actions occur at successive
    domain levels
  • Start with KS whose action occur at a given
    outcome level
  • Prefer KS triggered on recent problem-solving
    cycles
  • At this step, prefer this KS

66
Adding a control blackboard
  • The purpose is to build a control plan on the
    control BB the solution elements for this
    control problem are decisions about what actions
    are desirable, feasible, and actually performed
  • To this end, control KS exploit, generate and
    modify the solution elements placed on the
    control blackboard, under control of a scheduling
    mechanism
  • The potential activities of every domain and
    control KS are recorded on the same agenda, so
    that the most prioritary activity can be chosen
    by the scheduler

67
Systèmes Multi-Agents
  • Conception
  • Application

68
Part II - Application
  • Patient Monitoring
  • 1. Monitoring as a physically distributed problem
  • 2. Monitoring as a (distributed) cognition issue
  • 3. Monitoring as a negotiation problem

69
1. - Monitoring as a physically distributed
problem
  • Dimitrios G. Katehakis et al. A Distributed,
    Agent-Based Architecture for the Acquisition,
    Management, Archiving and Display of Real-Time
    Monitoring Data in the Intensive Care Unit,
    Technical Report FORTH-ICS / TR-261
  • http//www.ics.forth.gr/ICS/acti/cmi_hta/publicati
    ons/technical_reports/tr261/ICU.html

70
Intensive Care Unitsa physically distributed
environment
71
Intensive Care Unitsa physically distributed
environment
  • Many variables
  • Continuous measurements of electrocardiogram,
    central venous pressure, systemic arterial
    pressures, cardiac output, urine output,
    pulmonary arterial pressures, blood gases, and
    mixed venous saturation
  • Measurements made by the ventilator itself
    respiratory rate, tidal volume, peak inspiratory
    pressure, average airway pressure, spontaneous
    minute volume, lung mechanics, oxygen
    consumption, and metabolic rate
  • Many interaction / monitoring needs

72
(No Transcript)
73
(No Transcript)
74
System Architecture
  • Two types of agents the acquisition agent and
    the monitoring agent. Acquisition agents perform
    data acquisition and feed data to monitoring
    agents, who facilitate data visualization and
    storage

75
Agent s role
  • The acquisition agent collects data either from
    patient connected sensors or from clinical
    information systems
  • Acquired data are kept temporarily on a local
    data store, until they are transmitted to the
    appropriate monitoring agents
  • An acquisition agent may have a number of input
    and output channels, each of which can be
    dedicated to a different monitoring agent. The
    acquisition agent is therefore communicating with
    several monitoring agents simultaneously
  • The monitoring agents receive data, which are
    stored temporarily in a data repository and are
    visualized through a Graphical User Interface
    (GUI)

76
Introducing cognition agents
77
2. - Monitoring as a (distributed) cognitive issue
  • GUARDIAN -- A prototype intelligent agent for
    monitoring and therapeutics in intensive care
  • Barbara Hayes-Roth
  • Knowledge System, Laboratory at Stanford
    University
  • http//ai.eecs.umich.edu/cogarch2/authors/bhayes-r
    oth.html

78
Monitoring as a (distributed) cognitive issue
  • The perception - cognition - action problem
  • A compromise to be reached between the quality
    and rapidity of reasoning
  • Sacrifice the quality of a solution for one that
    meets the deadline
  • A quick action of less quality will push off the
    deadline far enough so that a quality solution
    can be found
  • Interleave reactive and cognitive behaviours

79
Architecture
  • Three independent sub-systems cognition,
    perception and action, which
  • Operate concurrently and asynchronously
  • Communicate through a globally accessable
    communication interface which asynchronously
    relays data among limited size I/O buffers
  • the system interact concurrently with subsets of
    the environment,
  • thereby increasing performance,
  • and reducing the overall complexity each
    subsystem must be able to deal with

80
Cognitive system
Perceptual input / Cognitive events
Action parameter/filters
Perceptual input/filters
Fast reflex arcs
Perception agents
Action agents
Perceptual input
Action parameter
Interactive displays
Sensors
Actuators
I/O
Patient, ventilator, human users,
81
Perception agent s role interpretation
  • The sensor agent s role is to acquire a given
    type of signals, transduce it into an internal
    representation, and holds the results in its I/O
    buffer
  • The perception agent s role is to retrieve the
    sensor agent information, to analyze and
    interpret it and transmit the results to the
    Communication Interface
  • Example peak inspiratory pressure
  • value  high 
  • trend  rising 
  • relevance to ongoing reasoning tasks  not
    relevant  
  • priority  high 

82
Perception agent s role focus of attention
  • The perception agent s retrieves the sensor
    agent information at some given rate, according
    to given filters
  • Rate and filter information is transmitted by the
    cognitive system, according to the current
    reasoning state
  • The system is therefore provided with focus of
    attention capabilities
  • The more important the data, the more often the
    system will see it
  • Conversely, as the buffer size is limited, if the
    cognitive system does not look at the buffer
    often enough, perceptual information may be lost

83
Action agent s role action
  • The action agent s role is to
  • Monitor its input buffer, retrieve intended
    actions, and translate them into executable
    programs of actuator commands
  • Control the execution of these programs by
    sending successive commands to its actuator at
    appropriate times 
  • Example action dynamically adjust the
    ventilator s settings
  • The action agents relieve the reasoning system of
    the computational burden of managing the
    low-level details of action execution

84
Action agent s role user interaction
  • The action agent s role is also to communicate
    with the external users, in order to
  • Recommend other interventions to correct
    diagnosed problems or avoid predicted problems
  • Give explanations about the system s current
    monitoring strategy, its reasoning about some
    particular problem, and the biological and
    physical phenomena underlying the patient s
    condition.

85
Coordination between perception, reasoning and
action
  • The perception, reasoning, and action systems
    work concurrently, in a continuous way
  • Information from the environment is perceived
    continuously, even while the system is engaged in
    computationally expensive reasoning tasks
  • The system is guaranteed to perceive any critical
    events that occur
  • Conversely, the system reasons continuously,
    regardless of the rate of incoming events. Thus,
    it is guaranteed to complete a critical reasoning
    task without interruption, unless it decides to
    attend to a more critical new event

86
Coordination between perception/reasoning and
action
  • Fast reflex reactions occur across
    perception-action arcs and allow perception to
    drive action directly
  • For example, Guardian might automatically sound
    an alarm and deliver a simple explanation
    whenever perceived values of key physiological
    parameters enter critical ranges
  • Comparatively slow cognitive reactions involve
    all three systems, with cognition mediating
    actions in response to perception

87
Cognitive system s role
  • The role of the cognitive system is twofold
  • To interpret perceived information from the
    environment, perform the needed reasoning tasks
    and decide what actions to perform (several
    reasoning agents)
  • To construct and modify dynamic control plans to
    coordinate the perception, reasoning, and action
    tasks (one control agent)
  • These tasks are performed by agents operating
    according to the blackboard model of control

88
Cognitive system s Architecture
89
Control agent
  • The control agent comprises 3 components that run
    sequentially
  • The agenda manager uses current perceptual and
    cognitive events and the current control plan to
    identify and rate possible reasoning operations
    it records them on the agenda buffer
  • The scheduler takes information from the agenda
    and uses the control plan to select the operation
    that best matches the current plan it records
    that operation on the next operation buffer it
    may also decide to interrupt the agenda
    management to give priority to critical
    operations
  • The executor executes the chosen operation,
    producing changes to the global memory new
    perceptual preprocessing parameters or intended
    actions in output buffers new reasoning results
    for ongoing tasks or new control decisions

90
Controller agent cycle
Scheduler
Agenda Manager
Executor
Reasoning Agents
Perception/Action Agents
91
Cognitive state
  • Holds the control information necessary to drive
    control it is comprised of three buffers
  • The event buffer holds current asynchronously
    arriving perceptual inputs and cognitive events
    produced by reasoning
  • The agenda holds currently executable reasoning
    operations - those whose trigger conditions are
    satisfied
  • The next operation holds the reasoning operation
    to be executed next

92
Cognitive state
  • All of these buffers have limited capacity, and
    are exploited according to two criteria
  • Best-first retrieval (items that score higher are
    retrieved earlier)
  • Worst-first overflow (items that score lower
    overflow earlier)
  • defined in terms of four orthogonal attributes
  • Relevance to Guardian's current reasoning
    activities
  • Importance with respect to Guardian's global
    objectives
  • Recency of entering the buffer
  • Urgency of processing the item in order to have
    the intended effect (e.g., meet deadlines)
  • If too many critical events occur simultaneously,
    they will overflow the buffers

93
The control plan
  • A control plan is a temporal pattern of control
    decisions, each describing a class of operations
    to be performed, under specified constraints,
    during some time period
  • It is used to focus the reasoning cycle given a
    strategy to complete the task at hand
  • This includes determining which actions have
    priority on the agenda, when the scheduler should
    interrupt, and what the perceptual filters should
    contain
  • The only changes to the control plan are those
    made by the control KS, and hence were determined
    necessary by the control plan and environment at
    that time

94
Cognitive systems Architecture
Control KS
Reasoning KS
Control agent
95
Example Control Plan
  • Example plan
  • Respond to critical events
  • Monitor all parameters for changes
  • D 10D 2D
  • Time
  • According to the first decision, Guardian decides
    to respond to critical events. With the second
    decision, it decides that the perceptual
    preprocessor should send new values for patient
    parameters only when their values change by a
    threshold percentage. These decisions remain in
    effect (with some changes in preprocessing
    threshold) throughout the given period of time

96
Example Control KS
  • Name Urgent-Reaction
  • Trigger Critical observation, O
  • Action Record control decision with
  • Prescription Quickly react to O
  • Criticality Criticality of O
  • Goal Diagnosed problems related to O are
    corrected
  • This operation is triggered and its parameter, O,
    is instantiated whenever the perception system
    delivers an observation with high criticality
    (such as high PIP - Peak Inspiratory Pressure)
  • When executed, it generates a control decision
    favoring  quick  reasoning operations that
     react to  O, and gives it the same criticality
    as O
  • The decision is deactivated when its goal is
    achieved, namely that all diagnosed problems
    related to O have been corrected.

97
Resulting control plan
  • Modified plan
  • Respond to critical events
  • Monitor all parameters for changes
  • D 10D 2D
  • Quickly react to high PIP
  • Time

98
Reasoning knowledge a multispecialist approach
  • Reasoning knowledge is distributed among several
    task-dependent specialists
  • Diagnosis of observed signs and symptoms
  • Prediction of patient condition
  • Causal inference of precursors and consequences
    of observations, problems, etc.
  • Explanation of underlying causal phenomena
  • Each of these tasks may be performed using
    associative or model-based reasoning methods

99
Associative methods
  • Associative methods use clinical knowledge, apply
    to familiar problems, and give simple
     answers , with minimal explanation
  • For example, Guardian responds to an observed
    rise in PIP by quickly diagnosing a
    hypoventilation problem and increasing the
    patient's ventilation
  • Having relieved the symptoms and extended the
    hard deadline, it acquires additional data to
    diagnose and correct the specific underlying
    problem (e.g., pneumothorax)

100
Model-based methods
  • Model-based methods use biological and
    first-principles knowledge, apply to familiar and
    unfamiliar problems, and give detailed
     answers  with informative explanations
  • For example, Guardian can give a
    pathophysiological explanation of its prediction
    that normal minute ventilation of a cold
    post-operative patient will result in low
    arterial partial pressure of CO2
  • The patient's low temperature leads to decreased
    metabolic activity in the cells, this results in
    decreased O2 consumption and decreased CO2
    production in the tissue compartment.

101
Model-based methods
  • Another example
  • Name Find-Generic-Causes
  • Trigger Observe condition C
  • where C exemplifies Generic-fault F
  • Action Find Generic-fault that can-cause F
  • Find-Generic-Causes is triggered when C is
    observed
  • Upon execution, the action is to look for
    generic-faults that  can-cause  F
  • By recording each such cause in the global
    memory, this operation creates internal events
    that trigger other reasoning operations

102
An illustrative scenario
  • A scenario illustrating the system capacity to
  • Manage moderately important, slowly evolving
    problems (e.g., low temperature and its
    consequences)
  • Manage time-critical problems (e.g., high PIP and
    the underlying pneumothorax)

103
A strategy to investigate a patient s low
temperature
  • The system is monitoring all patient parameters
    for value changes of a threshold percentage
  • It notices the patient s low temperature, a
    non-critical problem but worth investigating
  • It makes control decisions that instantiate an
    abstract strategy for investigating this type of
    problem
  • a) Diagnose the low temperature
  • b) Infer and correct immediate consequences
  • c) Predict changes
  • d) Infer and act to avoid expected consequences

104
  • a) Diagnose the low temperature
  • Attribute the low temperature to the patient s
    immediate post-operative status
  • b) Infer and correct immediate consequences
  • Infer that the patient's PaCO2 is currently low,
    due to the interaction between low temperature
    and normal breathing rate
  • c) Predict changes
  • Predict that the temperature will rise to high
    and then fall to normal over several hours
  • Predict that the PaCO2 will rise to high and fall
    to normal with temperature
  • d) Infer and act to avoid expected consequences
  • Decide to lower the breathing rate to correct the
    PaCO2
  • Plan a series of rate changes correlated with
    temperature to maintain the PaCO2 within an
    acceptable range

105
An unexpected event
  • In the course of this strategy, the system
    observes high, rising PIP, indicating a
    potentially life-threatening condition with a
    deadline for corrective action on the order of
    minutes
  • A control decision is made that instantiates an
    abstract strategy for correcting critical
    conditions as quickly as possible
  • Fast associative reasoning is favoured to
    diagnose and correct the problem

106
A strategy to correct critical conditions
  • a) Consider other patient data to diagnose the
    problem class, hypoventilation problem
  • b) Advise increasing ventilation so the patient
    will get enough oxygen
  • c) Request diagnostic actions auscultation of
    the chest for asymmetric breathing sounds and
    inspection of chest xrays
  • d) Diagnose the underlying problem, a
    pneumothorax
  • e) Advise insertion of a chest tube to relieve
    the pressure of accumulated air in the chest
    cavity
  • f) Predict and confirm the resulting drop in PIP
  • g) Advise reduction of the breathing rate as
    increased ventilation is no longer necessary
  • h) Request a lab test in twenty minutes to
    confirm that blood gases are normal

107
Discussion
  • Knowledge representation is complex, even for
    simple and well-kown situations it is difficult
    to ensure the order and time of execution of the
    system modules
  • There is a number of coefficients and variables
    to adjust
  • The basic functions of sensing, reasoning and
    acting are distributed among local agents
    sensing and acting may be engaged in a pure
    reactive way but as well be influenced by the
    reasoning process under development
  • The ratio of intra-agent computation to
    inter-agent com-munication is relatively high
  • Consistency is ensured by a global control plan
    influencing what the agents tackle and
    constraining their internal decisions

108
3. - Monitoring as a negotiation problem
  • Anthropic agency a multiagent system for
    physiological processes
  • Francesco Amigoni, Marco Dini, Nicola Gatti, and
    Marco Somalvico
  • Artificial Intelligence in Medicine Journal,
    Vol. 27, n3, 2003
  • Special issue  Software agents in health care 

109
The anthropic agency
  • Agency a multiagent system as a single machine
    composed of complex components the agents
  • Anthropic from the Greek anthropos, namely man
    the agency is employed to model the
    physiological processes of the human being
  • An example application the regulation of the
    glucose-insulin metabolism in diabetic patients,
    a process where partially overlapping models of
    glucose level regulation coexist

110
Diabetic pathology
  • Glucose is one of the bodys main sources of
    energy
  • The body regulates the processes that control the
    production and storage of glucose by secreting
    the endocrine hormone, insulin, from the
    pancreatic B-cells
  • Type 1 diabetes is characterized by a loss of
    pancreatic beta-cell (B-cell) function and an
    absolute insulin deficiency
  • Since insulin is the primary anabolic hormone
    that regulates blood glucose level, this results
    in the inability to maintain blood glucose
    concentrations within physiological limits

111
Diabetic pathology
  • A long time exposition to very high values of
    blood glucose concentration causes serious
    complications to other body organs
    (cardiovascular and renal system, retina)
  • Type 1 diabetics require a continuous supply of
    insulin for survival (multiple daily injections
    or a continuous subcutaneous insulin infusion
    guided by daily blood glucose measurements) in
    order to try and keep the glucose concentration
    under control
  • Many factors have to be considered to choose the
    current dose of insulin to inject amount of
    food, current glucose concentration value,
    general physical state

112
Diabetic pathology
  • In diabetes, there is an uncoupling of blood
    glucose levels and the concentration of insulin
    that prevents the proper regulation of glycemia.
    Instead of a narrow glycemic range, blood glucose
    deviations can extend from hypoglycemia into
    hyperglycemia

113
Diabetic pathology
  • The main problem in the diabetic pathology is the
    insulin response when the person eats because it
    is when the glucose concentration reaches the
    maximum value
  • Another issue is the effect of physical activity
    on the insulin level
  • There is a need to keep constant the glucose
    level to sustain the physical activity
  • Conversely, physical activity helps regulating
    the glucose level and keeping more sensitive to
    insulin, therefore being able to function with
    less insulin

114
Variation of the glucose level when eating
115
Purpose of a monitoring system
  • To constantly monitor the patient, eg analyze its
    current physiological state
  • To inject isulin when needed
  • To adjust the insulin amount in order to keep the
    glucose and insulin concentrations as close as
    possible to the concentrations of a normal person

116
System architecture
  • Three groups of agents working in an asynchronous
    way knowledge extraction, decision making, and
    plan generation
  • Several types of decisional agents with only
    partial views of the phenomenon to be controlled
    and different viewpoints (eg physiological
    models)
  • Presence of overlapping decisional models the
    input parameters as well as the output proposed
    decisions may intersect
  • A negotiation mechanism to fuse the corresponding
    decisions 

117
Agent s role
  • Knowledge extraction agent extract high-level
    information (parameter values) from low-level
    data received from sensors 
  • Decisional agents generate a set of decisions
    in terms of desirable new states ( corrected 
    values for the parameters to be monitored)
  • Actuator agents generate the sequence of
    actions to perform to reach the desired states
  • The agents communication is mediated by two
    dedicated blackboards the parameter blackboard
    and the knowledge blackboard

118
System architecture
Decision Making
Knowledge Extraction
Plan Generation



119
Extractor agents
  • An extractor agent
  • is connected to sensors,
  • from which it acquires signal information,
  • that it filters and processes,
  • to generate the values of a set of parameters
  • The parameters values generated by all the
    extractor agents are placed in the parameters
    blackboard.

120
Implemented extractor agent
  • The implemented extractor agent puts in the
    parameters blackboard a vector of parameter
    describing
  • The current level of insulin
  • The current level of glucose
  • The current variation of the glucose
  • The current level of the physical activity (as
    provided by piezoelectric crystal sensor for
    example)

121
Decisional agents
  • The decisional agents
  • Read the parameter information from the
    parameters blackboard 
  • Computes a  decision  as a pair (desired target
    value for a parameter, weight)
  • The computation of the desirable target value is
    based on the agent internal model, on the current
    values of parameters, and on the effects of past
    decisions
  • The weight is a measure of how much the current
    parameter value is away from optimum and, thus,
    of how much the decisional agent  wants  to
    reach the proposed target value for that
    parameter
  • Put the result (p,w) in the knowledge blackboard

122
Decisional agent s model
  • Each decisional agent embeds the model of a
    particular physiological process aspect
  • This model must provide a measure of how far the
    patient s state is from the optimum
  • The model must also account for the
    interdependencies between the patient s
    physiological state, pathological state and
    activity
  • Given a model, a set of desirable target states
    that minimize the distance to the optimum
    ( potential  values) is computed by means of an
    heuristic gradient descent technique

123
Decisional agent s model
  • Samples of the evolution over time of the levels
    of insulin, glucose and glucose variations are
    collected, given a pathology level (in terms of
    the insulin basal secretion level and glucose
    variation sensibility), a food absorbtion curve
    and a physical activity level
  • These curves are then sampled, thus providing the
    parameter values corresponding to different
    pathological states, in different life conditions
    these values are the indexes of the matrix
  • The matrix values (  potential  values), are
    computed for each set of indexes as the distance
    between the corresponding pathological parameter
    and the one of a normal person smaller values
    correspond to more desirable states

124
(No Transcript)
125
Decisional agents
  • Every parameter value of a proposed target point
    has an associated weight, which is the product of
    two factors
  • the difference between the potential of the
    current value and the potential of the proposed
    value
  • a weight measuring the  importance  of the
    physiological function controlled by the
    decisional agent
  • for example by setting the importance weights it
    is possible to give higher priority to the
    control of vital functions than to the control of
    peripheral functions

126
The negotiation mechanism at a glance
Decision Agent 1
Physical cost
Social cost
Agent decision
Actuator Agent 1
Equalizer
Target state
Agent decision
Social cost
Physical cost
Decision Agent 2
127
The implemented decisional agents (1)
  • The first decisional agent embeds a simplified
    control model of the glucose-insulin metabolism
    related to food adsorption
  • Its role is to compute desired values and weights
    for the level of insulin, level of glucose and
    variation of the glucose, based on their current
    values
  • This first decisional agent tries to reduce the
    glucose concentration during food adsorption

128
The implemented decisional agents (2)
  • The second decisional agent embeds a simplified
    control model of the glucose-insulin metabolism
    related to physical activity
  • Its role is to compute desired values and weights
    for the level of insulin, level of glucose and
    variation of the glucose based on their current
    values and the level of activity
  • This second decisional agent tries to keep
    constant the glucose level by limiting the
    exogenous insulin introduction when the physical
    activity is intense.

129
Agent s importance
  • The value of the importance is variable according
    to the current state of the system
  • The importance of the first decisional agent,
    which is related to the food absorbtion, is
    higher than that of the second decisional agent,
    which is related to physical activity
  • This reflects the fact that the main problem of a
    diabetic patient is the insulin response when the
    patient eats

130
The negotiation mechanism
  • Different decisional agents can propose different
    variations for the same parameters, generating
    conflicts
  • For example, the first decisional agent may
    propose to decrease the glucose level after a
    meal whilst the second may propose to keep
    constant the glucose level because the patient is
    undergoing an intense physical activity
  • In the proposed approach, the relations between
    agent s models are not explicitely considered
    and the decisional agents are not aware of the
    presence of the others
  • Necessity of an external negotiation mechanism
    (the equalizer, situated in the knowledge
    blackboard)

131
The negotiation mechanism
  • To make a final decision, the decisional agent
    has to select a single target state
  • For this purpose, it calculates the cost of the
    variation of each parameter from the current
    state to a target state, as the weighted sum of
    the measure variations
  • The weights are computed as the sum of two
    elementary costs 
  • The actuation cost ( physical  cost)
  • The negotiation cost ( social  cost)

132
The negotiation mechanism
  • The unitary cost of the parameter is the sum of
    two elements
  • The actuation cost is determined by the actuator
    agent that acts on the parameter it measures
    the cost of the physical variation of the
    parameter
  • The negotiation cost is determined by the
    equalizer component during the negotiation
    process it measures the difficul
Write a Comment
User Comments (0)
About PowerShow.com