Title: On%20the%20road%20to%20the%20creation%20of%20situation-adaptive%20dialogue%20managers
1On the road to the creation of situation-adaptive
dialogue managers
- Ajay Juneja
- akj_at_andrew.cmu.edu
- 11-716 Dialog Seminar
2Papers a diverse selection
- Design of the VICO Spoken Dialogue System
Evaluation of user Expectations by Wizard of Oz
Experiments (Petra Geutner, Frank Steffens, and
Dietrich Manstetten, LREC 2002) - An Automobile-Integrated System for Assessing and
Reacting to Driver Cognitive Load (Pompei,
Sharon, Buckley, and Kemp, 2002) - We are not amused but how do you know? User
States in a multi-modal dialogue system
(Batliner, ZeiBler, Frank, Adelhardt, Shi, Noth,
Eurospeech 2003)
3VICO overview
- Goal to evaluate the use of a Natural-Language
dialogue system in an automotive driving
simulation - How do drivers interact with the NLP system?
- What are the users reactions to such a system?
- What are the distraction effects on the driving
behavior? - VICO operates in a similar manner to Ariadne.
Petra Geutner used to be part of the Interactive
Systems Lab.
4VICO Driver Interactions
- Some interactions were initiated by the dialogue
manager, some were initiated by the user. - Dialogues ranged from 20-200 seconds in duration.
- Were the pre-defined system prompts enough?
According the results, they experienced much less
variation in what people would say than they
expected. As a result, they seemed to have very
promising results. - This is contrary to what General Motors feels
will be the case in NLP dialogue system design.
5VICO User Reactions
- 9/10 people experienced an overall pleasant
reaction to VICO. - Problems experienced in understanding the speech
output from VICO (Speech synthesis) - VICO overloaded some drivers with too much
information at once, one person felt VICO talked
to fast.
6VICO distraction effects
- People tended to slow down when using VICO as the
most common side effect. - Drifting lanes was not common.
- No accidents happened.
7Pompei SharonReaction to Driver Cognitive Load
- Utilized the following sensors
- Cameras
- GPS
- accelerometers
- grip sensors
- foot-position sensors
- ultrasonic sensors on the bumpers
- microphones
- seat sensors
- cup holder sensors
8Pompei SharonReaction to Driver Cognitive Load
- Also used Blue Eyes gaze tracking system (from
IBM Almaden Research Center) - They DID NOT use a telematics or navigation
system in the car, as they wanted to test the
complexity of the typically owned vehicles as a
baseline
9Pompei SharonReaction to Driver Cognitive Load
- Goals
- Monitor driver stress levels Anger is
associated with crashes. - Where the drivers gaze and attention are. Will
be utilized to determine if the driver is looking
at the road or not. - Force the drivers attention to a particular
device with the use of LEDs. - Improve someones driving habits.
- Limit the audibility of a cell phone or
telematics system messages to just the driver. - Warn the driver when appropriate
- Busy button to let the user tell the system
that they do not want to be disturbed by the cell
phone, telematics system, etc.
10Pompei SharonReaction to Driver Cognitive Load
- They havent yet done user studies or examined
all of the data yet.
11We are not amused but how do you know?
- Goal to examine emotional states in the context
of dialogue systems. - Used SmartKom dialogue system with gesture and
facial expression recognition. - What prosodic features are relevant to
classifying user emotional state?
12We are not amused but how do you know?
- Checked for word boundaries by using fixed
alignment. - Studied both holistic user states (Speech,
Gestures, Facial Expression) and just facial
expressions. - Marked significant deviations from neutral.
13(No Transcript)
14We are not amused but how do you know?
- Results
- Prosodic classification doesnt work so well, and
parts of speech dont help so much. - Much confusion between user states of angry and
helplessness. - They havent classified the facial data yet.
15We are not amused but how do you know?
- Characterizations of User States (audio only)
- Joyful is characterized by lower energy level and
less (duration/F0) variation - Helpless has more pauses and longer durations
- Angry has a higher energy level and less energy
variation
16Diverse topics, Where do we go from here?
- VICO shows that NLP does have a place within the
car over a command and control dialog.
Distraction caused by an NLP dialogue system
appears to be minor in their opinion - Pompei and Sharon show us a very interesting
control system set up within a car to monitor a
drivers behavior and have a great framework for
distraction studies. - Batliner, et. al show us that it is extremely
hard to gather information on a users state from
a dialogue manager alone.
17Where do we go from here?
- Within an automotive setting, utilize control
systems akin to what Pompei and Sharon have done,
and integrate them into a dialogue manager. - Have the dialogue manager adapt to different user
states as monitored by outside data, not just
emotional state as determine by tonal
characteristics in ones speaking behavior. - Toyota in another paper suggested that throttle
control was the best measure of someones
distraction level.