SIMONE: Spoken Interaction for Mobile Networked Ecosystems - PowerPoint PPT Presentation

1 / 15
About This Presentation
Title:

SIMONE: Spoken Interaction for Mobile Networked Ecosystems

Description:

Find the pictures I took at Michelle's wedding ... Data collection platform. MIT Computer Science and Artificial Intelligence Laboratory ... – PowerPoint PPT presentation

Number of Views:48
Avg rating:3.0/5.0
Slides: 16
Provided by: JimG50
Category:

less

Transcript and Presenter's Notes

Title: SIMONE: Spoken Interaction for Mobile Networked Ecosystems


1
SIMONE Spoken Interaction for Mobile Networked
Ecosystems

NRC Cambridge MIT CSAIL Spoken Language
Systems October 2, 2007
2
The Premise
  • Small devices need speech
  • Current interfaces are challenged
  • Spoken language is natural and efficient

Cancel my Thursday meeting with Tom
  • Dialogue is the crucial element
  • Interaction is more than recognition
  • Understanding, dialogue and generation must be
    incorporated

Play another song by that group
Find the pictures I took at Michelles wedding
3
Project Summary
  • Spoken dialogue to simplify the mobile device
    interface
  • To structured information (e.g., calendar)
  • To loosely structured data (e.g., photos)
  • Technology requirements
  • Portability (e.g., applications, platforms)
  • Personalization (e.g., adapting to the user)
  • Flexibility (e.g., open-ended input/retrieval)
  • Multilinguality

Language Generation
Dialogue Planning
Speech Synthesis
Speech Recognition
Context Resolution
Language Understanding
4
Outline
Spoken Access to Applications
Content Annotation and Retrieval
Small Platforms
5
Spoken Access to Applications
  • Personalized vocabularies
  • Data collection platform
  • Portability developments

6
Personalized Vocabularies
Events
Dynamic Classes
Recognition
Contacts
Understanding
7
Example Dialogue
May 23
May 26

May 24
11-12
1-2
2-3
2-4
  • Spoken language technology capabilities
  • Speaker-independent speech understanding
  • Speech generation to support display
  • Dialogue support for complex queries
  • Confirmation sub-dialogues
  • Negotiation for conflict resolution
  • Support for anaphoric references (e.g., this
    meeting)

8
Content Annotation and Retrieval
  • Flexible understanding
  • Data collection platform

9
Speech-based Photo Tagging Retrieval
Julia with Pluto at Disney World
Creating
Finding
Show me the photo of Julia and Pluto at Disney
World from December of 2006.
10
Photo Tagger/Browser Architecture
Verbal Annotation
Speech Hypotheses
AnnotationRecognizer
Annotation Indexer
Photo plus meta-data (date taken, owner, etc.)
Term Index
Meta Data
List of Photos
Annotation Terms
Meta-Data Terms
Query Recognizer
Spoken Query
11
Small Platforms
  • Speech recognition
  • N800 infrastructure
  • Future plans

12
Small Platform Development
  • We are migrating our Galaxy spoken dialog
    components from x86 workstations to small devices
    such as the N800

Recognition
Understanding
Generation
Synthesis
Audio
Dialogue
XML-RPC
Galaxy Proxy
Galaxy Proxy
  • Current progress
  • Proxies on workstation and N800 support hybrid
    dialogue systems
  • Access to streamed audio for recording and
    playback on N800
  • Integrated small-platform speech recognizer
  • Other Galaxy messages accessible via event-based
    interface
  • Debian packages and Python wrappers support
    application development
  • Prototype Weather forecasts with local speech
    recognition

Demo
13
Small Platform Development Next Steps
  • Plan to port understanding and generation
    components
  • Leverage Nokia speech synthesis development
    effort if possible

Understanding
Generation
Recognition
Synthesis
Audio
Dialogue
XML-RPC
Galaxy Proxy
Galaxy Proxy
  • Understanding involves parsing and semantic frame
    creation

14
The Next Steps
  • Spoken dialogue is a viable modality for mobile
    devices
  • A natural and efficient means of communication
    for small devices
  • Many possible applications on either the local
    device or via network
  • Many resources needed to transfer technology
  • Technology development and multilingual support
  • User interface developers and application
    integrators

15
  • Thank You
Write a Comment
User Comments (0)
About PowerShow.com