The Future of Mobile - PowerPoint PPT Presentation

1 / 44
About This Presentation
Title:

The Future of Mobile

Description:

... ever try to find a neighborhood restaurant using a mobile browser... Activity-based design creates chains of services (BigTribe) or menus of related actions. ... – PowerPoint PPT presentation

Number of Views:296
Avg rating:3.0/5.0
Slides: 45
Provided by: JFC1
Category:

less

Transcript and Presenter's Notes

Title: The Future of Mobile


1
The Future of Mobile Applications John Canny
UCB EECS Marc Davis UCB School of Information
Yahoo! Research Berkeley
2
The Business
  • There are 6.5 billion people on earth - only
    about 1.2 billion in developed countries
  • They will buy 800 million mobile phones this
    year - one person in eight on the planet
  • Thats 4x PC or TV unit sales
  • Fraction of smartphones should reach 40 by
    2009 - most common computer

3
What kind of computer is it?
  • This years Smartphone (free with service
    contract)
  • 150-200 MHz ARM processor
  • 32 MB ram
  • 2 GB flash (not included)
  • Windows-98 PC that boots quickly!
  • Plus
  • Camera
  • AGPS (Qualcomm/Snaptrack)
  • DSP cores, OpenGL GPU
  • EV-DO (300 kb/s), Bluetooth

4
Whats Coming
  • In the past, the platform was driven by
    voicemessaging
  • Now the high end is driven by video, gaming,
    location,
  • The result is diversification of the platform,
    and sudden jumps in performance, e.g. Qualcomm
    has 4 platforms
  • Value platform (voice only)
  • 4. Convergence platform (MP3 player, gamer,
    camera,) several timesthe performance of
    todays high-end

PC
5
The Inevitable
  • In response to MITs 100 laptop, Microsoft last
    month proposed the cell phone computer for
    developing countries

Bollywoodon demand click here
6
Back to the future, which is
  • Using Context
  • Location, time, BT neighborhood,
  • Community
  • User History
  • Harnessing Content
  • Text, Images, Video Metadata
  • Speech Recognition
  • Computer Vision

7
Whats wrong today
  • Did you ever try to find a neighborhood
    restaurant using a mobile browser
  • and find it while you were in the same
    neighborhood?
  • In a car you might end up in the next county
  • Luckily a house stopped thisdriver before they
    got into serioustrouble.

8
Context-Awareness
  • Context-awareness is the holy grail for next
    generation mobile applications
  • Location (e.g., video store) heavily shapes
    the users likely actions.
  • The system can present streamlined choices
    here are your top-10 video suggestions with
    clickable previews.
  • For users this is very convenient.
  • For vendors,

9
Context-Awareness and Pro-Activity
  • Knowledge of user background and context provide
    great opportunities for pro-active services
  • Its 7pm and youre in San Francisco, would you
    like me to find a nearby restaurant?

10
Context-Awareness and Pro-Activity
  • Knowledge of user background and context provide
    great opportunities for pro-active services
  • Its 7pm and youre in San Francisco, there is
    a table available two blocks away at Aqua.
    Would you like me to book it?

11
Context-Awareness and Pro-Activity
  • Knowledge of user background and context provide
    great opportunities for pro-active services
  • Its 7pm and youre in San Francisco, there is
    a table available two blocks away at Aqua, and
    they have a special on Salmon in parchment for
    28. Would you like me to book a table, and
    order the special?

12
Context-Awareness and Recognition
  • Consider now a speech recognizing version of this
    application
  • Its 7pm and youre in San Francisco, there is
    a table available two blocks away at Aqua, and
    they have a special on Salmon in parchment for
    28. Would you like me to book a table, and
    order the special?
  • User Yes or No

13
Context-Awareness and Activity
  • Peoples actions are part of larger wholes called
    activities.
  • When you plan an evening out it may include
  • Going for coffee
  • Seeing a movie
  • Eating dinner
  • - planned and coordinated by the phone
  • Sharing photos on your cameraphone
  • Scoring your date in real-time
  • Its a social platform!

14
Activity-Based Design
  • Activity-based design creates chains of services
    (BigTribe) or menus of related actions. More
    examples
  • Planning a trip hotel, car, events
  • Going back to school housing, books etc.
  • Shopping for holiday gifts
  • Moving house
  • Hobbies Needlepoint,Monster truck racing
  • Some of these services exist, but activity
    analysis supports automatic discovery and
    customization of them.

15
Sociotechnical Systems
  • The problems of the internet are not purely
    technological
  • Need to conduct sociotechnical analysis and
    design of large scale internet systems and
    applications at the intersection of media,
    technology, and people
  • Leverage media metadata created by context-aware
    devices, content analysis, and communities

16
Signal-to-Symbol Problems
  • Semantic Gap
  • Gap between low-level signal analysis and
    high-level semantic descriptions
  • Vertical off-white rectangular blob on blue
    background does not equal Campanile at UC
    Berkeley

17
Signal-to-Symbol Problems
  • Sensory Gap
  • Gap between how an object appears and what it is
  • Different images of same object can appear
    dissimilar
  • Images of different objects can appear similar

18
Computer Vision and Context
  • You go out drinking with your friends
  • You get drunk
  • Really drunk
  • You get hit over the head and pass out
  • You are flown to a city in a country youve never
    been to with a language you dont understand and
    an alphabet you cant read
  • You wake up face down in a gutter with a terrible
    hangover
  • You have no idea where you are or how you got
    there
  • This is what its like to be most computer vision
    systemsthey have no context
  • Context is what enables us to understand what we
    see

19
Campanile Inspiration
20
MMM Mobile Media Metadata Idea
  • Leverage the spatio-temporal context and social
    community of media capture in mobile devices
  • Gather all automatically available information at
    the point of capture (time of capture, spatial
    location, collocated phone users, etc.)
  • Analyze contextual metadata and media to find
    similar media that has been captured before
  • Use patterns in previously captured
    media/metadata to infer the content, context, and
    community of newly captured media
  • Interact with users to augment system-supplied
    metadata for captured media

21
MMM Mobile Media Metadata Projects
  • Mobile Media Metadata
  • Davis, Canny, et al.
  • UC Berkeley

22
Context-Aware Face Recognition
23
Context-Aware Face Recognition
  • Face recognition alone - 43
    accurate(state of the art computer vision)
  • Context analysis alone - 50
    accurate(Face prediction from contextual data
    on the phone)
  • ContextContent analysis - 60 accurate




Figure 1. (Top) Subjects with frontal pose,
(Bottom) Same
24
Context-Aware Place Recognition
  • Image analysis alone - 30
    accurate
  • Context analysis alone - 55
    accurate
  • ContextContent analysis - 67 accurate

25
MMM2 Context to Community
26
Photo Share Guesser
27
Photo Level of Interest (LOI) Browser
28
PhotoCat Context-Aware Photo Browser
29
Technologies
  • We have been developing core technologies for
    context and content mining for the last 5 years
  • Accurate, scalable personalization (used in
    MMM2)
  • Algorithms for integration of personal and
    context information, and for activity
    discovery
  • Methods to preserve privacy while mining user
    location history and online behavior
  • Weve also worked with a company (BigTribe)
    through 3 funded NSF SBIRs, to migrate these
    ideas into products.

30
Harnessing Large, Mixed Content Sources
  • Early access to an XML Content-Base (Mark Logic
    CIS)
  • We built an efficient locationmetadata server
    from diverse well- and poorly-structured data
    sources.
  • Street data comes from XML Census data
    (Tiger/GML).
  • Restaurant data is from the Open Directory,
    partly structured.
  • Addresses converted to LAT/LONG by the database.
  • The map you see is producedentirely using XQuery
    (SVG).

31
Location Content-Base
  • A native XML engine supports efficient tree
    traversal.
  • The location C-B uses R-treeorganization as its
    XML schema
  • The result is that our software spatial
    database has the same efficiency as custom
    spatial databases.
  • i.e. not only are data types extensible,but also
    the types of query that areefficiently supported.

32
Context-Aware Design Glaze
  • Designing new context-aware apps works best in
    the wild. We are doing a participatory design
    experiment with 20 AGPS phones this spring.
  • Users carry the phones with them everywhere, be
    able to use some seed applications, and otherwise
    create their own micro-apps through noun-verb
    composition.

33
Perceptual Interfaces - Vision
  • We needed continuous mouse input for map
    browsing, so we developed TinyMotion, a software
    mouse for cameraphones.
  • By moving the camera against any background,
    real-time image motion estimation provides mouse
    coordinates.Also great for games demo in BID
    lab

34
Perceptual Interfaces - Vision
  • Cameraphones are capable of much more. Right now,
    the vision algorithms available include
  • Motion
  • Barcodes
  • OCR text (business cards etc.)
  • Coming soon
  • Face recognition
  • Building or streetscape recognition

35
Perceptual Interfaces - Speech
  • Speech recognition technology has improved
    steadily in the last ten years, particularly in
    noisy environments.
  • Speech was never a good match for office
    environments.
  • But the mobile playing field is completely
    different.
  • Mobile users often need their eyes and hands
    free, and the phone will always have a voice
    channel for telephony.

36
Speech on Mobile Phones
  • Restricted speech recognition is available on
    many phones.
  • Large-vocabulary recognition just appeared on
    cell phones last year (Samsung P207). Its a huge
    step. It enables the next generation of mobile
    speech-based apps
  • Message dictation
  • Web search
  • Address/business lookup
  • Natural command forms(no need to learn them)
  • Most of this technology was developed in the US
    by VoiceSignal Technologies.

37
Research in Mobile Speech
  • We are developing a state-of-the-art (continuous
    acoustic model) recognizer for SmartPhones.
  • The goals are
  • To provide an open platform for next-generation,
    speech-based interfaces on mobile devices.
  • To support integration of contextual knowledge in
    the recognizer.
  • To allow efficient exploration of the higher
    levels of dialog-based interfaces.

38
Speech for Developing Regions
  • Speech is an even more important tool in
    developing regions.
  • Literacy is low, and iconic (GUI) interfaces can
    be hard to use.
  • Unfortunately, IT cannot help most of these
    people because they lack even more basic skills
    fluency in a widely-spoken language like English
    or Mandarin.
  • This project focuses on teaching English in an
    ecologically appropriate way.
  • Speech-based phones are ideal for this.

39
Speech for Developing Regions
  • Speech (with headset) allows students to learn
    while working.
  • It leaves their eyes and hands free, and engages
    their minds during tedious, manual work.
  • Some game motifs
  • Safari hear sound say the name in English
  • Karoake in English
  • Listen and summarize BBC, cricket etc.
  • Treasure hunt leave LB clues in English
  • Adventure games dialog-driven scenarios

40
  • In Summary,The Future of Mobile is
  • Using Context
  • Harnessing Content
  • Context ? Proactivity
  • C/P ? Content sharing
  • Perceptual Speech/Vis
  • 5 Billion new users

41
Upcoming
  • Special issue of ACM Queue magazine on
    context-aware and perceptual interfaces (summer
    06?) JFC guest Ed.

42
Workshop on Mobile Applications
  • Planning an event on campus later this semester.
  • Send mail to jfc_at_cs.berkeley.edu if interested.

43
Class Presentations on Mobile Applications
  • Our User Interface class is developing mobile
    applications on Microsoft Smart Phones (thanks,
    MS!), demos in May.
  • Projects include
  • Craiglist/Friendster in your vicinity
  • Video store wizard
  • Bluetooth E-tickets for BART, parking,
  • Diet/Nutrition assistant
  • Send mail to jfc_at_cs.berkeley.edu if interested.

44
Demos and Posters Today
  • You can see the projects discussed here in the
    BID lab (Berkeley Institute of Design) open
    house, 2-4pm
  • Tinymotion camera mouse
  • Glaze location service design
  • Speech recognition for cell phones
  • English-Language learning with cell phones
  • a dozen other projects
  • The lab is in 354-360 Hearst Mining Bldg.

45
Acknowledgements
  • Thanks to the sponsors of this work
  • and..
Write a Comment
User Comments (0)
About PowerShow.com