Emotion Recognition - PowerPoint PPT Presentation

1 / 18
About This Presentation
Title:

Emotion Recognition

Description:

Formants & their Bandwidths. MFCC (Mel Frequency Cepstral Coefficients) Classifiers ... Max Formant. Happy. Min MFCC. Neutral. Excited. Decision tree( cont' ... – PowerPoint PPT presentation

Number of Views:5997
Avg rating:5.0/5.0
Slides: 19
Provided by: AIE
Category:

less

Transcript and Presenter's Notes

Title: Emotion Recognition


1
Emotion Recognition
  • Iris Bass
  • Thao Nguyen

Supervised by Dr. Ishwar K. Sethi
2
(No Transcript)
3
Objective
  • Human computer interaction
  • Voice Stream
  • Discriminate between emotions

4
Why do we need emotion recognition?
  • Interactive Voice Response (IVR)
  • Robotics
  • Computer/video games

5
Steps
6
Features
  • Pitch
  • Energy
  • Speaking Rate
  • Formants their Bandwidths
  • MFCC (Mel Frequency Cepstral Coefficients)

7
Classifiers
  • SVM
  • Decision Tree
  • Other Methods
  • - HMM
  • - KNN

8
Support Vector Machine
  • 1) Problem statement
  • Given l examples ( x1, y1),, ( xl, yl )
  • where xi is a feature vector of length d
  • and yi 1 , -1
  • Find a classifier with the decision function f(x
    ) such that y f( x )
  • where y is the class label for x

9
Support Vector Machine( cont)
  • 2) Separable case
  • Separating hyperplane
  • w.x b 0
  • Constraints
  • w.xi b 1
  • w.xi b -1
  • Margin
  • 2/ w

10
Support Vector Machine( cont)
  • 3) Nonlinear-separable case
  • Map the training data from RN to some higher
    dimensional space H
  • Use Kernel function
  • K( x, y ) ( x.y 1 )d
  • K( x, y ) exp( -((x y)2 )/ ( 2?2) )
  • K( x, y ) tanh( x.y - ? )

11
Decision Tree( ID3 )
12
Decision tree( cont )
  • Use entropy and information gain
  • Entropy( S ) ?-pilog2pi
  • Gain( S, A) Entropy(S)
  • ?(Sv/S).Entropy(Sv)


13
Wav Samples
Found some samples in the following websites
http//www.computing.dundee.ac.uk/staff/i
rmurray/hamlet.asp Phrase 1 - "This is not what I
expected Unemotion ( file hamlet01 ) Anger
( file hamlet02 ) Happiness
( file hamlet03 ) Sadness
( file hamlet04 ) Fear
( file hamlet05 ) Disgust
( file hamlet06 ) Phrase 2 - "You have asked me
that question so many times Unemotion
( file hamlet07 ) Anger (
file hamlet08 ) Happiness ( file
hamlet09 ) Sadness ( file
hamlet10 ) Fear ( file
hamlet11 ) Disgust ( file
hamlet12 )
14
Spider-Man( 2002)
DEMO
15
Future Work
  • Accuracy
  • Classification
  • Database
  • Gender

16
References
  • R. Kent, and C. Read, Acoustic Analysis of
    Speech Second Edition, Thomas Learning, 2002
  • Rabiner, L. and Schafer, R. Digital Processing of
    Speech Signals, Prentice-Hall, 1978
  • J. Deller, J. Proakis, and J.Hansen,
    Discrete-Time Processing of Speech Signals,
    Macmillian, 1993
  • T. Mitchell, Machine Learning, McGraw-Hill 1997
  • M. Schröder (2003). Emotional speech synthesis
    for emotionally-rich virtual worlds. Proc. of
    Workshop on emotionally rich virtual worlds with
    emotion synthesis at the 8th International
    Conference on 3D Web Technology (Web3D), 10.
    March 2003, St.Malo, France.
  • O.-W. Kwon, K. Chan, J. Hao, and T.-W. Lee,
    Emotion Recognition by speech Signals," Proc.
    EUROSPEECH 2003, Geneva, Switzerland, Sept. 2003.
  • Mingkun Li, Support Vector Machine, Intelligent
    Information Engineering Lab, Oakland University,
    2003
  • Dr. Ishwar K. Sethi, Language Identification for
    a Voice Stream Portizoration System, Intelligent
    Information Engineering Lab, Oakland University,
    2003
  • http//www.fbckenner.org/.../ images/future.gif
    future

17
References Continued
  • A. Nogueriras, A. Moren, A. Bonafonte, and J.
    Marino Speech Emotion Recognition Using Hidden
    Markov Models Proc. EUROSPEECH, Scandinavia,
    2001
  • Feng Yu,Eric Chang,Ying-Qing Xu, Heung-Yeung Shum
    . Emotion Detection from Speech to Enrich
    Multimedia Content. The Second IEEE Pacific-Rim
    Conference on Multimedia, Beijing,October 24-26,
    2001
  • S. Yacoub, S. Simske, X. Lin, J. Burns
    HPL-2003-136 Recognition of Emotions in
    Interactive Voice Response Systems -2003
  • http//www.asel.udel.edu/speech/tutorials/acoustic
    s/freq_domain.html Sound in the Frequency
    Domain 1996
  • http//www.asel.udel.edu/speech/tutorials/acoustic
    s/time_domain.html Sound in the Time Domain
    1996
  • http//mi.eng.cam.ac.ukkkc21/thesis_main.html
    The Formulation of Support Vector Machine 1998
  • www.robotoys.com/.../ st_prod.html?p_prodid151
    I Cybie

18
(No Transcript)
Write a Comment
User Comments (0)
About PowerShow.com