Newton Method for the ICA Mixture Model - PowerPoint PPT Presentation

About This Presentation

Title:

Newton Method for the ICA Mixture Model

Description:

Basic Newton method ... Newton method converges 200 iterations, natural gradient fails to converge, ... Basic ICA Newton Method ... – PowerPoint PPT presentation

Number of Views:119

Avg rating:3.0/5.0

Slides: 22

Provided by: sccn

Learn more at: https://sccn.ucsd.edu

Category:

more less

Transcript and Presenter's Notes

Title: Newton Method for the ICA Mixture Model

1
Newton Method for theICA Mixture Model

Jason A. Palmer1 Scott Makeig1Ken
Kreutz-Delgado2 Bhaskar D. Rao2
1 Swartz Center for Computational Neuroscience2
Dept of Electrical and Computer
EngineeringUniversity of California San Diego,
La Jolla, CA

2
Introduction

Want to model sensor array data with multiple
independent sources ICA
Non-stationary source activity mixture model
Want the adaptation to be computationally
efficient Newton method

3
Outline

ICA mixture model
Basic Newton method
Positive definiteness of Hessian when model
source densities are true source densities
Newton for ICA mixture model
Example applications to analysis of EEG

4
ICA Mixture Modeltoy example

3 models in two dimensions, 500 points per model
Newton method converges lt 200 iterations, natural
gradient fails to converge, has difficulty on
poorly conditioned models

5
ICA Mixture Model

Want to model observations x(t), t 1,,N,
different models active at different times
Bayesian linear mixture model, h 1, . . . , M
Conditionally linear given the model,
Samples are modeled as independent in time

6
Source Density Mixture Model

Each source density mixture component has unknown
location, scale, and shape
Generalizes Gaussian mixture model,
more peaked, heavier tails

7
ICA Mixture ModelInvariances

The complete set of parameters to be estimated
is
h 1, . . ., M, i 1, . . ., n, j 1, . .
., m
Invariances W row norm/source density scale and
model centers/source density locations

8
Basic ICA Newton Method

Transform gradient (1st derivative) of cost
function using inverse Hessian (2nd derivative)
Cost function is data log likelihood
Gradient
Natural gradient (positive definite transform)

9
Newton Method Hessian

Take derivative of (i,j)th element of gradient
with respect to (k,l)th element of W
This defines a linear transform
In matrix form, this is

10
Newton Method Hessian

To invert rewrite the Hessian transformation
in terms of the source estimates
Define , ,
Want to solve linear equation

11
Newton Method Hessian

The Hessian transformation can be simplified
using source independence and zero mean
This leads to 2x2 block diagonal form

12
Newton Direction

Invert Hessian transformation, evaluate at
gradient
Leads to the following equations
Calculate the Newton direction

13
Positive Definiteness of Hessian

Conditions for positive definiteness
Always true for true when model source densities
match true densities
1)
2)
3)

14
Newton for ICA Mixture Model

Similar derivation applies to ICA mixture model

15
Convergence Rates

Convergence is really much faster than natural
gradient. Works with step size 1!
Need correct source density model

log likelihood
iteration
iteration
16
Segmentation of EEG experiment trials
3 models
4 models

trial
trial
time
time
log likelihood
log likelihood
iteration
iteration
17
Applications to EEGEpilepsy
1 model
5 models

log likelihood
time
time
log likelihood difference from single model
time
18
Conclusion

We applied method of Amari, Cardoso and Laheld,
to formulate a Newton method for the ICA mixture
model
Arbitrary source densities modeled with
non-gaussian source mixture model
Non-stationarity modeled with ICA mixture model
(multiple mixing matrices learned)
It works! Newton method is substantially faster
(superlinear). Also Newton can converge when
Natural Gradient fails

19
Code

There is Matlab code available!!
Generate toy mixture model data for testing
Full method implemented mixture sources, mixture
ICA, Newton
Extended version of paper in preparation, with
derivation of mixture model Newton updates
Download from
http//sccn.ucsd.edu/jason

20
Acknowledgements

Thanks to Scott Makeig, Howard Poizner, Julie
Onton, Ruey-Song Hwang, Rey Ramirez, Diane
Whitmer, and Allen Gruber for collecting and
consulting on EEG data
Thanks to Jerry Swartz for founding and providing
ongoing support the Swartz Center for
Computational Neuroscience
Thanks for your attention!

21
Newton for ICA Mixture Model

Write a Comment

User Comments (0)