Author: Hao Cheng, Kien A Hua, and Khanh Vu

About This Presentation

Title:

Description:

Number of Views:86

Avg rating:3.0/5.0

Slides: 24

Provided by: haoc6

Category:

more less

Transcript and Presenter's Notes

Title: Author: Hao Cheng, Kien A Hua, and Khanh Vu

1
Local and Global Structures Preserving Projection

2
Overview

3
Introduction

Data usually reside in a high dimensional space.
The intrinsic dimensionality of data is much
lower.
Manifold learning
finds a low dimensional embedding of the raw
data and the embedding can well preserve the
intrinsic structures of the data.
a recent popular research topic.

4
Related Work

5
PCA

Principal Component Analysis (PCA)
PCA projects the data along a set of axes which
exhibit greater variances than other axes
PCA minimizes the distortion of all the pairwise
distances of the data after the reduction.
PCA can well preserve the global structures of
the data.

6
LPP

Local Preserving Projection (LPP)
LPP constructs a similarity matrix W
If point i is the top K nearest neighbor of point
j, then W(i,j) W(j,i) 1. Otherwise W(i,j)
0.
W encodes local neighborhood information.
LPP finds a set of axes in order to minimize the
pairwise distances of the data (indicated by W).
LPP can well preserve the neighborhoods.

7
Nonlinear Methods

Both PCA and LPP are linear methods.
Nonlinear methods
ISOMAP, Locally Linear Embedding (LLE), Hessian
LLE (HLLE), Local Tangent Space Alignment (LTSA),
Diffusion Maps (DM).
Problems
Computational intensive.
Do not scale well.
Performances are not very robust.

8
Motivation

PCA global structure
LPP local structure
Both global and local structures are important,
and should be properly preserved!
Look at the toy examples.

9
Toy Example 1

10
Toy Example 2

Neither of them does well!
11
LGSPP

12
Local Structure

For each data point x,
S(x) is the set of points include x itself and
its Ks nearest neighbors (Ks is a system
parameter).
S(x) is the local neighborhood around the point x.

13
Global Structure

For each data point x,
D(x) is the set of Kd points, which are far from
point x and also far from each other (Kd is
another parameter).
For example

Blue dot x Red/Green dots in D(x).
Points in D(x) and point x are from different
dense regions.
14
Extraction Algorithm

15
S(x) and D(x)

S(x) local neighborhood of x.
D(x) point x and points in D(x) are highly
likely from different dense regions in the
dataset.
Local and global structures
S(x) and D(x) for each point x.

16
Embedding

Goals of embedding
Keep the points in S(x) close to each other in
the reduced space minimize the pairwise
distances in S(x)
Keep the points in D(x) far from those in S(x) in
the reduced space maximize the pairwise
distances between S(x) and D(x)

17
Optimization