Spectral LPM: An Optimal Locality-Preserving Mapping using the Spectral (not Fractal) Order - PowerPoint PPT Presentation

About This Presentation

Title:

Spectral LPM: An Optimal Locality-Preserving Mapping using the Spectral (not Fractal) Order

Description:

Model the set of multi-dimensional points P as a graph G(V,E) such that each ... For each i=1 n, assign the value xi to vi, and hence to Pi. ... – PowerPoint PPT presentation

Number of Views:46

Avg rating:3.0/5.0

Slides: 26

Provided by: mok94

Learn more at: https://raidlab.cs.purdue.edu

Category:

more less

Transcript and Presenter's Notes

Title: Spectral LPM: An Optimal Locality-Preserving Mapping using the Spectral (not Fractal) Order

1
Spectral LPM An Optimal Locality-Preserving
Mapping using the Spectral (not Fractal) Order

Mohamed F. Mokbel
Walid G. Aref
Ananth Grama

2
Locality-Preserving Mapping

Multi-dimensional data is more difficult to
process than one-dimensional data.
A mapping function f is required to map the
multi-dimensional space into the one dimensional
space.
Locality-Preservation is a desirable property for
the mapping function f.
Mapping data from the multi-dimensional space
into the one-dimensional space is considered
locality-preserving if the points that are nearby
in the multi-dimensional space are nearby in the
one-dimensional space.

3
Applications of Locality-Preserving Mapping

Range Queries, nearest-neighbor queries
Multi-dimensional spatial join.
Multi-dimensional indexing.
R-Tree Packing
Spatial Access Method.
Declustering
Memory Management
GIS
Disk Scheduling
Traveling Salesman problem
Parallel processing.

4
Distance-Preserving or Locality-Preserving

An optimal Distance-Preserving mapping algorithm
maps the multi-dimensional space into the
one-dimensional space such that the distance
between each pair of points in the
multi-dimensional space is preserved in the
one-dimensional space.
A point P in the D-dimensional space has 2D
neighbors with Manhattan distance M1. Mapping P
and its neighbors into the one-dimensional space
allows only two neighbors to have M1. Thus, the
distance between 2(D-1) of the points cannot be
preserved.
An optimal Distance-Preserving mapping is
infeasible, what about Locality-Preserving
mapping ?

5
The Good Mapping

Divide the 2D neighbors into two equal groups.
Map the first group to the left of P, and the
second group to the right of P.
The same algorithm is applied for points with
Manhattan distance M gt1
An optimal Locality-Preserving mapping with
respect to P.
What about Q and R..?

6
The Bad Mapping (Fractals)

Fractals divide the space into a number of
fragments.
Once a fractal starts to visit points from a
certain fragment, no other fragment is visited
until the current one is completely exhausted.
Fractals perform a local optimization based on
the current fragment.
Boundary Effect problem. Two points Pi and Pj lie
on the boundaries of two different fragments and
Pi-Pj1. However, Pi and Pj will be very far
from each other in the one-dimensional space.

7
How bad is the Fractals ?
Peano Gray Hilbert
P1-P2 6 5 11
P3-P4 22 47 43

Things become even worse with the increase of the
grid size

8
Can we avoid the Boundary Effect in Fractals?

NO, the boundary problem is a property of the
fractals Man77.
Only two attempts to avoid the boundary effect
SZM98 uses different space-filling curves for
the same data. If two points lie on the boundary
of one space-filling curve, they will not be in
the boundary of the other space-filling curve.
LLL01 uses multiple shifted copies of the
Hilbert SFC. If two points are in the boundary of
one copy of the Hilbert SFC, they will not be in
the boundary of another copy.
The main idea is
By using more than one SFC, we can hopefully get
better results

9
The Optimal Mapping (Spectral LPM)

Unlike Fractals, Spectral LPM achieves global
optimization where all multi-dimensional points
are taken into account when performing the
mapping.
Spectral LPM does not favor any set of points
over the others.
Spectral LPM is proved to be globally optimal
with respect to all multi-dimensional points.

10
Overview of the Spectral Algorithms

Spectral algorithms are based on the spectral
theory which relates matrix to its eigenvalues
and eigenvectors.
A milestone in Spectral algorithms is due to
Fiedler Fie75 who proposed using the
eigenvalues and eigenvectors of the Laplacian
matrix L(G) instead of the Adjacency matrix A(G).
Spectral Algorithms have been widely used in
Graph Partitioning
Data Clustering
Linear labeling of a graph.
Up to the authors knowledge, the use of spectral
mapping to support similarity search queries is a
novel application

11
The Spectral LPM Algorithm

Algorithm Spectral Locality-Preserving Mapping
(Spectral LPM)
Input P, a set of multi-dimensional points.
Output S, a linear order of the set P.
Model the set of multi-dimensional points P as a
graph G(V,E) such that each point Pi?P is
represented by a vertex vi?V, and there is an
edge (vi,vj) ?E iff Pi-Pj1.
Compute the graph Laplacian matrix L(G)
D(G)-A(G).
Compute the second smallest eigenvalue ?2 and its
corresponding eigenvector X2 of L(G).
For each i1?n, assign the value xi to vi, and
hence to Pi.
The linear order S of P is the order of the
assigned values of Pis.
Return S
End

12
Example of The Spectral Mapping
13
The Optimality of the Spectral Mapping

Definition
A vector X (x1,x2,,xn) that represents the n
one-dimensional values of n multi-dimensional
points represented as a graph G(V,E) is
considered to provide the global optimal
locality-preserving mapping from the
multi-dimensional space into the one-dimensional
space if X satisfies the following optimization
problem

14
The Optimality of the Spectral Mapping

The optimization problem in the optimality
definition is equivalent to

15
The Optimality of the Spectral Mapping

Theorem Fiedler, 1973
The solution of the optimization problem
is the second smallest eigenvalue ?2 and its
corresponding eigenvector X2

16
Extensibility of the Spectral LPM

Spectral LPM can change the way of constructing
the graph G.
Spectral LPM can incorporate any number of
additional constraints

17
Extensibility of the Spectral LPM

Spectral LPM can model the multi-dimensional
points as a weighted graph. The weight w of an
edge e(v1,v2) represents the priority of mapping
v1 and v2 to nearby locations in the
one-dimensional space. In this case the objective
function will be
The proof of optimality of the Spectral LPM is
valid regardless of the graph type. Spectral LPM
is optimal for the chosen graph type.

18
Experimental Results

If the Manhattan distance between any two points
Pi, Pj in the multi-dimensional space is MD, then
what is the Manhattan distance M1 between the
same two points in the one-dimensional space? The
lower M1 the better the locality-preserving
mapping.
For any multi-dimensional range query, what is
the difference between the minimum and the
maximum one-dimensional values of the points that
lie inside the range query? The smaller the
difference the better the locality-preserving
mapping.
The answers of these questions should have
Bounded worst-case
Dimension Independence
Location Independence

19
Bounded Worst-Case (Nearest-Neighbor Queries)
20
Bounded Worst-Case (Range Queries)
21
Dimension Independence
22
Dimension Independence (Range Queries)
23
Location Independence
24
Summary

The need of a locality-preserving mapping from
the multi-dimension space into the
one-dimensional space is needed for a variety of
applications.
We argue against the use of fractals as a basis
for locality-preserving mapping.
The Spectral LPM algorithm is proposed as an
optimal locality-preserving mapping that depends
on the spectral properties of the
multi-dimensional points.
The optimality proof of the Spectral LPM
algorithm is provided.
Unlike Fractals, the Spectral LPM algorithm can
be extended in several ways.
We provide experimental evidence that Spectral
LPM is superior to the fractal-based algorithms

25
References

Man77 B. Mandelbrot. Fractal Geometry of
Nature. W. H. Freeman, NY, 1977.
LLL01 S. Liao, M. Lopez, and S. Leutenegger.
High Dimensional Similarity search with
space-filling curves., ICDE 2001.
SZM98 J. Shepherd, X. Zhu, and N. Megiddo. A
fast indexing method for multidimensional nearest
neighbor search. Proc.of SPIE, Storage and
Retrieval for Image and Video Databases, 1998.
Ste73 L.A. Steen. Highlights in the history of
spectral theory. American Mathematically Monthly,
80(4), 1973.
Fie73 M. Fiedler. Algebraic Connectivity of
Graphs. Czechoslovak Mathematical Journal 23(98),
1973.
Fie75 M. Fiedler. A property of eigenvectors of
nonnegative symmetric matrices and its
application to graph theory. Czechoslovak
Mathematical Journal 25(100), 1975.