Nonlinear Dimensionality Reduction with Fuzzy Integral and Applications - PowerPoint PPT Presentation

1 / 19
About This Presentation
Title:

Nonlinear Dimensionality Reduction with Fuzzy Integral and Applications

Description:

... Dimensionality Reduction with Fuzzy Integral and Applications. Speaker: Wang ... The Choquet integral is based on linear operators to deal ... integral to ... – PowerPoint PPT presentation

Number of Views:127
Avg rating:3.0/5.0
Slides: 20
Provided by: wangji
Category:

less

Transcript and Presenter's Notes

Title: Nonlinear Dimensionality Reduction with Fuzzy Integral and Applications


1
Nonlinear Dimensionality Reduction with Fuzzy
Integral and Applications
Part B Research Proposal
  • Speaker Wang JinFeng
  • Supervisors Prof. Leung KwongSak, Prof. Lee
    KinHong
  • Date 2006.12.07

2
Outline
  • Introduction about dimensionality reduction
  • Basic Concepts
  • Fuzzy measure
  • Fuzzy integral
  • Condition matrix
  • Main algorithms and methods
  • Nonlinear regression with FI
  • Condition matrix to learn parameters in FI
  • Projection by FI to reduce dimension
  • Applications

3
Introduction about dimensionality reduction
  • Dimensionality reduction with minimal information
    loss is important for feature extraction, compact
    coding and computational efficiency.
  • Classical methods
  • Principle component analysis (PCA)
  • ---to identfy the dependence structure behind a
    multivariate stochastic observation in order to
    obtain a compact description of it.
  • Multidimensional scaling (MDS)
  • ---to detect meaningful underlying dimensions
    that allow the researcher to explain observed
    similarities or dissimilarities (distances)
    between the investigated objects.
  • Locally linear embedding (LLE)
  • ---to compute low-dimensional,
    neighborhood-preserving embeddings of
    highdimensional inputs
  • ---be able to learn the global structure of
    nonlinear manifolds, such as imagines of faces or
    documents of text

4
Basic Concepts
  • 1. Fuzzy measure
  • Generalization of probability measures replacing
    the additive property by the monotonic property
    with respect to set inclusion.
  • Let X be a non-empty finite set and P(X) the
    power set of X.
  • Definition 1. A set function µ P(X)? 0,1 is a
    fuzzy measure if
  • (1)

(2)
(3)
Probability, possibility, belief and plausibility
measures are all special cases of fuzzy measure.
5
Basic Concepts-contd
  • 2. Fuzzy integral
  • ?Sugeno integral
  • Definition 2. Let µ be a fuzzy measure on X. The
    Sugeno integral of a function f X?0,1, with
    respect to µ is defined by

6
Basic Concepts-contd
  • 2. Fuzzy integral
  • ? Choquet integral
  • Definition 1. Let be a fuzzy measure on A.
    The Choquet integral of a function
    with respect to is defined by

indicates that the indices have been permuted so
that
where
The Choquet integral is based on linear operators
to deal with nonlinear space. It is different
with The Sugeno integral which is based on
nonlinear operators.
7
Basic Concepts-contd
  • 3. Condition matrix based Genetic Programming
  • --- An extension of the instruction matrix for
    generating rule base from datasets.
  • --- CM keeps some of characteristics of IM and
    incorporates the information about attributes of
    dataset. In the evolving process,
  • --- Adopt an elitist idea to keep the better
    individuals alive to the end.
  • ---This algorithm has been applied to Rule
    Learning successfully 8.

8
Main algorithms and methods A new nonlinear
multi-regression model
  • Example Let (x1, x2, , xn) be predictive
    attributes and y be the objective attribute.
    Denote Xx1, x2, , xn. The data have a form as
    following.
  • x1 x2 xn y
  • f11 f12 f1n y1
  • f21 f22 f2n y2
  • fm1 fm2 fmn ym
  • fij the observation of attributes
  • yi the value of classes
  • m the size of the data

9
Main algorithms and methods-contd A new
nonlinear multi-regression model
  • The new nonlinear multi-regression model is
    constructed by
  • For the general Choquet integral, this model can
    be transferred to
  • The regression residual error can be calculated
    by
  • This model will determine the coefficients
    minimizing the square error.

10
Main algorithms and methods-contd Condition
Matrix to learn the parameters in FI
The brief process of algorithm is listed as
follows. Step 1. Initialize condition
matrix Step 2. Select fuzzy measure from CM row
by row according to value of each fuzzy measure
as fitness Step3. Call fuzzy integral to
evaluate. If least square error satisfies one
threshold, stop it else go back to step 2.
11
Main algorithms and methods -contdCondition
Matrix to learn the parameters in FI
u(A(1)) u(A(.)) u(A(.)) u(A(.)) Cu(f(x))
u(A(.)) u(A(.)) u(A(.))
12
Main algorithms and methods-contd Condition
Matrix to learn the parameters in FI
  • Initially, all fitness is designed randomly as
  • One observation of dataset has three values
  • Supposed that the following subsets are selected,
    x2, x1x2, x2x3, x1x3, x2, x1x2x3, x1x2

13
Main algorithms and methods -contd Projection
by fuzzy integral from h-D to l-D
  • Projection to 1-D space--the real axis
  • Model Projecting the points in the feature
    space onto a real axis through a nonlinear
    fuzzy integral
  • M Rn R
  • R 1-D space, i.e. a real axis
  • Rn the feature space x1,x2,,xn

14
Main algorithms and methods -contd Projection
by fuzzy integral from h-D to l-D
  • ????
    u
  • But intersection situation may exist in data
    projected by fuzzy integral
  • ? ??
    u

15
Main algorithms and methods-contd
Projection by fuzzy integral from h-D to l-D
  • Projection onto 2-D space with fuzzy integral

16
Main algorithms and methods -contd Projection
by fuzzy integral from h-D to l-D
  • Double fuzzy integral

17
Main algorithms and methods-contd Projection
by fuzzy integral from h-D to l-D
  • General case of Projection from h-D to l-D

18
Main algorithms and methods -contd Projection
by fuzzy integral from h-D to l-D
  • For a training dataset, we design the square
    error of nonlinear regression with multiple fuzzy
    integral as the evaluation criteria of stopping
    projection.
    .
  • pseudo-code
  • n the degree of dimension to be projected onto
  • m the size of data set
  • let n1
  • For j1 to m
  • Compute fuzzy integral value for each data to
    get
  • get the residual
  • If , stop and return n as the final degree of
    dimension
  • else
  • let nn1 and go back to For

19
Application
  • Bioinformatics
  • Predicting Protein Cellular Localization---Richard
    Mott etc.
  • Overlap of Protein structures---David Pelta etc.
  • Finance
  • Predicting stock market---Hellström, Holmström
    (1998)
Write a Comment
User Comments (0)
About PowerShow.com