Outline

About This Presentation

Transcript and Presenter's Notes

Title: Outline

1
Outline

2
Maximum-Likelihood Estimation

Assumptions
We separate a collection of samples according to
class
D1, D2, ....., Dc
Samples in Dj are drawn independently according
to the probability p(xwj)
We assume that p(xwj) has a known parametric
form and is uniquely determined by the value of a
parameter vector ?j
To simplify further, we assume that samples in Di
give no information about ?j if i ? j

3
Maximum-Likelihood Estimation cont.

4
Bayesian Estimation

Assumptions
The form of the density p(xq) is assumed to be
known, but the value of the parameter vector q is
not known exactly
Our initial knowledge about q is assumed to be
contained in a known prior density p(q)
The rest of our knowledge about q is contained in
a set D of n samples x1, ....., xn drawn
independently according to the unknown
probability density p(x)

5
Bayesian Estimation cont.

6
Bayesian Estimation cont.

7
Bayesian Estimation cont.

8
Non-parametric Methods

9
A Multimodal Density
10
Solutions

11
Non-parametric Methods

Most of the non-parametric density estimation
methods are based on the following fact
The probability P that a vector x will fall in a
region R is given by

12
Non-parametric Methods cont.

For n smaples x1, ....., xn that are drawn
independently according to p(x), the probability
that k of n will be in R is given by

V is the volume of R
13
Non-parametric Methods cont.
14
Non-parametric Methods cont.

Problems to be addressed
If we fix the volume V and have more samples, the
ratio k/n will converge as desired
Averaged version of p(x)
How to estimate p(x)?
Let V approach zero?

15
Parzen Windows

Parzen windows
We use a window function for interpolation, each
sample contributing to the estimate in accordance
with its distance from x
Here hn is a parameter

16
Parzen Windows - cont.

17
Parzen Windows - cont.

Properties
Convergence of mean
As n approaches infinity, the estimate will also
approach p(x) if p(x) is continuous
Smaller Vn is better
Convergence of variance
A smaller variance needs a larger Vn

18
Parzen Windows - cont.
19
Parzen Windows - cont.
20
Parzen Windows - cont.
21
Parzen Windows - cont.
22
Kn-Nearest-Neighbor Estimation

Let the cell volume be a function of the training
data
To estimate p(x) from n samples, we can center a
cell about x and let it grow until it captures kn
samples

23
Kn-Nearest-Neighbor Estimation cont.
24
Kn-Nearest-Neighbor Estimation cont.
25
Kn-Nearest-Neighbor Estimation cont.
26
The Nearest-Neighbor Rule

Outline PowerPoint PPT Presentation