Definition and overview of chemometrics - PowerPoint PPT Presentation

About This Presentation
Title:

Definition and overview of chemometrics

Description:

Hospitals. Agriculture. Food industry. More. Nomenclature. Samples are objects ... Vectors (physics) x = [ x1, x2, x3 ] || x || = ( x12 x22 x32 ) 1/2. Geometry. a ... – PowerPoint PPT presentation

Number of Views:51
Avg rating:3.0/5.0
Slides: 47
Provided by: Pau1137
Category:

less

Transcript and Presenter's Notes

Title: Definition and overview of chemometrics


1
Definition and overview of chemometrics
2
Paul Geladi
Head of Research NIRCE Chairperson NIR Nord Unit
of Biomass Technology and Chemistry Swedish
University of Agricultural Sciences Umeå Technobot
hnia Vasa paul.geladi _at_ btk.slu.se paul.geladi
_at_ syh.fi
3
(No Transcript)
4
(No Transcript)
5
Project geography
6
Chemometrics
  • Mathematics
  • Statistics
  • Computer Science
  • In Chemistry

7
Similar fields
  • Biometrics 1900
  • Psychometrics 1930
  • Econometrics 1950
  • Technometrics 1960

8
Chemometrics
  • Design of Experiments (DOE)
  • Exploratory Data Analysis
  • Classification
  • Regression and Calibration

9
Design of Experiments
  • Most important where possible
  • Uses
  • ANOVA
  • F-test
  • t-test
  • Plots
  • Response Surfaces

10
Design of Experiments
  • y b0 b1x1 b2x2 ...bKxK b11x12
  • b22x22 ... bKKxK2 b12x1x2 ... e
  • Factors x1, x2,...xK changed systematically
  • Response y measured and modeled

11
Exploratory Data Analysis
  • Design not possible
  • Sampling situations
  • Find structure
  • Find groupings
  • Find outliers

12
Classification
  • Check for groupings UNSUPERVISED
  • Existing groupings SUPERVISED
  • Visualize groupings
  • Classify
  • Test

13
Regression / Calibration
  • Two types of variables X / y
  • Relationship linear / nonlinear
  • Model
  • Diagnostics
  • Residual

14
y
x
15
Multivariate Data Analysis
16
Multivariate Data Analysis
  • Sampled data and design with too many reponses
  • Mining
  • Hospitals
  • Agriculture
  • Food industry
  • More

17
Nomenclature
  • Samples are objects
  • What is measured on the object is a variable

18
34.92
Spectrum
K
1
1
Samples
Vectors
I
19
A vector is a collection of numbers. It is
always a column vector.
12 3.6 11.1 5.9 34 0.5 1.4 17
20
12 3.6 11.1 5.9 34 0.5 1.4 17
The transpose of a vector is a row
vector. Symbols for transpose are and T. a or
aT.
21
Particle size, 1 sample
22
Small particles, 35 samples
23
The Data Matrix
K
A data matrix is a vector of vectors
I
24
Size histograms, all samples
Particle area
25
Times in batch reaction
NIR wavelengths
26
Geometry of multivariate space
27
Problem
  • I and K can be large
  • Correlation
  • Univariate statistics does not apply

28
3 variables blood oxygen, iron, hemoglobin
I patients
29
Hb
Fe
O2
30
Hb
Fe
O2
31
Hb
Fe
O2
32
Hb
Fe
O2
33
Hb
Fe
O2
34
Hb
Fe
O2
35
Hb
Fe
O2
36
Hb
Fe
O2
37
Hb
Fe
O2
38
Properties of multivariate space
  • Rotation
  • vectors unchanged / distance unchanged
  • Translation
  • vectors changed / distance unchanged
  • Rescaling / change units
  • all changes

39
Consequences
  • We can move the coordinate sytem around
  • The relative distances between objects do not
    change
  • We can rotate the coordinate system
  • Scale changes are important
  • Move coordinate system to center of data
  • Scale properly

40
Vectors (physics)
x x1, x2, x3
x ( x12 x22 x32 ) 1/2
41
Geometry
c2 a2 b2
c
a
b
42
Vectors (K dimensions)
x x1, x2,..., xK
x ( x12 x22 ... xK2 ) 1/2
43
Problem
  • We can not see in more than 3 dimensions
  • Paper, computer screen 2-2.5 dimensions

44
Hb
Fe
O2
45
Hb
Fe
O2
46
Projection
  • 2D plane (screen, paper)
  • Many projections possible
  • Find a good one
  • Find a few good ones
  • What is good?
Write a Comment
User Comments (0)
About PowerShow.com