Accuracy and power of randomization tests in multivariate analysis of variance with vegetation data - PowerPoint PPT Presentation

1 / 23
About This Presentation
Title:

Accuracy and power of randomization tests in multivariate analysis of variance with vegetation data

Description:

Universidade Federal do Rio Grande do Sul. Porto Alegre, Brazil. vpillar_at_ecologia.ufrgs.br ... Applicable to most cases analyzed by classical methods. ... – PowerPoint PPT presentation

Number of Views:70
Avg rating:3.0/5.0
Slides: 24
Provided by: bio2
Category:

less

Transcript and Presenter's Notes

Title: Accuracy and power of randomization tests in multivariate analysis of variance with vegetation data


1
Accuracy and power of randomization tests in
multivariate analysis of variance with vegetation
data
  • Valério De Patta Pillar
  • Departamento de Ecologia
  • Universidade Federal do Rio Grande do Sul
  • Porto Alegre, Brazil
  • vpillar_at_ecologia.ufrgs.br
  • http//ecoqua.ecologia.ufrgs.br

2
  • Randomization testing
  • Became practical with fast microcomputers.
  • Applicable to most cases analyzed by classical
    methods.
  • Applicable to cases not covered by classical
    methods.

3
How good is randomization testing?
  • Is it accurate?
  • Is it powerful enough?

4
Group comparison by randomization testing
Choose a test criterion (?) to compare the groups
Permute the data according to the conditions
stated by the null hypothesis (Ho) that the
groups do not differ
Calculate the test criterion?in the random data
and compare it to the value found in the observed
data.
After many iterations, the probability P(?o ?)
will be the number of iterations with ?o ?
divided by the total number of iterations.
Reject Ho if P(?o ?) is smaller than a
threshold (?)
Manly, B. F. J. 1997. Randomization, Bootstrap
and Monte Carlo Methods in Biology. 2 ed. Chapman
and Hall.
5
Randomization test criteria for multivariate
comparisons of any number of groups
Pillar, V. D. Orlóci, L. 1996. J. Veg. Sci.
7585-592.
6
An example
Is there a significant effect of N on vegetation
composition as defined by these two PFTs?
SQ between groups(Qb) 60.088 - 10.02 50.068
How common is a Qb 50.068 if Ho were true (that
the composition is unrelated to group)?
7
Reference set under Ho
If Ho true, the observation vector in a given
sampling unit is independent from the group to
which the unit belongs.
8
A random permutation and corresponding statistics
SQ between groups(Qbo) 60.088 - 28.35
31.738 Since, 31.738 lt 50.068 (Qbo lt Qb), this
iteration adds zero to the frequency of cases in
which Qbo Qb.
9
After 10000 random permutations
10
Two-factor designs
  • Test criterion
  • Qb Qt - Qw is based on the groups defined by
    the joint states of the factors.
  • Qb is partitioned as
  • Qb QbA QbB QbAB
  • where
  • QbA sum of squares between la groups according
    to factor A disregarding factor B
  • QbB sum of squares between lb groups according
    to factor B disregarding factor A
  • QbAB sum of squares of the interaction AB,
    obtained by difference.
  • F-ratio Qb/Qw

11
Unrestricted permutation in two-factor design
12
Two-factor Multivariate Analysis of Variance
One random permutation
Observed
Data Species (57) composition in 8 vegetation
units surveyed in two landscape positions (factor
A) and two grazing levels (factor B).
13
After 10000 random permutations
Data Species (57) composition in 8 vegetation
units surveyed in two landscape positions (factor
A) and two grazing levels (factor B).
Unrestricted random permutations. Test criterion
F-ratio Qb/Qw.
14
Restricted permutations
  • In two-factor (not nested) designs, for testing
    one factor, permutations may be restricted to
    occur within the levels of the other factor
    (Edgington 1987).
  • Restricted permutation within the levels of
    factor A (for testing factor B)

Edgington, E. S. 1987. Randomization Tests.
Marcel Dekker, New York.
15
Permutations of residuals instead of raw data
16
Two-factor multivariate analysis of variance by
randomization testing for the effects of
landscape position and grazing level in natural
grassland, southern Brazil (data from Pillar
1986). The data set contains 16 polled community
stands by 60 species. Restricted random
permutations for testing factors landscape and
grazing. Permutation of residuals removing both
factors for testing the interaction.
17
How good is randomization testing in two-factor
multivariate analysis of variance?
18
Simulation of interaction
  • For each case, 1000 data sets were generated,
    with distribution properties of real vegetation
    data and subject to multivariate analysis of
    variance with randomization testing.
  • When factor or interaction effect is set to zero,
    the proportion of Ho rejection under a given a
    threshold estimates Type I Error, the probability
    of wrongly rejecting Ho when it is true.
  • If Type I Error is equal to a, the test is exact.
  • When factor or interaction effect gt 0, the
    proportion of Ho rejection estimates the power of
    the test, which is the one-complement of Type II
    Error, the probability of not rejecting Ho when
    it is false.

19
Simulated data generated with distributional
properties of real data
  • Data set 16 grassland units described by cover
    of 60 species.
  • Two factors landscape position (top-convex,
    concave-lowland) and grazing levels (grazed,
    ungrazed).
  • Procedure described by Peres-Neto Olden (2000)
  • Calculate the mean (???) and the standard
    deviation (?ij) for each species vector i within
    each group j defined by the four factor level
    combinations
  • Standardize these vectors for mean equal 0 and
    standard deviation equal 1, thij(xhij-???)/???
  • Randomly permute whole stand vectors across
    groups
  • Restore the original dispersion within each group
    by computing new observations shij thij???,
    defining in this way a data set with the
    conditions specified by Ho
  • Apply to the species vectors the corresponding
    group differences for factor and interaction
    effects
  • Perform the randomization tests using 1000 random
    permutations
  • Repeat the steps (3) to (6) 1000 times, recording
    the proportion of Ho rejection.

Peres-Neto, P.R. Olden, J.D. 2000. Animal
Behaviour 61 79-86.
20
Results of power evaluation by data simulation in
two-factor MANOVA. The proportion of Ho rejection
at a 0.05 was obtained for 1000 simulated data
sets generated on the basis of plant community
data with 16 units and 60 species, with
increasing difference between the two groups for
factor 1, with no interaction. Each factor
combination had equal number of units. For each
data set a randomization test was run with 1000
iterations.
21
As the effects of both factors increase, type I
error for the interaction is underestimated with
unrestricted permutations with Qb and ?-ratio,
but not with residuals.
22
As the effect of interaction increases, type I
error for both factors is underestimated with Qb
and ?-ratio, un- and restricted permutations.
But, main factors should not be considered at all
when interaction is present!
23
(No Transcript)
24
Results of power evaluation by data simulation in
two-way designs. The proportion of Ho rejection
at a 0.05 was obtained for 1000 simulated data
sets generated on the basis of plant community
data with 60 units and 60 species, with
increasing relative difference between the four-
group factor combinations. Factor combinations
had unequal number of units (11 31, 12 9, 21
9, 22 11). For each data set a randomization
test was run with 1000 iterations.
25
References
  • Anderson, M.J. and ter Braak, C. 2003,
    Permutation tests for multi-factorial analysis of
    variance. Journal of Statistical Computations and
    Simulations 7385-113.
  • Edgington, E. S. 1987. Randomization Tests.
    Marcel Dekker, New York.
  • Manly, B. F. J. 1997. Randomization, Bootstrap
    and Monte Carlo Methods in Biology. 2 ed. Chapman
    and Hall.
  • Peres-Neto, P.R. Olden, J.D. 2000. Assessing
    the robustness of randomization tests examples
    from behavioural studies. Animal Behaviour 61
    79-86.
  • Pillar, V. D. Orlóci, L. 1996. On randomization
    testing in vegetation science multifactor
    comparisons of relevé groups. Journal of
    Vegetation Science 7585-592.
  • Pillar, V. D. 1994-2004. MULTIV Software for
    multivariate analysis, randomization tests and
    bootstrapping. Available (minor version, manual
    included) at http//ecoqua.ecologia.ufrgs.br
Write a Comment
User Comments (0)
About PowerShow.com