Circular analysis in systems neuroscience with particular attention to crosssubject correlation mapp - PowerPoint PPT Presentation

1 / 62
About This Presentation
Title:

Circular analysis in systems neuroscience with particular attention to crosssubject correlation mapp

Description:

General introduction to circular analysis in systems neuroscience ... allows smooth blending of blind faith and empiricism. Cons. Circular analysis. Pros ... – PowerPoint PPT presentation

Number of Views:64
Avg rating:3.0/5.0

less

Transcript and Presenter's Notes

Title: Circular analysis in systems neuroscience with particular attention to crosssubject correlation mapp


1
Circular analysis in systems neuroscience with
particular attention to cross-subject correlation
mapping
  • Nikolaus Kriegeskorte
  • Laboratory of Brain and Cognition, National
    Institute of Mental Health

2
Collaborators
  • Chris I Baker
  • W Kyle Simmons
  • Patrick SF Bellgowan
  • Peter Bandettini

3
Overview
  • Part 1General introduction to circular analysis
    in systems neuroscience(synopsis of Kriegeskorte
    et al. 2009)
  • Part 2Specific issue selection bias
    incross-subject correlation mapping(following
    up on Vul et al. 2009)

4
(No Transcript)
5
(No Transcript)
6
(No Transcript)
7
(No Transcript)
8
assumptions
9
assumptions
data
results
10
Circular inference
assumptions
11
Circular inference
assumptions
12
How do assumptions tinge results?
Through variants of selection!
13
Elimination (binary selection)
assumptions selection criteria
14
Example 1Pattern-information analysis
15
Experimental design
TASK (property judgment)
Simmons et al. 2006
Animate?
Pleasant?
STIMULUS (object category)
16
Pattern-information analysis
  • define ROI by selecting ventral-temporal voxels
    for which any pairwise condition contrast is
    significant at plt.001 (uncorr.)
  • perform nearest-neighbor classificationbased on
    activity-pattern correlation
  • use odd runs for trainingand even runs for
    testing

17
Results
stimulus (object category)
task (judged property)
decoding accuracy
chance level
18
?
!
fMRI data
data from Gaussian random generator
using all data to select ROI voxels
...but we used cleanly independent training and
test data!
using only training data to select ROI voxels
19
Conclusion for pattern-information analysis
  • The test data must not be used in either...
  • training a classifier or
  • defining the ROI

continuous weighting
binary weighting
20
Data selection is key to many conventional
analyses.
  • Can it entail similar biases in other contexts?

21
Example 2Regional activation analysis
22
ROI definition is affected by noise
independent ROI
overfitted ROI
true region
overestimated effect
ROI-average activation
23
Data sorting
assumptions sorting criteria
24
Set-average tuning curves
...for data sorted by tuning
response
stimulus parameter (e.g. orientation)
noise data
25
Set-average activation profiles
...for data sorted by activation
noise data
26
To avoid selection bias, we can...
  • ...perform a nonselective analysis
  • OR
  • ...make sure that selection and results
    statistics are independent under the null
    hypothesis,
  • because they are either
  • inherently independent
  • or computed on independent data

e.g. whole-brain mapping (no ROI analysis)
e.g. independent contrasts
27
Does selection by an orthogonal contrast vector
ensure unbiased analysis?
  • ROI-definition contrast AB
  • ROI-average analysis contrast A-B

cselection1 1T
ctest1 -1T
orthogonal contrast vectors ?
28
Does selection by an orthogonal contrast vector
ensure unbiased analysis?
contrast vector
No, there can still be bias.
The design and noise dependencies matter.
design
noise dependencies
29
Circular analysis
  • Pros
  • Cons
  • highly sensitive
  • widely accepted (examples in all high-impact
    journals)
  • doesn't require independent data sets
  • grants scientists independencefrom the data
  • allows smooth blending of blind faith and
    empiricism

30
Circular analysis
  • Pros
  • Cons
  • highly sensitive
  • widely accepted (examples in all high-impact
    journals)
  • doesn't require independent data sets
  • grants scientists independencefrom the data
  • allows smooth blending of blind faith and
    empiricism

31
Circular analysis
  • Pros

Pros
  • highly sensitive
  • widely accepted (examples in all high-impact
    journals)
  • doesn't require independent data sets
  • grants scientists independencefrom the data
  • allows smooth blending of blind faith and
    empiricism
  • the error that beautifies results
  • confirms even incorrect hypotheses
  • improves chances ofhigh-impact publication
  • Cons
  • cant think of any right now

32
Part 2Specific issue selection bias
incross-subject correlation mapping(following
up on Vul et al. 2009)
33
Motivation
  • Vul et al. (2009) posed a puzzle
  • Why are the cross-subject correlations found in
    brain mapping so high?
  • Selection bias is one piece of the puzzle.
  • But there are more pieces and we have yet to put
    them all together.

34
Overview
  • List and discuss six pieces of the puzzle.
  • (They don't all point in the same direction!)
  • Suggest some guidelines for good practice.

35
Six pieces synopsis
  • Cross-subject correlation estimates are very
    noisy.
  • Bin or within-subject averaging legitimately
    increases correlations.
  • Selecting among noisy estimates yields large
    biases.
  • False-positive regions are highly likely for a
    whole-brain mapping thresholded at plt.001,
    uncorrected.
  • Reported correlations are high, but not highly
    significant.
  • Studies have low power for finding realistic
    correlations in the brain if multiple testing is
    appropriately accounted for.

36
Vul et al. 2009
,,
noise-free correlation
population
,,
The geometric mean of the reliability is an upper
bound on the population correlation.
The reliabilities provide no bound on the sample
correlation.
37
Sample correlationsacross small numbers of
subjectsare very noisy estimatesof population
correlations.
Piece 1
38
0.65
39
(No Transcript)
40
Cross-subject correlation estimatesare very noisy
95-confidence interval
correlation
10 subjects
41
Cross-subject correlation estimatesare very noisy
42
The more we average(reducing noise but not
signal),the higher correlations become.
Piece 2
43
Bin-averaging inflates correlations
44
Bin-averaging inflates correlations
45
(No Transcript)
46
  • Subjects are like bins...
  • For each subject, all data is averaged to give
    one number.
  • Take-home message
  • Cross-subject correlation estimates are expected
    to be...
  • high (averaging all data for each subject)
  • noisy (low number of subjects)

So what's Ed fussing about?We don't need
selection bias to explain the high correlations,
right?
47
Selecting the maximumamong noisy
estimatesyields large selection biases.
Piece 3
48
Expected maximum correlationselected among null
regions
expected maximum correlation
bias
16 subjects
49
False-positive regions are likely to be found in
whole-brain mappingusing plt.001, uncorrected.
Piece 4
50
Mapping with plt.001, uncorrected
Global null hypothesis is true (population
correlation 0 in all brain locations)
51
Reported correlations are high,but not highly
significant.
Piece 5
52
Reported correlations are high,but not highly
significant
plt0.00001 plt0.001 plt0.01 plt0.05
53
Reported correlations are high,but not highly
significant
plt0.00001 plt0.001 plt0.01 plt0.05
54
Reported correlations are high,but not highly
significant
What correlations would we expect under the
global null hypothesis?
(assuming each study reports the maximum of
500 independent brain locations)
plt0.00001 plt0.001 plt0.01 plt0.05
55
Reported correlations are high,but not highly
significant
What correlations would we expect under the
global null hypothesis?
plt0.00001 plt0.001 plt0.01 plt0.05
(assuming each study reports the max.of 500
independent brain locations)
56
Most of the studies have low powerfor finding
realistic correlationswith whole-brain
mappingif multiple testing is appropriately
accounted for.
Piece 6
see also Yarkoni 2009
57
Numbers of subjectsin studies reviewed by Vul et
al. (2009)
number of correlations estimates
4
8
16
36
60
100
number of subjects
58
In order to find a single region with
across-subject correlation of 0.7 in the brain...
...we would need about 36 subjects
16 subjects
59
In order to find a single region with
across-subject correlation of 0.7 in the brain...
...we would need about 36 subjects
16 subjects
60
  • Take-home message
  • Whole-brain cross-subject correlation mapping
  • with 16 subjects

does
not
work.
Need at least twice as many subjects.
61
Conclusions
  • Unless much larger numbers of subjects are used,
    whole-brain cross-subject correlation mapping
    suffers from either
  • very low power to detect true regions(if we
    carefully to correct for multiple comparisons)
  • very high rates of false-positive
    regions(otherwise)
  • If analysis is circular, selection bias is
    expected to be high here (because selection
    occurs among noisy estimates).

...in other words, it doesn't work.
62
Suggestions
  • Design study to have enough power to detect
    realistic correlations. (Need either anatomical
    restrictions or large numbers of subjects.)
  • Consider studying trial-to-trial rather than
    subject-to-subject effects.
  • Correct for multiple testing to avoid false
    positives.
  • Avoid circularity Use leave-one-subject out
    procedure to estimate regional cross-subject
    correlations.
  • Report correlation estimates with error bars.
Write a Comment
User Comments (0)
About PowerShow.com