CAT (Critically Appraised Topic) (adapted from Sackett, et al. 2000) - PowerPoint PPT Presentation

About This Presentation
Title:

CAT (Critically Appraised Topic) (adapted from Sackett, et al. 2000)

Description:

Title: PowerPoint Presentation Author: Christine Dollaghan Last modified by: Eugene Chubmkov Created Date: 3/16/2001 8:51:05 PM Document presentation format – PowerPoint PPT presentation

Number of Views:80
Avg rating:3.0/5.0
Slides: 40
Provided by: Christin686
Learn more at: http://www.bibalex.org
Category:

less

Transcript and Presenter's Notes

Title: CAT (Critically Appraised Topic) (adapted from Sackett, et al. 2000)


1
CAT (Critically Appraised Topic) (adapted from
Sackett, et al. 2000)
  • 1-page summary of evidence resulting from
    critical appraisal of an article, test, etc.
  • Answers a specific foreground question
  • Compared to no treatment, does
    parent-administered treatment significantly
    improve the language skills of toddlers with
    language delay?

2
First part of CAT identical for tx and dx studies
(see handout pp. 2-3)
  • Clinical bottom line (appears 1st but completed
    last)
  • Clinical question
  • Search terms
  • Appraised by whom, and date
  • Synopsis of key (memorable) information, in a
    concise, maximally useful format (e.g., types of
    subjects, procedures, measures, results, etc.)

3
CAT-egories (appraisal points) for a study of
therapy (Sackett et al., 2000)
  • Prospective, controlled?
  • Random assignment?
  • Comparing gt 2 conditions?
  • Recognizable subjects?
  • Evidence of pre-tx group similarity?
  • Blinding (insofar as possible) of evaluators,
    relevant others?

4
Appraisal points (cont.)
  • Control over nuisance variables?
  • Valid, reliable measures of tx effects?
  • Statistically significant difference (p-value)?
  • Practically significant difference (d-value)?
  • Precision of treatment effects (narrow CI)?
  • Outcomes for all enrolled?
  • Cost-benefit and feasibility analyses?

5
A sample treatment CAT
  • CAT Language of delayed toddlers improves in
    response to parent-administered focused
    stimulation
  • Clinical bottom line Compared to an untreated
    control group, motivated mothers of
    low-vocabulary toddlers significantly decreased
    their speaking rate and language complexity and
    increased their vocabulary inputs in response to
    18 hr of instruction in focused stimulation
    techniques, and their children produced
    significantly more words and early grammatical
    forms.
  • Clinical question Compared to no treatment,
    does parent-administered treatment significantly
    improve the language skills of toddlers with
    language delay?
  • Search terms word learning AND toddlers, PubMed
    clinical query
  • Appraised by Dollaghan

6
Key appraisal points
  • Prospective, controlled Yes
  • Randomized Yes
  • Comparing gt 2 conditions Yes
  • Recognizable Ss Yes
  • Pre-tx similarity Yes
  • Blinding Yes Cn no parent
  • Control over nuisance variables Yes
  • Valid, reliable measures Yes
  • Statistically significant differences Yes
  • Practically significant differences Yes
  • Precision of treatment effects No
  • Outcomes for all enrolled Yes
  • Cost-benefit, feasibility analyses Yes

7
Critical appraisal of evidence on diagnostic
indicators
  • The key variables by which individuals are
    identified as members of a class, ostensibly to
    improve prediction and outcome for them
  • Myriad diagnostic indicators have been proposed
    in communication sciences and disorders
  • Diagnostic indicators in your area of interest?

8
Most diagnostic indicators in CSD are based on
Phase I studies
  • Group mean comparison studies
  • People with, and people without, the condition of
    interest are compared with respect to a proposed
    indicator
  • Correlational studies
  • Association between proposed indicator and
    accepted indicators
  • Such studies cant address the two most crucial
    features of a diagnostic indicator accuracy and
    precision

9
Accuracy and precision
  • Accuracy
  • The ability of an indicator to identify a
    condition of interest, i.e., the amount of
    agreement between the proposed indicator and a
    reference standard
  • Precision
  • Width of confidence intervals (CI) for estimates
    of accuracy

10
Accuracy of a diagnostic indicator
  • The ability of an indicator to identify a
    condition of interest, i.e., the amount of
    agreement between the proposed indicator and a
    reference standard
  • Preferred measures of diagnostic accuracy
    positive and negative likelihood ratios
    (Battaglia et al., 2002)

11
Positive Likelihood Ratio (LR)
  • Reflects the degree of confidence that a person
    who scores in the positive (affected or
    disordered) range on a dx indicator does have the
    disorder
  • Formula sensitivity/1-specificity
  • The higher the LR, the more informative the
    indicator for identifying people who have the
    disorder

12
Interpreting LR values (Sackett et al., 1991)
  • LR gt 20 Very high virtually certain that a
    person with this score has the disorder
  • LR 10 High disorder very likely in a person
  • with this score
  • LR 4 Intermediate the indicator is
  • suggestive of disorder but
    insufficient
  • to diagnose
  • LR 1 Equivocal a person who scores in the
  • disordered range on the measure may
  • or may not have the disorder
    the
  • measure provides no new
    information

13
Negative Likelihood Ratio (LR-)
  • Reflects the degree of confidence that a person
    scoring in the negative (normal) range on the
    diagnostic indicator truly does not have the
    disorder
  • Formula 1-sensitivity/specificity
  • The lower the LR-, the more informative the
    indicator for ruling out the presence of disorder

14
Interpreting LR- values (Sackett et al., 1991)
  • LR- lt 0.10 Very low virtually certain that a
  • person scoring in this range does not
  • have the disorder
  • LR- 0.20 Low disorder very unlikely
  • LR- 0.40 Intermediate the indicator is
    suggestive
  • but insufficient to rule out the disorder
  • LR- 1.0 Equivocal a person scoring in the
  • normal range on this measure may or
  • may not be normal

15
Calculating sensitivity and specificity (nothing
more than LR precursors)
  • Sensitivity the percentage of people with the
    disorder that the new indicator correctly
    classifies as disordered
  • Specificity the percentage of people who dont
    have the disorder that the new indicator
    correctly classifies as not disordered
  • The true status of every individual with regard
    to the disorder is established according to a
    gold (or reference) standard

16
Disorder Status (re Gold Standard)
- Disorder (LN)
Disorder (LI)
a b
c d
Disorder (LI)
New Test Result
-Disorder (LN)
with disorder
without disorder
17
Disorder Status (re Gold Standard)
Disorder (LI)
- Disorder (LN)
Disorder (LI)
New Test Result
-Disorder (LN)
Sensitivitya/ac (the proportion of people
with the disorder that the new test identifies as
having the disorder)
18
Disorder Status (re Gold Standard)
Disorder
- Disorder
True positive a False positive b
c False negative d True negative
Disorder
New Test Result
-Disorder
Specificity d/bd (the proportion of
people without the disorder that the new test
identifies as not having the disorder)
19
Example
  • 100 children diagnosed with language impairments
    (LI) and enrolled in language intervention, and
    100 same-age children with no history of language
    impairment (LN), were administered a new test of
    grammatical morphology.
  • 80 of the children with LI, and 30 of the
    children with LN, scored in the disordered range
    on the new measure.

20
Disorder Status (re Gold Standard)
Disorder (LI)
- Disorder (LN)
80 a 30 b
c (20) d (70)
Disorder (LI)
New Test Result
-Disorder (LN)
100 with disorder Sens a/ac 80/100 .80
100 without disorder Spec d/bd 70/100 .70
21
Why not just use sensitivity and specificity as
measures of accuracy?
  • Its their interrelationship that is most
    important overall
  • Sensitivity and specificity vary substantially
    according sample characteristics, including N,
    base rate (prevalence), severity, confusability
  • Likelihood Ratios are not impervious to sample
    characteristics, but are much less affected than
    are sensitivity and specificity

22
Calculating Likelihood Ratios
  • Sens .80
  • Spec .70
  • LR sens/1-spec .80/.30 2.67
  • LR- 1-sens/spec .20/.770 0.29
  • Several programs, some free on web, are set up
    to allow entry in 2x2 table format
  • In addition to accuracy measures, they also
    provide information on precision

23
Precision of a diagnostic indicator
  • Width of confidence intervals (CI) for
    sensitivity, specificity, and likelihood ratios,
    calculated by adding and subtracting a multiple
    of standard error (e.g., 1.96 SE for a 95 CI)
  • Standard error depends on sample size and
    reliability larger samples and higher
    reliability will result in narrower CIs, all else
    being equal
  • Sackett et al. (2000) appendix shows how to
    calculate CIs by hand, and programs (some free)
    provide CIs given raw numbers in a 2x2 table

24
Sample size and precision 95 CIs for studies
with same LRs but different Ns
  • N 200 N 20
  • Value (95 CI) (95 CI)
  • Sens .80 (0.71-0.87) (0.44-0.98)
  • Spec .70 (0.60-0.79) (0.35-0.93)
  • LR 2.67 (1.98-3.70) (1.12-7.66)
  • LR- 0.29 (0.19-0.42) (0.08-0.87)

25
CAT-ing evidence on a diagnostic indicator
(Sackett et al., 2000 Battaglia et al., 2002)
  • Does the study report a comparison between
    measures, or measure and gold standard?
  • sine qua non for evidence of diagnostic accuracy
  • Was the gold (or reference) standard valid,
    reliable, and/or reasonable?
  • Gold standard and new indicator also must be
    independent to avoid incorporation bias that can
    inflate accuracy measures

26
Criteria for diagnostic indicators (cont.)
  • Were patients enrolled prospectively and
    consecutively (or by random assignment), and
  • Did the sample include a spectrum of patient
    types and severities?
  • These two criteria are important in avoiding
    spectrum bias, in which the sample includes only
    clear-cut or hand-picked cases and thus does not
    represent the diagnostic task

27
Criteria for diagnostic indicators (cont.)
  • Were the new measure and the reference standard
    administered independently, by different
    examiners, and
  • Were the examiners blinded to the subjects
    performance on the other test and to other
    relevant subject information?
  • Were the new measure and the reference standard
    both administered to all subjects and controls?
  • Important to avoid differential verification
    bias, when controls are assumed to be normal
    without testing on gold standard

28
Criteria for diagnostic indicators (cont.)
  • Do likelihood ratios suggest adequate diagnostic
    accuracy?
  • LR gt 4.0 (gt 10 cf. Bayes Library, 2002)
  • LR- lt 0. 40 (lt 0.20, cf Bayes Library, 2002)
  • Precision (narrow confidence intervals)?
  • Feasibility for usual clinical practice?
  • Value (i.e., better than current measure)?

29
Evidence on norm-referenced tests as diagnostic
indicators for early LI
  • Many norm-referenced tests have diagnosis of LI
    as their explicit purpose
  • A growing number of tests meet typical
    psychometric criteria, e.g. N 100 subjects per
    age level reliability gt .90 means, standard
    deviations, and standard errors of measurement
  • But very few provide evidence of diagnostic
    accuracy or precision, and none meet the
    recommended critical appraisal criteria

30
Norm-referenced tests not providing information
on accuracy or precision
  • Test of Language Development (TOLD)
  • Sequenced Inventory of Language Development
    (SICD)
  • Test of Early Language Development (TELD)
  • Reynell Scales
  • MacArthur Communicative Development Inventories
    (CDI)

31
A few tests provide information allowing accuracy
and precision to be calculated
  • Age LI LN LR (95 CI) LR- (95 CI)
  • PLS-4 Total language score lt 85
  • 3 24 24 6.7 (2.6-19.4) 0.19 (.08-.42)
  • 4 23 23 18 (3.6-102) 0.23 (.10-.44)
  • 5 28 28 4.4 (2.1-10.2) 0.26 (.12-.50)
  • 3-5 75 75 6.7 (3.7-12.5) 0.23 (.14-.35)
  • CELF-P Total language score lt 85
  • 3-5 80 80 5.3 (2.9-10.2) 0.45 (.34-.58)
  • CELF-P Total language score lt 77
  • 3-5 80 80 12.7 (4.4-37.8) 0.54 (.43-.66)
  • But note that these studies would fail many of
    the other critical appraisal criteria, their
    accuracy notwithstanding.

32
The situation is no better for other proposed
diagnostic indicators
  • Few compare indicator to a gold standard, so
    accuracy cant be determined
  • Few used blinded examiners, so a high potential
    for context and other biases
  • Small samples, wide CIs (rarely provided)
  • When sensitivity and specificity have been
    reported, they have sometimes been calculated
    incorrectly and/or misinterpreted

33
I choose not to despair
  • Knowing the limitations of our diagnostic tools
    is an important prerequisite to designing better
    diagnostic tools
  • Several possible ways forward, most
    involving clinician-researcher partnerships

34
A way forward to EBP in Speech-language pathology
and Audiology
  • Designing studies to meet the criteria for strong
    evidence
  • e.g., STARD (Bossuyt et al., 2003) statement
  • Large-scale, cooperative studies of diagnostic
    indicators
  • CARE-COAD model (Straus et al. 2002)
  • Dealing with the absence of a gold standard
  • e.g., Demissie et al., 1998 Dunson, 2001
    reliability and outcome studies
  • Diagnostic studies as multivariable, prediction
    research (Moons Grobbee, 2002)

35
Test yourself
  • Critical appraisal of diagnostic test (handout p.
    5)
  • Critical appraisal of treatment study (handout p.
    4)

36
Critical appraisal and CAT enable the remaining
steps to EBP
  • 5. Decide whether the evidence is strong enough
    to influence your clinical practice
  • 6. Integrate the evidence with the intangibles
  • 7. Update!

37
EBP is itself a set of assumptions, not a cult
  • Ultimately, strong evidence will be needed to
    determine whether EBP results in improved
    clinical service.
  • And EBP cant be applied blindly, to all kinds of
    problems...

38
As with many interventions intended to prevent
ill health, the effectiveness of parachutes has
not been subjected to rigorous evaluation by
using randomised controlled trials. Advocates of
evidence based medicine have criticised the
adoption of interventions evaluated by using only
observational data. We think that everyone might
benefit if the most radical protagonists of
evidence based medicine organised and
participated in a double blind, randomised,
placebo controlled, crossover trial of the
parachute.
39
Thanks!
References
Write a Comment
User Comments (0)
About PowerShow.com