P1254325703tchnP - PowerPoint PPT Presentation

1 / 81
About This Presentation
Title:

P1254325703tchnP

Description:

Define and discuss the advantages and disadvantages of objective and ... Adjuvant post operative chemotherapy is currently the recommended treatment for ... – PowerPoint PPT presentation

Number of Views:32
Avg rating:3.0/5.0
Slides: 82
Provided by: Burs3
Category:

less

Transcript and Presenter's Notes

Title: P1254325703tchnP


1
Measurement issues
Jean Bourbeau, MD Respiratory Epidemiology and
Clinical Research Unit McGill University Clinica
l Epidemiology (679) June 19, 2006

2
Objectives
  • Define categorical and continuous variables
  • Define 2 sources of variation biological and
    measurement error (random and bias)
  • Describe the classification measures and their
    focus functional, descriptive and methodological
  • Define and discuss the advantages and
    disadvantages of objective and subjective health
    measures
  • Define the psychometric properties of measurement
    instruments reliability, validity,
    responsiveness
  • Discuss key questions and concerns about each of
    the psychometric properties of an instrument
    reliability, validity and responsiveness
  • Define and discuss minimal clinically important
    difference

3
Reading
  • Fletcher, Chapter 2

4
Outline of Measurement issues
  • 1. Measurements
  • 2. Sources of variation
  • 3. Classification
  • 4. Health measurements
  • 5. Measurement properties

5
Outline of Measurement issues
  • 1. Measurements
  • 2. Sources of variation
  • 3. Classification
  • 4. Health measurements
  • 5. Measurement properties

6
Examples
  • In a 60-year-old patient after right
    hemicolectomy, the DUKE stage is a widely
    accepted, indispensable descriptive tool for
    planning further treatment.
  • Adjuvant post operative chemotherapy is currently
    the recommended treatment for resected Duke C
    colon cancer.

7
Examples
  • In a 20-year-old woman with right lower quadrant
    pain and vomiting, the likely diagnosis is an
    appendicitis or a gynecological infection.
  • After excluding pelvic inflammatory disease, an
    experienced surgeon or gastroenterologist will
    diagnose appendicitis based on history, clinical
    findings and ultrasound.

8
Measurement
We need to assign numbers to certain clinical
phenomena to make them manageable and scientific
9
Measurement
  • Measure
  • A scale or test is an instrument to measure a
    clinical phenomenon a score is a value on the
    scale in a given patient

10
Measurement
  • The attributes or events that are measured in
  • a research study are called  variables 
  • Variables are measured according to 2 types
  • Categorical
  • Continuous

11
Categorical variables
  • Also called discrete variable
  • Dichotomous
  • or Polychotomous (multilevel)
  • - Nominal
  • - Ordinal

12
Dichotomous categorical variables
  • Examples
  • Vital status (alive vs dead)
  • Yes or no (response to a question)
  • Sex (male vs female)

13
Polychotomous categorical variables
  • Nominal
  • Named categories that bear no ordered
    relationship to one another
  • Example
  • Hair colour, race, or country of origin

14
Nominal scale
  • Hierarchy of mathematical adequacy
  • Lowest level (not a measurement but a
    classification)
  • Use numbers as a labels (such as male or female)
  • No inference can be drawn from the relative size
    of the numbers used

15
Polychotomous categorical variables
  • Ordinal
  • Named categories that bear an ordered
    relationship to one another
  • The intervals need not be equal
  • Example
  • Ordinal pain scale that include  pain
    severity  none, mild, moderate, and severe
  • Deep tendon reflex absent, 1,2, 3, or 4

16
Ordinal scale
  • Hierarchy of mathematical adequacy
  • Numbers are again used as a labels for response
    categories
  • Numbers reflect the increasing order of the
    characteristics being measured (mild,
    moderate,severe)
  • The numeric values, and the differences between
    them, hold no intrinsic meaning

17
Continuous variables
  • Also called dimensional, quantitative or interval
    variables
  • Expressed as integers, fractions, or decimals in
    which equal distances exist between successive
    intervals
  • Examples age, blood pressure, temperature

18
Interval scale
  • Hierarchy of mathematical adequacy
  • Numbers are assigned to the response categories
    in such a way that a unit change represents a
    constant change across the range of the scale
    (temperature in degrees Celsius)

19
Ratio scale
  • Hierarchy of mathematical adequacy
  • With a ratio scale, it becomes possible to state
    how many times greater one score is than another
  • This improves on the interval scale by including
    a zero point

20
Scales
Binary Rank order (small to large) Continuous (0
to 8) Ratios
21
Outline of Measurement issues
  • 1. Measurements
  • 2. Sources of variation
  • 3. Classification
  • 4. Health measurements
  • 5. Measurement properties

22
Sources of variation
  • 2 sources of variation
  • Biological variation
  • Measurement error

23
Biological variation
  • Sources
  • Dynamic nature of most biologic entities
    (differences in age, sex, race, or disease
    status)
  • Temporal variation
  • (sometimes predictable, such as the diurnal cycle
    of plasma cortisol)

24
Measurement error
  • 2 different types
  • Random (chance error)
  • Bias (systematic error)

25
(No Transcript)
26
Measurement error
  • Can arise from
  • The method (measuring instrument )
  • Observer (the measurer)

27
Measurement error
  • We can talk about the variability between methods
    of making the measurement or between the
    observers
  • Repeated measurements by the same method or
    observer
  • Intramethod or Intraobserver
  • Between two or more methods or observers
  • Intermethod or Interobserver

28
Consequences of erroneous measurement
  • Individual
  • Makes no difference whether the error is
    systematic or random
  • Group
  • Variability in the absence of bias should not
    change the average group value
  • However, it can have deleterious consequences
    when one is seeking associations or correlations
    between 2 measures (analytic bias)

29
Regression toward the mean
  • Individual measurement is subject to both
    biologic variation and measurement error
  • An extremely high or low value obtained in an
    individual from a group is more likely to be an
    error than is an intermediate value
  • Tendency toward a less extreme value is greater
    than the tendency for an intermediate value to
    become more extreme

30
Outline of Measurement issues
  • 1. Measurements
  • 2. Sources of variation
  • 3. Classification
  • 4. Health measurements
  • 5. Measurement properties

31
Classifications of measures
  • Functional classifications focus on
  • Purpose of application of the measures
  • Descriptive classifications focus on
  • Their scope
  • Methodological classifications focus on
  • Technical aspects

32
Functional classification
  • Measures have discriminative, evaluative or
    predictive properties
  • Choice of measure depends on the purpose(s) for
    which it will be used

33
Functional classification
  • Discriminative instrument
  • Can discriminate between people with different
    levels of a particular attribute or disease
  • For example
  • NYHA scale
  • MRC dyspnea scale

34
MRC Dyspnea Scale
none
  • Grade 1 ? Breathless with strenuous exercise
  • Grade 2 ? Short of breath when hurrying on the
  • level or walking up a slight hill
  • Grade 3 ? Walks slower than people of the same
  • age on the level or stops for breath while
  • walking at own pace on the level
  • Grade 4 ? Stops for breath after walking 100
    yards
  • Grade 5 ? Too breathless to leave the house or
  • breathless when dressing

severe
35
(No Transcript)
36
Functional classification
  • Predictive instrument
  • Can predict the probability of a clinical
    diagnosis (diagnostic test) or the likelihood of
    a future event (prognostic test)

37
5-year survival COPD
Dyspnea MRC scale
FEV1
...according to the level of dyspnea as evaluated
by the MRC Dyspnea Scale
...according to staging as defined by the ATS
Guidelines ( predicted FEV1)
Nishimura K, et al. Chest 2002 121 1434-1440.
38
Functional classification
  • Evaluative instrument
  • Can measure change over time in the same person
  • For example
  • Dyspnea subscale of the Chronic Respiratory
    Questionnaire (CRQ) (COPD disease-specific
    quality of life questionnaire)

39
Descriptive classification
  • Large number of possible categories
  • Can categorize instruments by
  • Content domains of interest (dyspnea, fatigue,
    emotion)
  • Generic or disease-specific

40
COPD
Questionnaires
Disease-Specific
General
  • used in any population
  • cross-condition comparison
  • co-morbid conditions and
  • effects to treatment covered
  • do not focus on HRQL/ COPD
  • irrelevant items
  • insensitive to small changes
  • focus on relevant aspects
  • of HRQL
  • greater sensitivity for
  • disease changes
  • increased responsiveness
  • no comparisons

41
Methodological classification
  • Large number of possible categories
  • Can categorize by
  • Interviewer versus self-administered
  • Objective versus subjective

42
Outline of Measurement issues
  • 1. Measurements
  • 2. Sources of variation
  • 3. Classification
  • 4. Health measurements
  • 5. Measurement properties

43
Health measurements
  • Measurements may be based on
  • Laboratory or diagnostic tests (objective)
  • Indicators in which the patient or the clinician
    makes a judgement (subjective)

44
Health measurements
  • Unfortunately subjective is also used in other
    ways
  • To indicate if the variable is observable or not
  • Examples
  • Objective indicator such as  The ability to
    climb stairs 
  • Subjective indicators such as  pain or feelings 

45
Objective vs Subjective
  • Objective
  • More often continuous (lab data)
  • Few categorical (vital status, sex and race)
  • Subjective
  • Greater potential, for bias or variability on the
    part of
  • the observer
  • Many variables that are most important in caring
    for
  • patients are  soft  and subjective
  • For example pain, mood, dyspnea, ability to
    work, HRQL

46
The example of CABG
  • Why is quality of life important in studies
  • of CABG patients?
  • Survival with surgery gt medical treatment for
    patients with left main and triple vessels
  • Survival similar in patients with less severe
    disease
  • CASS NEJM 1984 European cooperative study Lancet
    1982.

47
As Feinstein has emphasized
The tendency of clinical investigators to focus
on objective rather than subjective
measurements can result in research that is both
dehumanizing and irrelevant
48
Subjective vs Objective measurement
49
Objective vs Subjective
  • Data traditionally considered objective or
    hard can be seen to have feet of softer clay
  • Example
  • X-ray or cytopathologic diagnoses have been shown
    to be subject to considerable intra- and
    interobserver variability

50
Subjective health measurements
  • May be grouped into 3 main categories
  • General feelings of well-being
  • Symptoms of illness
  • Adequacy of a persons functioning

51
Subjective health measurements
  • Advantages
  • Amplify the data obtainable from morbidity and
    mortality statistics
  • Give insights into matters of human concern such
    as pain suffering or depression
  • Offer a systematic way to record the  voice of
    the patient 
  • Do not require expensive or invasive procedures

52
Subjective health measurements
  • Disadvantages
  • Contrast sharply with the inherent reliability of
    mortality rates
  • Seem more susceptible to bias
  • Applying these measures to an entire population
    more difficult or impossible

53
Subjective health measurements
The use of rating methods suitable for
statistical analysis permit subjective health
measurements to rival the quantitative strengths
of the traditional objective indicators 
54
Health measurements
  • Scientific basis
  • Subjective judgements as a valid approach to
    measurement derive from the field of
    psychophysics
  • Psychophysical principles were later incorporated
    into psychometrics from which most of the
    techniques used to develop subjective
    measurements of health have been derived

55
Outline of Measurement issues
  • 1. Measurements
  • 2. Source of variation
  • 3. Classification
  • 4. Health measurements
  • 5. Measurement properties

56
Psychometric properties
  • Definition
  • Psychometrics is the science of using
    standardized tests or scales to measure
    attributes of a person or object

57
Numerical estimates of health
  • Many scaling methods exist for
  • Translating  indicators  into numerical
    estimates of severity
  • When it is done, they may be combined into an
    overall score, termed  health index 

58
Psychometric properties
  • Criteria for a scoring system
  • Reliability
  • Validity
  • Responsiveness
  • Minimal clinically important difference (MCID)

59
Reliability
  • Definition
  • The extent to which the same results are obtained
    when the measurement is repeated
  • It may reflect either (temporal) variation or
    random measurement error

60
Reliability
  • Key Questions
  • Internal consistency
  • Test-retest reliability (reproducibility)
  • Key Concern
  • Error
  • (error attenuates relationships between
    variables, and makes it more difficult to detect
    treatment effects)

61
Validity
  • Definition
  • The extent to which the measurement corresponds
    to the  true  value (some accepted  gold
    standard ), or behaves as expected
  • Validity depends on minimizing measurement error
    caused by bias

62
Type of measurement validity
Content validity Construct validity (convergent,
discriminant) Criterion validity (predictive,
concurrent) Cross-cultural validity Situational
validity
63
Content validity
  • Definition
  • The extent to which the items sampled for
    inclusion in the instrument adequately represent
    the domain of content (particular domain area)
    addressed by the instrument

64
Content validity
  • Key Questions
  • Theoretical foundation of the instrument
  • Instrument development primary sources of
    information, sources of items and scaling
    structure selection
  • Rules applied for content validation patient
    and/or clinician validation scientific review
  • Instrument is appropriate for the study under
    consideration

65
Content validity
  • Key concern
  • Without validity, an instrument has no meaning

66
Construct validity
  • Definition
  • The extent to which the instrument measures an
    abstract concept (construct) or attribute
    evaluated by comparison with instruments
    measuring related constructs
  • Convergent (come together, same concept) or
    discriminant with other instruments (truly
    measures something different from other
    instruments)

67
Criterion validity
  • Definition
  • Extent to which the instrument relates to an
    external criterion (criterion of practical value)
  • Concurrent (able to correlate with a present
    criterion) or predictive (able to correlate with
    a future criterion)

68
Construct validity
It is important to understand that a direct test
of the validity of an abstract concept such as
impaired health due to disease is not possible
69
Construct validity
  • Key Questions
  • Factor structure of the measure consistent with
    expectations
  • Scores from the instrument correlate with those
    of other instruments (measuring the same or
    related constructs)
  • Score from the instrument independent of scores
    from instruments measuring dissimilar constructs
  • Differentiate groups known to differ on the
    attribute being measured, e.g. on HRQL

70
Testing construct validity
  • The most widely method used is the
    multitrait-multimethod matrix
  • It involves testing a series of hypotheses
    concerning relationships between the new
    instrument and a range of reference measures of
    disease activity

71
Construct validity
  • Key concern
  • Without validity, an instrument has no meaning

72
Cross-cultural validity
  • Definition
  • The extent to which an instrument developed and
    tested in one cultural group is appropriate for,
    and behaves similarly in, another

73
Cross-cultural validity
  • Key Questions
  • Items appropriate for the culture under
    consideration
  • Instrument translated culturally and
    linguistically
  • Evidence of reliability and validity

74
Situational validity
  • Definition
  • The extent to which an instrument is appropriate
    for use in any given situation

75
Situational validity
  • Key Questions
  • Instrument should measure an appropriate outcome
    for the trial
  • Instrument should be valid for the specific
    purpose of the trial
  • Sufficiently reliable and responsive for this
    purpose
  • Sample size sufficient to detect change in the
    outcome measure of interest

76
Situational validity
  • Key Issues
  • Validity can be situation specific an instrument
    valid for one situation is not necessarily valid
    for another
  • Failure to detect treatment effects may be a
    function of study design, rather than a
    limitation of the instrument

77
Responsiveness
  • Definition
  • The extent to which scores change with a given
    change in the condition or disease state
  • Key Questions
  • Instrument has been evaluated for responsiveness
  • Effects sizes have been associated with the
    instrument in well designed trials.
  • Key concerns
  • The ability to track changes

78
MCID
  • Definition
  • The smallest difference that clinicians and
    patients would care about
  • Key Questions
  • Has the MCID been established?
  • What was the method used?
  • Key concerns
  • The ability to detect true treatment effects

79
Benefits of Pulmonary Rehabilitation
Functional exercise capacity 6-MWD (N444)
Health status CRQ dyspnea (N519)
Lacasse Y, et al. Cochrane Database Syst Rev
2002 3CD003793.
80
Key messages
  • Some simple criteria
  • The system must address a well defined clinical
    phenomenon
  • The scale has to have a clearly defined ranking
    in a hierarchical order (reasonable clinical or
    mathematical criteria)
  • The different stages or categories have to be
    mutually exclusive
  • The scale has to be adapted to the area of
    measurement where it will be applied
  • Creating complex or composite scores such as
    quality of life requires one to address issues
    concerning the inner structure of a score

81
Key messages
  • Quote from McDowell and Newell
  • Ultimately the selection of a measurement
    contains an element of art and perhaps even luck
    it is often prudent to apply more than one
    measurement whenever possible.
  • This has the advantage of reinforcing the
    conclusions of the study when the results from
    ostensibly similar methods are in agreement, and
    it also serves to increase our general
    understanding of the comparability of the
    measurements we use.
Write a Comment
User Comments (0)
About PowerShow.com