Confounding adjustment: Ideas in Action a case study - PowerPoint PPT Presentation

About This Presentation
Title:

Confounding adjustment: Ideas in Action a case study

Description:

underwent an initial Percutaneous Coronary Intervention (PCI) were treated with 'usual care' alone or usual care plus a relatively expensive ... – PowerPoint PPT presentation

Number of Views:77
Avg rating:3.0/5.0
Slides: 35
Provided by: mbswo
Category:

less

Transcript and Presenter's Notes

Title: Confounding adjustment: Ideas in Action a case study


1
Confounding adjustment Ideas in Action -a case
study
  • Xiaochun Li, Ph.D. Associate Professor Division
    of Biostatistics Indiana University School of
    Medicine

2
Outline
  • Description of the data set
  • Quantity to be estimated
  • Summary of baseline characteristics
  • Approaches to data analyses
  • Results
  • Discussion

3
Simulation Setup
  • Linder Center data described and analyzed in
    Kereiakes et al. (2000)
  • 6 month follow-up data on 996 patients who
  • underwent an initial Percutaneous Coronary
    Intervention (PCI)
  • were treated with usual care alone or usual
    care plus a relatively expensive blood thinner
    (IIB/IIIA cascade blocker
  • has10 variables
  • Y 2 outcomes, mort6mo (efficacy) and cardcost
    (cost)
  • X 1 treatment variable, and 7 baseline
    covariates, stent, height, female, diabetic,
    acutemi, ejecfrac and ves1proc

4
Baseline characteristics
5
The LSIM10K dataset
  • Simulation data set was based on the Linder
    Center data
  • 17 copies of the clustered Lindner data, with
    fudge factors added to ejfract and hgt, and some
    clipping
  • same correlation among covariates, same
    clustering patterns
  • Contains the values of 10 simulated variables for
    10,325 hypothetical patients
  • To simplify analyses, the data contain no missing
    values.
  • Details and dataset available from Bobs website

6
What do we want to estimate?
  • The population average treatment effect (ATE),
    i.e.,
  • E(Y1) - E(Y0)
  • Y1 and Y0 are conterfactual outcomes
  • In plain words what if scenarios
  • The expected response if treatment had been
    assigned to the entire study population minus the
    expected response if control had been assigned to
    the entire study population

7
Baseline covariate balanceassessment
8
Visualizing overall imbalance
Deep blue high values
C
T
9
Analytical Methodsfor confounding adjustment
  • The following methods were applied to lsim10k
  • Outcome regression adjustment (OR)
  • Propensity score (PS) stratification
  • Inverse-probability-treatment-weighted (IPTW)
  • Doubly robust estimation
  • Matching by
  • Mahalonobis distance
  • PS only

10
Analysis of mort6mo
  • OR model for mort6mo
  • treatment indicator (trtm)
  • main effect terms for all seven covariates
  • quadratic terms for both height and ejfract
  • Residual deviance 2410.4 on 10323 degrees
    of freedom
  • PS model
  • saturated model for the five categorical
    covariates (main effects and interaction terms up
    to fifth-order)
  • main effects and quadratic terms for height and
    ejfract

11
Covariates Balance Evaluations based on PS
Quintiles
12
Stent
13
Female
14
Diabetic
15
Acutemi
16
Ves1proc
17
Heightstrata 2 (0.95 cm) and 3 (-1.50cm)
18
Height
  • Existence of residual confounding after adjusting
    for PS quintiles
  • The within-stratum between-group height
    difference
  • mean s.d. p
  • Stratum 2 0.949 0.44 0.032
  • Stratum 3 -1.497 0.43 0.0005

19
Ejfractstrata 1 (0.81), 2 (-1.32) and 3 (-0.72)
20
Ejfract
  • Existence of residual confounding after
    adjusting for PS quintiles
  • The within-strata between-group height
    difference
  • mean s.d. p-value
  • Stratum 1 0.812 0.41 0.0475
  • Stratum 2 -1.322 0.33 7.38e-5
  • Stratum 3 -0.721 0.32 0.025

21
PS Stratification
  • Residual confounding within strata
  • In PS stratification method, height and ejfract
    are further adjusted
  • stratum specific
  • Treatment effect
  • Height, ejfract main effects and their quadratic
    terms

22
Results mort6mo
True ?-0.036

Results of all methods are consistent, providing
evidence of treatment effectiveness at
preventing death at 6 months.
23
Analysis of cardcost
ps model same as before
  • cardcost model
  • treatment indicator (trtm)
  • main effect terms for all seven covariates
  • quadratic terms for both height and ejfract
  • cardcost model of CA with PS stratification
  • stratum specific
  • Treatment effect
  • Height, ejfract main effects and their quadratic
    terms

24
Model checking OR Adjusted R-squared 0.0386
25
Model checking OR (log transformed) Adjusted
R-squared 0.0693
26
Results cardcost

27
IPTW 1 vs 2
28
Discussion
  • All methods give consistent results on the 2
    outcomes
  • All PS based results have similar variance except
    IPTW1
  • IPTWs depend on approx. correct PS model
  • OR depends on approx. correct outcome model
  • DR is a fortuitous combination of OR and IPTW
    depends on one of models being right
  • Nonparametric models of either models may be an
    alternative to parametric models

29
Double Robustness
  • wrong PS model adjust for one covariate
    acutemi only
  • wrong OR model for card cost adjust for the
    treatment indicator trtm and the acutemi
    covariate

By right, we mean approximately.
30
Propensity score estimation
  • The majority applications in literature use a
    parametric logistic regression model that assume
    covariates are linear and additive on the log
    odds scale
  • May include selected interactions and polynomial
    terms
  • Accurate PS estimation is impeded by
  • High dimensional covariates which ones should
    we de-confound?
  • Unknown functional form how do they relate to
    the treatment selection
  • PS model misspecification can substantially bias
    the estimated treatment effect
  • Nonparametric approach is flexible to accommodate
    nonlinear/non-additive relationship of covariates
    to treatment assignment, e.g., trees

31
Nonparametric regression techniques
  • Generalized Boosted Models (GBM) to estimate the
    propensity score function
  • Friedman, 2001 Madigan and Ridgeway, 2004
    McCaffrey, Ridgeway, and Morral, 2004
  • R package twang
  • Regression tree model to predict cardcost
  • Ripley, 1996 Therneau and Atkinson, 1997
  • R package rpart

32
Generalized Boosted Models (GBM)
  • A multivariate nonparametric regression technique
  • Sum of a large set of simple regression trees
    modelling log-odds
  • gbm finds mle of g(x)log(p(x)/(1-p(x)),
    p(x)P(T1x)
  • Predict treatment assignment from a large number
    of pretreatment covariates adaptively choose
    them
  • Nonlinear
  • No need to select variables
  • Can model complex interactions
  • Invariant to monotone transformations of x
  • E.g, same PS estimates whether use age, log(age)
    or age2
  • Outperforms alternative methods in prediction
    error

33
Results cardcostnonparametric approach

34
Future research
  • People try quintiles, deciles for propensity
    score stratification need data driven approach
    (based on bias-variance tradeoff) for number of
    strata
  • Model selection PS model, and outcome model
  • Nonparametric estimation of models may be
    intuitive, but not clear about the properties of
    the causal estimates
  • Nonparametric caveat still need to define a set
    of confounders based on knowledge of causal
    relationship among treatment, outcome and
    covariates rather than conditioning
    indiscriminatly on all covariates that have
    associations with treatment and outcome
Write a Comment
User Comments (0)
About PowerShow.com