Evaluating the ROC performance of markers for future events - PowerPoint PPT Presentation

1 / 33
About This Presentation
Title:

Evaluating the ROC performance of markers for future events

Description:

Data Analysis. Seattle Heart Failure. Kidney Biomarker Study (TPF, FPF) for ... Summarize the current methods of ROC analysis for censored failure time data. ... – PowerPoint PPT presentation

Number of Views:209
Avg rating:3.0/5.0
Slides: 34
Provided by: coe172
Category:

less

Transcript and Presenter's Notes

Title: Evaluating the ROC performance of markers for future events


1
Evaluating the ROC performance of markers for
future events
  • Yuying Jin, Yingye Zheng, Margaret Pepe
  • Department of Biostatistics, University of
    Washington
  • Fred Hutchinson Cancer Research Center
  • Aug 2nd, 2007

2
Application I
  • Acute Kidney Injury
  • Event Patients undergoing major cardiac surgery
    are at high risk of kidney damage. An AKI event
    occurs when serum creatinine level increases by
    25 and sustained for 24 hours.
  • Outcome time from surgery to AKI event.
  • Marker Two new markers measured in urine at
    baseline and at various intervals after surgery
    are under investigation.
  • Goal Determine the numbers of AKI patients be
    diagnosed in advance with new markers, by how
    long and at what cost in terms of false
    diagnoses.
  • Issues Study involves competing risk due to
    non-AKI deaths, no censoring.

3
Application II
  • Seattle Heart Failure Study
  • Event More than 5 million people in United State
    have heart failure. The event of this study is
    defined as death caused by heart failure.
  • Outcome time from entry to death.
  • Marker Linear combinations of predictors derived
    from a cohort (SHF score).
  • Goal Evaluate the performance of the SHF score
    to discriminate people who die in the first 2
    years or not.
  • Issues This study includes censoring, but no
    competing risk events.

4
Overview
  • Definition of ROC for Event Time Outcomes
  • Estimation Approaches
  • Retrospective Methods
  • Model marker distribution conditioning on
    outcome.
  • Prospective Methods
  • Model event time conditioning on marker.
  • Data Analysis
  • Seattle Heart Failure
  • Kidney Biomarker Study

5
(TPF, FPF) for Binary Marker
  • Binary marker Y measured at baseline.
  • TPFt sensitivity (t) P(Y positive event at t)
  • FPF 1-specificity P(Y positive controls)
  • Natural controls exist for some studies, e.g. AKI
    study.
  • If there are no natural controls, controls can be
    determined by outcome time gt t.
  • t is a large landmark time point to define
    controls.
  • FPF 1-specificity P(Y positive Tgt t)
  • Focus on incident True Positive Fraction and
    static False Positive Fraction defined in
    Heagerty and Zheng (2005).

6
ROC curve for Continuous Marker
  • Continuous marker Y measured at baseline.
  • With a specific threshold rule Ygtc,
  • TPFt(c) sensitivity (c,t) P(Ygtc event
    at t),
  • FPF(c) 1-specificity(c) P(Ygtc
    controls).
  • TPFt is a decreasing function of t.
  • ROCt is the plot of TPFt(c) versus FPF(c) for

7
Retrospective Estimation Methods
  • Leisenring et al. (1997) proposed simple binary
    regression for binary marker and no censoring.
  • Etzioni et al. (1999) extended the binary
    regression approach to continuous marker, again
    in the absence of censoring.
  • Cai et al. (2006) offer a comprehensive
    approach, encompassing previous methods and
    extending them to censored failure time data.

8
Leisenring and Etzionis Methods
  • For binary marker without censoring, Leisenring
    et. al. estimate FPF using controls and binary
    regression to estimate TPF(t)
    from cases, where g-1 is link function and
    , is a set of polynomial functions.
  • For continuous marker, in the absence of
    censoring, Etzioni et al. use ROC-GLM to model
    ,
  • where g-1 is link function, g(h(f)) is the
    baseline ROC curve at t0 and fFPF.
  • ROC-GLM is a general regression method of
    modeling ROC curves directly. (CaiPepe (2002),
    JASA Pepe (1997), Biometrika)

9
Cais Method
  • Extended to censored data.
  • Censored subject
  • Control when censored time Xgt t
  • Weighted average of cases and controls
    when Xlt t.
  • Weights are determined by estimating the
    distribution of T using standard failure time
    methods.
  • For continuous biomarker with censoring, Cai et
    al. replace each marker with a series of binary
    records, corresponding to a series of thresholds,
    c1,,cp and adopt ROC-GLM model.

10
Prospective Estimation Methods
  • Prospective model combines risk regression
    techniques with observed predictor distributions
    to calculate TPF and FPF.
  • Heagerty and Zheng (2005) and Song and Zhou (in
    press) employ a Cox model for a baseline marker.
  • Censoring is naturally incorporated.

11
Heagerty and Zhengs Method
  • Cox model for a baseline marker Y
  • For binary marker and denote the risk set at t by
    R(t),

  • is a consistent estimate of TPF(t), follows
    from Xu and OQuigley (2000).
  • is the empirical estimate.
  • With continuous biomarkers, is the empirical
    dist. of Y in controls,

  • where c , generalizes
    .

12
Song and Zhous Method for binary marker
  • Use Bayes theorem to write TPF(t) and FPF, and
    estimate TPF(t) and FPF using estimates from Cox
    model.
  • For binary marker,
  • TPF(t)
  • FPF
  • When

13
Song and Zhous Method for continuous maker
  • With continuous marker, integrals over the
    distribution of Y, F, substitute
    estimated from Cox Model and the empirical
    distribution of Y.
  • ROC Curve estimator is

14
Heagerty and Zhengs versus Song and Zhous
  • Advantages of Song and Zhous method,
  • More efficient, uses maximum partial likelihood
    estimators.
  • Allows censoring to depend on Y.
  • Advantage of Heagerty and Zhengs method,
  • Allows estimation under non-proportional hazards.
  • Current Song and Zhous method is valid only when
    proportional hazards assumption satisfied. But it
    can be extended to non-proportional hazards
    situation with further work.

15
Comparisons of Retrospective and Prospective
methods
  • All are semiparametric methods.
  • Censoring depending on Y is only accommodated by
    SZ.
  • Prospective Methods can only accommodate study
    designs that allow the hazard function and
    population distribution of predictor can be
    calculated, e.g. a simple case control design
    cannot be accommodated by Prospective Methods.
  • Retrospective methods can include disease
    specific covariates, e.g. severity of kidney
    injury.

16
Data Analysis I
  • Seattle Heart Failure
  • A random sample of 1000 observations from
    Val-heft trial
  • Controls are subjects alive at 2 years
  • There are 165 deaths, 375 censoring observations
    before 2 years and 460 subjects remaining alive
    at 2 years.
  • Crude ROC curve
  • Cases are the subjects who have events in the
    interval (t-?,t?).
  • Controls are the subjects whose outcome times are
    aftert.
  • Censored observations are excluded.

17
Seattle Heart Failure Study
Figure 1 SHF scores measured at enrollment in
cases. A box-plot of the SHF score distribution
in known controls.
18
Seattle Heart Failure Study cont.
Figure 2 ROC curves using 4 methods
(a)categorizing T and comparing with known
controls only (b) Cais retrospective method
(c) Heagerty and Zheng with proportional hazards
model and (d) Song and Zhou with a proportional
hazards model.
19
Seattle Heart Failure Study cont.
Table 1 Comparison of estimated ROC curves at
f0.2 and f0.8. 95 confidence intervals in
parentheses are based on the same 200
bootstrapped samples.
  • Conclusion
  • Crude ROC curves have the largest variance
  • Prospective methods have narrower confidence
    intervals than Cais method.
  • Among prospective methods, Song and Zhous method
    is more efficient than Heagerty and Zhengs.


20
Data Analysis II
  • Kidney Biomarker Study
  • Simulated data that approximates the study design
  • There are 1800 subjects in the study. 1440
    patients who has no kidney injury are treated as
    controls. There are 136 severe AKI events, 206
    mild AKI event, and 18 patients died for non
    kidney related causes.
  • True ROC curve
  • This is a simulated study. With extremely large
    data size, we can approximate the true ROC curve.

21
Kidney Biomarkers Study
Figure 3 Baseline AKI (Acute Kidney Injury)
biomarker distributions. Lowess curves for
biomarkers in severe and mild AKI subgroups are
shown.
22
Kidney Biomarkers Study cont.
Figure 4a True and Crude ROC curves for the
baseline AKI biomarker at T 1 and 2 days after
surgery. Solid line is T1 and dashed line is T2.
23
Kidney Biomarkers Study cont.
Figure 4b Estimated ROC curves using Cais and
Song and Zhous methods for the baseline AKI
biomarker at T 1 and 2 days after surgery.
24
Kidney Biomarkers Study cont.
  • For baseline AKI marker, ROC curve estimated from
    Cais method follows the true ROC well.
  • Song and Zhous estimates are not close to the
    true ROC curves comparing to Cais method. It is
    possible due to the violation of proportional
    hazards assumption.

25
Conclusion
  • Summarize the current methods of ROC analysis for
    censored failure time data. In our opinion a
    retrospective analysis is more natural and direct
    .
  • Focus of paper is only ROC curves. Various
    methods exist for estimating AUC (area under ROC
    curve).
  • Slides available at http//www.fhcrc.org/science/l
    abs/pepe/dabs/

26
Discussion
  • ROC curve for event time outcome can be extended
    to longitudinal markers.
  • Other Definitions of True Positive Fraction and
    False Positive Fraction are not covered.
    (Heagerty and Zheng (2005), Biometrics)
  • Questions?

27
References
  • Cai T and Pepe MS (2002) Semi-parametric ROC
    analysis to evaluate biomarkers for disease.
    Journal of the American Statistical Association
    9710991107.
  • Etzioni R, Pepe M, Longton G, Hu C, Goodman G
    (1999) Incorporating the time dimension in
    receiver operating characteristic curves a case
    study of prostate cancer. Medical Decision Making
    19242251.
  • Heagerty PJ, Zheng Y (2005) Survival model
    predictive accuracy and ROC curves. Biometrics
    6192105.
  • Heagerty PJ, Lumley T, Pepe MS (2000)
    Time-dependent ROC curves for censored survival
    data and a diagnostic marker. Biometrics 56
    337344.

28
References
  • Pepe MS (2003) The Statistical Evaluation of
    Medical Tests for Classification and
    Prediction.Oxford University Press. New York.
  • Song X and Zhou XH (in press) A semiparametric
    approach for the covariate specific ROC curve
    with survival outcome. Statistica Sinca.
  • Xu R, OQuigley J (2000) Proportional hazards
    estimate of the conditional survival function.
    Journal of the Royal Statistical Society Series B
    62 667-680.
  • Zheng Y, Heagerty PJ (2004) Semiparametric
    estimation of time-dependent ROC curves for
    longitudinal marker data. Biostatistics 5
    615-632.

29
ROC curve for longitudinal marker
  • Longitudinal marker Y is binary and measured at
    time s.
  • Reset clock of measure time s to zero for Y(s)
    clustered observations.
  • TPF (s, t) Prob (Y(s)1Tst)
  • Sensitivity of marker depends on event time and
    measurement time.
  • FPF (s) Prob (Y(s)1Tgtst)
  • Time dependent ROC
  • ROCt,s(f)Prob(Y(s)gtc(s)Tst) where
    c(s)F-1(1-f)
  • F denotes the cdf for Y(s) in the control group.
  • ROCt,s(f) is TPF(t,s) corresponding to an
    FPF(s)f.

30
Kidney Biomarkers Study (longitudinal)
Figure 5 Biomarker distributions in cases as a
function of the time lag between marker
measurement and event time, t T - s, and in
controls.
31
Kidney Biomarkers Study (longitudinal) cont.
Figure 6a True and Crude ROC curves for the
longitudinally measured AKI biomarker measured at
1 and 2 days prior to clinical diagnosis of AKI
with serum creatinine.
32
Kidney Biomarkers Study (longitudinal) cont.
Figure 6b Estimated ROC curves using Cais and
Song and Zhous methods for the longitudinally
measured AKI biomarker measured at 1 and 2 days
prior to clinical diagnosis of AKI with serum
creatinine.
33
Kidney Biomarkers Study (longitudinal) cont.
  • For longitudinal AKI marker, ROC curve by Cais
    method is close to the crude ROC curve.
  • Song and Zhous method appears to underestimate
    the ROC curve, especially at smaller FPFs.
    Presumably the proportional hazards assumptions
    fails.
Write a Comment
User Comments (0)
About PowerShow.com