• No results found

Adaptation of the weighted Kaplan-Meier method to time-dependent ROC curves

N/A
N/A
Protected

Academic year: 2022

Share "Adaptation of the weighted Kaplan-Meier method to time-dependent ROC curves"

Copied!
11
0
0

Loading.... (view fulltext now)

Full text

(1)

RESEARCH ARTICLE

J.Natn.Sci.Foundation Sri Lanka 2018 46 (1): 11 - 21 DOI: http://dx.doi.org/10.4038/jnsfsr.v46i1.8261

Adaptation of the weighted Kaplan-Meier method to time-dependent ROC curves

Deniz Sigirli1*, Ilker Ercan1, Ozkan Balcin2 & Ekrem Kaya2

1 'HSDUWPHQWRI%LRVWDWLVWLFV)DFXOW\RI0HGLFLQH8OXGDJ8QLYHUVLW\%XUVD7XUNH\

2 'HSDUWPHQWRI*HQHUDO6XUJHU\)DFXOW\RI0HGLFLQH8OXGDJ8QLYHUVLW\%XUVD7XUNH\

Revised: 10 March 2017; Accepted: 17 August 2017

Abstract: This study was aimed at adapting the weighted Kaplan-Meier method to time-dependent ROC curve analysis.

The performances of these two time-dependent ROC curve methods were compared, in which the Kaplan-Meier estimator and weighted Kaplan-Meier estimator were used. An application was presented for pancreatic cancer patients to evaluate the prognostic ability of the CA19-9 antigen. A simulation study was performed for different scenarios to see the performance of the proposed method. In all situations, it is observed that the AUC values that were obtained by the weighted time-dependent ROC (WTDR) curves more closely approximated the real AUC values than the classical time-dependent ROC (TDR) curve method and has got smaller mean square error rates.

Keywords: Censored data, time-dependent ROC curves, weighted Kaplan-Meier.

INTRODUCTION

In classical ROC analysis, disease status is treated as a

¿[HGVWDEOHIHDWXUHRIWKHH[SHULPHQWDOXQLW+RZHYHU

disease status can change over time, and those individuals who are not diseased can develop the disease during the study period. There can be a certain time lag between the time that the diagnostic test is conducted and the onset of the disease. How well a diagnostic test result, having been measured at the beginning of the study, can discriminate between diseased and undiseased individuals at a [0, t]

follow-up time, is the question that must be addressed in such situations.

In literature there are several discussions of time- dependent ROC curves (Etzioni et al., 1999; Heagerty et al., 2000; Slate & Turnbull, 2000; Cai et al., 2003;

2006; Heagerty & Zheng, 2005; Chambless & Diao, 2006; Uno et al., 2007; Hung & Chiang, 2010a;

Martínez-Camblor et al., 2016). Cumulative/dynamic, incident/static and incident/dynamic estimators for WLPHGHSHQGHQW VHQVLWLYLW\VSHFL¿FLW\ DQG UHODWHG 52&

FXUYHV ZHUH GH¿QHG E\ +HDJHUW\ DQG =KHQJ   and also discussed by Cai et al. (2006) and Pepe et al.

(2008). Sensitivity can be estimated using cases that DUH VWUDWL¿HG DFFRUGLQJ WR WKH WLPH DW ZKLFK WKH HYHQW

RFFXUV LQFLGHQWVHQVLWLYLW\ RUXVLQJDOOFDVHVLGHQWL¿HG

up to time t FXPXODWLYHVHQVLWLYLW\ 6SHFL¿FLW\FDQEH

estimated using all individuals who are not cases at time t G\QDPLFVSHFL¿FLW\ RUXVLQJRQO\WKRVHLQGLYLGXDOVZKR

DUH HYHQW IUHH WKURXJK D ¿[HG IROORZXS SHULRG VWDWLF

VSHFL¿FLW\ 7KHLQFLGHQWVWDWLFGH¿QLWLRQRIVHQVLWLYLW\

VSHFL¿FLW\ZDVFRQVLGHUHGE\VHYHUDODXWKRUV (W]LRQLet al., 1999; Slate & Turnbull, 2000; Heagerty & Zheng, 2005; Cai et al., 2006), and two methods were proposed by Etzioni et al   7KH ¿UVW RQH FDOFXODWHV WKH

ROC curve given estimates of the longitudinal model parameters by utilising random-effects models to capture the correlation between within-subject measurements.

The second one is based on estimating the ROC curve at any time of interest, by setting the time covariate to a VSHFL¿FYDOXH6ODWHDQG7XUQEXOO  JDYHGH¿QLWLRQV

RI VHQVLWLYLW\ VSHFL¿FLW\ DQG UHODWHG 52& FXUYH IRU

* Corresponding author ([email protected]; https://orcid.org/0000-0002-4006-3263)

This article is published under the Creative Commons CC-BY-ND License (http://creativecommons.org/licenses/by-nd/4.0/).

This license permits use, distribution and reproduction, commercial and non-commercial, provided that the original work is properly cited and is not changed anyway.

(2)

12 Deniz Sigirli et al.

the longitudinal biomarker data by focusing on fully Bayesian hierarchical models and the latent disease process models. Cai et al. (2006) used generalised linear models to estimate time-dependent ROC (TDR) curve with incident sensitivity in censored data. They modelled the dependence in time by using vectors of polynomial or spline basis functions.

The present study was focused on the cumulative/

G\QDPLF GH¿QLWLRQ ZKHUH HDFK LQGLYLGXDO SOD\V WKH

role of control for times t”X, and then contributes as a case for times t > X, where X is the failure time. Time- dependent ROC curves related to cumulative sensitivity DQG G\QDPLF VSHFL¿FLW\ KDV UHFHQWO\ EHHQ XVHG E\ D

number of authors (Heagerty et al., 2000; Chambless &

Diao, 2006; Uno et al., 2007; Song & Zhou, 2008; Hung

& Chiang, 2010a; Wolf et al., 2011; Blanche et al., 2013a;

Li et al., 2015; Li, 2016; Martínez-Camblor et al., 2016;

Rodríguez-Álvarez et al., 2016). Heagerty et al. (2000) proposed estimators based on the cumulative distribution function of the biomarker and Kaplan-Meier estimator of the survival function. Since this method does not satisfy the condition of monotonicity for the ROC curve, they also proposed an estimator based on the nearest neighbour estimator of Akritas (Akritas et al., 1994), which estimates the bivariate distribution of the marker and the failure time using kernel smoothing techniques.

Two methods were proposed by Chambless and Diao   7KH ¿UVW RQH LV D UHFXUVLYH FDOFXODWLRQ RYHU

the ordered times of events, analogous to the Kaplan- Meier approach to survival function estimation. This method does not guarantee the monotonicity. The second approach uses a regression survival model to estimate the conditional survival probability of the outcome at time t given the marker Y.

 6RQJDQG=KRX  GH¿QHGWKHFRYDULDWHVSHFL¿F

time-dependent ROC curve using both the cumulative and incident sensitivity. Uno et al. (2007) and Hung and Chiang (2010a) have proposed estimators based on inverse probability of censoring weighting (IPCW).

Blanche et al. (2013a) proposed a conditional IPWC PHWKRG ZKLFK LV WKH PRGL¿HG YHUVLRQ RI ,3&: WR

obtain a nonparametric estimator robust to marker- dependent censoring. They gave a detailed review of the time-dependent ROC curve estimators proposed in literature and compared the properties. Wolf et al.

(2011) introduced a method for calculating sensitivity DQGVSHFL¿FLW\IRUFHQVRUHGGDWDEDVHGRQWKH1HOVRQ

Aalen estimator and they used isotonic regression to achieve monotonicity for the ROC curve. Li et al. (2015) proposed a weighting method to estimate cumulative/

G\QDPLF WLPHGHSHQGHQW VHQVLWLYLW\VSHFL¿FLW\ DQG

related ROC curve nonparametrically by using uniform

kernel, which has connections to the methods in Heagerty et al. (2000) and Chambless and Diao (2006). Martínez- Camblor et al. (2016) proposed a methodology, which assigns a probability of belonging to a group by using proportional hazard Cox regression model and Kaplan–

Meier estimator for cumulative/dynamic ROC curve estimation. Rodríguez-Álvarez et al. (2016) proposed nonparametric regression estimators Nadaraya-Watson kernel weights for cumulative/dynamic ROC curves in the presence of covariate-dependent censoring. Also, Li (2016) studied estimation of cumulative/dynamic time- dependent ROC and area under the ROC curve (AUC) for left-truncated and right-censored data.

 7KHDUHDXQGHUWKH52&FXUYH $8& ZDVGH¿QHG

as the probability that the marker value of a randomly selected case exceeds the marker value of a randomly selected control (Hanley & McNeil, 1982; Pepe, 2003).

The extension of AUC to the time-dependent setting has been discussed by many authors (Heagerty & Zheng, 2005; Chambless & Diao, 2006; Chiang & Hung, 2010; Hung & Chiang, 2010b; Cai et al., 2011; Viallon

& Latouche, 2011; Schmid et al., 2015; Lambert &

Chevret, 2016). The construction in their works was to estimate AUC (t) for each t time. Blanche et al. (2013b) presented different estimators of the time-dependent AUC for univariate survival data, longitudinal setting and competing events setting. The time-dependent ROC curve can be calculated by plotting the sensitivity versus

VSHFL¿FLW\IRUDUDQJHRIFXWRIIYDOXHVDQGWKH$8&

can be calculated using standard numerical integration methods such as trapezoid rule (Campbell, 1994;

Chambless & Diao, 2006).

The term borrowing strength is typically used in Bayesian statistics and generally references an attempt to improve precision by using additional information from allied sources.

The weighted likelihood (WL) function has been designed to incorporate information from populations that are relevant, but not of prime inferential interest to the study population (Hu, 1994). The WL function suggested by Hu and Zidek (2002) is based on the result of James and Stein (1961) insofar as, in terms of the sum of the mean-square-error-of-estimation criterion, the sample averages could be improved upon by borrowing information from the other samples – the so-called Stein paradox (James & Stein, 1961; Hu & Zidek, 2002).

Similar to the James-Stein estimator, a WL estimator that facilitates drawing inferences on one sample by using additional information from different populations was suggested by Hu and Zidek (2002).

References

Related documents

Residents are required to pay both an entrance fee and a monthly fee which includes housing, medical help, activities, etc... All Israeli health funds (Kupat Holim)

On December 17, 1997 the NSW Department of Health issued a media release warning the public to cook pipis well before eating them. When subsequent cases reported incubation periods

Within Micropterigidae as sampled here (five genera, node 3), the molecular data strongly support a basal split between the Nearctic Epimartyria (Walsingham) (representing a

A diverse population of birds belonging to 64 families were identified in the riverine areas along the Bhagirathi river (upper Ganges) at different elevations during the survey

Since the railway companies have already developed the areas along their lines as much as they can, the only way in which they can pursue growth is to move outside the areas

In a longer article, Blum and Ames claim that tris-BP is more mutagenic in the bacterial test than several known carcinogens. They too caution however that “the use of the

BTT standards/values for some edible vegetable oils under 2.2: Fats, oils and Fat emulsions as per Food Safety and Standard Act 2006 and food product standards and food

Other things you need to know Using the Periodic Table Qualitative Analysis Notes?. Glossary of terms used in