ABSTRACT
Nonparametric inference of the area under ROC curve (AUC) has been well developed either in the presence of verification bias or clustering. However, current nonparametric methods are not able to handle cases where both verification bias and clustering are present. Such a case arises when a two-phase study design is applied to a cohort of subjects (verification bias) where each subject might have multiple test results (clustering). In such cases, the inference of AUC must account for both verification bias and intra-cluster correlation. In the present paper, we propose an IPW AUC estimator that corrects for verification bias and derive a variance formula to account for intra-cluster correlations between disease status and test results. Results of a simulation study indicate that the method that assumes independence underestimates the true variance of the IPW AUC estimator in the presence of intra-cluster correlations. The proposed method, on the other hand, provides a consistent variance estimate for the IPW AUC estimator by appropriately accounting for correlations between true disease statuses and between test results.
Disclosure statement
No potential conflict of interest was reported by the author(s).
Appendix A. Proof of EquationEquation (4)(4)
(4)
First observe that can be expressed as
Now, we prove that . To this end, we observe that
can be written as
Define Then,
Since , thus it suffices to show
Now, we have
Therefore,
where is the maximum cluster size and
is the minimum sampling probability. This shows that
. Finally, note that
which completes the proof of EquationEquation (4)(4)
(4) .