Abstract
Diagnostic tests are often evaluated by comparison of the areas under receiver operating characteristic (ROC) curves. In this study the authors compared this approach with a more direct method that takes into account consequences of a diagnosis. Data from a prospective study of diagnosis of pulmonary embolism were used for a motivating example. Using multivariable logistic regression analysis, three diagnostic models were built and compared based on their ROC curves. Although model 1 (0,706) and model 2 (0.702) had the same ROC-curve area, they-performed differently when risks and benefits of subsequent decisions were considered by applying the treatment probability threshold. Models 1 and 3 (0.611) had substantially different ROC-curve areas but performed similarly taking into account the therapeutic consequences. This demonstrates that comparison of diagnostic tests using the areas under the ROC curves may lead to erroneous conclusions about therapeutic usefulness. To correspond to daily practice, it would be more appropriate to also consider the clinical implications in evaluating diagnostic tests. This is made feasible by explicit definition and application of a treatment threshold.
Original language | English |
---|---|
Pages (from-to) | 447-454 |
Number of pages | 8 |
Journal | Medical Decision Making |
Volume | 17 |
Issue number | 4 |
DOIs | |
Publication status | Published - 27 Oct 1997 |
Keywords
- Benefits and rises
- Clinical relevance
- Diagnosis
- Diagnostic test evaluation
- ROC curve
- Treatment threshold