TY - JOUR
T1 - GRADE guidance 37
T2 - rating imprecision in a body of evidence on test accuracy
AU - Mustafa, Reem A.
AU - El Mikati, Ibrahim K.
AU - Murad, M. Hassan
AU - Hultcrantz, Monica
AU - Steingart, Karen R.
AU - Yang, Bada
AU - Leeflang, Mariska M.G.
AU - Akl, Elie A.
AU - Dahm, Philipp
AU - Schünemann, Holger J.
N1 - Publisher Copyright:
© 2023
PY - 2024/1
Y1 - 2024/1
N2 - Objectives: To provide guidance on rating imprecision in a body of evidence assessing the accuracy of a single test. This guide will clarify when Grading of Recommendations Assessment, Development and Evaluation (GRADE) users should consider rating down the certainty of evidence by one or more levels for imprecision in test accuracy. Study Design and Setting: A project group within the GRADE working group conducted iterative discussions and presentations at GRADE working group meetings to produce this guidance. Results: Before rating the certainty of evidence, GRADE users should define the target of their certainty rating. GRADE recommends setting judgment thresholds defining what they consider a very accurate, accurate, inaccurate, and very inaccurate test. These thresholds should be set after considering consequences of testing and effects on people-important outcomes. GRADE's primary criterion for judging imprecision in test accuracy evidence is considering confidence intervals (i.e., CI approach) of absolute test accuracy results (true and false, positive, and negative results in a cohort of people). Based on the CI approach, when a CI appreciably crosses the predefined judgment threshold(s), one should consider rating down certainty of evidence by one or more levels, depending on the number of thresholds crossed. When the CI does not cross judgment threshold(s), GRADE suggests considering the sample size for an adequately powered test accuracy review (optimal or review information size [optimal information size (OIS)/review information size (RIS)]) in rating imprecision. If the combined sample size of the included studies in the review is smaller than the required OIS/RIS, one should consider rating down by one or more levels for imprecision. Conclusion: This paper extends previous GRADE guidance for rating imprecision in single test accuracy systematic reviews and guidelines, with a focus on the circumstances in which one should consider rating down one or more levels for imprecision.
AB - Objectives: To provide guidance on rating imprecision in a body of evidence assessing the accuracy of a single test. This guide will clarify when Grading of Recommendations Assessment, Development and Evaluation (GRADE) users should consider rating down the certainty of evidence by one or more levels for imprecision in test accuracy. Study Design and Setting: A project group within the GRADE working group conducted iterative discussions and presentations at GRADE working group meetings to produce this guidance. Results: Before rating the certainty of evidence, GRADE users should define the target of their certainty rating. GRADE recommends setting judgment thresholds defining what they consider a very accurate, accurate, inaccurate, and very inaccurate test. These thresholds should be set after considering consequences of testing and effects on people-important outcomes. GRADE's primary criterion for judging imprecision in test accuracy evidence is considering confidence intervals (i.e., CI approach) of absolute test accuracy results (true and false, positive, and negative results in a cohort of people). Based on the CI approach, when a CI appreciably crosses the predefined judgment threshold(s), one should consider rating down certainty of evidence by one or more levels, depending on the number of thresholds crossed. When the CI does not cross judgment threshold(s), GRADE suggests considering the sample size for an adequately powered test accuracy review (optimal or review information size [optimal information size (OIS)/review information size (RIS)]) in rating imprecision. If the combined sample size of the included studies in the review is smaller than the required OIS/RIS, one should consider rating down by one or more levels for imprecision. Conclusion: This paper extends previous GRADE guidance for rating imprecision in single test accuracy systematic reviews and guidelines, with a focus on the circumstances in which one should consider rating down one or more levels for imprecision.
KW - GRADE
KW - Guideline
KW - Imprecision
KW - Systematic review
KW - Test accuracy
KW - Threshold
UR - http://www.scopus.com/inward/record.url?scp=85179887762&partnerID=8YFLogxK
U2 - 10.1016/j.jclinepi.2023.10.005
DO - 10.1016/j.jclinepi.2023.10.005
M3 - Article
C2 - 38613246
AN - SCOPUS:85179887762
SN - 0895-4356
VL - 165
JO - Journal of Clinical Epidemiology
JF - Journal of Clinical Epidemiology
M1 - 111189
ER -