TY - JOUR
T1 - Automated pelvic MRI measurements associated with urinary incontinence for prostate cancer patients undergoing radical prostatectomy
AU - van den Berg, Ingeborg
AU - Spaans, Robert N
AU - Wessels, Frank J
AU - van der Hoeven, Erik J R J
AU - Nolthenius, Charlotte J Tutein
AU - van den Bergh, Roderick C N
AU - van der Voort van Zyp, Jochem R N
AU - van den Berg, Cornelis A T
AU - van Melick, Harm H E
N1 - Publisher Copyright:
© 2023, The Author(s).
PY - 2024/1/2
Y1 - 2024/1/2
N2 - Background: Pelvic morphological parameters on magnetic resonance imaging (MRI), such as the membranous urethral length (MUL), can predict urinary incontinence after radical prostatectomy but are prone to interobserver disagreement. Our objective was to improve interobserver agreement among radiologists in measuring pelvic parameters using deep learning (DL)-based segmentation of pelvic structures on MRI scans. Methods: Preoperative MRI was collected from 167 prostate cancer patients undergoing radical prostatectomy within our regional multicentric cohort. Two DL networks (nnU-Net) were trained on coronal and sagittal scans and evaluated on a test cohort using an 80/20% train-test split. Pelvic parameters were manually measured by three abdominal radiologists on raw MRI images and with the use of DL-generated segmentations. Automated measurements were also performed for the pelvic parameters. Interobserver agreement was evaluated using the intraclass correlation coefficient (ICC) and the Bland–Altman plot. Results: The DL models achieved median Dice similarity coefficient (DSC) values of 0.85–0.97 for coronal structures and 0.87–0.98 for sagittal structures. When radiologists used DL-generated segmentations of pelvic structures, the interobserver agreement for sagittal MUL improved from 0.64 (95% confidence interval 0.28–0.83) to 0.91 (95% CI 0.84–0.95). Furthermore, there was an increase in ICC values for the obturator internus muscle from 0.74 (95% CI 0.42–0.87) to 0.86 (95% CI 0.75–0.92) and for the levator ani muscle from 0.40 (95% CI 0.05–0.66) to 0.61 (95% CI 0.31–0.78). Conclusions: DL-based automated segmentation of pelvic structures improved interobserver agreement in measuring pelvic parameters on preoperative MRI scans. Relevance statement: The implementation of deep learning segmentations allows for more consistent measurements of pelvic parameters by radiologists. Standardized measurements are crucial for incorporating these parameters into urinary continence prediction models. Key points: • DL-generated segmentations improve interobserver agreement for pelvic measurements among radiologists. • Membranous urethral length measurement improved from substantial to almost perfect agreement. • Artificial intelligence enhances objective pelvic parameter assessment for continence prediction models. Graphical Abstract: [Figure not available: see fulltext.]
AB - Background: Pelvic morphological parameters on magnetic resonance imaging (MRI), such as the membranous urethral length (MUL), can predict urinary incontinence after radical prostatectomy but are prone to interobserver disagreement. Our objective was to improve interobserver agreement among radiologists in measuring pelvic parameters using deep learning (DL)-based segmentation of pelvic structures on MRI scans. Methods: Preoperative MRI was collected from 167 prostate cancer patients undergoing radical prostatectomy within our regional multicentric cohort. Two DL networks (nnU-Net) were trained on coronal and sagittal scans and evaluated on a test cohort using an 80/20% train-test split. Pelvic parameters were manually measured by three abdominal radiologists on raw MRI images and with the use of DL-generated segmentations. Automated measurements were also performed for the pelvic parameters. Interobserver agreement was evaluated using the intraclass correlation coefficient (ICC) and the Bland–Altman plot. Results: The DL models achieved median Dice similarity coefficient (DSC) values of 0.85–0.97 for coronal structures and 0.87–0.98 for sagittal structures. When radiologists used DL-generated segmentations of pelvic structures, the interobserver agreement for sagittal MUL improved from 0.64 (95% confidence interval 0.28–0.83) to 0.91 (95% CI 0.84–0.95). Furthermore, there was an increase in ICC values for the obturator internus muscle from 0.74 (95% CI 0.42–0.87) to 0.86 (95% CI 0.75–0.92) and for the levator ani muscle from 0.40 (95% CI 0.05–0.66) to 0.61 (95% CI 0.31–0.78). Conclusions: DL-based automated segmentation of pelvic structures improved interobserver agreement in measuring pelvic parameters on preoperative MRI scans. Relevance statement: The implementation of deep learning segmentations allows for more consistent measurements of pelvic parameters by radiologists. Standardized measurements are crucial for incorporating these parameters into urinary continence prediction models. Key points: • DL-generated segmentations improve interobserver agreement for pelvic measurements among radiologists. • Membranous urethral length measurement improved from substantial to almost perfect agreement. • Artificial intelligence enhances objective pelvic parameter assessment for continence prediction models. Graphical Abstract: [Figure not available: see fulltext.]
KW - Artificial intelligence
KW - Deep learning
KW - Membranous urethral length
KW - Prostate cancer
KW - Urinary incontinence
UR - http://www.scopus.com/inward/record.url?scp=85181252866&partnerID=8YFLogxK
U2 - 10.1186/s41747-023-00402-4
DO - 10.1186/s41747-023-00402-4
M3 - Article
C2 - 38165522
SN - 2509-9280
VL - 8
JO - European radiology experimental
JF - European radiology experimental
IS - 1
M1 - 1
ER -