TY - JOUR
T1 - Swarm learning with weak supervision enables automatic breast cancer detection in magnetic resonance imaging
AU - Saldanha, Oliver Lester
AU - Zhu, Jiefu
AU - Müller-Franzes, Gustav
AU - Carrero, Zunamys I.
AU - Payne, Nicholas R.
AU - Escudero Sánchez, Lorena
AU - Varoutas, Paul Christophe
AU - Kyathanahally, Sreenath
AU - Laleh, Narmin Ghaffari
AU - Pfeiffer, Kevin
AU - Ligero, Marta
AU - Behner, Jakob
AU - Abdullah, Kamarul A.
AU - Apostolakos, Georgios
AU - Kolofousi, Chrysafoula
AU - Kleanthous, Antri
AU - Kalogeropoulos, Michail
AU - Rossi, Cristina
AU - Nowakowska, Sylwia
AU - Athanasiou, Alexandra
AU - Perez-Lopez, Raquel
AU - Mann, Ritse
AU - Veldhuis, Wouter
AU - Camps, Julia
AU - Schulz, Volkmar
AU - Wenzel, Markus
AU - Morozov, Sergey
AU - Ciritsis, Alexander
AU - Kuhl, Christiane
AU - Gilbert, Fiona J.
AU - Truhn, Daniel
AU - Kather, Jakob Nikolas
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025/2/6
Y1 - 2025/2/6
N2 - Background: Over the next 5 years, new breast cancer screening guidelines recommending magnetic resonance imaging (MRI) for certain patients will significantly increase the volume of imaging data to be analyzed. While this increase poses challenges for radiologists, artificial intelligence (AI) offers potential solutions to manage this workload. However, the development of AI models is often hindered by manual annotation requirements and strict data-sharing regulations between institutions. Methods: In this study, we present an integrated pipeline combining weakly supervised learning—reducing the need for detailed annotations—with local AI model training via swarm learning (SL), which circumvents centralized data sharing. We utilized three datasets comprising 1372 female bilateral breast MRI exams from institutions in three countries: the United States (US), Switzerland, and the United Kingdom (UK) to train models. These models were then validated on two external datasets consisting of 649 bilateral breast MRI exams from Germany and Greece. Results: Upon systematically benchmarking various weakly supervised two-dimensional (2D) and three-dimensional (3D) deep learning (DL) methods, we find that the 3D-ResNet-101 demonstrates superior performance. By implementing a real-world SL setup across three international centers, we observe that these collaboratively trained models outperform those trained locally. Even with a smaller dataset, we demonstrate the practical feasibility of deploying SL internationally with on-site data processing, addressing challenges such as data privacy and annotation variability. Conclusions: Combining weakly supervised learning with SL enhances inter-institutional collaboration, improving the utility of distributed datasets for medical AI training without requiring detailed annotations or centralized data sharing.
AB - Background: Over the next 5 years, new breast cancer screening guidelines recommending magnetic resonance imaging (MRI) for certain patients will significantly increase the volume of imaging data to be analyzed. While this increase poses challenges for radiologists, artificial intelligence (AI) offers potential solutions to manage this workload. However, the development of AI models is often hindered by manual annotation requirements and strict data-sharing regulations between institutions. Methods: In this study, we present an integrated pipeline combining weakly supervised learning—reducing the need for detailed annotations—with local AI model training via swarm learning (SL), which circumvents centralized data sharing. We utilized three datasets comprising 1372 female bilateral breast MRI exams from institutions in three countries: the United States (US), Switzerland, and the United Kingdom (UK) to train models. These models were then validated on two external datasets consisting of 649 bilateral breast MRI exams from Germany and Greece. Results: Upon systematically benchmarking various weakly supervised two-dimensional (2D) and three-dimensional (3D) deep learning (DL) methods, we find that the 3D-ResNet-101 demonstrates superior performance. By implementing a real-world SL setup across three international centers, we observe that these collaboratively trained models outperform those trained locally. Even with a smaller dataset, we demonstrate the practical feasibility of deploying SL internationally with on-site data processing, addressing challenges such as data privacy and annotation variability. Conclusions: Combining weakly supervised learning with SL enhances inter-institutional collaboration, improving the utility of distributed datasets for medical AI training without requiring detailed annotations or centralized data sharing.
UR - http://www.scopus.com/inward/record.url?scp=85218187172&partnerID=8YFLogxK
U2 - 10.1038/s43856-024-00722-5
DO - 10.1038/s43856-024-00722-5
M3 - Article
AN - SCOPUS:85218187172
SN - 2730-664X
VL - 5
JO - Communications medicine
JF - Communications medicine
IS - 1
M1 - 38
ER -