TY - GEN
T1 - Multi-scanner Canine Cutaneous Squamous Cell Carcinoma Histopathology Dataset
AU - Wilm, Frauke
AU - Fragoso, Marco
AU - Bertram, Christof A.
AU - Stathonikos, Nikolas
AU - Öttl, Mathias
AU - Qiu, Jingna
AU - Klopfleisch, Robert
AU - Maier, Andreas
AU - Breininger, Katharina
AU - Aubreville, Marc
N1 - Publisher Copyright:
© 2023 Der/die Autor(en), exklusiv lizenziert an Springer Fachmedien Wiesbaden GmbH, ein Teil von Springer Nature.
PY - 2023
Y1 - 2023
N2 - In histopathology, scanner-induced domain shifts are known to impede the performance of trained neural networks when tested on unseen data. Multidomain pre-training or dedicated domain-generalization techniques can help to develop domain-agnostic algorithms. For this, multi-scanner datasets with a high variety of slide scanning systems are highly desirable. We present a publicly available multi-scanner dataset of canine cutaneous squamous cell carcinoma histopathology images, composed of 44 samples digitized with five slide scanners. This dataset provides local correspondences between images and thereby isolates the scanner-induced domain shift from other inherent, e.g. morphology-induced domain shifts. To highlight scanner differences, we present a detailed evaluation of color distributions, sharpness, and contrast of the individual scanner subsets. Additionally, to quantify the inherent scanner-induced domain shift, we train a tumor segmentation network on each scanner subset and evaluate the performance both in - and cross-domain. We achieve a class-averaged in-domain intersection over union coefficient of up to 0.86 and observe a cross-domain performance decrease of up to 0.38, which confirms the inherent domain shift of the presented dataset and its negative impact on the performance of deep neural networks.
AB - In histopathology, scanner-induced domain shifts are known to impede the performance of trained neural networks when tested on unseen data. Multidomain pre-training or dedicated domain-generalization techniques can help to develop domain-agnostic algorithms. For this, multi-scanner datasets with a high variety of slide scanning systems are highly desirable. We present a publicly available multi-scanner dataset of canine cutaneous squamous cell carcinoma histopathology images, composed of 44 samples digitized with five slide scanners. This dataset provides local correspondences between images and thereby isolates the scanner-induced domain shift from other inherent, e.g. morphology-induced domain shifts. To highlight scanner differences, we present a detailed evaluation of color distributions, sharpness, and contrast of the individual scanner subsets. Additionally, to quantify the inherent scanner-induced domain shift, we train a tumor segmentation network on each scanner subset and evaluate the performance both in - and cross-domain. We achieve a class-averaged in-domain intersection over union coefficient of up to 0.86 and observe a cross-domain performance decrease of up to 0.38, which confirms the inherent domain shift of the presented dataset and its negative impact on the performance of deep neural networks.
UR - http://www.scopus.com/inward/record.url?scp=85164943243&partnerID=8YFLogxK
U2 - 10.1007/978-3-658-41657-7_46
DO - 10.1007/978-3-658-41657-7_46
M3 - Conference contribution
AN - SCOPUS:85164943243
SN - 9783658416560
T3 - Informatik aktuell
SP - 206
EP - 211
BT - Bildverarbeitung für die Medizin 2023 Proceedings, German Workshop on Medical Image Computing, Braunschweig
A2 - Deserno, Thomas M.
A2 - Handels, Heinz
A2 - Maier, Andreas
A2 - Maier-Hein, Klaus
A2 - Palm, Christoph
A2 - Tolxdorff, Thomas
PB - Springer Science and Business Media Deutschland GmbH
T2 - Bildverarbeitung für die Medizin Workshop, BVM 2023
Y2 - 2 July 2023 through 4 July 2023
ER -