TY - GEN
T1 - Designing for Qualitative Evaluation of Synthetic Medical Data
AU - Silva, Isabella Barbosa
AU - Oliveira, Elsa
AU - Melo, Ricardo
AU - Rosado, Luís
AU - Gálvez-Barrón, César
AU - Heijink, Irene Bernadet
AU - Hoogteijling, Sem
AU - Gabilondo, Iñigo
N1 - Publisher Copyright:
© 2025 Copyright held by the owner/author(s).
PY - 2025/4/26
Y1 - 2025/4/26
N2 - Machine learning in healthcare often struggles with data access for model training due to privacy restrictions, rare conditions, and high acquisition costs. Synthetic data offers a potential workaround, yet there are no agreed-upon gold standards for evaluating it. As quantitative metrics alone cannot fully assess the desired qualities of generative model outputs, human inspection is a key component of validation, warranting a “Doctor-in-the-loop” approach. However, research is scarce on best practices for interaction and user interface design in such systems. This paper presents preliminary designs for qualitative synthetic medical data evaluation, informed by four participatory workshops with seven doctors and nine machine learning engineers. Spanning tabular, image, and time series data, this study emphasised transparency and clear communication of the synthetic data generation. In addition to presenting the rationale behind the evaluation workflow design, we highlight challenges in the medical domain, including doctors’ limited familiarity and skepticism with synthetic data.
AB - Machine learning in healthcare often struggles with data access for model training due to privacy restrictions, rare conditions, and high acquisition costs. Synthetic data offers a potential workaround, yet there are no agreed-upon gold standards for evaluating it. As quantitative metrics alone cannot fully assess the desired qualities of generative model outputs, human inspection is a key component of validation, warranting a “Doctor-in-the-loop” approach. However, research is scarce on best practices for interaction and user interface design in such systems. This paper presents preliminary designs for qualitative synthetic medical data evaluation, informed by four participatory workshops with seven doctors and nine machine learning engineers. Spanning tabular, image, and time series data, this study emphasised transparency and clear communication of the synthetic data generation. In addition to presenting the rationale behind the evaluation workflow design, we highlight challenges in the medical domain, including doctors’ limited familiarity and skepticism with synthetic data.
KW - Doctor-in-the-Loop (DITL)
KW - Human-Computer Interaction (HCI)
KW - Machine Learning in Healthcare
KW - Participatory Design
KW - Qualitative Evaluation
KW - Synthetic Data (SD)
KW - Synthetic medical data (SMD)
UR - http://www.scopus.com/inward/record.url?scp=105005776125&partnerID=8YFLogxK
U2 - 10.1145/3706599.3720274
DO - 10.1145/3706599.3720274
M3 - Conference contribution
AN - SCOPUS:105005776125
T3 - Conference on Human Factors in Computing Systems - Proceedings
BT - CHI EA 2025 - Extended Abstracts of the 2025 CHI Conference on Human Factors in Computing Systems
T2 - 2025 CHI Conference on Human Factors in Computing Systems, CHI EA 2025
Y2 - 26 April 2025 through 1 May 2025
ER -