Designing for Qualitative Evaluation of Synthetic Medical Data

Isabella Barbosa Silva*, Elsa Oliveira, Ricardo Melo, Luís Rosado, César Gálvez-Barrón, Irene Bernadet Heijink, Sem Hoogteijling, Iñigo Gabilondo

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Abstract

Machine learning in healthcare often struggles with data access for model training due to privacy restrictions, rare conditions, and high acquisition costs. Synthetic data offers a potential workaround, yet there are no agreed-upon gold standards for evaluating it. As quantitative metrics alone cannot fully assess the desired qualities of generative model outputs, human inspection is a key component of validation, warranting a “Doctor-in-the-loop” approach. However, research is scarce on best practices for interaction and user interface design in such systems. This paper presents preliminary designs for qualitative synthetic medical data evaluation, informed by four participatory workshops with seven doctors and nine machine learning engineers. Spanning tabular, image, and time series data, this study emphasised transparency and clear communication of the synthetic data generation. In addition to presenting the rationale behind the evaluation workflow design, we highlight challenges in the medical domain, including doctors’ limited familiarity and skepticism with synthetic data.

Original languageEnglish
Title of host publicationCHI EA 2025 - Extended Abstracts of the 2025 CHI Conference on Human Factors in Computing Systems
ISBN (Electronic)9798400713958
DOIs
Publication statusPublished - 26 Apr 2025
Event2025 CHI Conference on Human Factors in Computing Systems, CHI EA 2025 - Yokohama, Japan
Duration: 26 Apr 20251 May 2025

Publication series

NameConference on Human Factors in Computing Systems - Proceedings

Conference

Conference2025 CHI Conference on Human Factors in Computing Systems, CHI EA 2025
Country/TerritoryJapan
CityYokohama
Period26/04/251/05/25

Keywords

  • Doctor-in-the-Loop (DITL)
  • Human-Computer Interaction (HCI)
  • Machine Learning in Healthcare
  • Participatory Design
  • Qualitative Evaluation
  • Synthetic Data (SD)
  • Synthetic medical data (SMD)

Fingerprint

Dive into the research topics of 'Designing for Qualitative Evaluation of Synthetic Medical Data'. Together they form a unique fingerprint.

Cite this