TY - JOUR
T1 - Sharing Medical Big Data While Preserving Patient Confidentiality in Innovative Medicines Initiative
T2 - A Summary and Case Report from BigData@Heart
AU - Schröder, Megan
AU - Muller, Sam H.A.
AU - Vradi, Eleni
AU - Mielke, Johanna
AU - Lim, Yvonne M.F.
AU - Couvelard, Fabrice
AU - Mostert, Menno
AU - Koudstaal, Stefan
AU - Eijkemans, Marinus J.C.
AU - Gerlinger, Christoph
N1 - Publisher Copyright:
© Megan Schröder et al. 2023; Published by Mary Ann Liebert, Inc.
PY - 2023/12
Y1 - 2023/12
N2 - Sharing individual patient data (IPD) is a simple concept but complex to achieve due to data privacy and data security concerns, underdeveloped guidelines, and legal barriers. Sharing IPD is additionally difficult in big data-driven collaborations such as Bigdata@Heart in the Innovative Medicines Initiative, due to competing interests between diverse consortium members. One project within BigData@Heart, case study 1, needed to pool data from seven heterogeneous data sets: five randomized controlled trials from three different industry partners, and two disease registries. Sharing IPD was not considered feasible due to legal requirements and the sensitive medical nature of these data. In addition, harmonizing the data sets for a federated data analysis was difficult due to capacity constraints and the heterogeneity of the data sets. An alternative option was to share summary statistics through contingency tables. Here it is demonstrated that this method along with anonymization methods to ensure patient anonymity had minimal loss of information. Although sharing IPD should continue to be encouraged and strived for, our approach achieved a good balance between data transparency while protecting patient privacy. It also allowed a successful collaboration between industry and academia.
AB - Sharing individual patient data (IPD) is a simple concept but complex to achieve due to data privacy and data security concerns, underdeveloped guidelines, and legal barriers. Sharing IPD is additionally difficult in big data-driven collaborations such as Bigdata@Heart in the Innovative Medicines Initiative, due to competing interests between diverse consortium members. One project within BigData@Heart, case study 1, needed to pool data from seven heterogeneous data sets: five randomized controlled trials from three different industry partners, and two disease registries. Sharing IPD was not considered feasible due to legal requirements and the sensitive medical nature of these data. In addition, harmonizing the data sets for a federated data analysis was difficult due to capacity constraints and the heterogeneity of the data sets. An alternative option was to share summary statistics through contingency tables. Here it is demonstrated that this method along with anonymization methods to ensure patient anonymity had minimal loss of information. Although sharing IPD should continue to be encouraged and strived for, our approach achieved a good balance between data transparency while protecting patient privacy. It also allowed a successful collaboration between industry and academia.
KW - data transparency
KW - health data research collaborations
KW - patient data privacy
KW - responsible data governance
KW - sharing individual patient data
KW - summary statistics
UR - http://www.scopus.com/inward/record.url?scp=85176381956&partnerID=8YFLogxK
U2 - 10.1089/big.2022.0178
DO - 10.1089/big.2022.0178
M3 - Article
C2 - 37889577
AN - SCOPUS:85176381956
SN - 2167-6461
VL - 11
SP - 399
EP - 407
JO - Big Data
JF - Big Data
IS - 6
ER -