TY - JOUR
T1 - Stratification of Patients with Diabetes Using Continuous Glucose Monitoring Profiles and Machine Learning
AU - Mao, Yinan
AU - Tan, Kyle Xin Quan
AU - Seng, Augustin
AU - Wong, Peter
AU - Toh, Sue Anne
AU - Cook, Alex R.
N1 - Publisher Copyright:
Copyright © 2022 Yinan Mao et al. Exclusive Licensee Peking University Health Science Center. Distributed under a Creative Commons Attribution License (CC BY 4.0).
PY - 2022
Y1 - 2022
N2 - Background. Continuous glucose monitoring (CGM) offers an opportunity for patients with diabetes to modify their lifestyle to better manage their condition and for clinicians to provide personalized healthcare and lifestyle advice. However, analytic tools are needed to standardize and analyze the rich data that emerge from CGM devices. This would allow glucotypes of patients to be identified to aid clinical decision-making. Methods. In this paper, we develop an analysis pipeline for CGM data and apply it to 148 diabetic patients with a total of 8632 days of follow up. The pipeline projects CGM data to a lower-dimensional space of features representing centrality, spread, size, and duration of glycemic excursions and the circadian cycle. We then use principal components analysis and k-means to cluster patients' records into one of four glucotypes and analyze cluster membership using multinomial logistic regression. Results. Glucotypes differ in the degree of control, amount of time spent in range, and on the presence and timing of hyper- and hypoglycemia. Patients on the program had statistically significant improvements in their glucose levels. Conclusions. This pipeline provides a fast automatic function to label raw CGM data without manual input.
AB - Background. Continuous glucose monitoring (CGM) offers an opportunity for patients with diabetes to modify their lifestyle to better manage their condition and for clinicians to provide personalized healthcare and lifestyle advice. However, analytic tools are needed to standardize and analyze the rich data that emerge from CGM devices. This would allow glucotypes of patients to be identified to aid clinical decision-making. Methods. In this paper, we develop an analysis pipeline for CGM data and apply it to 148 diabetic patients with a total of 8632 days of follow up. The pipeline projects CGM data to a lower-dimensional space of features representing centrality, spread, size, and duration of glycemic excursions and the circadian cycle. We then use principal components analysis and k-means to cluster patients' records into one of four glucotypes and analyze cluster membership using multinomial logistic regression. Results. Glucotypes differ in the degree of control, amount of time spent in range, and on the presence and timing of hyper- and hypoglycemia. Patients on the program had statistically significant improvements in their glucose levels. Conclusions. This pipeline provides a fast automatic function to label raw CGM data without manual input.
UR - http://www.scopus.com/inward/record.url?scp=85138464001&partnerID=8YFLogxK
U2 - 10.34133/2022/9892340
DO - 10.34133/2022/9892340
M3 - Article
AN - SCOPUS:85138464001
SN - 2097-1095
VL - 2022
JO - Health Data Science
JF - Health Data Science
M1 - 9892340
ER -