Search for a command to run...
Diabetes is a growing global health concern, affecting 537 million adults worldwide, significantly contributing to mortality and disability. Cluster analysis, a branch of Unsupervised machine learning (ML), has been increasingly used to identify distinct subgroups of patients with diabetes, enabling better disease stratification and personalized treatment strategies. Despite the growing burden of diabetes, clinical management often relies on generalized treatment approaches that may not account for patient heterogeneity. There is a need to better understand the underlying subtypes of diabetes to enable more personalized care. This narrative review evaluates the application of clustering in stratifying diabetes patients across diverse populations based on clinical, biomarker, and molecular characteristics and more. A literature search was conducted using PubMed, and Google Scholar following the Scale for the Assessment of Narrative Review Articles (SANRA) guidelines. A total of 22 articles were finalized to be included in the analysis where different clustering approaches on diabetes patients have been used. This review further classifies the studies into diabetes subgroups focusing specifically on type 2 diabetes patients where Severe Autoimmune Diabetes (SAID), Severe Insulin Deficient Diabetes (SIDD), Mild Obesity-Related Diabetes (MOD), and Mild Age-Related Diabetes (MARD) clusters were most commonly found, and biomarker-based classification such as inflammatory and molecular including only type 2 diabetes patients, where additional two clusters: Severe Insulin Resistant Diabetes with Relative Insulin Insufficiency (SIRD-RII), and Mild Age-Related Diabetes with Relative Insulin Insufficiency (MARD-RII), were found. This review underscores the potential of clustering in diabetes research and identifies gaps in standardization across studies. Future research should focus on refining clustering approaches, mainly introducing soft clustering to identify if patients fall in more than 1 cluster. What is currently known about this topic? Diabetes has distinct subgroups (SAID, SIDD, SIRD, MOD, MARD). Machine Learning clustering can identify T2DM heterogeneity, comorbidity patterns & risk. The Ahlqvist 5-cluster model is largely reproducible across populations. What is the key research question? How does the application of clustering stratify diverse diabetes patient populations for personalized care? What is new? Our review is novel because it introduces the concept of personalized care through clustering to a high burden setting where generalized treatment is failing, as evidenced by the poorly understood link between disease causation and poor outcomes, especially microvascular dysfunction. It also highlights the treatment-stratification gap by discussing newer, disease-modifying agents like SGLT-2 inhibitors (for kidney/cardio protection) and Ozempic (for fat reduction), which represent a massive paradigm shift. Clustering is the novel methodological tool required to prioritize these expensive therapies to the right endophenotypes in Pakistan (e.g., SIRD, MOD) to maximize public health impact. Moreover, soft/fuzzy clustering methods (e.g., in T2DM genetic or metabolic studies) reveal that a significant proportion of patients exhibit overlap between traditional "hard" clusters (like MOD and MARD), moving toward a more realistic continuum of disease. How might this study influence clinical practice? Enables personalized risk stratification and treatment based on T2DM subtype.
Published in: Cardiovascular Diabetology – Endocrinology Reports
Volume 12, Issue 1