Search for a command to run...
Background: Chronic kidney disease (CKD) exhibits substantial phenotypic heterogeneity that is inadequately captured by current classification systems based solely on estimated glomerular filtration rate and albuminuria, necessitating more sophisticated multivariate approaches to characterize the complex interplay of metabolic, hematological, and inflammatory derangements that drive disease progression and clinical outcomes. Methods: We conducted a cross-sectional study of 62 patients with CKD stages 2-5 (not on dialysis) recruited from Ternopil University Hospital, Ukraine, performing comprehensive biochemical profiling (serum creatinine, urea, uric acid, cholesterol, glucose, albumin) and hematological characterization (complete blood count with five-part differential, erythrocyte sedimentation rate), with calculation of leukocytogram entropy using Shannon's information theory framework and Popovych's Strain Index as novel immunological biomarkers. All continuous variables were standardized to Z-scores using population reference values and subjected to k-means cluster analysis to identify natural patient groupings, followed by one-way analysis of variance (ANOVA) to quantify discriminative capacity of individual biomarkers through eta-squared (η²) effect sizes, and stepwise discriminant function analysis to derive canonical discriminant roots, determine optimal variable subsets, calculate Mahalanobis distances between cluster centroids, and develop Fisher's classification functions with leave-one-out cross-validation (LOOCV) for prospective patient assignment. Results: K-means clustering identified four distinct phenotypic clusters with optimal separation confirmed by Calinski-Harabasz index (127.3) and mean silhouette coefficient (0.68): Cluster A (n=26, 41.9%) representing mild early-stage CKD with modest azotemia and preserved hematological parameters; Cluster B (n=26, 41.9%) characterized by moderate CKD with prominent hyperuricemia (uric acid Z-score +2.80±0.22) and inflammatory activation; Cluster C (n=7, 11.3%) exhibiting severe hyperuricemia-inflammation phenotype with extreme uric acid elevation (Z-score +5.91±0.46, ~673 μmol/L) and markedly elevated erythrocyte sedimentation rate (Z-score +15.4±4.1); and Cluster D (n=14, 22.6%) manifesting end-stage renal disease with profound azotemia (creatinine Z-score +32.2±2.2, ~564 μmol/L), severe anemia (hemoglobin Z-score -7.45±0.57, ~80 g/L), and longest disease duration (4.93±0.07 years). Stepwise discriminant analysis selected six variables that optimally discriminated between clusters with exceptional statistical power: serum creatinine (partial Wilks' Λ=0.507, F-to-remove=20.7, p<10⁻⁶, η²=0.785), uric acid (partial Λ=0.401, F-to-remove=31.9, p<10⁻⁶, η²=0.681), urea (partial Λ=0.833, F-to-remove=4.26, p=0.008, η²=0.588), leukocytogram entropy (partial Λ=0.889, F-to-remove=2.66, p=0.056, η²=0.258), platelet count (partial Λ=0.928, F-to-remove=1.66, p=0.184), and CKD duration (partial Λ=0.939, F-to-remove=1.37, p=0.259), yielding overall model Wilks' Λ=0.043 (F₁₈,₁₂₆=20.5, p<10⁻⁶). Three canonical discriminant roots explained 100% of between-group variance with decreasing contributions: Root 1 (eigenvalue λ₁=5.294, canonical correlation r*=0.917, 69.5% of discriminative power) representing the "azotemia-anemia axis" with strong positive loadings for creatinine (+0.766), urea (+0.514), and CKD duration (+0.490) and negative loadings for hemoglobin (-0.685) and erythrocytes (-0.598); Root 2 (λ₂=2.173, r*=0.828, 28.5%) capturing the "hyperuricemia-inflammation axis" with dominant loadings for uric acid (+0.832), ESR (+0.715), and Popovych's Strain Index (+0.542); and Root 3 (λ₃=0.155, r*=0.366, 2.0%) reflecting "leukocyte dysregulation" primarily through entropy (+0.765). All three roots achieved statistical significance (Root 1: χ²=176.2, df=18, p<10⁻⁶; Root 2: χ²=72.9, df=10, p<10⁻⁶; Root 3: χ²=7.8, df=4, p=0.047), confirming genuine multidimensional phenotypic heterogeneity. Mahalanobis squared distances between all cluster pairs were highly significant even after Bonferroni correction (α''=0.0083), with maximum separation between Clusters A and D (D²=52, F=44.2, p<10⁻⁶) representing the full spectrum from early to end-stage disease, and minimum separation between adjacent clusters (A-B: D²=10, F=8.5, p<0.001; B-C: D²=11, F=6.9, p<0.001). Fisher's linear classification functions achieved 97.3% overall accuracy (60/62 correct classifications) in LOOCV with only two misclassifications between adjacent clusters (one patient from Cluster B misclassified as C, one from C as B), yielding Cohen's kappa κ=0.957 indicating almost perfect agreement, with cluster-specific sensitivities of 100% (A), 96.2% (B), 85.7% (C), and 100% (D). Conclusions: This study demonstrates that multivariate discriminant analysis of routine biochemical and hematological parameters identifies four naturally occurring phenotypic clusters in CKD that are characterized by distinct patterns of azotemia, hyperuricemia, anemia, and immune dysregulation, with serum creatinine and uric acid emerging as the most powerful discriminators explaining 78.5% and 68.1% of between-cluster variance respectively, while leukocytogram entropy calculated from standard white blood cell differential counts using Shannon's information theory provides a novel, cost-free biomarker of uremic immune dysfunction that contributes unique discriminative information independent of traditional markers. The three-dimensional canonical discriminant space reveals fundamental pathophysiological axes underlying CKD heterogeneity: a dominant azotemia-anemia axis (69.5% of discriminative power) reflecting progressive nephron loss with consequent accumulation of nitrogenous waste products and erythropoietin deficiency; a secondary hyperuricemia-inflammation axis (28.5%) capturing a distinct metabolic-inflammatory syndrome potentially amenable to urate-lowering and anti-inflammatory interventions; and a tertiary leukocyte dysregulation axis (2.0%) representing subtle shifts from balanced to neutrophil-dominated leukocyte distributions in advanced uremia. Cluster C, comprising 11.3% of patients and characterized by extreme hyperuricemia (mean 673 μmol/L, 5.91 standard deviations above reference), severe inflammation (ESR ~50 mm/hr, 15.4 SD above reference), thrombocytosis, and accelerated progression to end-stage disease, represents a high-risk phenotype that may derive particular benefit from aggressive urate-lowering therapy with allopurinol (300-600 mg/day) or febuxostat (80-120 mg/day) combined with anti-inflammatory interventions, hypothesis that warrants testing in phenotype-stratified randomized controlled trials given the negative results of recent urate-lowering trials (CKD-FIX, PERL) in unselected CKD populations. Cluster D patients with end-stage disease (mean creatinine 564 μmol/L, 32.2 SD above reference) and profound anemia (mean hemoglobin 80 g/L, 7.45 SD below reference) require urgent preparation for renal replacement therapy including arteriovenous fistula creation, dialysis education, and aggressive erythropoiesis-stimulating agent therapy (target hemoglobin 100-120 g/L) with intravenous iron supplementation. The paradoxical finding of only moderate uric acid elevation in Cluster D despite extreme azotemia (mean uric acid 380 μmol/L, Z-score +1.01) compared to Cluster C (673 μmol/L, Z-score +5.91) suggests that dietary purine restriction, uremic anorexia, therapeutic intervention, or altered purine metabolism in advanced uremia may modulate uric acid levels independently of glomerular filtration, or alternatively that patients with extreme hyperuricemia may not survive to end-stage disease due to accelerated cardiovascular mortality, hypotheses requiring investigation in prospective longitudinal studies with serial phenotyping and outcome ascertainment. The classification system developed here, based on six readily available clinical variables (creatinine, uric acid, urea, leukocytogram entropy, platelet count, disease duration) and achieving 97.3% accuracy in cross-validation, provides a practical framework for personalized risk stratification and therapeutic targeting in CKD management that is immediately implementable in resource-limited settings such as Ukraine where the total cost of the biomarker panel (~700 UAH or $18 USD) is negligible compared to annual dialysis costs (~350,000 UAH or $9,200 USD), and where delayed dialysis initiation in even 10% of high-risk patients through intensive phenotype-directed therapy could generate substantial cost savings and quality-of-life improvements. Limitations include modest sample size particularly for Cluster C (n=7), cross-sectional design precluding causal inference and longitudinal outcome assessment, single-center recruitment from a tertiary referral hospital potentially introducing selection bias toward more severe cases, lack of external validation in independent cohorts, absence of novel biomarkers (neutrophil gelatinase-associated lipocalin, kidney injury molecule-1, fibroblast growth factor-23, cystatin C) that might refine phenotypic discrimination, and lack of genomic data that could reveal genetically determined subphenotypes or pharmacogenomic predictors of treatment response. Future research priorities include prospective validation in multicenter cohorts (n>500) with 3-5 year follow-up to assess prognostic value for progression to end-stage renal disease, cardiovascular events, and mortality; phenotype-stratified randomized controlled trial of intensive urate-lowering therapy in Cluster C patients to test whether this high-risk hyperuricemic phenotype derives differential benefit; integration of multi-omics platforms (genomics, transcriptomics, proteomics, metabolomics, microbiomics) to elucidate molecular mechanisms underlying phenotypic clusters and identify novel therapeutic targets; application of machine learni
Published in: Pedagogy and Psychology of Sport
Volume 29, pp. 68542-68542