Search for a command to run...
This paper focuses on an approach to address large-scale data gathered from heterogeneous sources by integrating static and dynamic data in hierarchical clusterization, and its application to the analysis of retail branches. Traditionally, branch clustering analysis has relied on static information and the utilization of statistical measures to extract relevant features from the dynamic data and incorporate them into the static dataset; however, the application of this approach presents several challenges. This research proposes a solution that addresses these disadvantages while aiming to maintain the success achieved when applying unsupervised machine learning algorithms. The paper presents an approach based on the integration of static attributes and time series data in a hierarchical clustering manner that enables the identification of key performance indicators and offers insight into factors that influence branch performance over time. The results show the potential to optimize resource allocation, inventory management, and customer service strategies. The proposed approach is demonstrated using retail shop data from a Spanish telecommunications company (Grupo Masmovil), highlighting its effectiveness in enhancing cluster profiling and offering meaningful insights beyond the prevailing approaches. This method presents significant enrichment for clustering analysis that can be applied to different domains. • Integrated static and time-series data for comprehensive retail branch clustering. • Successfully applied the approach to a Spanish telecommunications company data. • Addressed big data variety by proposing hierarchical clustering for data integration. • The proposed approach extends beyond retail and is applicable to diverse domains.