Revista de Investigación Científica y Tecnológica Alpha Centauri
Introduction
In today’s highly competitive retail environment,
characterized by the massive digitalization of transactions
and the proliferation of customer interaction channels,
understanding and anticipating consumer behavior has
become a strategic necessity. Companies that successfully
interpret purchasing patterns and customer motivations
gain a sustainable competitive advantage, since retaining
customers is more profitable than acquiring new ones and
contributes directly to long-term brand loyalty and
profitability [1], [2].
However, modern consumer behavior is increasingly
dynamic and complex. Digital immediacy, hyper-
personalized services, and omnichannel purchasing
experiences have generated vast amounts of heterogeneous
data that challenge traditional segmentation models [3].
The RFM (Recency, Frequency, Monetary value)
approach, widely used in marketing analytics, remains
effective for identifying high-value customers but fails to
capture behavioral, contextual, and nonlinear relationships
between variables [6], [7], [11]. Recent studies emphasize
that classical RFM-based models lack the flexibility
required to represent customer engagement and evolving
consumption patterns in digital retail environments [13].
To address these limitations, Machine Learning (ML)
provides a more robust and adaptive analytical framework.
ML algorithms can process massive datasets, uncover
hidden behavioral patterns, and predict customer actions
based on historical and contextual information [9], [12].
Specifically, ML-based segmentation models integrate
transactional, demographic, and behavioral data, allowing
the formation of more accurate and operationally
meaningful customer clusters that enhance strategic
decision-making in marketing and customer management.
The model proposed in this research introduces a new
behavioral variable into the traditional RFM framework,
extending it into an RFMC model (Recency, Frequency,
Monetary value, Category/Engagement). This additional
dimension incorporates customer engagement or product
category interaction, capturing behavioral nuances that are
not reflected in spending frequency or transaction volume.
The integration of this variable, together with
dimensionality reduction through Principal Component
Analysis (PCA), significantly increases cluster cohesion
and predictive power without compromising statistical
robustness.
From a technical standpoint, the inclusion of the
“Category” or “Engagement” variable enhances
segmentation granularity, distinguishing customers who
display similar frequency and spending patterns but differ
in product preferences or interaction levels. This
enrichment of traditional metrics expands the analytical
capacity of ML systems, allowing more precise predictions
and the delivery of personalized marketing
recommendations [8], [10].
From a commercial perspective, the implications of the
proposed model are equally relevant. By more accurately
identifying high-value segments and anticipating churn
patterns, companies can design more effective marketing
strategies, optimize resource allocation in promotional
campaigns, and strengthen loyalty programs. The synergy
between predictive analytics and commercial strategy
fosters a transformation toward data-driven marketing,
maximizing both customer satisfaction and organizational
profitability.
In summary, this research demonstrates that an advanced
Machine Learning segmentation model, enhanced with an
additional behavioral variable beyond the classical RFM
framework, provides an effective analytical tool for
predicting customer behavior and strengthening loyalty in
the retail sector. The model aligns technological innovation
with strategic business goals, contributing to
competitiveness, customer retention, and sustainable value
creation.
Literature review
2.1 Artificial Intelligence and Machine Learning in
Customer Management
Artificial Intelligence (AI) has transformed the way
companies manage customer relationships, enabling a
deeper understanding of behaviors, needs, and motivations.
Within this field, Machine Learning (ML) has emerged as
one of the most effective tools for predictive analytics and
service personalization [1], [9].
ML, as a subfield of AI, is based on algorithms that learn
from data and improve their performance without explicit
programming. Unlike traditional statistical models, ML
algorithms do not rely on rigid assumptions about data
distribution, providing greater flexibility to detect
nonlinear and hidden patterns among variables [9]. This
adaptability is especially valuable in the retail sector, where
information is massive, heterogeneous, and continuously
evolving.
According to Kühl et al. [9], ML-driven service systems
have evolved from descriptive models toward adaptive
systems that integrate transactional, demographic, and
contextual data. Similarly, Aguiar-Costa et al. [1]
demonstrated that AI-driven customer service applications
significantly increase satisfaction and loyalty by enabling