Key metrics for monitoring performance variability in edge computing applications
- PMID: 40454233
- PMCID: PMC12125128
- DOI: 10.1186/s13638-025-02469-6
Key metrics for monitoring performance variability in edge computing applications
Abstract
Edge computing is an emerging approach that enables applications to run closer to users, accommodating their specific execution time requirements. Edge computing systems typically consist of heterogeneous processing and networking components, resulting in inconsistent task performance. To improve the consistency of edge computing applications, this study presents a method to identify the factors that affect variability in task execution time. We deploy a set of single-particle analysis algorithms, designed for an electron microscopy use case, running on a Kubernetes cluster monitored by Prometheus. This specific usecase was chosen because it encompasses a diverse set of time-sensitive and privacy-sensitive applications, with a wide range of resource requirements. Our experiments revealed a significant increase in the variability of round-trip time when tasks share resources. The proposed approach identifies the most relevant monitoring metrics from a larger set of collected ones (provided by Prometheus), with correlations up to 87%. This process reduces the number of metrics to 90, achieving a reduction of 80%. As a result, the overhead of the monitoring system is decreased, and the use of these metrics for further processing, such as predictive modeling and scheduling, is simplified. These selected metrics not only help to understand the causes of performance variability, but also possess predictive value, enabling more efficient scheduling. The prediction power of these metrics is shown using SHapley Additive exPlanations analysis.
Keywords: Edge computing; Kubernetes; Monitoring metrics; Performance variability; Prometheus.
© The Author(s) 2025.
Conflict of interest statement
Competing interestsThe authors declare no Conflict of interest.
Figures
References
-
- P. Townend, S. Clement, D. Burdett, R. Yang, J. Shaw, B. Slater, J. Xu, Invited paper: improving data center efficiency through holistic scheduling in kubernetes. In: 2019 IEEE International Conference on Service-Oriented System Engineering (SOSE), pp 156–15610. IEEE. 10.1109/SOSE.2019.00030
-
- Y. Fu, S. Zhang, J. Terrero, Y. Mao, G. Liu, S. Li, D. Tao, Progress-based container scheduling for short-lived applications in a kubernetes cluster. In: 2019 IEEE International Conference on Big Data (Big Data), pp 278–287. IEEE. 10.1109/BigData47090.2019.9006427
-
- A. Kuriata, R.G. Illikkal, Predictable performance for QoS-sensitive, scalable, multi-tenant function-as-a-service deployments. In: Paasivaara, M., Kruchten, P. (eds.) Agile Processes in Software Engineering and Extreme Programming - Workshops vol. 396, pp. 133–140. Springer. 10.1007/978-3-030-58858-8_14 . Series Title: Lecture Notes in Business Information Processing
-
- A. Maricq, D. Duplyakin, R. Stutsman, I. Jimenez, R. Ricci, C. Maltzahn, Taming performance variability, 18
-
- D. Duplyakin, A. Uta, A. Maricq, R. Ricci, On studying CPU performance of CloudLab hardware. In: 2019 IEEE 27th International Conference on Network Protocols (ICNP), pp 1–2. IEEE.10.1109/ICNP.2019.8888128
LinkOut - more resources
Full Text Sources
Research Materials