Efficient Microservice Autoscaling through muOpt

IRIS

Microservices have become the architecture of choice for cloud applications requiring high performance and scalability. Autoscaling, which dynamically adjusts resource allocation based on workload fluctuations, is key to optimizing performance and controlling costs. This paper presents μOpt, a computationally efficient, model-based autoscaler specifically designed for microservices. μOpt leverages a nonlinear optimization problem tied to a fluid approximation of a layered queuing network (LQN) model to determine optimal configurations that maximize key performance metrics—such as throughput, CPU usage, and response time—while minimizing operational costs. On a well-known benchmark application, our numerical experiments show that μOpt achieves fast solution times, enabling responsiveness to dynamic workloads. Compared to a state-of-the-art LQN-based autoscaler employing genetic algorithms, μOpt delivers improved application performance using fewer resources. To demonstrate the robustness and generalizability of our underlying model, we validate its prediction accuracy across ten randomly generated applications with diverse architectures, showing that its performance is a reliable foundation for autoscaling. Finally, it also outperforms Horizontal Pod Autoscaler, a production-ready solution for Kubernetes deployments in Google Cloud Platform, consistently reducing resource usage while more accurately tracking CPU utilization targets across both synthetic and real-world workloads.

Efficient Microservice Autoscaling through muOpt / Incerto, E., Pizziol, R., Tribastone, M.. - In: IEEE TRANSACTIONS ON SERVICES COMPUTING. - ISSN 1939-1374. - (2026), pp. 1-14. [10.1109/TSC.2026.3696863]

Efficient Microservice Autoscaling through muOpt

Incerto Emilio;Pizziol Roberto;Tribastone Mirco

2026

Abstract

Microservices have become the architecture of choice for cloud applications requiring high performance and scalability. Autoscaling, which dynamically adjusts resource allocation based on workload fluctuations, is key to optimizing performance and controlling costs. This paper presents μOpt, a computationally efficient, model-based autoscaler specifically designed for microservices. μOpt leverages a nonlinear optimization problem tied to a fluid approximation of a layered queuing network (LQN) model to determine optimal configurations that maximize key performance metrics—such as throughput, CPU usage, and response time—while minimizing operational costs. On a well-known benchmark application, our numerical experiments show that μOpt achieves fast solution times, enabling responsiveness to dynamic workloads. Compared to a state-of-the-art LQN-based autoscaler employing genetic algorithms, μOpt delivers improved application performance using fewer resources. To demonstrate the robustness and generalizability of our underlying model, we validate its prediction accuracy across ten randomly generated applications with diverse architectures, showing that its performance is a reliable foundation for autoscaling. Finally, it also outperforms Horizontal Pod Autoscaler, a production-ready solution for Kubernetes deployments in Google Cloud Platform, consistently reducing resource usage while more accurately tracking CPU utilization targets across both synthetic and real-world workloads.

Scheda breve

Scheda completa

Scheda completa (DC)

	Anno
	
				2026
			
	Rivista
	
				IEEE TRANSACTIONS ON SERVICES COMPUTING
			
	Codice OpenAlex
	
				W7162321435
			
	Parole chiave
	
				Microservices, Autoscaling, Performance Modeling, Optimization, Layered Queueing Networks
			
	Appare nelle tipologie:
	
				1.1 Articolo in rivista

File in questo prodotto:

File	Dimensione	Formato
Efficient_Microservice_Autoscaling_through_muOpt.pdf embargo fino al 31/12/2026 Descrizione: Efficient Microservice Autoscaling through μOpt Tipologia: Documento in Post-print Licenza: Non specificato Dimensione 1.71 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	1.71 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.11771/41681

Citazioni

ND

ND

0

social impact