Practice of Alibaba Cloud on Elastic Resource Provisioning for Large-scale Microservices Cluster

by   Minxian Xu, et al.

Cloud-native architecture is becoming increasingly crucial for today's cloud computing environments due to the need for speed and flexibility in developing applications. It utilizes microservice technology to break down traditional monolithic applications into light-weight and self-contained microservice components. However, as microservices grow in scale and have dynamic inter-dependencies, they also pose new challenges in resource provisioning that cannot be fully addressed by traditional resource scheduling approaches. The various microservices with different resource needs and latency requirements can create complex calling chains, making it difficult to provide fine-grained and accurate resource allocation to each component while maintaining the overall quality of service in the chain. In this work, we aim to address the research problem on how to efficiently provision resources for the growing scale of microservice platform and ensure the performance of latency-critical microservices. To address the problem, we present in-depth analyses of Alibaba's microservice cluster and propose optimized resource provisioning algorithms to enhance resource utilization while ensuring the latency requirement. First, we analyze the distinct features of microservices in Alibaba's cluster compared to traditional applications. Then we present Alibaba's resource capacity provisioning workflow and framework to address challenges in resource provisioning for large-scale and latency-critical microservice clusters. Finally, we propose enhanced resource provisioning algorithms over Alibaba's current practice by making both proactive and reactive scheduling decisions based on different workloads patterns, which can improve resource usage by 10-15 necessary latency for microservices.


page 1

page 2

page 5

page 7

page 10

page 17


Containers Orchestration with Cost-Efficient Autoscaling in Cloud Computing Environments

Containers are standalone, self-contained units that package software an...

CASH: A Credit Aware Scheduling for Public Cloud Platforms

The public cloud offers a myriad of services which allows its tenants to...

Prism: Revealing Hidden Functional Clusters from Massive Instances in Cloud Systems

Ensuring the reliability of cloud systems is critical for both cloud ven...

Dynamic Resource Allocation in the Cloud with Near-Optimal Efficiency

Cloud computing has motivated renewed interest in resource allocation pr...

A Dynamic, Hierarchical Resource Model for Converged Computing

Extreme dynamic heterogeneity in high performance computing systems and ...

Graph-PHPA: Graph-based Proactive Horizontal Pod Autoscaling for Microservices using LSTM-GNN

Microservice-based architecture has become prevalent for cloud-native ap...

Adaptive Services Function Chain Orchestration For Digital Health Twin Use Cases: Heuristic-boosted Q-Learning Approach

Digital Twin (DT) is a prominent technology to utilise and deploy within...

Please sign up or login with your details

Forgot password? Click here to reset