Scales linearly and horizontally without having to size for peak loads
Relies on the Kubernetes Horizontal Pod Autoscalers (HPAs) to automatically scale pod replicas based on thresholds set on collected metrics and a baselined redundancy allocation
Scales based on standard metrics such as CPU and Memory, and also custom metrics like gRPC requests per second for specific services
Optimises for performance attributes including low latency, throughput or price