Unit 4 - Notes

INT363 6 min read

Unit 4: Economic Benefits of Microservices

1. Scalability Advantages and Cost Implications

One of the primary economic drivers for adopting microservices architecture is the shift from the coarse-grained scaling of monolithic applications to the fine-grained, granular scaling of distributed services.

1.1 Granular Scaling vs. Monolithic Scaling

In a monolithic architecture, if a specific module (e.g., the "Checkout" service) experiences high traffic, the entire application stack must be duplicated to handle the load. This leads to Resource Wastage.

Monolith Scenario: A 10GB RAM server is required to run the app. If load increases, another 10GB server is provisioned, even if the bottlenecked component only requires 500MB.
Microservices Scenario: Each service runs in its own container with defined resource limits. If the "Checkout" service is under load, the orchestrator (e.g., Kubernetes) scales only the "Checkout" containers.
- Economic Benefit: You pay only for the compute resources required to handle the specific load, dramatically reducing over-provisioning.

1.2 Elasticity and Auto-scaling

Microservices are typically stateless and containerized, making them ideal for rapid elasticity.

Scale Out (Horizontal): Adding more instances of a service during peak hours.
Scale In: Removing instances when demand subsides.
Cost Implication: This aligns infrastructure costs directly with revenue/usage curves (Pay-as-you-go). You do not pay for idle capacity during off-peak hours.

1.3 The "Hidden" Costs of Microservices

While resource utilization improves, operational complexity can increase costs if not managed:

Orchestration Overhead: Running the control plane (e.g., Kubernetes master nodes) costs money.
Inter-service Communication: High volume of network traffic between services can lead to increased latency and data transfer costs.
Observability Costs: Storing logs, metrics, and traces for dozens of services increases storage and monitoring tool costs compared to a single monolith log stream.

2. Pricing Models of Major Cloud Providers

Cloud providers offer various pricing strategies that microservices architectures leverage to optimize spending.

2.1 Amazon Web Services (AWS)

On-Demand: Pay for compute capacity by the second/hour with no long-term commitment. Best for spiky, unpredictable microservice workloads.
Savings Plans: A flexible pricing model offering lower prices (up to 72% savings) in exchange for a specific usage commitment (e.g., $10/hour) for a 1 or 3-year term. Applies to EC2, Fargate (Serverless containers), and Lambda.
Reserved Instances (RIs): Significant discount compared to On-Demand. Requires reserving capacity for 1 or 3 years.
Spot Instances: Spare compute capacity available at up to 90% discount.
- Microservices Use Case: Ideal for stateless, fault-tolerant microservices that can handle interruptions.

2.2 Microsoft Azure

Pay-as-you-go: No upfront commitment; cancel anytime.
Azure Reserved Virtual Machine Instances: Up to 72% savings compared to pay-as-you-go prices with 1-year or 3-year terms.
Azure Hybrid Benefit: Allows utilization of existing on-premises Windows Server and SQL Server licenses on the cloud to save costs.
Spot VMs: access unused Azure compute capacity at deep discounts.

2.3 Google Cloud Platform (GCP)

Free Tier: Specific limits on resources (e.g., Cloud Functions, Compute Engine) available for free.
Committed Use Discounts (CUDs): 1-year or 3-year commitment for a specific amount of vCPUs and memory.
Sustained Use Discounts: Automatic discounts applied when specific compute resources are used for a significant portion of the billing month (unique to GCP; no upfront commitment required).
Preemptible VMs / Spot VMs: Short-lived compute instances suitable for batch jobs and fault-tolerant microservices, up to 80% cheaper.

3. Cost Components: Compute, Storage, Network, and Transfer

Microservices decouple these components, allowing for independent optimization, but also introducing fragmented costs.

3.1 Compute Costs

Compute is usually the largest portion of the bill.

Virtual Machines (VMs): Charged by vCPU/RAM per second or hour. The base layer for Kubernetes nodes.
Container Orchestration (Managed): EKS (AWS), AKS (Azure), and GKE (Google) often charge a flat fee per hour for the control plane management, plus the cost of the worker nodes.
Serverless (FaaS): AWS Lambda, Azure Functions, Google Cloud Functions.
- Pricing Model: Charged based on number of requests and duration of execution (GB-seconds).
- Economic Advantage: "Scale to Zero." If a microservice is not called, it costs $0.00.

3.2 Storage Costs

Microservices often use "Polyglot Persistence," meaning different services use different database technologies.

Object Storage (S3, Blob, Cloud Storage): Cheapest tier. Used for static assets, backups, and logs. Pricing based on storage volume (GB/month) and access frequency (API calls).
Block Storage (EBS, Managed Disks): Attached to VMs/Containers. Expensive. Costs accrue even if the VM is stopped but the disk exists.
Managed Databases (RDS, CosmosDB, Cloud SQL):
- Provisioned: Pay for a specific instance size regardless of usage.
- Serverless DBs: Pay for "Read/Write Units" or "Throughput." Cost-effective for microservices with sporadic data access.

3.3 Network and Data Transfer Costs

This is often the area of "Bill Shock" in microservices due to the chatty nature of the architecture.

Ingress (Inbound): Usually free across all major providers. Bringing data into the cloud costs nothing.
Egress (Outbound): Data leaving the cloud (to the internet) is expensive.
Inter-Availability Zone (AZ) Transfer:
- In a High Availability setup, microservices replicas span multiple AZs.
- Traffic between Service A (Zone 1) and Service B (Zone 2) usually incurs a per-GB data transfer cost.
NAT Gateways: Private subnets require NAT Gateways to access the internet (e.g., for pulling docker images). These incur hourly charges plus data processing charges.

Example of Cost Escalation:

Service A calls Service B 1,000 times per second. If they are in different regions, you pay cross-region data transfer rates. If they are in the same region but different zones, you pay inter-zone rates.

4. Cost Monitoring and Optimization Tools

Implementing FinOps (Financial Operations) is essential to manage distributed cloud costs.

4.1 Native Cloud Tools

AWS Cost Explorer & Trusted Advisor: Visualizes spending patterns, forecasts future costs, and identifies underutilized EC2 instances (e.g., "This instance has < 5% CPU utilization for 7 days").
Azure Cost Management + Billing: Offers budget alerts, cost analysis, and recommendations for buying reserved instances.
Google Cloud Billing Reports: customizable dashboards and BigQuery export for deep analysis of billing data.

4.2 Microservice-Specific Tools

Since multiple microservices might run on a single cluster, attributing cost to a specific service is difficult with standard cloud tools.

Kubecost: specifically designed for Kubernetes. It allocates costs to specific deployments, namespaces, or labels. It breaks down cost by:
- Cost per Microservice
- Cost per Namespace (e.g., Dev vs. Prod)
- Idle Cost (Wasted resources inside the cluster)
CloudHealth (VMware): A comprehensive multi-cloud management platform for policy-driven cost optimization.

4.3 Optimization Strategies

Rightsizing: Analyzing metrics to change instance types (e.g., moving from a t3.large to a t3.medium) to match actual performance needs.
Tagging and Allocation: Enforcing a strict tagging policy (e.g., CostCenter: Marketing, Service: Inventory) to ensure every dollar spent can be attributed to a specific business unit.
Auto-scaling Policies: Configuring aggressive scale-down policies for non-production environments (e.g., shutting down Dev environments at 7 PM and weekends).
Spot Instance Adherence: Architecting non-critical or stateless microservices to run specifically on Spot instances to leverage the 70-90% discount.

Unit 3

Unit 5