Amazon EC2 Auto Scaling (original) (raw)

Last Updated : 9 Jun, 2026

Amazon EC2 Auto Scaling is a service that helps you maintain application availability and allows you to automatically add or remove EC2 instances according to conditions you define. You can use the fleet management features of EC2 Auto Scaling to maintain the health and availability of your fleet.

Instead of guessing how many servers you need (Provisioning for peak load and wasting money during low traffic periods.), Auto Scaling ensures you have exactly the right amount of compute power right when you need it.

containerized_applications

Scaling Amazon EC2 means automatically increasing or decreasing EC2 instances based on application demand. It helps maintain performance, ensures enough computing power for users, and reduces cost by using only the required resources.

Core Components of Auto Scaling

To configure Auto Scaling, you need to define three main components:

1. Launch Template

2. Auto Scaling Group (ASG)

An Auto Scaling Group creates and manages a logical group of EC2 instances. It defines the VPC and subnets where the instances will launch.

It also manages capacity limits

desired_capacity

3. Scaling Policies

Scaling Polices decide when EC2 instances should be added or removed.

EC2-Auto-Scaling-2

Capacity-Day of the Week Graph

Features of AWS Auto Scaling

Here are the some most important features of AWS Auto scaling

Types of AWS (Amazon Web Services) Autoscaling

AWS offers several ways to scale your infrastructure:

1. Predictive Scaling

2. Scheduled Scaling

3. Target Tracking Scaling

4. Reactive Scaling

5. Vertical Scaling

6. Horizontal Scaling

types_of_auto_scaling_aws

Advanced Features

Mixed Instances Policy (Cost Optimization)

Mixed Instances Policy helps optimize cost and improve availability by using different EC2 instance types and pricing models.

Health Checks

Health Checks monitor instance and application health to maintain reliability and availability.

Lifecycle Hooks

Lifecycle Hooks allow custom actions during instance launch or termination.

Use Case

Working of AWS Auto Scaling

To know how to create autoscaling refer to Create and Configure the Auto Scaling Group in EC2.

Amazon EC2 Auto Scaling Instance Lifecycle

Every EC2 instance within an auto scaling group follows a distinct lifecycle. This lifecycle begins when the instance is launched and concludes with its termination. Below is an illustration of the various stages an instance goes through during its lifecycle

Amazon-EC2-Auto-Scaling-Instance-Lifecycle

Scaling Instance Lifecycle

Pricing for Amazon EC2 Auto Scaling

Amazon EC2 Auto Scaling does not have any additional service cost. You only pay for the AWS resources used, such as EC2 instances, Load Balancers, and CloudWatch monitoring.

Pricing Component Cost
Auto Scaling Service No additional charge for Auto Scaling
Amazon EC2 Instances Charged based on instance type and region
Amazon EC2 On-Demand Instances Starts around $0.0042 per hour
Amazon EC2 Reserved Instances Up to 72% lower cost than On-Demand
Amazon EC2 Spot Instances Up to 90% lower cost than On-Demand
Amazon EC2 Elastic Load Balancing Charged per hour and data processed
Amazon CloudWatch (Monitoring) Basic monitoring free, detailed monitoring charged separately.
Data Transfer Incoming data free, outgoing internet traffic charged
Elastic IP Addresses First Elastic IP free with running instance

Scaling Plan