7.2 Creating an AMI from a Running Instance

Right, you’ve got an instance humming along perfectly. It’s configured just so, the application is purring, and you’ve finally vanquished that one weird permissions bug that only happened on a Tuesday. This is a beautiful, unique snowflake of a server, and you want to clone it. That’s what an AMI is for: a frozen snapshot of this exact moment in time, so you can launch a hundred more just like it, or keep it as a golden image for disaster recovery.

The process itself is deceptively simple. You click a button or run a command, AWS does its magic, and a little while later, you have a new AMI. But the devil, as always, is in the details. Let’s walk through it without shooting ourselves in the foot.

The “No Downtime” Myth and the API Reality

First, let’s kill a common misconception: creating an AMI from a running instance does not require you to stop it. AWS will do a live snapshot. This is fantastic for minimizing downtime, but you must understand what “live” means. It’s like taking a photo of a moving car. The overall structure is there, but if you were in the middle of writing a file to disk, that file might be corrupted in the image. The photo captures the car, but the driver might be mid-blink.

For a truly consistent filesystem, especially for stateful applications like databases, you should stop the instance first. It’s the equivalent of asking the car to stop and the driver to smile for the camera. But since we’re often dealing with stateless application servers, the live method is the standard. Just be aware of the risk.

Now, the programmatic way to do this is via the create-image CLI command. The web console is just a GUI wrapper for this.

aws ec2 create-image \
    --instance-id i-1234567890abcdef0 \
    --name "My-Perfect-WebServer-2023-10-27" \
    --description "Nginx configured, app deployed, ready for ASG" \
    --no-reboot  # This is the interesting flag.

Look at that --no-reboot flag. This is AWS asking, “How much do you care about filesystem consistency?” By default, if you omit this flag, AWS will attempt to shut down your instance gracefully, take the snapshot, and then restart it. This guarantees consistency but means a brief downtime.

If you include --no-reboot, it does the live snapshot. The instance never stops, but you accept the small risk of inconsistency. For most web servers, --no-reboot is fine. For your company’s main Oracle database? Probably not.

What Actually Gets Captured? (Spoiler: Not Everything)

This is where people get tripped up. The AMI creation process snapshots every EBS volume attached to the instance at that moment. This includes the root volume and any additional drives you’ve attached.

Crucially, it does NOT capture:

Instance-specific data: The AMI gets a new UUID for the machine ID. Your new instances won’t have the same hostname, SSH host keys, or /etc/machine-id as the original. This is good! You don’t want clones sharing SSH host keys.
Data on instance store volumes: If your instance uses ephemeral (instance store) disks, that data is gone. AMIs only capture EBS volumes. This is a feature, not a bug—instance store is meant for truly temporary data.
Your AWS credentials: I hope this is obvious, but please, for the love of all that is holy, never bake your IAM access keys or secret keys into an AMI. Use IAM roles attached to the instance. If you have a .aws/credentials file on that box, delete it before imaging.

The Waiting Game and the No-Touch Zone

You’ve run the command. It returns an AMI ID, say ami-0a1b2c3d4e5f60001. Do not, I repeat, DO NOT immediately start terminating your old instance or launching 50 new ones from this AMI.

The AMI creation process has two distinct phases:

Creating the AMI metadata: This is fast. It’s just an entry in a database.
Creating the underlying EBS snapshots: This can take several minutes to hours, depending on the size of your volumes and how much data has changed since the last snapshot. The AMI status will be pending.

If you try to launch an instance from a pending AMI, it will fail. You must wait until its status changes to available. You can check this with:

aws ec2 describe-images --image-ids ami-0a1b2c3d4e5f60001

Look for "State": "available" in the output. Only then is your image ready for prime time.

Best Practices for a Sane Life

Simplify Before You Snap: Before creating the AMI, run a package update (apt-get upgrade or yum update), clean up temporary files, and remove any application-specific logs or state that shouldn’t be cloned. You want a clean, generic starting point.
Name with Purpose: Use a descriptive naming convention like app-role-date-version. MyServer is useless. wordpress-prod-20231027-v2 tells you everything. The description field is your friend; use it to note what’s installed and why this image exists.
Deregister the Old: AMIs cost money. The metadata entry costs a trivial amount, but the underlying EBS snapshots absolutely do. When you create a new golden image, make a script to deregister the old AMI and delete its associated snapshots. Hoarding old AMIs is a fantastic way to get a surprisingly large bill for storage you’d forgotten about.

# First, get the snapshot ID from the AMI description
aws ec2 describe-images --image-ids ami-old12345 --query 'Images[].BlockDeviceMappings[].Ebs.SnapshotId'

# Then, deregister the AMI
aws ec2 deregister-image --image-id ami-old12345

# Finally, delete the snapshot(s)
aws ec2 delete-snapshot --snapshot-id snap-abcdef12345