32.2 Envelope Encryption: Encrypting Data Keys with a CMK

Alright, let’s talk about envelope encryption. It sounds fancy, but the concept is brilliantly simple and solves a massive problem: performance. Imagine you have a 500GB database backup file. Encrypting that entire thing by making a call over the network to KMS for every block of data would be painfully, unusably slow. We’re talking minutes or hours, not milliseconds.

So, we cheat. Wisely. Here’s the gambit: we use a super-fast encryption algorithm (like AES-256) to encrypt your data locally. But what do we use for the key for that algorithm? We can’t just hardcode a key in our source code; that’s like locking a vault and then taping the combination to the door. This is where KMS waltzes in. We generate a unique, high-quality data key locally, use that to encrypt our massive file, and then we immediately turn around and encrypt that data key with a Customer Master Key (CMK) from KMS. We then store the now-encrypted data key right alongside our encrypted data.

The unencrypted data key? We annihilate it from memory as soon as we’re done. It exists for mere milliseconds. The only way to get it back is to call KMS with the encrypted copy and our CMK to decrypt it. We’ve effectively “sealed” our high-performance data key inside a “envelope” that only KMS can open. Hence the name.

The Two-Step Dance: GenerateDataKey

This is the core operation. You don’t just generate a plaintext key yourself and then call encrypt. You ask KMS to generate a new data key and immediately encrypt it under your specified CMK, all in one atomic operation. It returns both the plaintext and ciphertext versions of the data key. This is crucial for security—the plaintext key never exists on AWS’s infrastructure, only in memory on your client machine in the response.

Let’s see it in action with the AWS CLI. Notice the --key-spec option specifying the length.

# This command does the magic two-step
aws kms generate-data-key \
    --key-id alias/my-app-key \
    --key-spec AES_256 \
    --output json \
    --profile my-profile

# The response will look like this:
{
    "CiphertextBlob": "AQEDAHhJb4...aLongBase64String...FwAGg5Example=",
    "Plaintext": "aLongBase64String...Represents16Bytes...ForAES256=",
    "KeyId": "arn:aws:kms:us-east-1:123456789012:key/abcd1234-56ab-7890-cdef-1234567890ab"
}

Your job, in your application code, is to:

Use the Plaintext to encrypt your data (right now, in memory).
Store the CiphertextBlob somewhere (S3 metadata, a database column next to the encrypted data, etc.).
Immediately obliterate the Plaintext value from your application’s memory. Seriously. Null it out. Don’t let it get swapped to disk. Its purpose has been served.

Why This is So Damn Secure

The beauty of this model is that your most sensitive secret—the plaintext data key—has an incredibly short lifespan and is never persisted anywhere. The encrypted data (your database backup) and the encrypted key (the CiphertextBlob) can be stored together on a less-trusted system, like S3. Without access to the CMK in KMS, that CiphertextBlob is just a useless pile of random-looking bytes. An attacker could steal your entire S3 bucket and get nowhere. The security is centralized on the KMS service and the access controls (IAM policies, key policies) you’ve defined on your CMK.

The Decryption Reverse

To get your data back, you simply reverse the process. You grab the CiphertextBlob from wherever you stored it, and you ask KMS to decrypt it.

aws kms decrypt \
    --ciphertext-blob fileb://<(echo "AQEDAHhJb4...aLongBase64String...FwAGg5Example=" | base64 -D) \
    --output text \
    --query Plaintext \
    --profile my-profile | base64 -D

Heads up: The decrypt command doesn’t require you to specify the KeyId. This is a feature, not an oversight. The CiphertextBlob isn’t just encrypted data; it contains metadata that tells KMS which CMK was used to encrypt it. KMS then checks if you have permission to use that key for decryption. This is fantastic because it means your application code doesn’t need to track key ARNs everywhere.

Common Pitfalls and How to Avoid Them

Storing the Plaintext Key: I’m saying it again because it’s the number one rookie mistake. If you log, persist, or otherwise fail to securely erase the Plaintext response from GenerateDataKey, you’ve completely defeated the purpose of envelope encryption. Your data is now only as secure as wherever you left that plaintext key.
Assuming Decrypt Implies Allow: Just because you can call decrypt on a CiphertextBlob doesn’t mean you’ll get the plaintext back. The operation will only succeed if your IAM principal has the kms:Decrypt permission granted on the key policy of the CMK that was used. Permission on the principal isn’t enough; the key’s policy must allow it too. This dual-control system is powerful but trips people up.
Ignoring Encryption Context: This is a advanced, but critical, feature. When calling GenerateDataKey or Encrypt, you can provide an additional key-value string map called an encryption context. You must provide the exact same context when decrypting. KMS cryptographically binds this context to the ciphertext. It’s a brilliant data integrity check and a powerful tool for authorization (e.g., using "context": {"service": "prod-payment-db"} in the key policy to restrict decryption to only services that can provide that context). Start using it early.