9.6 Autoencoders for Dimensionality Reduction

Right, so you’ve slogged through PCA, marveled at the weird, clumpy art of t-SNE, and felt the clean, topological embrace of UMAP. They’re all fantastic, but they share a common trait: they’re projection methods. They take your high-dimensional data and squash it down onto a lower-dimensional plane for you to look at. Useful, but a bit like a tourist taking a photo of a city—you get a nice 2D view, but you can’t really go and build a new building there.

Enter the autoencoder. This is a different beast entirely. It’s not a projection; it’s a learned compression algorithm. Think of it as teaching a neural network to be a master summarizer of your data. We force it to squeeze the data through a narrow bottleneck (our low-dimensional latent space) and then reconstruct the original data from that summary as best it can. The magic isn’t in the reconstruction itself—we usually throw that away—it’s in that tight bottleneck layer. That’s our new, reduced-dimensionality representation.

The Architecture: An Identity Crisis with a Purpose

An autoencoder has two main parts, cleverly named: the encoder and the decoder. The encoder, f, takes your input x (e.g., a 100-dimensional vector) and maps it to a compressed latent representation z (e.g., 2 dimensions). The decoder, g, takes z and tries to spit out x', a reconstruction of the original x.

The entire network is trained to minimize the difference between x and x'—the reconstruction loss. We’re literally training it to copy the input to the output. This seems absurdly pointless until you remember the bottleneck. The network is forced to develop a smart, efficient way to represent the most important features of the data in z because that’s all the decoder has to work with. It has to learn what information is truly essential to reconstruct a plausible version of the input.

A Bare-Knuckled Keras Example

Let’s get concrete. Say we’re working with the MNIST dataset (28x28 images, so 784 dimensions) and we want to crush it down to a 2D latent space. Here’s how we’d build a simple autoencoder.

from tensorflow import keras
from tensorflow.keras import layers

# This is the size of our encoded representations (the bottleneck)
encoding_dim = 2

# Define the encoder
input_img = keras.Input(shape=(784,))
encoded = layers.Dense(128, activation='relu')(input_img)
encoded = layers.Dense(64, activation='relu')(encoded)
encoded = layers.Dense(encoding_dim, activation='relu')(encoded)  # This is our latent vector z

# Define the decoder
decoded = layers.Dense(64, activation='relu')(encoded)
decoded = layers.Dense(128, activation='relu')(decoded)
decoded = layers.Dense(784, activation='sigmoid')(decoded)  # Squash outputs to [0,1] like original images

# This model maps an input to its reconstruction
autoencoder = keras.Model(input_img, decoded)

# Let's also create a separate encoder model for later use
encoder = keras.Model(input_img, encoded)

Now, we compile and train it. We use 'binary_crossentropy' as our loss because our pixel values are between 0 and 1.

autoencoder.compile(optimizer='adam', loss='binary_crossentropy')

# Load MNIST, normalize the data, and flatten the images
(x_train, _), (x_test, _) = keras.datasets.mnist.load_data()
x_train = x_train.astype('float32') / 255.
x_test = x_test.astype('float32') / 255.
x_train = x_train.reshape((len(x_train), 784))
x_test = x_test.reshape((len(x_test), 784))

# Train the autoencoder. We're using the images as both input AND target.
history = autoencoder.fit(x_train, x_train,
                epochs=50,
                batch_size=256,
                shuffle=True,
                validation_data=(x_test, x_test))

After training, you can use that encoder model we made to transform your high-dimensional data into your slick new 2D space: encoded_imgs = encoder.predict(x_test).

Why This Works and Where It Gets Weird

The reason this is so powerful is its flexibility. Unlike PCA, which can only learn linear transformations, the non-linear activation functions (like relu) in the autoencoder allow it to learn highly non-linear manifolds. It can model much more complex shapes in your data.

But this flexibility is also its biggest pitfall. An autoencoder will learn whatever you teach it, including the noise. If you give it enough parameters and train it for too long, it will simply learn to perfectly memorize the input data by pushing values through the bottleneck without learning a meaningful representation. This is overfitting in its purest form. Your reconstruction loss will be near zero, but your latent space will be a garbled mess. You combat this by keeping the model small relative to your data, using regularization (like dropout or L2 weight regularization), and adding noise to the inputs (a variant called a Denoising Autoencoder).

Best Practices from the Trenches

Start Simple: Don’t immediately jump to a 10-layer deep monster. A shallow autoencoder like the one above is a great baseline and often all you need.
Tune the Bottleneck: The size of encoding_dim is your most important knob. Too small, and it can’t represent the data well (high loss). Too large, and it will start memorizing. Plot the reconstruction loss vs. latent dim size to find the elbow point.
Visualize, Visualize, Visualize: After you encode your data to 2D or 3D, plot it! Color the points by their class. See if the structure makes sense. This is the ultimate test.
Consider a Variational Approach (VAE): For a truly well-structured latent space where interpolation makes sense (e.g., smoothly morphing a ‘1’ into a ‘0’), you’ll want to look into Variational Autoencoders. They add a probabilistic twist, forcing the latent space to conform to a nice distribution (like a Gaussian), which is a whole other beautiful can of worms.

Autoencoders are workhorses. They’re not as flashy or plug-and-play as UMAP, but they give you a learned, parametric function for dimensionality reduction. You can take that trained encoder and use it to compress any new data point you get in the future, which is something most other methods can’t do. It’s not just a picture of the city; it’s the architect’s blueprint.