72.4 Structured Logging with structlog

Right, let’s talk about making your logs actually useful. You’ve probably been there: staring at a text file that looks like a frantic, unstructured diary entry written by a machine on three cups of espresso. Timestamp, log level, some vague message… good luck finding the one error in that mess. The default logging module is fine for telling you that something happened, but it’s terrible at telling you the story of why it happened. That’s where structlog comes in. It’s not just a library; it’s a philosophy for turning your logs from a liability into a debuggable, queryable asset.

Think of it this way: instead of writing a paragraph of text, you’re building a dictionary of context. Every log entry becomes a structured event, packed with key-value pairs that you can actually search, filter, and analyze later. This is a game-changer when you’re trying to trace a specific user’s journey through a complex system or figure out which specific database query is murdering your performance.

The Core Idea: Processors and Context

The genius—and, frankly, the initial head-scratcher—of structlog is its use of processors. These are little functions that your log data gets passed through on its way to its final destination (a file, the console, a network service). This assembly line approach is what gives you ultimate flexibility.

The other core concept is the context. This is a dictionary that structlog automatically carries around for the lifetime of your logger, attaching its contents to every single log call. It’s the perfect place to stash information that’s global to a request, like a user ID, a request ID, or a session token.

Let’s get a basic setup running. You’ll want to install it first (pip install structlog), and then here’s a configuration that gives you colorful, useful logs in development:

import structlog

# Configure structlog's settings
structlog.configure(
    processors=[
        # Add the log level and a timestamp to the event dictionary.
        structlog.processors.add_log_level,
        structlog.processors.TimeStamper(fmt="iso"),
        # Add callsite parameters (module, funcName, lineno) for debugging.
        structlog.processors.CallsiteParameterAdder(
            {
                structlog.processors.CallsiteParameter.FILENAME,
                structlog.processors.CallsiteParameter.FUNC_NAME,
                structlog.processors.CallsiteParameter.LINENO,
            }
        ),
        # Render the final event as a colored console message.
        structlog.dev.ConsoleRenderer(),
    ],
    wrapper_class=structlog.make_filtering_bound_logger(structlog.levels.INFO),
    context_class=dict,
    logger_factory=structlog.PrintLoggerFactory(),
)

# Get a logger. You'd typically do this once per module.
log = structlog.get_logger()

# And use it!
log.info("user_login_attempt", username="alice", user_ip="192.168.1.42")
log.error("database_connection_failed", retry_count=3, error="Connection timeout")

Run that. See the difference? It’s not just a string; it’s a structured event. You get color-coding by level, and all your key-value pairs (username, user_ip) are right there, easy to read. Now imagine grepping a massive log file for user_ip="192.168.1.42". You can actually do that.

Binding Context for Powerful Tracing

The real power unlocks when you start binding context. This is how you avoid repeating yourself in every log call. Let’s simulate a web request:

# At the start of a request, you get a user and a unique request ID.
def handle_request(request_id, user_id):
    # Create a logger with context bound to this specific request.
    request_log = log.bind(request_id=request_id, user_id=user_id)

    request_log.info("request_started", path="/api/v1/profile")
    # ... some application logic ...
    try:
        item = get_user_profile(user_id)
        request_log.info("profile_retrieved", item_id=item.id)
    except Exception as e:
        # This error log will automatically include request_id and user_id!
        request_log.error("operation_failed", error=str(e))
    request_log.info("request_completed")

# Later, in a deep, dark function far away...
def get_user_profile(user_id):
    # You get a logger here too. It will inherit the bound context!
    log.debug("fetching_user_from_db", user_id=user_id)
    # ... actual DB code ...
    return {"id": 42, "name": "Alice"}

Now every log message from within the context of handle_request carries the request_id and user_id. When your distributed system is having a meltdown, you can filter logs for a single request_id and see its entire path, which is worth its weight in gold. It turns a haystack into a single, very visible needle.

Production Configuration: JSON and Performance

The colorful console output is great for humans at a terminal, but it’s useless for machines. In production, you want JSON. Machines eat JSON for breakfast. Luckily, switching is trivial because of the processor pipeline. You just swap out the renderer.

structlog.configure(
    processors=[
        structlog.processors.add_log_level,
        structlog.processors.TimeStamper(fmt="iso"),
        structlog.processors.JSONRenderer()  # That's it. Changed one line.
    ],
    wrapper_class=structlog.make_filtering_bound_logger(structlog.levels.INFO),
    context_class=dict,
    logger_factory=structlog.WriteLoggerFactory(
        file=open("/var/log/myapp.json", "a")
    ),
)

Now your logs are written as JSON objects, ready to be sucked into tools like Elasticsearch, Logstash, Kibana (ELK), Grafana Loki, or any other log aggregator that can make sense of them.

A quick pro-tip on performance: binding context is cheap, but if you’re in a super hot path and doing log.debug("thing", data=massive_dict), you might pay the cost of building that dictionary even if debug logging is disabled. structlog is pretty smart, but it’s not clairvoyant. To avoid this, use the log.debug("thing", **expensive()) pattern sparingly. A better way is to use lazy evaluation:

# Instead of this (evaluated immediately):
log.debug("data_load", data=expensive_serialization_function())

# Do this (only evaluated if debug logging is on):
log.debug("data_load", data=structlog.processors.ExceptionPrettyPrinter())

This ensures the expensive function is only called if the log message is actually going to be output. It’s a small thing, but in high-throughput applications, these small things matter.