27.7 Structured Logging Best Practices for Kubernetes Apps

Look, let’s be honest: your application’s logs are a firehose of misery. In the old days, you’d SSH into a server, tail -f a file, and hope for the best. In Kubernetes, that single server is now a teeming swarm of ephemeral, constantly rescheduled pods. Your logs are scattered across nodes, stored in containers that vanish the moment you need them most. Grepping a thousand text files isn’t just impractical; it’s a form of career self-sabotage.

This is why we need to get smart. We need to stop writing logs for humans to read and start writing them for machines to parse. We need structured logging. This isn’t just a “best practice”; it’s your only hope of staying sane. The goal is to turn a chaotic stream of text into a queryable, filterable, actionable dataset.

The Absolute Basics: JSON, Not Poetry

Forget log lines that look like this: [ERROR] Something bad happened in module foo for user bar!

A machine looks at that and sees a string blob. It has no idea what “module foo” or “user bar” is. Now, look at this:

{
  "timestamp": "2023-10-27T14:23:01.123Z",
  "severity": "ERROR",
  "message": "Failed to process user request",
  "module": "user-auth",
  "user_id": "a1b2c3d4",
  "http.status_code": 500,
  "error": "Database connection timeout"
}

See the difference? Every piece of contextual information is a separate, queryable field. Your logging system (e.g., Loki, Elasticsearch) can now instantly show you all errors for user_id: 'a1b2c3d4' or all 500 errors from the user-auth module. This is a superpower.

Instrument Your Code Properly

This isn’t about wrapping print statements. You need a proper logging library. For Go, it’s slog (now in the standard library) or zap. For Python, it’s structlog. For Node.js, pino. These libraries are built for this.

Here’s a naive example in Go, followed by the right way:

// Don't do this. This is how you end up crying.
log.Printf("Error fetching pod %s in namespace %s: %v", podName, namespace, err)

// Do this. This is how you become the hero your cluster needs.
import "log/slog"

logger := slog.Default()
// ...
logger.Error("Error fetching pod",
    "pod", podName,
    "namespace", namespace,
    "error", err,
)

The slog library automatically handles structuring your key-value pairs. The output, when formatted as JSON, is exactly what we want.

Context, Context, Context

The single most important piece of advice I can give you is to add context early and often. A log event should tell a complete story. Don’t make your future self (or worse, your on-call colleague) play detective across a dozen different log lines.

Imagine a function that handles an HTTP request. Instead of this:

func HandleRequest(w http.ResponseWriter, r *http.Request) {
    logger.Info("Started request")
    userID := r.Header.Get("X-User-ID")
    // ... do work ...
    logger.Info("Finished request")
}

Do this. Attach context to the logger itself:

func HandleRequest(w http.ResponseWriter, r *http.Request) {
    userID := r.Header.Get("X-User-ID")
    // Create a child logger with request-scoped context
    requestLogger := logger.With(
        "http.method", r.Method,
        "http.path", r.URL.Path,
        "user_id", userID,
        "request_id", generateRequestID(),
    )
    
    requestLogger.Info("request_started")
    // ... do work ...
    requestLogger.Info("request_finished", "duration_ms", duration)
}

Now, every single log message emitted from this request, even those buried ten function calls deep (if you pass requestLogger down), will be stamped with the method, path, user, and a unique request ID. You can trace the entire lifecycle of a single request instantly. This is a game-changer.

The Devil’s in the Details (The Pitfalls)

Cardinality Explosion: This is the big one. Don’t use high-cardinality values as standalone labels or keys if you’re sending logs to a system like Loki or Grafana that indexes them. Using user_id="a1b2c3d4" as a label is fine if you have 100 users. It will melt your entire logging infrastructure if you have 10 million. High-cardinality data belongs in the log message itself, not as an indexed label. Use labels for low-cardinality, high-value fields like namespace, severity, app_name, http_method.
Sensitive Information: The log aggregator is now the easiest place to mine for secrets. Your structured logs will be ingested and stored by a third-party system. You must have a process to scrub them. Never log passwords, API keys, or tokens. Be cautious with PII like emails and addresses. Use your logging library’s hooks to filter or mask these values before they ever leave your application.
Standardize Your Schema: Decide on a common set of field names across all your services. Will you use user_id or userId? http_status or statusCode? Pick one and stick to it. This consistency is what turns a bunch of individual app logs into a cohesive, company-wide signal. Create a small shared library or document to enforce this.
Watch Your Output: In development, it’s nice to see pretty, colored, human-readable logs. In production, you must output in JSON format. This is usually configured via an environment variable (e.g., LOG_FORMAT=json). This ensures your production logs are consistently structured for your machines to parse, while your local terminal remains readable for you.