Java Structured Logging (JSON/logfmt + Correlation + Safety)

Scope

In scope

Out of scope

Every log record should be a single JSON object with stable keys.

•timestamp (ISO-8601)
•level (TRACE/DEBUG/INFO/WARN/ERROR)
•logger (class/category)
•thread
•message
•service.name
•service.version
•deployment.environment (dev/staging/prod)
•traceId (if tracing enabled)
•spanId (if tracing enabled)
•requestId (if available)
•correlationId (if you use a custom correlation id)
•http.method, http.path, http.status (for request logs)
•event.name (semantic event label; stable)
•error.type, error.message (safe), error.stack (internal-only, never to clients)

•Do not rename fields without migration plan.
•Prefer adding new fields rather than changing meaning.
•
Avoid high-cardinality fields unless explicitly needed:
- •Never log raw user identifiers; hash or redact.
- •Never log full request/response bodies by default.

•Prefer traceId/spanId (from tracing context).
•Also include a requestId (gateway id or generated at ingress).
•
For async flows, propagate correlation through message headers:
- •include traceparent (W3C) and optional correlationId.

•Use MDC (Mapped Diagnostic Context) to attach correlation fields per thread/request.
•
At request entry:
- •extract incoming traceparent / requestId if present
- •otherwise generate requestId
- •set MDC keys: traceId, spanId, requestId
•Ensure MDC cleanup at request end.

•
MDC is thread-local by default. For executors:
- •wrap Runnable/Callable to copy MDC context
- •or use framework/otel context propagation when available.

•INFO: business milestones (state transitions), start/stop, essential events.
•WARN: recoverable anomalies, retries, degraded mode.
•ERROR: failures requiring attention, request failed.
•DEBUG/TRACE: development-only, guarded by config; never enable globally in prod.

This creates noise, cost, and hides signals.

•Log error class and sanitized message.
•Store full stack traces only in internal logs with restricted access, or sample them.

•
For high-volume endpoints:
- •sample INFO request logs (e.g., 1%)
- •always keep WARN/ERROR
•Rate limit repetitive error logs to avoid log storms.

•docs/logging.md (schema + policies)
•logging/log-schema.json (optional JSON schema)
•logging/redaction-rules.md
•
Code changes:
- •ingress filter/middleware for MDC
- •JSON encoder configuration
- •unit tests for redaction and MDC cleanup

•Symptom: cannot correlate logs with traces -> Fix: include traceId and propagate context; add MDC.
•Symptom: log storms during incidents -> Fix: add sampling + rate limiting; reduce noisy INFO logs.
•Symptom: leaked secrets -> Fix: redaction filters; code review checklist; scanners.