Instrument Backend Observability

Name: instrument-backend-observability
Rating: 92
Author: willyu1007

Purpose

Make backend services diagnosable in production by standardizing logs, error tracking, metrics, and tracing.

Use this skill when you are:

•Unknown errors MUST be captured by an error tracker (or equivalent) with context.
•Logs MUST be structured and SHOULD include a correlation/request ID.
•Sensitive data MUST NOT be logged (tokens, passwords, secrets, raw PII beyond what is required).
•Observability MUST NOT change business behavior (instrumentation should be side-effect free).

•
Errors
- •rate of 5xx
- •rate of domain-specific 4xx (for detecting client issues or abuse)
•
Latency
- •p50/p95/p99 per endpoint
•
Saturation
- •CPU, memory, DB connection pool utilization
•
Traffic
- •request volume per endpoint

•Ensure a request/correlation ID exists for every request.
•
Add structured logs at key boundaries:
- •request start/end (method, path, status, duration)
- •key domain actions (entity IDs, operation names)
•
Capture exceptions with context:
- •endpoint name
- •user/tenant identifiers (redacted as needed)
- •correlation ID
•
Add metrics for:
- •request duration
- •error counts
•
Define alerts for:
- •sustained 5xx rate
- •sustained latency regression
•
Verify by simulating:
- •a known operational error
- •an unknown exception

•Templates: ./templates/ includes recommended log fields and exception capture patterns.
•Examples: ./examples/ includes incident triage checklists.