Investigate Alert

Quick investigation of a firing Prometheus alert. For deep investigation, use debug_prod directly.

Kubeconfig Rules

NEVER copy kubeconfig files. Use the correct file per environment:

File	Environment
`~/.kube/config.s`	Stage
`~/.kube/config.p`	Production

Use --kubeconfig=~/.kube/config.s or config.p with kubectl/oc.

Input	Type	Default	Purpose
`environment`	string	required	`stage`, `production`, or `prod`
`namespace`	string	main	`main` or `billing`
`alert_name`	string	-	Specific alert to investigate
`auto_escalate`	bool	true	Auto-run debug_prod if critical

Load incident persona (prometheus, alertmanager, kibana, k8s tools).

From config.json → namespaces:

•k8s_namespace_health(namespace, environment) or kubectl_get_pods(namespace, environment)
•kubectl_top_pods(namespace, environment) — CPU/memory usage
•kubectl_get_events(namespace, environment, field_selector="type=Warning")
•prometheus_query_range — error rate trend over last hour

•kibana_search_logs(query="error OR exception OR critical", namespace, environment, limit=10)
•kibana_get_errors(namespace, environment)

•memory_read("learned/patterns") — match alerts/pod issues against error_patterns

•code_search(query=alert_name, project="automation-analytics-backend", limit=5)
•knowledge_query(project="automation-analytics-backend", persona="devops", section="gotchas")

Report with: alerts count, pod health, resource usage, events, Kibana errors, pattern matches, silence recommendation, escalation status.