AgentSkillsCN

readiness-probe-fail

在启动看似成功但健康检查却失败的情况下,解决就绪探针故障。

SKILL.md
--- frontmatter
name: readiness-probe-fail
description: Resolve readiness probe failures when startup appears successful but health checks fail.
triggers:
  - readiness_probe_fail
  - readiness
  - ready.flag
  - probe fail

Readiness Probe Recovery

Execution Modes

  • Manual runbook: follow the shell steps below.
  • Executable helper path: run the skill code directly.

Skill helper files:

  • diagnose.py
  • remediate.py
  • skill.py (exports diagnose and remediate)

Examples:

  • TARGET_URL=http://127.0.0.1:15000 TARGET_CONTAINER=openhands-gepa-demo python .agents/skills/readiness-probe-fail/diagnose.py
  • TARGET_URL=http://127.0.0.1:15000 TARGET_CONTAINER=openhands-gepa-demo python .agents/skills/readiness-probe-fail/remediate.py
  • cd .agents/skills/readiness-probe-fail && python -c "from skill import diagnose, remediate; print(diagnose()); print(remediate())"

Diagnostic Workflow

  1. Verify failure with curl -i http://127.0.0.1:15000
  2. Check readiness artifacts (e.g., /tmp/ready.flag)
  3. Confirm readiness conditions in service code

Remediation Workflow

  1. Restore required readiness artifact/state
  2. Restart only if readiness is initialized at boot

Verification

  1. curl -i http://127.0.0.1:15000
  2. Confirm HTTP 200 and healthy readiness signal