Translator Run Audit (serverless-translator)

Scope

Use this skill only for:

Do not use for other repositories.

For each translation execution:

•Step Functions total duration + key task durations (SplitEpub, TranslateWorkspace, AggregateEpub)
•Total Gemini API request count
•Per-request details: batch_size, attempt, split_depth, latency_ms, outcome
•Failure chain summary: failed requests, retries, split-to-success path
•Effective runtime knobs: target_input_tokens, concurrency
•Persisted local audit files under docs/audit/ (one file per execution)

After each audit run, always write Markdown report files to:

If folder does not exist, create it first.

Use one file per execution, and include date + total time + file size in filename:

YYYY-MM-DD__<execution-name>__<total-seconds>s__<source-bytes>B.md

Example:

2026-02-10__s3-source-test-1770745084__118.97s__311926B.md

Notes:

•YYYY-MM-DD uses execution start date (local timezone in report body is allowed).
•total-seconds uses workflow total duration from Step Functions.
•source-bytes uses S3 source object size from s3://<source-bucket>/<source-key> via head-object.
•If source size cannot be resolved, use unknownB.

•
Resolve execution
- •If only name is provided, map to ARN via list-executions.
•
Pull Step Functions timing
- •describe-execution
- •get-execution-history (derive per-task durations)
•
Pull Translator Lambda logs
- •locate request stream via REPORT line in execution time window
- •parse request lines (batch/done/failed)
•
Summarize runtime knobs
- •from Gemini dynamic batching prepared (preferred)
- •fallback to Lambda env vars if needed
•Output fixed report format (below)

Prefer new format:

•Gemini dynamic batching prepared: segments=... batches=... target_input_tokens=... concurrency=...
•Gemini API request batch: batch_index=... request_no=... batch_size=... attempt=... split_depth=...
•Gemini API request done: batch_index=... request_no=... batch_size=... status=ok latency_ms=...
•Gemini API request failed: batch_index=... request_no=... batch_size=... attempt=... split_depth=... error=...

Backward compatibility:

•Gemini API request batch: request_no=... batch_size=... attempt=... split_depth=...
•Gemini API request done: request_no=... batch_size=... status=ok latency_ms=...
•Gemini API request failed: request_no=... batch_size=... attempt=... split_depth=... error=...

•
list failed requests with:
- •request seq, batch size, attempt, error
•
show recovery path:
- •retried attempts
- •split depth and final successful child batches

•
ordered list (by start time), each line contains:
- •seq, chapter (if available), batch_index, request_no
- •batch_size, attempt, split_depth
- •start/end, latency_ms, outcome, error

Before finalizing report, verify:

•request counts are internally consistent (batch == done + failed for terminal attempts)
•no mixed invocation pollution (filter by Lambda RequestId)
•if compare mode is used, both executions use same source key/provider/target when claiming perf delta

text

Use translator-run-audit for execution s3-source-test-1770745084

text

Use translator-run-audit for execution A and compare with execution B